<p>Mechanize combined with Beautiful Soup are often a good combo. YMMV</p>
<p>- Roger</p>
<div class="gmail_quote">On Apr 23, 2011 9:51 PM, <<a href="mailto:trideceth12@gawab.com">trideceth12@gawab.com</a>> wrote:<br type="attribution">> Hi all,<br>> <br>> Can anyone recommend me a python package for handling webscraping<br>
> operations. I need to be able to log-in to an https site and crawl from<br>> there.<br>> <br>> I have been trying to use HtmlUnit for java and have seen some people<br>> using HtmlUnit and Jython, but so far HtmlUnit seems a bit flaky -<br>
> retaining logged-in status on some sites, not on others.<br>> <br>> Is this really so hard???? I'm sure this must be a common operation.<br>> <br>> Thanks in advance,<br>> Jake<br>> <br>> <br>
> <br>> _______________________________________________<br>> python-au maillist - <a href="mailto:python-au@starship.python.net">python-au@starship.python.net</a><br>> <a href="http://starship.python.net/mailman/listinfo/python-au">http://starship.python.net/mailman/listinfo/python-au</a><br>
</div>