-
Notifications
You must be signed in to change notification settings - Fork 0
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Use lxml instead of ugly HTMLParser #3
Comments
I'm a big fan of BeautifoulSoup, even nicer than lxml. I could show you ;-) |
lxml is gloriously victorious in all benchmarks I read yesterday, beating On 24 June 2014 10:00, Christophe Gueret [email protected] wrote:
|
Ok! Enjoy it then :-) |
Although, from what I read now, lxml is the parser of BeautifulSoup, A. On 24 June 2014 10:02, Albert Meroño Peñuela [email protected]
|
Here is a one-liner to find the first h1 in a file with BeautifoulSoup:
As seen on On 24 June 2014 10:04, Albert Meroño-Peñuela [email protected]
Onderzoeker Data Archiving and Networked Services (DANS) DANS bevordert duurzame toegang tot digitale onderzoeksgegevens. Kijk op Let op, per 1 januari hebben we een nieuw adres: DANS | Anna van Saksenlaan 51 | 2593 HW Den Haag | Postbus 93067 | 2509 AB Let's build a World Wide Semantic Web! e-Humanities Group (KNAW) |
+1 for BeautifulSoup! It’s beautiful.. eh.. soup! On 24 Jun 2014, at 10:07, Christophe Gueret [email protected] wrote:
|
No description provided.
The text was updated successfully, but these errors were encountered: