-
Notifications
You must be signed in to change notification settings - Fork 36
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Cannot parse HTML5 #146
Comments
I made an issue in HTML5ever. If they provide LibXML2 bindings we can use that. otherwise, we can use gumbo |
It might be easier instead to use Gumbo.jl and convert that! |
@aminya |
Yes! Please check |
I still don't know how to do it. How can I convert a |
It is not possible directly. It needs some work as mentioned in the issue. |
I believe the solution to the "unknown tag name" problem is to pass HTML_PARSE_RECOVER to htmlParseMemory. I believe this is what |
correction: they are warnings, not errors, so they can be ignored by passing noerror=true |
I am trying to parse this HTML using
readhtml
, but it throws some warningsa.zip
The text was updated successfully, but these errors were encountered: