-
Notifications
You must be signed in to change notification settings - Fork 151
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
StackOverflowError when parsing certain html #135
Comments
I'm getting this as well. Digging through logs now to find some example data... |
Can you provide a failing gist please? |
https://gist.github.com/retnuh/9747891f2d1fb74e787b I've stripped down the clojure to more or less bare bones, but haven't had time to dig through the HTML file. I at first thought it might be the STYLE tag outside the HTML tag, but a stripped down version (i.e. most of the body removed) works okay. bad2.html also triggers StackOverflowError, and it happens much more quickly. |
Thanks Hunter.
The snippet seems alright, but the html file is too large for us to investigate it. It'd be greatly helpful if you could track down where exactly it blows up. Alternatively, try using JSoup as a parser as it is more robust than TagSoup. |
Using enlive when reading certain urls gives me a StackOverflowError, with these parts of the stacktrace repeated over and over:
Any way to avoid this? Are we just naively recurring somewhere? Can this be turned into a
loop/recur
?Thank you!
The text was updated successfully, but these errors were encountered: