-
Notifications
You must be signed in to change notification settings - Fork 38
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
How to allow extracting YouTube videos or <iframe> tags? #93
Comments
Hi @cayolblake - welcome 👋 . Are you using the Python library (uses our HTML simplification code) or calling from the command line with default options (uses Mozilla's Readability.js package to simplify the HTML)? Thanks. |
I'm using the Python library :) |
@cayolblake I'm afraid that, if this isn't fixed by updating to the latest version of Mozilla's I think for your use case, adding support in our python HTML simplifier won't be enough, as we're not currently as good as Readability.js in stripping out non-content elements so I think would not be suitable for you even if it did contain images and video tags. I'm tagging this with a |
Linked to issue #31, which considers |
That's perfectly understood. I'm planning to take a dive into your project and understand how it works - any docs that can help explaining/simplifying things further would be appreciated - hopefully after doing so I'm be able to find that best candidate for applying modification if possible. I think the Have you thought about splitting your own simplifier and the readability wrapper/utilizer in two different projects? I guess that could highlight more healthy focus on your own simplifier while still having something that works on its own dependably and may be use it as a reference or a benchmark? Just a humble thought 🤔 |
Hello,
Is there a way to allow extracting YouTube video and iframe tags similar to how image extraction is done?
The text was updated successfully, but these errors were encountered: