Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Add MaxDocumentLength and custom UserAgent support #13

Open
wants to merge 6 commits into
base: master
Choose a base branch
from

Conversation

nightbloos
Copy link

Due to the reason that sometimes URLs can be to some pages/files that are a pretty big one - we want to be able to abort the reading body for those pages.

Alexandr Filioglo added 5 commits August 7, 2020 13:18
- added support for og:type
- fixed incorrect handling of relative paths
- changed to "silent" checks Content-Length in Head requests
…o 2 different functions `GetDocument` and `ParseDocument` (#2)
Due to the reason that we "re-create" new link - only by scheme, host, and path - is present risk to lose some other data from the original link.
Previously `/some/path.png?param=value`, was transformed into `http://mydomain.com/some/path.png`
Now this issue should be fixed, and the output should be `http://mydomain.com/some/path.png?param=value`
For some ULRs was found that we can't get for strange reason the `og:type` data.
One of this ULRs - was youtube links.
Was detected that in YouTube they keep metadata in body (and not in head as other normal services).
And because previously the criteria for breaking loop of procession of tokens was "we have Title + description + ogImage and we passed head" - we were not able to process all other optional meta after that we pass head.

Now we are able to control how much tokens we can process before breaking loop (or if we found required optional fields already)
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants