Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Infinite crawling #41

Open
PriyaJainDev opened this issue May 15, 2020 · 1 comment
Open

Infinite crawling #41

PriyaJainDev opened this issue May 15, 2020 · 1 comment
Labels
waiting for response waiting for issue owner response

Comments

@PriyaJainDev
Copy link

PriyaJainDev commented May 15, 2020

If I need to scroll through, say 10 LinkedIn profile URLs, the crawler after successfully crawling through the 10 URL links provided into the rootProfiles, keep on executing. It keeps on returning the logger as:

2020-05-15T19:53:36.399Z info: starting scraping: undefined show urls undefined 2020-05-15T19:53:36.401Z error: error on crawling profile: undefined TypeError: Cannot read property 'indexOf' of undefined 2020-05-15T19:53:37.400Z info: starting scraping: undefined show urls undefined 2020-05-15T19:53:37.401Z error: error on crawling profile: undefined TypeError: Cannot read property 'indexOf' of undefined 2020-05-15T19:53:38.400Z info: starting scraping: undefined show urls undefined
again and again even after profiles has been saved.

Is there any mechanism to stop the script automatically after the number of profile matching and fetching has been done?

Because this infinite crawling might consume up the resource much more than required.

@leonardiwagner
Copy link
Member

Yes, you could implement code on crawler.js to check if it has more profiles to crawl

PR is welcomed, thanks

@leonardiwagner leonardiwagner added the waiting for response waiting for issue owner response label Jun 9, 2020
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
waiting for response waiting for issue owner response
Projects
None yet
Development

No branches or pull requests

2 participants