-
Notifications
You must be signed in to change notification settings - Fork 227
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
ValueError: empty vocabulary #79
Comments
I'm trying to reproduce this issue on Windows10, but haven't been able to. JobFunnel seems to work fine on my Windows10 installation. |
Yea, no problem.
settings.yaml
|
Awesome! Thanks so much for all these details! |
As a quick and dirty fix, could you comment out the GlassDoor scraper from the settings file?
in the settings file look like this:
Does that fix it? |
I was able to re-produce! Can confirm this bug only affects the
You will change the
Now your entire
Following these steps should fix your problem. It is unlikely that the Sorry we don't have clear documentation on these changes. Will make sure to update the readme on the next PR to make this clear to users. Hope this works! |
Awesome! I will try this as soon as possible. |
Issue Template
Description
Standard search produces web scrape error
Steps to Reproduce
Standard search with
Expected behavior
Results of query
Actual behavior
No loglevel
query_words is empty therefore cannot be fit_transform by vectorizer
Debug Loglevel
webdriver manager returning 404 errors?
Variable Contents
prev_dict
cur_dict.values()
query_ids
query_words
Environment
beautifulsoup4>=4.6.3 (4.9.1)
lxml>=4.2.4 (4.5.1)
requests>=2.19.1 (2.23.0)
python-dateutil>=2.8.0 (2.8.1)
PyYAML>=5.1 (5.3.1)
scikit-learn>=0.21.2 (0.23.1)
nltk>=3.4.1 (3.5)
scipy>=1.4.1 (1.4.1)
selenium>=3.141.0 (3.141.0)
webdriver-manager>=2.4.0 (3.1.0)
soupsieve>1.2 (2.0.1)
certifi>=2017.4.17 (2020.4.5.2)
urllib3!=1.25.0,!=1.25.1,<1.26,>=1.21.1 (1.25.9)
chardet<4,>=3.0.2 (3.0.4)
idna<3,>=2.5 (2.9)
six>=1.5 (1.15.0)
threadpoolctl>=2.0.0 (2.1.0)
joblib>=0.11 (0.15.1)
numpy>=1.13.3 (1.18.5)
click (7.1.2)
tqdm(4.46.1)
atomicwrites>=1.0; (1.4.0)
packaging (20.4)
pluggy<1.0,>=0.12 (0.13.1)
The text was updated successfully, but these errors were encountered: