Yellow Pages Business Details Scraper 🚀

Scraping Business Details with multi processing concept from https://www.yellowpages.com where the (Keyword, place, Count) using Python and LXML to CSV file.

Getting Started

These instructions will get you a copy of the project up and running on your local machine for development and testing purposes.

Fields

Extracting possible fields from the search page and specific business card details in search page:

From search page	From business page
ID	Email
Business name	Years in business
Phone	General info
page(href)	Category
Address	Neighborhoods
Website	Services
Rating

Libraries

This script built using Python 3 and:

requests -- For calling Yellow Pages URLs
lxml -- To convert the HTML to string
unicodecsv -- Export the data to CSV file
argparse -- Handling arguments passes to script
math -- Calculate to get page number
urllib3 -- Remove https error
multiprocessing -- To use multi process to finish the script faster
time -- Calculate to getting time spent to finish

How to run the script

You Need to run the script name followed by the positional arguments keyword and place and count, the script working well with small/capital cases 👍 count argument is count of business cards in the search page example used to looping on all business cards related the keyword and place Here is an example to find the business details for a digital agency in Los Angeles, CA.

python yellow_pages.py digital+agency Los+Angeles,+CA 64

Sample Output

This will create a CSV file: Sample output

Name		Name	Last commit message	Last commit date
Latest commit History 16 Commits
.github/ISSUE_TEMPLATE		.github/ISSUE_TEMPLATE
.gitignore		.gitignore
CODE_OF_CONDUCT.md		CODE_OF_CONDUCT.md
CONTRIBUTING.md		CONTRIBUTING.md
LICENSE		LICENSE
README.md		README.md
digital+agency-Los+Angeles,+CA.csv		digital+agency-Los+Angeles,+CA.csv
yellow_pages.py		yellow_pages.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Yellow Pages Business Details Scraper 🚀

Getting Started

Fields

Libraries

How to run the script

Sample Output

Copyright and license

About

Releases

Packages

Languages

License

abdelrhman-arnos/yellowpages-scraper

Folders and files

Latest commit

History

Repository files navigation

Yellow Pages Business Details Scraper 🚀

Getting Started

Fields

Libraries

How to run the script

Sample Output

Copyright and license

About

Topics

Resources

License

Code of conduct

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages