This project is designed to scrape data from the FSRA Public Register in Abu Dhabi Global Market.
The scraped data is parsed and then stored in a DataFrame, and finally exported to a CSV file.
Make sure that you have the latest version of Python installed.
git clone https://github.com/cr2007/adgm-web-scraping
cd adgm-web-scraping
Optionally, you can also download the repository as a ZIP file.
To keep dependencies isolated, it's recommended to create a virtual environment:
python3 -m venv .venv
source .venv/bin/activate
python -m venv .venv
.venv\Scripts\activate
Once the virtual environment is activated, install the required Python packages by running:
pip install -r requirements.txt
Create a .env file in the root directory and add the necessary environment variables.
Check out sample.env for the variables required.
To run the scraping script, execute the following command:
# For macOS/Linux
python3 main.py
# For Windows
python main.py
After you are done, deactivate the virtual environment by running:
decativate
- Modify the
COMPANY_NAME_SPECIAL_CASES
dictionary to handle any specific company names that need special formatting.