Skip to content

Web Scraping to get data from the FSRA Public Register in ADGM

Notifications You must be signed in to change notification settings

cr2007/ADGM-Web-Scraping

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

19 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

ADGM Web Scraping Project

Open in GitHub Codespaces

This project is designed to scrape data from the FSRA Public Register in Abu Dhabi Global Market.

The scraped data is parsed and then stored in a DataFrame, and finally exported to a CSV file.

Installation

Make sure that you have the latest version of Python installed.

Clone the repository

git clone https://github.com/cr2007/adgm-web-scraping
cd adgm-web-scraping

Optionally, you can also download the repository as a ZIP file.

Create and activate a virtual environment

To keep dependencies isolated, it's recommended to create a virtual environment:

On macOS/Linux

python3 -m venv .venv
source .venv/bin/activate

On Windows

python -m venv .venv
.venv\Scripts\activate

Install Dependencies

Once the virtual environment is activated, install the required Python packages by running:

pip install -r requirements.txt

Environment Variables

Create a .env file in the root directory and add the necessary environment variables.

Check out sample.env for the variables required.

Running the Code

To run the scraping script, execute the following command:

# For macOS/Linux
python3 main.py

# For Windows
python main.py

Deactivating the Virtual Environment

After you are done, deactivate the virtual environment by running:

decativate

Customizing the Code

  • Modify the COMPANY_NAME_SPECIAL_CASES dictionary to handle any specific company names that need special formatting.

About

Web Scraping to get data from the FSRA Public Register in ADGM

Topics

Resources

Stars

Watchers

Forks

Languages