ADGM Web Scraping Project

This project is designed to scrape data from the FSRA Public Register in Abu Dhabi Global Market.

The scraped data is parsed and then stored in a DataFrame, and finally exported to a CSV file.

Installation

Make sure that you have the latest version of Python installed.

Clone the repository

git clone https://github.com/cr2007/adgm-web-scraping
cd adgm-web-scraping

Optionally, you can also download the repository as a ZIP file.

Create and activate a virtual environment

To keep dependencies isolated, it's recommended to create a virtual environment:

On macOS/Linux

python3 -m venv .venv
source .venv/bin/activate

On Windows

python -m venv .venv
.venv\Scripts\activate

Install Dependencies

Once the virtual environment is activated, install the required Python packages by running:

pip install -r requirements.txt

Environment Variables

Create a .env file in the root directory and add the necessary environment variables.

Check out sample.env for the variables required.

Running the Code

To run the scraping script, execute the following command:

# For macOS/Linux
python3 main.py

# For Windows
python main.py

Deactivating the Virtual Environment

After you are done, deactivate the virtual environment by running:

decativate

Customizing the Code

Modify the COMPANY_NAME_SPECIAL_CASES dictionary to handle any specific company names that need special formatting.

Name		Name	Last commit message	Last commit date
Latest commit History 19 Commits
.devcontainer		.devcontainer
.gitignore		.gitignore
README.md		README.md
main.py		main.py
poetry.lock		poetry.lock
pyproject.toml		pyproject.toml
requirements.txt		requirements.txt
sample.env		sample.env

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

ADGM Web Scraping Project

Installation

Clone the repository

Create and activate a virtual environment

On macOS/Linux

On Windows

Install Dependencies

Environment Variables

Running the Code

Deactivating the Virtual Environment

Customizing the Code

About

Languages

cr2007/ADGM-Web-Scraping

Folders and files

Latest commit

History

Repository files navigation

ADGM Web Scraping Project

Installation

Clone the repository

Create and activate a virtual environment

On macOS/Linux

On Windows

Install Dependencies

Environment Variables

Running the Code

Deactivating the Virtual Environment

Customizing the Code

About

Topics

Resources

Stars

Watchers

Forks

Languages