Bronya is a crawler for ecommerce data
The project is a WIP, so expect major changes and additions. Master branch is to be considered as always ready to use, with major changes/features introduced in feature branches.
- Python 3.11+
- Poetry for dependency management
- SQLAlchemy ORM with alembic migrations
- RabbitMQ integrated via pika
- configuration via ENV variables and/or
.env
file - single file for each class
- Docker-ready (see here)
- PM2-ready
- supports single-IP/rotating proxy config out of the box (see here)
To create and run a new Scrapy project using this boilerplate, you need to:
- Clone the repository.
cp .env.example .env
- No docker:
- Have the following prerequisites: python 3.11+, poetry, mysqlclient libraries, etc
poetry install
poetry shell
scrapy
- Docker:
- Have the following prerequisites: docker, docker-compose
docker compose up -d database python
docker compose exec python bash
cd /var/app/python/src/
poetry shell
scrapy