- Scraps Article on mutiple topics at once;
- Saves Fomatted word file containing headers and text scaped;
- Uses Python 3, wikipedia, BeautifulSoup, requests, Python-docx and more to efficiently scrap and save docs;
- Uses AWS for processing of documents.
- Save in multiple format. PDF etc.
- Mutithreading for different topics to reduce scraping time.
- Output reference links from wikipedia
https://akashnotes.com/articulate.html
Articulate is a software to prepare articles on single or multiple topics at once. It saves the articles in MS-Word automatically with a little formatting. Novice developers are welcomed to do modification and contribute to the repository.
- Install dependencies
pip install -r requirements.txt
This software can be run in ways
- Using CLI
- Using API
Run the following command after installing requirements
python driver.py --config CLI
This invokes CLI mode of the software.
The file will be saved in the current dir named articulate.docx
The following file shows implementation of Articulate API used in the web interface
from driver import MainDriver
md_obj = MainDriver()
# Put your string in place of "input_string"
returned_file = md_obj.article_maker("input_string")
The returned_file is the word file with articles.