Google Scholar Literature Scraper

This library is designed to scrape, store, and process bibliographic data from Google Scholar. It consists of two main components:

data_scraper.py: Scrapes academic data and saves each entry as a pickle file.
data_handler.py: Reads and processes the stored pickle files, extracting relevant metadata and generating structured outputs.

Installation

Ensure you have Python 3 installed and install scholarly.

Modify config.json to add the desired queries before running the scraper. This file should contain the search terms or parameters you want to use when collecting data.
Run data_scraper.py script to collect and store academic data into pickle files.

python data_scraper.py

python data_handler.py

This will generate structured outputs in multiple formats:

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
README.md		README.md
config.json		config.json
data_handler.py		data_handler.py
data_scraper.py		data_scraper.py