This is a webscraper to download the FREE Springer textbooks from the Springer website. These textbooks are available until July.
This program differs from other projects. Namely, that it does not download the .epub
files -- only the PDF. It also is a webscraper, unlike the leading popular Python port of this type of program which downloads the scripts based on their url.
-
Currently the books inside here are all the
.pdf
books in the Excel spreadsheet. If you would like to only download certain books, add links from theOpenURL
tab in the Excel spreadsheet into thebooks.txt
file. -
run
pip install -r requirements.txt
to install BeautifulSoup and the wget dependency. -
run
python3 download.py
to begin the download and see the output. -
open the
ebooks
folder that is created and your downloads should be present.
Windows, macOS, or Linux
On each download, this program renames the filename to the title of the respective textbook.
This is fine, but when a /
exists there is an error. This scraper modifies the title to a /
to remove the error.
- continuing to the next link after failure:
Thanks to Teal Dulcet for coming up with a better /
solution than mine, as well as alerting me to an issue regarding duplicate book titles + authors
which caused some books to be overwritten.