Standford Encyclopedia of Philosopfy scraper

Scrap SEP entry

Download the article content of a entry in SEP with link of the entry

What I need

You need the folowing librarys instaled via pip:

requests validators

Also need:

lxml with html

Usage:

python sep.py <standford entry link>

Whit name only:

If you use a *nix system make executable and ruin it.

chmod +x sep.py

./sep.py <standford entry link>

Results:

The script puts the content of the article in a sigle file named out.html. Is html raw without header or body content, only the <div> section with the article alone.

Example:

python sep.py https://plato.stanford.edu/entries/descartes/

pandoc -t latex --latex-engine=xelatex -f html -o Descartes.pdf out.html

What I cand do with this?

You can convert the html file to other formats vìa pandoc. Nice results with pdf latex output format. If you can't convert use xelatex engine.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

README.md

README.md

Standford Encyclopedia of Philosopfy scraper

What I need

Also need:

Usage:

Whit name only:

Results:

Example:

What I cand do with this?

Files

README.md

Latest commit

History

README.md

File metadata and controls

Standford Encyclopedia of Philosopfy scraper

What I need

Also need:

Usage:

Whit name only:

Results:

Example:

What I cand do with this?