All notable changes to this project will be documented in this file.
The format is based on Keep a Changelog, and this project adheres to Semantic Versioning.
- Add
exclude_lemmas
option when querying compounds
setup.sh
to install all package requirements- methods for querying the nouns file (see
german_nouns/query
)
- directory structure
- README.md
- wiktionary_de_parser requirement
- updated
create_csv/main.py
to use with latest version of wiktionary_de_parser - updated
nouns.csv
- README
- wiktionary_de_parser as parser script for Wiktionary xml dump
- 'Genus 1' - 'Genus 4' columns
- overall better parsing results due to submodule wiktionary_de_parser
- Python 3.7+ requirement
- compound column due to unprecise results
- remove dashes in empty cells
- info about compound words
- improved parser script (+11 thousand more nouns)
- prefill declination values for adjective nouns
- sorting order in nouns.csv
- refactor parser script
- updated readme
- column 'WikiTitel' to 'lemma'
- refactor code
- improved noun parsing
- fixed dash chracter for non-available singulars or plurals
- initial release