Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

First steps towards a CG-based UD parser; point to the lexicon-proofreading-effort in the docs; some corrections in puupankki #16

Open
wants to merge 151 commits into
base: main
Choose a base branch
from

Conversation

IlnarSelimcan
Copy link
Member

No description provided.

…n preceedes the digit

In UD treebank, in the phrase 'Еуровидение 2010 ән конкурсы' ('Eurovison 2020 song contest'),
2020 received <num><ord> analysis. Apparently that's what expected, hence this change.
…h A1 and ADV unless you know what you're doing; add few more disambiguation rules to fully disambiguage sent apertium#4 of kazakh ud treebank
… the analyser (doing that seemed useful for UD parsing, but now I realize that this change will cause a lot of trouble for machine-translators
біз noun is 'шило' in russian, but pronoun reading is much more frequent so made default
…pankki it is followed by да<postadv> a lot (whose head is сондықтан), but the validator won't allow an advmod child on a SCONJ :/
…nd), disambiguation and parsing. Will need to be cleaned up some before replacing the apertium-kaz.kaz.rlx. All to be used with apertiumpp-kaz's kaz.lexc and Makefile.am
…but probably other vadjes should have that subtag as well
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

5 participants