Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

No morphology on numbers/punctuation #2

Open
MemduhG opened this issue May 10, 2018 · 2 comments
Open

No morphology on numbers/punctuation #2

MemduhG opened this issue May 10, 2018 · 2 comments
Labels

Comments

@MemduhG
Copy link
Contributor

MemduhG commented May 10, 2018

quoting @koguzhan from the wiki:

Suffixes after Numbers or characters like " << >> are currently not analyzed at all

Copying the CRH or TUR solution might work, though I'm not sure how uyghur usually puts numbers and morphology.

@MemduhG MemduhG added the FST label May 10, 2018
@MemduhG
Copy link
Contributor Author

MemduhG commented Aug 8, 2018

The standard way seems to leave a space after the number, punctuation or similar "word." I believe English organization names such as the BBC have a similar issue, what else can you think of @koguzhan?

@koghuzhan
Copy link
Collaborator

%, abbreviations in general. Uyghur mostly uses « » instead of " " btw so we'd need to add that too.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Projects
None yet
Development

No branches or pull requests

2 participants