-
Notifications
You must be signed in to change notification settings - Fork 2
Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
- Loading branch information
Showing
1 changed file
with
10 additions
and
10 deletions.
There are no files selected for viewing
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -1,10 +1,10 @@ | ||
Running the following version of UD tools: | ||
commit 78ce4b21495c6e4c17a7b07925bec1267d833d14 | ||
commit c1984d97df0ecdcc1b50fbeaa8c96419c6321432 | ||
Author: Dan Zeman <[email protected]> | ||
Date: Sun May 5 09:21:16 2024 +0200 | ||
Date: Sun Nov 10 10:33:45 2024 +0100 | ||
Evaluating the following revision of UD_English-PUD: | ||
commit 76440f59849f4e7ff5aab20aef8b5b320b297285 | ||
Merge: fd6010b 994bf7d | ||
commit 08b35519fb286bbae97b743c49c0f3d3043816ac | ||
Merge: b2378f2 6e328e8 | ||
Author: Dan Zeman <[email protected]> | ||
Size: counted 21180 of 21180 words (nodes). | ||
Size: min(0, log((N/1000)**2)) = 6.10611468034652. | ||
|
@@ -15,13 +15,13 @@ Split: Found at least 10000 test words. | |
Lemmas: source of annotation (from README) factor is 0.4. | ||
Universal POS tags: 17 out of 17 found in the corpus. | ||
Universal POS tags: source of annotation (from README) factor is 1. | ||
Features: 14265 out of 21180 total words have one or more features. | ||
Features: 14323 out of 21180 total words have one or more features. | ||
Features: source of annotation (from README) factor is 0.4. | ||
Universal relations: 35 out of 37 found in the corpus. | ||
Universal relations: source of annotation (from README) factor is 1. | ||
Udapi: | ||
TOTAL 225 | ||
Udapi: found 225 bugs. | ||
TOTAL 214 | ||
Udapi: found 214 bugs. | ||
Udapi: worst expected case (threshold) is one bug per 10 words. There are 21180 words. | ||
Genres: found 2 out of 17 known. | ||
/net/work/people/zeman/unidep/tools/validate.py --lang en --max-err=10 UD_English-PUD/en_pud-ud-test.conllu | ||
|
@@ -33,8 +33,8 @@ Validity: 1 | |
(weight=0.256410256410256) * (score{size}=0.441975318590489) = 0.113327004766792 | ||
(weight=0.0512820512820513) * (score{split}=0.34) = 0.0174358974358974 | ||
(weight=0.0769230769230769) * (score{tags}=1) = 0.0769230769230769 | ||
(weight=0.307692307692308) * (score{udapi}=0.893767705382436) = 0.27500544780998 | ||
(weight=0.307692307692308) * (score{udapi}=0.898961284230406) = 0.276603472070894 | ||
(weight=0.0769230769230769) * (score{udeprels}=0.945945945945946) = 0.0727650727650728 | ||
(TOTAL score=0.626044734994937) * (availability=1) * (validity=1) = 0.626044734994937 | ||
(TOTAL score=0.627642759255851) * (availability=1) * (validity=1) = 0.627642759255851 | ||
STARS = 3 | ||
UD_English-PUD 0.626044734994937 3 | ||
UD_English-PUD 0.627642759255851 3 |