Skip to content

Commit d7bce82

Browse files
committed
Update README.md
1 parent bf81d34 commit d7bce82

File tree

1 file changed

+2
-0
lines changed

1 file changed

+2
-0
lines changed

README.md

+2
Original file line numberDiff line numberDiff line change
@@ -2,3 +2,5 @@ split-ne
22
========
33

44
Tool that divide complex named entitities in Czech into their parts
5+
6+
Our target are sequences like (hokejista - hockey player) New York Rangers Jaromír Jágr. In this case, it is clear for human that name of the hockey player is not the whole sequence. It is not so clear for computers, so we are trying to develop an quantitative method based on frequencies on large corpus that could do that.

0 commit comments

Comments
 (0)