Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

step2 对解析出的文章处理,取得名字 #3

Open
findsomeoneyys opened this issue May 16, 2016 · 0 comments
Open

step2 对解析出的文章处理,取得名字 #3

findsomeoneyys opened this issue May 16, 2016 · 0 comments
Labels

Comments

@findsomeoneyys
Copy link
Collaborator

处理工具为HanLp

工具在github链接为https://github.com/hankcs/HanLP

初始处理目标

  1. 打开目录下test文本,获取所有内容
  2. 对文本处理,去除<p>,<br>等标签并分隔,得到一个列表类似于["王总和小丽结婚了"."编剧邵钧林和稽道青说","这里有关天培的有关事迹"...]
  3. 调用HanLP进行处理,只做输出测试观察是否取得人名

后续目标

  • 功能按需写在类里面,方便在另一程序中直接调用
  • 取得结果应存入数据库中
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Projects
None yet
Development

No branches or pull requests

2 participants