forked from HIT-SCIR/ltp
-
Notifications
You must be signed in to change notification settings - Fork 0
Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
- Loading branch information
Showing
5 changed files
with
133 additions
and
67 deletions.
There are no files selected for viewing
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -1,11 +1,12 @@ | ||
新闻 | ||
===== | ||
|
||
语言技术平台3.2.0版 发布 | ||
语言技术平台3.3.0版 发布 | ||
|
||
* [增加] 分词模块增量模型训练工具。使用户可以在语言技术平台基线模型的基础之上增加训练数据,从而获得特定领域性能更好的模型。 | ||
* [修改] Boost.Regex到1.56.0,由于旧版本Boost.Regex的 `match_results` 类存在竞争问题,这一修改修复了 `multi_cws_cmdline` 随机出错的问题。 | ||
* [修改] 使自动化测试脚本支持Windows运行以及多线程测试 | ||
* [修改] 将原 `examples` 文件夹下的示例文件转移到 `test` 文件夹下并纳入语言技术平台的编译链 | ||
* [测试] 新版语言技术平台通过 `cygwin` 编译测试 | ||
* [测试] 多线程程序 `multi_ltp_test` , `multi_cws_cmdline` 以及 `multi_pos_cmdline` 在Windows通过测试 | ||
* [增加] 词性标注模型加入微博数据,使得在开放域上的词性标注性能更好(+3.3 precision) | ||
* [增加] 依存句法分析模型加入微博数据,使得在开放域上的句法分析性能更好(+3 UAS) | ||
* [增加] 依存句法分析算法切换到transition-based neural network parser,速度从40 tokens/s提升到8000 tokens/s。同时通过加入聚类特征以及优化训练算法,(在新闻领域)准确率也得到小幅提升(+0.2 UAS) | ||
* [增加] `ltp_test`默认支持多线程,线程数可配置。 | ||
* [增加] 新加入子模块命令行程序,`cws_cmdline`,`pos_cmdline`,`par_cmdline`,`ner_cmdline`,使用户可以很容易替换中间模块,从而实现语言分析的组合。 | ||
* [修改] 优化了训练套件的交互方式 | ||
* [增加] 添加模型验证,单元测试模块。 |
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters