113 Commits (master)

Author SHA1 Message Date
fxsjy aa65031788 fix file mode 6 years ago
fxsjy 2eb11c8028 fix issue #810 6 years ago
JesseyXujin d703bce302 paddle coredump exception fix (#807) 6 years ago
fxsjy 97c32464e1 fix issue #798 6 years ago
pkpk 27910094ac Fix bugs in Paddle seg and Paddle postag (#789) 6 years ago
fxsjy 478c3b9bb4 lazy import paddle 6 years ago
imzhengzx ca444fb4da
fix the error about imoprting ChineseAnalyzer 7 years ago
sunjunyi01 b4dd5b58f3 bug fix, issue: #511, #512 8 years ago
huntzhan 60acefd9b1 Bugfix for HMM=False in parallelism. 9 years ago
Dingyuan Wang 99d0fb1a8a use regex and fix encoding related issues in load_userdict 10 years ago
Dingyuan Wang ceb5c26be4 fix self.FREQ in cut_for_search; make pair object iterable 10 years ago
Dingyuan Wang 3b76328f2a allow ignoring word frequency while providing pos tag 10 years ago
Dingyuan Wang 94840a734c wraps most globals in classes 10 years ago
Dingyuan Wang 4a552ca94f suggest word frequency, support passing str to add_word 11 years ago
Dingyuan Wang f808ea0ebb use only one dict to store words and prefixes 11 years ago
fxsjy 5bfa43a781 fix test scripts 11 years ago
Dingyuan Wang f3a53dd2da fix print() in tests 11 years ago
fxsjy 8cbb26a7b6 fix test_file.py 11 years ago
Dingyuan Wang 3dad899ec8 backport 2to3 scripts and changelog 11 years ago
Dingyuan Wang c6b386f65b update jieba3k 11 years ago
Dingyuan Wang a5ecf70f71 update to v0.35 11 years ago
Dingyuan Wang 4a6140081e fix problems in auto2to3 11 years ago
Dingyuan Wang 7a6caa0c3c port extract_tags, etc to jieba3k; add auto2to3 script 11 years ago
walkskyer 6772f0282e 修复带权重测试脚本输出结果是调用顺序错误 11 years ago
Dingyuan Wang fd9f1f2c0e update README, textrank, etc. 11 years ago
fxsjy f5ca87e088 merge change of @fukuball 11 years ago
Dingyuan Wang bb1e6000c6 fix version; fix spaces at end of line 11 years ago
Dingyuan Wang 51df77831b use prefix dict instead of trie, add a command line interface, and a few small improvements 11 years ago
Dingyuan Wang 6fad5fbb2c update to v0.33 11 years ago
Fukuball Lin b658ee69cb 讓 jieba 可以自行增加 stop words 語料庫 11 years ago
Fukuball Lin 7198d562f1 讓 jieba 可以切換 idf 語料庫 11 years ago
Dingyuan Wang c04ccd0d12 Update to v0.32 according to the master branch. 11 years ago
fxsjy 18678d50c6 fix bug issue #132 12 years ago
gan 31d5845535 add better support for english. like input: 'this is interesting and interested me'-->output:'this interest interest',which 'interest' match 'interesting interested' 12 years ago
Sun Junyi 7e7fcc1184 add an option to disable HMM 12 years ago
ZoeyYoung d49542c06e fix bug 12 years ago
ZoeyYoung 6024497917 更新 12 years ago
ZoeyYoung dce353f88b merge from master 12 years ago
Sun Junyi 81390a2d23 test_file.py: close the file object 12 years ago
fxsjy b77645b3aa modify test_file.py; use less memory 12 years ago
Linker Lin 5d83855088 自动检测CPU数目,启动合适数目的进程。 12 years ago
Linker Lin 2ceb981da0 自动检测CPU数目,启动合适数目的进程。 12 years ago
Cheng wei 6035bb6320 fix invalid syntax for python3 12 years ago
Sun Junyi 9d0ea771a5 fix bug; decimals & digit-english mixed 12 years ago
Sun Junyi ba5114dc95 update whoosh example 12 years ago
Sun Junyi f424862222 clean the files in tmp 12 years ago
Sun Junyi b9b1f1a418 fix conflict of merging 12 years ago
miao.lin becd32b178 made test_whoosh.py happy. 12 years ago
Sun Junyi b62f052927 PEP8 12 years ago
Sun Junyi 45daf561c7 follow PEP8: change tab to 4 white spaces 12 years ago