simtext, the most accurate Chinese text similarity calculation tool.(最准的中文文本相似度计算工具)
- pip3 install simtext
or
git clone https://github.com/shibing624/simtext.git
cd simtext
python3 setup.py install
import simtext
a = '湖北人爱吃鱼'
b = '甘肃人不爱吃鱼'
s = simtext.score(a, b)
print(s)
output:
0.7783981956422068