MulDi Chinese (IPA: [ˌmʌl'daɪ] [ˌtʃaɪˈniːz]) is a multidimensional analysis tagger of Mandarin Chinese.
- Installation:
pip install muldichinese
from muldichinese.MulDiChinese import MulDiChinese
mdc=MulDiChinese('/write/path/to/your/file(s)/')
mdc.files()
#print a list of your input files
mdc.tag()
#Segmentation and pos tagging completed.
mdc.features()
#Standardised frequencies of all 60 features written.
mdc.dimensions()
#Dimension scores written.
This programme is based on the ICTCLAS, and it is advised to reference ICTCLAS when MulDi Chinese is used. Please refer to https://dl.acm.org/citation.cfm?id=1119280.
Python packages needed are:
The manual contains a detailed description of the 60 features.