-
not done yet
-
done
-
Automate search for Zhonga ZH-RU Dictionary. This cannot be done by simple URL manipulation, since each character has its own ID, which is present in the URL and makes automatic searching of individual characters impossible. For words that consist of more than one character this is not a problem though.
-
Create a script that can convert Traditional Chinese Characters in the clipboard to Simplified Characters and then send the simplified version to the clipboard again.
-
Automate search for Mini Taiwanese Mandarin Dictionary. This resource is specially useful for searching individual characters, and the problem is the same as described for Zhonga ZH-RU: each character has its own ID, which is present in the URL.
-
Automate pronunciation field in anki cards, the idea is to add pronuncation effortlessy, by minimizing user-input as much as possible yet we have thought about creating a python script that identifies the TW and donwload the pronunciation in forvo, and automatically puts in the right place. The challenge is making it more "buky" , so we dont have to go card by card manually.
-
Create a script that converts a .srt subtitle file in Simplified Chinese to Traditional Chinese.
-
Create a script that takes a Chinese dictionary file (like the ones for GoldenDict, Yomichan, etc.), identifies the Pinyin in the entries and change it for the correspondent Zhuyin (phonetic system used in Taiwan). Romanization is bad.
-
Create a script in order to quickly and efficiently use this OCR engine for Mandarin Chinese. Choose a keybind, press it, select an area with Chinese text with your mouse and the OCR will work, the recognized text will be sent to your clipboard. ImageMagick might be useful.
-
Analyze this tool's code and change its built-in resources for Mandarin Chinese ones.
Dictionaries can NOT be imported dynamically. They are built into the app itself. One can build the dictionary database and compile the app himself if needed.
- Word segmentation can be achieved by using a trained version of mecab and a modification in gd-mecab . It's also worth noticing that those scripts used to create a csv can also be used to create a cantonese version of the same program, by taking the data in cc-canto DONE
it was taken from this repo