Additional Data for eSpeak

Some languages don't have simple spelling-to-sound rules and eSpeak needs more data than usual to perform this task. To avoid increasing the size of the main eSpeak package, these files are provided here separately for those who need them.

zh   Chinese - Mandarin

zh_listx contains a list of Chinese ideograms and their Pinyin translations for Mandarin Chinese.

The zh voice includes a basic set of Chinese characters. This zh_listx is a supplimental list with additional characters.

Download this file and extract the zh_listx file into eSpeak's dictsource directory. From within the dictsource directory, do:
  espeak --compile=zh

Download: zh_listx.zip   Updated 04.Feb.2013


zhy   Chinese - Cantonese

zhy_list contains a list of Chinese ideograms and their phonetic translations for Cantonese. You need to download and compile this data before the zhy voice can recognise Chinese characters.

Download this file and extract the zhy_list file into eSpeak's dictsource directory. From within the dictsource directory, do:
  espeak --compile=zh-yue

Download: zhy_list.zip   Updated 05.Oct.2012


ru   Russian

The position of the stressed syllable within a Russian word can't be detertmined by rules. eSpeak makes a guess, but it's usually wrong. It needs additional dictionary information of the Russian words and their stress position which don't match eSpeak's guess.

Here is a copy of the Russian data which is compiled for eSpeak 1.47. Extract ru_dict from the archive and copy it into the espeak-data directory to replace the smaller version of ru_dict.
If you use the compiled data in ru_dict, it must be the same version as the version of eSpeak which you are using.
Download: ru_dict-48.zip, for eSpeak 1.48
Download: ru_dict-47.zip, for eSpeak 1.47.09
Download: ru_dict-47.zip, for eSpeak 1.47
Download: ru_dict-46.zip, for eSpeak 1.46
Download: ru_dict-45.zip, for eSpeak 1.45

Alternatively, here is a source copy of the additional Russian data which can be used to compile a new ru_dict file. Download ru_listx.zip and extract the ru_listx file into eSpeak's dictsource directory, (which should also contain the ru_list and ru_rules) and do:
  espeak --compile=ru

to produce a new, enlarged, version of the compiled dictionary: ru_dict. Copy this into the espeak-data directory.

Download: ru_listx.zip