OALib Journal期刊
ISSN: 2333-9721
费用：99美元

投递稿件

查看量	下载量

相关文章
更多...

计算机应用研究 2013

Research on out-of-vocabulary words'recognition in Uyghur-Chinese machine translation
维汉机器翻译未登录词识别研究

MI Cheng-gang,WANG Lei,YANG Ya-ting,CHEN Ke-hai,
米成刚,王　磊,杨雅婷,陈科海

Keywords: Uyghur-Chinese machine translation,phrase table,string similarity algorithms,out-of-vocabulary words,word segmentation,edit distance
维汉机器翻译,短语表,字符串相似度算法,未登录词,词切分,编辑距离

Full-Text Cite this paper Add to My Lib

Abstract:

Aimed at the phenomenon that there are so many out-of-vocabulary words in Uyghur-Chinese machine translation and the situation that the Uyghur language resources are very scarce, combined the features of Uyghur and string similarity algorithms, the paper presented an out-of-vocabulary word recognition model of Uyghur-Chinese machine translation which based on string similarity algorithms. With the help of phrase based model's phrase table, and the external dictionary, the model computed the maximum strings similarity between the out-of-vocabulary word and the Uyghur words' in phrase table and dictionary, got the translation corresponding to the Uyghur word. The experiments show that compared with the out-of-vocabulary words recognition method which based on word segmentation, this model is better retaining the words' information, and also improves the quality of the translation.

Full-Text

comments powered by Disqus

Contact Us

service@oalib.com

QQ:3279437679

WhatsApp +8615387084133

WeChat 1538708413

Research on out-of-vocabulary words'recognition in Uyghur-Chinese machine translation维汉机器翻译未登录词识别研究

Research on out-of-vocabulary words'recognition in Uyghur-Chinese machine translation
维汉机器翻译未登录词识别研究