全部 标题 作者
关键词 摘要

OALib Journal期刊
ISSN: 2333-9721
费用:99美元

查看量下载量

相关文章

更多...

Arabic Content Classification System Using statistical Bayes classifier With Words Detection and Correction

Keywords: text mining , classification , Arabic text classification , Arabic language processing.

Full-Text   Cite this paper   Add to My Lib

Abstract:

Automatic Arabic content classification is an important text mining task especially with the rapid growth of the number of online Arabic documents. This system is an enhancement of the implemented machine learning classification algorithm by applying detection and correction algorithm of Non-Words in Arabic text. This detection and correction algorithm is built on morphological knowledge in form of consistent root pattern relationships, and some morpho-syntactical knowledge based on affixation and morph-graphic rules to specify the word recognition and non-word correction process. Many researchers had been focused on Arabic content classification from only morphological view such as word’s root and stemming techniques (prefixes and suffixes) which showed variant results. In this work, consider classification from a very different way which is the syntactical approach. This paper presents the results of experiments on document classification achieved on ten different Arabic domains (Economy, History, Family studies, Islamic, Sport, Health, Law, Stories, astronomy and Food articles) using statistical methodology. The performance of this classification system showed encouraging results compared with other existing systems.

Full-Text

comments powered by Disqus

Contact Us

service@oalib.com

QQ:3279437679

WhatsApp +8615387084133