全部 标题 作者
关键词 摘要

OALib Journal期刊
ISSN: 2333-9721
费用:99美元

查看量下载量

相关文章

更多...

Portuguese corpus-based learning using ETL

DOI: 10.1007/BF03192569

Keywords: entropy guided transformation learning, transformation-based learning, decision trees, natural language processing.

Full-Text   Cite this paper   Add to My Lib

Abstract:

we present entropy guided transformation learning models for three portuguese language processing tasks: part-of-speech tagging, noun phrase chunking and named entity recognition. for part-of-speech tagging, we separately use the mac-morpho corpus and the tycho brahe corpus. for noun phrase chunking, we use the snr-clic corpus. for named entity recognition, we separately use three corpora: harem, miniharem and learnnec06. for each one of the tasks, the etl modeling phase is quick and simple. etl only requires the training set and no handcrafted templates. etl also simplifies the incorporation of new input features, such as capitalization information, which are sucessfully used in the etl based systems. using the etl approach, we obtain state-of-the-art competitive performance in all six corpora-based tasks. these results indicate that etl is a suitable approach for the construction of portuguese corpus-based systems.

Full-Text

comments powered by Disqus

Contact Us

service@oalib.com

QQ:3279437679

WhatsApp +8615387084133

WeChat 1538708413