|
A SEMANTIC SIMILARITY MEASUREKeywords: semantic similarity , data mining , information retrieval Abstract: Similarity measures are the most important tools in information retrieval and natural language processing. Sentence similarities are of capital importance in online translation. Words-to-document similarity is the key factor to compute query relevance. Text similarities play a big role in data mining. A lot of similarity measures have been used in different domains. Most of the measures are corpus dependent or language dependent. When some measures are good to compute word-to-document matching, they are unable to compute document-to-document similarity and vice versa. In this paper we present a method that can be used to compute any kind of semantic similarity. The method is neither corpus dependent nor language dependent, and gives a way to compare more accurately semantic relatedness.
|