Publications
Publication details [#7326]
Nenadic, Goran, Irena Spasic and Sofia Ananiadou. 2004. Mining term similarities from corpora. In Daille, Béatrice, Kyo Kageura, Hiroshi Nakagawa and Lee-Feng Chien, eds. Recent trends in computational terminology. Special issue of Terminology. International Journal of Theoretical and Applied Issues in Specialized Communication 10 (1): 55–81.
Publication type
Article in Special issue
Publication language
English
Keywords
Journal DOI
10.1075/term
Abstract
In this article, the authors present an approach to the automatic discovery of term similarities, which may serve as a basis for a number of term-oriented knowledge mining tasks. The method for term comparison combines internal (lexical similarity) and two types of external criteria (syntactic and contextual similarities). Lexical similarity is based on sharing lexical constituents (i.e. term heads and modifiers). Syntactic similarity relies on a set of specific lexico-syntactic co-occurrence patterns indicating the parallel usage of terms (e.g., within an enumeration or within a term coordination/conjunction structure), while contextual similarity is based on the usage of terms in similar contexts. Such contexts are automatically identified by a pattern mining approach, and a procedure is proposed to assess their domain-specific and terminological relevance. Although automatically collected, these patterns are domain dependent and identify contexts in which terms are used.
Source : Based on abstract in journal