Publications

Publication details [#559]

Mima, Hideki and Sofia Ananiadou. 2000. An application and evaluation of the C/NC-value approach for the automatic term recognition of multi-word units in Japanese. In Kageura, Kyo and Teruo Koyama, eds. Revising and editing for translators. Special issue of Terminology. International Journal of Theoretical and Applied Issues in Specialized Communication 6 (2): 175–194.
Publication type
Article in Special issue
Publication language
English
Journal DOI
10.1075/term

Abstract

Technical terms are important for knowledge mining, especially as vast amounts of multi-lingual documents are available over the Internet. Thus, a domain and language-independent method for term recognition is necessary to automatically recognize terms from Internet documents. The C-/NC-value method is an efficient domain-independent multi-word term recognition method which combines linguistic and statistical knowledge. Although the C-value/NC-value method is originally based on the recognition of nested terms in English, the aim of this paper is to evaluate the application of the method to other languages and to show its feasibility for multi-language environments. In this article, the authors describe the application of the C/NC-value method to Japanese texts. Several experiments analyzing the performance of the method using the NACSIS Japanese AI-domain corpus demonstrate that the method can be utilized to realize a practical domain-and language-independent term recognition system.
Source : Based on abstract in journal