Publications

Publication details [#1071]

Hisamitsu, Toru, Yoshiki Niwa, Shingo Nishioka, Hirofumi Sakurai, Osamu Imaichi, Makoto Iwayama and Akihiko Takano. 2000. Extracting terms by a combination of term frequency and a measure of term representativeness. In Kageura, Kyo and Teruo Koyama, eds. Revising and editing for translators. Special issue of Terminology. International Journal of Theoretical and Applied Issues in Specialized Communication 6 (2): 211–232.
Publication type
Article in Special issue
Publication language
English
Journal DOI
10.1075/term

Abstract

This article describes a method for extracting terms that combines term frequency with a novel measure of term representativeness (i.e. informativeness or domain specificity). The measure is defined as the normalized distance between the word distribution in the documents which contain the term and the word distribution in the whole corpus. The measure is particularly effective in discarding uninformative terms that frequently appear and has a well-defined threshold value for judging the representativeness of a term. The authors combined the new measure with term frequency and applied it to the extraction of terms from abstracts of artificial intelligence papers. This article introduces the measure and reports on its effectiveness in term extraction.
Source : Based on abstract in journal