Automatic recognition of complex terms
Problems and the TERMINO solution
Andy Lauriston | University of Manchester, Institute of Science and Technology
While the term-extraction decisions made by terminologists are based primarily on semantic and pragmatic criteria, automated processes have barely started operating at these levels of linguistic analysis. This paper discusses the graphic, lexical, syntactic and semantic difficulties encountered in automated text processing in general and emphasizes in particular certain specific problems that arise in the automatic recognition of complex terms. In order to illustrate the current limitations of existing systems, the article goes on to describe TERMINO, a morphosyntactic text-analysis system developed to help in French-language term extraction. A quantitative and qualitative assessment is made of the system's performance in recognizing complex terms.
Keywords: Candidate Term, Complex Term, Computer-Aided Term Extraction, Categorical Pattern Matching, Graphic Ambiguity, Graphic-Level Processing, Disambiguation, Morphosyntactic Text Analysis, Nested Construction, Morphosyn-Tactic Analyzer, Synapsy, Nonterm, Stemming, Tagging, Syntactic Noise, Syntactic Structure, Term-Formation Pattern, Term Extraction, Term Recognition, TERMINO, Terminological Noise, Automated Text Processing, Categorical Ambiguity, Automatic Term Recognition
Published online: 01 January 1994
https://doi.org/10.1075/term.1.1.11lau
https://doi.org/10.1075/term.1.1.11lau
Cited by
Cited by 9 other publications
Ananiadou, Sophia & John McNaught
Jacquemin, Christian & Evelyne Tzoukermann
Jalabert, Fabien, Sylvie Ranwez, Vincent Derozier & Michel Crampes
Losee, Robert M.
Marshall, Peter & Zuhair Bandar
Montero-Martı́nez, Silvia & Mercedes Garcı́a de Quesada
Montero‐Martínez, Silvia & Mercedes García de Quesada
Qiang Zhan & Chunhong Wang
Savary, Agata & Christian Jacquemin
This list is based on CrossRef data as of 07 february 2022. Please note that it may not be complete. Sources presented here have been supplied by the respective publishers. Any errors therein should be reported to them.