Automatic recognition of complex terms
Problems and the TERMINO solution
Andy Lauriston | University of Manchester, Institute of Science and Technology
While the term-extraction decisions made by terminologists are based primarily on semantic and pragmatic criteria, automated processes have barely started operating at these levels of linguistic analysis. This paper discusses the graphic, lexical, syntactic and semantic difficulties encountered in automated text processing in general and emphasizes in particular certain specific problems that arise in the automatic recognition of complex terms. In order to illustrate the current limitations of existing systems, the article goes on to describe TERMINO, a morphosyntactic text-analysis system developed to help in French-language term extraction. A quantitative and qualitative assessment is made of the system's performance in recognizing complex terms.
Keywords: Candidate Term, Complex Term, Computer-Aided Term Extraction, Categorical Pattern Matching, Graphic Ambiguity, Graphic-Level Processing, Disambiguation, Morphosyntactic Text Analysis, Nested Construction, Morphosyn-Tactic Analyzer, Synapsy, Nonterm, Stemming, Tagging, Syntactic Noise, Syntactic Structure, Term-Formation Pattern, Term Extraction, Term Recognition, TERMINO, Terminological Noise, Automated Text Processing, Categorical Ambiguity, Automatic Term Recognition
Cited by
Cited by 9 other publications
Ananiadou, Sophia & John McNaught
1995.
Terms are not alone: term choice and choice terms.
Aslib Proceedings 47:2
► pp. 47 ff.

Jacquemin, Christian & Evelyne Tzoukermann
1999.
NLP for Term Variant Extraction: Synergy Between Morphology, Lexicon, and Syntax. In
Natural Language Information Retrieval [
Text, Speech and Language Technology, 7],
► pp. 25 ff.

Jalabert, Fabien, Sylvie Ranwez, Vincent Derozier & Michel Crampes
2006.
i $^{\rm {\sc 2}}$ dee: An Integrated and Interactive Data Exploration Environment Used for Ontology Design. In
Managing Knowledge in a World of Networks [
Lecture Notes in Computer Science, 4248],
► pp. 256 ff.

Losee, Robert M.
1996.
Text windows and phrases differing by discipline, location in document, and syntactic structure.
Information Processing & Management 32:6
► pp. 747 ff.

Marshall, Peter & Zuhair Bandar
1999.
Working Towards Connectionist Modeling of Term Formation. In
Computational Intelligence [
Lecture Notes in Computer Science, 1625],
► pp. 522 ff.

Montero-Martı́nez, Silvia & Mercedes Garcı́a de Quesada
2004.
Designing a corpus-based grammar for pragmatic terminographic definitions.
Journal of Pragmatics 36:2
► pp. 265 ff.

Montero‐Martínez, Silvia & Mercedes García de Quesada
2003.
Terminological analysis for translation.
Perspectives 11:4
► pp. 293 ff.

Qiang Zhan & Chunhong Wang
2015.
2015 International Joint Conference on Neural Networks (IJCNN),
► pp. 1 ff.

Savary, Agata & Christian Jacquemin
2003.
Reducing Information Variation in Text. In
Text- and Speech-Triggered Information Access [
Lecture Notes in Computer Science, 2705],
► pp. 145 ff.

This list is based on CrossRef data as of 25 february 2023. Please note that it may not be complete. Sources presented here have been supplied by the respective publishers.
Any errors therein should be reported to them.