Term Variation in Specialised Corpora

Characterisation, automatic discovery and applications

| University of Nantes - LINA
HardboundAvailable
ISBN 9789027223432 | EUR 99.00 | USD 149.00
 
e-Book
ISBN 9789027265357 | EUR 99.00 | USD 149.00
 
This book addresses term variation which has been a very important topic in terminology, computational terminology and natural language processing for up to twenty years. This book presents the first complete inventory of term variants and the linguistic procedures that lead to their formation. It also takes into account issues raised by multilingual applications and presents ways to detect variants in five different languages: French, English, German, Spanish and Russian.

The book provides insights into the following issues: What is a variant? What are the main linguistic mechanisms involved in the transformation of base terms into variants? How can variants be automatically detected in texts? Should variation be taken into account in natural language processing applications?

This book is targeted at terminologists and linguists interested in term variation as well as researchers in natural language processing and computer science that must handle term variants in different kinds of applications.
Publishing status: Available
Table of Contents
Acknowledgements
xi–xii
Chapter 1. Introduction
2–8
Part I. Characterisation
12–80
Chapter 2. Definitions
12–31
Chapter 3. Conceptualisation of terminological variants
34–65
Chapter 4. Semantics of conceptual variants
68–80
Part II. Automatic discovery
84–168
Chapter 5. Primitive exploration of variants using comparable corpora
84–108
Chapter 6. Processing methods for the detection of variants from corpora
110–124
Chapter 7. Grammar of variants
126–146
Chapter 8. Synonymic variants
148–168
Part III. Applications and tools
172–221
Chapter 9. Terminology extraction
172–203
Chapter 10. End-user applications and tools
206–221
Part IV. Conclusions
226–233
Chapter 11. Term variants and their discovery
226–233
Bibliography
235–246
Appendix A. Notation
Appendix B. Multext categories
250
Appendix C. Search with Antconc
252–255
Appendix D. GGRV
258–269
Index
271–272
References

Bibliography

S. Abney
. Partial parsing via finite-state cascades. Natural Language Engineering, 2(4): 337–344 1996. CrossrefGoogle Scholar
K. Ahmad, A. Davies, H. Fulford, and M. Rogers
. What is a term? The semiautomatic extraction of terms from text. Translation Studies: An Interdiscipline, pages 267–278 1994.Google Scholar
D. Amiot and G. Dal
. Integrating neoclassical combining forms into a lexeme-based morphology. In Proceedings of the 5th Mediterranean Morphology Meeting (MMM5) 2007.Google Scholar
. La composition néoclassique en français et l'ordre des constituants. La composition dans les langues, pages 89–113 2008.Google Scholar
A. Aronson
. Effective mapping of biomedical text to the UMLS metathesaurus: The metamap program. In Proceedings of the 25th Annual Symposium of the American Medical Informatics Association (AMIA 2001), pages 17–21 2001.Google Scholar
M. Baroni and S. Bernardini
. BootCaT: Bootstrapping corpora and terms from the web. In Proceedings of the 4th conference on International Language Resources and Evaluation (LREC 2004), pages 1313–1316 2004.Google Scholar
M. Baroni and A. Lenci
. Distributional memory: A general framework for corpus-based semantics. Computational Linguisics, 36(4):673–721 2010. CrossrefGoogle Scholar
C. Barrière
. Recherche contextuelle d'équivalents en banque de terminologie. In Actes de la 17ème conférence sur le Traitement Automatique des Langues Naturelles (TALN 2010) 2010.Google Scholar
L. Bauer
. English word-formation. Cambridge University Press, Cambridge 1983. CrossrefGoogle Scholar
. Morphological productivity. Cambridge University Press, Cambridge 2001. CrossrefGoogle Scholar
L. Bauer and A. Renouf
. A corpus-based study of compounding in english. Journal of English Linguistics, 29:101–123 2001. CrossrefGoogle Scholar
H.-D. Béchade
. Phonétique et morphologie du français moderne et contemporain. Presses Universitaires de France, Paris 1992.Google Scholar
O. Ben-Kiki, C. Evans, and B. Ingerson
. Yaml ain't markup language (yamlTM) version 1.1. yaml. org, Tech. Rep 2005.Google Scholar
M. Benson
. Collocations and general-purpose dictionaries. International Journal of Lexicography, 3(1):23–25 1990. CrossrefGoogle Scholar
É. Benveniste
. Problèmes de linguistique générale, chapter Formes nouvelles de la composition nominale, pages 163–173. Gallimard, Paris 1966.Google Scholar
. Problèmes de linguistique générale. Gallimard, Paris 1974.Google Scholar
D. Bergmark, C. Lagoze, and A. Sbityakov
. Focused crawls, tunneling, and digital libraries. In M. Agosti and C. Thanos editors Research and Advanced Technology for Digital Librairies – 6th European Conference, ECDL 2002, volume 2458 of Lecture notes in computer science, pages 91–106. Springer, New York 2002.Google Scholar
D. Biber
. A Typology of English Texts. Linguistics, 27(1):3–43 1989. CrossrefGoogle Scholar
D. Biber, S. Johansson, G. Leech, S. Conrad, and E. Finegan. Longman Grammar of Spoken and Written English. Longman, London 1999.Google Scholar
M.-B. Billami, J. Camacho-Collados, E. Jacquey, and L. Kister
. Semantic annotation and terminology validation in full scientific articles in social sciences and humanities (annotation sémantique et validation terminologique en texte intégral en shs). In Actes de la 21e conférence sur le Traitement Automatique des Langues Naturelles (TALN 2014), pages 363–376 2014.Google Scholar
L. Bloomfield
. Language. Allen and Unwin, London 1935.Google Scholar
N. N. Bobyreva
. Syntactic way of term-building in terminological system of rhytmic gymnastics in the Russian and the English languages. TCHGPU, 12 2009.Google Scholar
E. Bolshakova
. Common scientific lexicon for automatic discourse analysis of scientific and technical texts. International Journal “Information Theories & pplications”, 15: 189–195 2008.Google Scholar
B. Bommier-Pincemin
. Diffusion ciblée automatique d'informations : conception et mise en œuvre d'une linguistique textuelle pour la caractérisation des destinataires et des documents. PhD thesis, Université Paris IV, Sorbonne 1999.Google Scholar
A. Bosredon
. Un adjectif de trop : l'adjectif de relation. Information grammaticale, 37: 3–7 1989. CrossrefGoogle Scholar
. Prédicativité et lexicalisation. Studia Romanica Posnaniensia, 16:85–100 1991.Google Scholar
A. Bougouin, F. Boudin, and B. Daille
. TopicRank: Graph-Based Topic Ranking for Keyphrase Extraction. In Proceedings of the 8th International Joint Conference on Natural Language Processing (IJCNLP 2013), pages 543–551 2013.Google Scholar
D. Bourigault
. LEXTER : un logiciel d'EXtraction de TERminologie. Application à l'extraction des connaissances à partir de textes. Mathématiques, informatique ap-pliquée aux sciences de l'Homme, École des Hautes Études en Sciences Sociales, Paris 1994.Google Scholar
D. Bourigault and C. Jacquemin
. Construction de ressources terminologiques. In J.-M. Pierrel editor Ingénierie des langues, pages 215–234. Hermes, Paris 2000.Google Scholar
D. Bourigault and M. Slodzian
. Pour une terminologie textuelle. Terminologies nou-velles, 19:29–32 1999.Google Scholar
L. Bowker and S. Hawkins
. Variation in the organization of medical terms – exploring some motivations of term choice. Terminology, 12(1):79–110 2006. CrossrefGoogle Scholar
L. Bowker and J. Pearson
. Working with Specialized Language: A Practical Guide to Using Corpora. Routledge, London 2002. CrossrefGoogle Scholar
D. A. Burgos Herera
. Towards an Image-Term Co-occurrence Model for Multilingual Terminolog Alignment and Cross-Language Image Indexing. PhD thesis, Universitat Pompeu Fabra 2014.Google Scholar
G. Burnage
. CELEX: A Guide for Users. Center for Lexical Information, Univerty of Nijmegen 1990.Google Scholar
H. Bussmann
. Routledge Dictionary of Language and Linguistics. Routledge, London 1996.Google Scholar
M. T. Cabré
. La terminologie: théorie, méthodes et applications. Armand Colin, Paris 1998.Google Scholar
. La terminologia. Representación y communicación. Una teoría de base communicativa. Universitat Pompeu Fabra, Barcelona 1999.Google Scholar
. Do we need an autonomous theory of terminology? Terminology, 5(1): 2–20 2000.Google Scholar
M. T. Cabré and J. Sager
. Terminology: Theory, Methods, and Applications. John Benjamins, Amsterdam 1999. CrossrefGoogle Scholar
M. T. Cabré, R. E. Bagot, and J. V. Platresi
. Automatic term detection: A review of current systems. In D. Bourigault, C. Jacquemin, and M.-C. L'Homme editors Recent Advances in Computational Terminology, volume 2 of Natural Language Processing, pages 53–88. John Benjamins, Amsterdam 2001.Google Scholar
E. Carey. L'énergie éolienne. Master's thesis, Maitrise LEA. Université Paris III 2004.Google Scholar
R. Chaffin and D. J. Herrmann
. The similarity and diversity of semantic relations. Memory and Cognition, 12(2):134–141 1984. CrossrefGoogle Scholar
A. X. Chang and C. D. Manning
. TokensRegex: Defining cascaded regular expressions over tokens. Technical Report CSTR 2014–02, Department of Computer Science, Stanford University 2014.Google Scholar
S. Chaudiron
. L'évaluation des systèmes de traitement de l'information textuelle : vers un changement de paradigme. Mémoire pour HDR, Université de Paris X, Paris 2001.Google Scholar
G. Ciapuscio
. Textos especializados y terminología. PhD thesis, Institut Universitari de Lingüística Aplicada, Universitat Pompeu Fabra, Barcelona 2003.Google Scholar
T. Collet
. La réduction des unités terminologiques complexes de type syntagmatique. Meta : journal des traducteurs / Meta: Translators' Journal, 42(1):193–206 1997. CrossrefGoogle Scholar
CollinsDictionary. Collins dictionary
2013.Google Scholar
G. Colpron
. Les anglicismes au québec : essai de classification et de numération comparative. Master's thesis, Université de Montreal, Faculté des lettres 1965.Google Scholar
A. Condamines
. Corpus analysis and conceptual relation patterns. Terminology, 8(1): 141–162 2002. CrossrefGoogle Scholar
. L'expression de la méronymie dans les petites annonces immobil-ières ; comparaison français, anglais, espagnol. Journal of French Language Studies (JFLS), 19:3–23 2009. CrossrefGoogle Scholar
M. Creutz and K. Lagus
. Unsupervised morpheme segmentation and morphology induction from text corpora using morfessor 1.0. In Computer and Information Science, Report A81, Helsinki University of Technology 2005.Google Scholar
M. Crochemore, C. Hancart, and T. Lecroq
. Algorithmique du texte. Vuibert, Paris 2001.Google Scholar
K. Crowston, B. Kwaśnik, and J. Rubleske
. Problems in the use-centered development of a taxonomy. In A. Meher, S. Sharoff, and M. Santini editors Genres on the Web, volume 42 of Text, Speech and Language Technologies series, Chapter 4. Springer, New York 2010.Google Scholar
D. A. Cruse
. Lexical Semantics. Cambridge University Press, Cambridge 1995.Google Scholar
. Meaning in Language: An Introduction to Semantics and Pragmatics. Oxford University Press, Oxford 2004.Google Scholar
J. R. Curran and M. Moens
. Improvements in automatic thesaurus extraction. In Proceedings of the Workshop on Unsupervised Lexical Acquisition associated to the 40th Annual Meeting of the Association for Computational Linguistics (ACL 2002), pages 59–66 2002.Google Scholar
B. Daille
. Variations and application-oriented terminology engineering. Terminology, 11 (1): 181–197 2005. CrossrefGoogle Scholar
B. Daille, B. Habert, C. Jacquemin, and J. Royauté
. Empirical observation of term variations and principles for their description. Terminology, 3(2):197–257 1996. CrossrefGoogle Scholar
B. Daille, C. Enguehard, C. Jacquin, R. Raharinirina, B. Ralalaoherivony, and C. Lehman
. Traitement automatique de la terminologie en langue malgache. In K. Chibout, J. Mariani, N. Masson, and F. Néel editors Ressources et évaluation en ingénierie des langues, Actualité scientifique – Universités Francophones, pages 225–242. De Boek & Larcier, Bruxelles 2000.Google Scholar
J. Dancette and M.-C. L'Homme
. Building Specialized Dictionaries using Lexical Functions. Linguistica Antverpiensa, 3:113–132 2004.Google Scholar
C. de
Groc. Babouk: Focused web crawling for corpus compilation and automatic terminology extraction. In Proceedings of the 2011 IEEE/WIC/ACM International Conferences on Web Intelligence and Intelligent Agent Technology (WI-IAT 2011), pages 497–498 2011.Google Scholar
L. Déléger and P. Zweigenbaum. Extracting lay paraphrases of specialized expressions from monolingual comparable medical corpora. In Proceedings of the 2nd Workshop on Building and Using Comparable Corpora: from Parallel to Non-parallel Corpora, pages 2–10, Suntec, Singapour 2009. CrossrefGoogle Scholar
A. Depierre
. Souvent HAEMA varie …: Les dérivés du grec HAEMA en anglais : Étude de cas de variation. Terminology, 13(2):155–176 2007. CrossrefGoogle Scholar
P. Drouin
. Term extraction using non-technical corpora as a point of leverage. Terminology, 9(1):99–117 2003. CrossrefGoogle Scholar
T. Dunning
. Accurate methods for the statistics of surprise and coincidence. Computational Linguistics, 19(1):61–74 1993.Google Scholar
M. Dupuch, T. Hamon, and N. Grabar
. Groupement de termes basé sur des régularités linguistiques et sémantiques dans un contexte cross-langue. In Actes de la 20e conférence sur le Traitement Automatique des Langues Naturelles (TALN 2013), pages 62–75 2013.Google Scholar
C. Durieux
. Fondement didactique de la traduction technique. La maison du dictionnaire, Paris 2010.Google Scholar
P. Dury
. Les indices d'évolution diachronique dans les dictionnaires de langue générale : une étude de cas autour de carbon. In F. Maniez and P. Dury editors Lexicologie et terminologie : histoire des mots, Travaux du CRTT, pages 269–284. Presse universitaire de l'Université Lumière-Lyon 2, Lyon 2007.Google Scholar
P. Dury and P. Drouin
. L'obsolescence des termes en langues de spécialité : une étude semi-automatique de la “nécrologie” en corpus informatisés, appliquée au domaine de l'écologie. In Proceedings of the XVII European LSP Symposium 2009, pages 1–11 2010.Google Scholar
DWIA
. Wind Energy Reference Manual. Danish Wind Industry Association 2003.Google Scholar
C. Dyer
. Using a maximum entropy model to build segmentation lattices for MT. In Proceedings of the Human Language Technologies: The 11th Annual Conference of the North American Chapter of the ACL (HLT-NAACL 2009), pages 406–414 2009.Google Scholar
R. Estopà, J. Vivaldi, and M. T. Cabré
. Use of greek and latin forms for term detection. In Proceedings of the 2nd conference on International Language Resources and Evaluation (LREC 2000) 2000.Google Scholar
R. Estopà Bagot
. Les unités de signification spécialisées élargissant l'objet du travail en terminologie. Terminology, 7(2):217–237 2001. CrossrefGoogle Scholar
B. S. Everitt
. The Analysis of Contingency Tables. Chapman & Hall, London 1992.Google Scholar
S. Evert
. Corpora and Collocations. In A. Lüdeling and M. Kytö editors Corpus Linguistics: An International Handbook, pages 1212–1248. Mouton de Gruyter, Berlin 2008.Google Scholar
S. Evert and B. Krenn
. Using small random samples for the manual evaluation of statistical association measures. Computer Speech & Language, 19(4):450–466 2005. CrossrefGoogle Scholar
L. Faigley and S. Witte
. Analyzing revision. College Composition and Communication, 32(4):400–414 1981. CrossrefGoogle Scholar
R. M. Fano
. Transmission of Information: A statistical Theory of Communications. MIT Press, Cambridge, MA 1961.Google Scholar
C. Fellbaum
editor WordNet: An Electronic Lexical Database. MIT Press, Cambridge, MA 1998.Google Scholar
O. Ferret
. Testing semantic similarity measures for extracting synonyms from a corpus. In Proceedings of the 7th conference on International Language Resources and Evaluation (LREC 2010) 2010.Google Scholar
O. Ferret. Identifying bad semantic neighbors for improving distributional thesauri. In Proceedings of the 51st Annual Meeting of the Association for Computational Linguistics (ACL 2013), pages 561–571 2013.Google Scholar
D. Ferrucci and A. Lally
. UIMA: an architectural approach to unstructured information processing in the corporate research environment. Natural Language Engineering, 10:327–348 2004. CrossrefGoogle Scholar
J. Firth
. A synopsis of linguistic theory 1930–1955. In F. R. Palmer editor Selected Papers of J. R. Firth 1952–1959, pages 168–205. Longman, London 1968.Google Scholar
T. Fontenelle
. Dictionnaires et outils de correction linguistique. Revue française de linguistique appliquée, X(2):119–128 2005.Google Scholar
P. Forchini and A. Murphy
. N-grams in comparable specialized corpora: Perspectives on phraseology, translation, and pedagogy. International Journal of Corpus Linguistics, 13(3):351–367 2008. CrossrefGoogle Scholar
B. Fradin
. Nouvelles approches en morphologie. Presses Universitaires de France, Paris 2003. CrossrefGoogle Scholar
K. Frantzi and S. Ananiadou
. The C-Value/NC-Value domain independent method for multi-word term extraction. Journal of Natural Language Processing, 6(3):145–179 2000. CrossrefGoogle Scholar
J. Freixa
. La variació terminològica: Anàlisi de la variació denominativa en textos de diferent grau d'especialització de l'àrea de medi ambient. PhD thesis, Institut Universitari de Lingüística Aplicada, Universitat Pompeu Fabra, Barcelona 2002.Google Scholar
. Causes of denominative variation – A typology proposal. Terminology, 12 (1):51–77 2006. CrossrefGoogle Scholar
O. Frunza and D. Inkpen
. Identification and Disambiguation of Cognates, False Friends, and Partial Cognates Using Machine Learning Techniques. International Journal of Linguistics, 1(1) 2009.Google Scholar
A. Fujita and P. Isabelle
. Expanding paraphrase lexicons by exploiting lexical variants. In Proceedings of the 2015 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, pages 630–640. Association for Computational Linguistics 2015.Google Scholar
W. A. Gale and K. W. Church
. Identifying word correspondences in parallel texts. In Proceedings of the Workshop Speech and Natural Language, pages 152–157 1991. CrossrefGoogle Scholar
F. Gaudin
. Socioterminologie. Une approche socioterminologique de la terminologie. De Boeck/Duculot, Bruxelles 2003.Google Scholar
A. Gelbukh and G. Sidorov
. Approach to construction of automatic morphological analysis systems for inflective languages with little effort. In A. Gelbukh editor Computational Linguistics and Intelligent Text Processing – 4th International Conference, CICLing 2003, volume 2588 of Lecture Notes in Computer Science, pages 215–220. Springer, New York 2003.Google Scholar
P. Gipe
. Wind power: renewable energy for home, farm, and business. Chelsea Green, Chelsea, VT 2004.Google Scholar
G. Grefenstette
. Corpus-derived first, second and third-order word affinities. In Proceedings of the 6th International Congress of the European Association for Lexicography (EURALEX 1994), pages 279–290 1994a.Google Scholar
. Explorations in Automatic Thesaurus Discovery. Kluwer, Boston, MA 1994b. CrossrefGoogle Scholar
G. Gross. Degré de figement des noms composés. Langages, 90:57–72 1988. CrossrefGoogle Scholar
M. Guégan and C. de Loupy
. Knowledge-poor approach to shallow parsing: Contribution of unsupervised part-of-speech induction. In G. Angelova, K. Bontcheva, R. Mitkov, and N. Nicolov editors Proceedings the Recent Advances in Natural Language Processing (RANLP 2011), pages 33–40 2011.Google Scholar
M. Hagiwara
. A supervised learning approach to automatic synonym identification based on distributional features. In Proceedings of the HLT Student Research Workshop associated to Workshop on the 46th Annual Meeting of the Association for Computational Linguistics (ACL 2008), pages 1–6 2008.Google Scholar
T. Hamon and A. Nazarenko
. Detection of synonymy links between terms: experiments and results. In D. Bourigault, C. Jacquemin, and M.-C. L'Homme editors Recent Advances in Computational Terminology, volume 2 of Natural Language Processing, pages 185–208. John Benjamins, Amsterdam 2001.Google Scholar
Y. Haralambous and E. Lavagnino
. La réduction de termes complexes dans les langues de spécialité. Traitement automatique des langues (TAL), 52(1):37–68 2011.Google Scholar
R. Harastani, B. Daille, and E. Morin
. Neoclassical compound alignments from comparable corpora. In Proceedings of the 13th International Conference Computational Linguistics and Intelligent Text Processing (CICLing 2012), pages 72–82 2012.Google Scholar
Z. S. Harris
. Distributional structure. Word, 10(23):146–162 1954. CrossrefGoogle Scholar
B. Hauer and G. Kondrak
. Clustering semantically equivalent words into cognate sets in multilingual lists. In Proceedings of the 5th International Joint Conference on Natural Language Processing (IJCNLP 2005), pages 865–873 2011.Google Scholar
E. Haugen
. The analysis of linguistic borrowing. Language: Journal of the Linguistic Society of America, 26:210–231 1950. CrossrefGoogle Scholar
F. J. Hausmann
. Kollokationen im deutschen wörterbuch. ein beitrag zur theorie des lexikographischen beispiels. In H. Bergenholtz and J. Mudgan editors Lexikographie und Grammatik. Akten des Essener Kolloquiums zur Grammatik im Worterbuch, volume 3 of Lexicographica Series Maior, pages 118–129. Niemeyer, Tübingen 1985.Google Scholar
A. Hazem and B. Daille
. Semi-compositional Method for Synonym Extraction of Multi-Word Terms. In Proceedings of the 9th edition of the Language Resources and Evaluation Conference (LREC 2014) 2014.Google Scholar
U. Heid
. On the representtion of collocational phenomena in sublanguage lexicons. In Proceedings of the Third International Congress on Terminology and Knowledge Engineering (TKE 1993), pages 343–353 1993.Google Scholar
D. Hindle
. Noun classification from predicate-argument structures. In Proceedings of the 28th Annual Meeting of the Association for Computational Linguistics (ACL 1990), pages 268–275 1990. CrossrefGoogle Scholar
D. Hull
. A practical approach to terminology alignment. In Proceedings of the First Workshop on Computational Terminology (COMPUTERM 1998), pages 1–7 1998.Google Scholar
F. Ibekwe-SanJuan
. Fouille de textes. Hermès Sciences – Lavoisier, Paris 2007.Google Scholar
C. Jacquemin
. Spotting and Discovering Terms through Natural Language Processing. MIT Press, Cambridge, MA 2001.Google Scholar
C. Jacquemin, B. Daille, J. Royauté, and X. Polanco
. In vitro evaluation for a program to machine-aided indexing. Information Processing & Management (IPM), 38(6):765–792 2002. CrossrefGoogle Scholar
M.-P. Jacques. Emploi de termes réduits comme révélateur de la centralité dans le domaine. In Actes des spetièmes journées scientifiques du réseau Lexicologie, Terminologie, Traduction (LTT) : Mots, termes et contextes, pages 299–308, Paris 2005 Archives Contemporaines.Google Scholar
E. Jacquey, A. Tutin, L. Kister, M.-P. Jacques, S. Hatier, and S. Ollinger
. Filtrage terminologique par le lexique transdisciplinaire scientifique : une expérimentation en sciences humaines. In Proceedings of the 10th International Conference on Terminology and Artificial Intelligence (TIA 2013), pages 121–128 2013.Google Scholar
G. J. F. Jones, M. Burke, J. Judge, A. Khasin, A. Lam-Adesina, and J. Wagner
. Experiments in monolingual, bilingual and multilingual retrieval. In C. Peters, P. Clough, J. Gonzalo, G. J. F. Jones, M. Kluck, and B. Magnini editors Multilingual Information Access for Text, Speech and Images: 5th Workshop of the Cross-Language Evaluation Forum, CLEF 2004, pages 207–220. Springer, Berlin 2005. CrossrefGoogle Scholar
A.-L. Jousse and M. Bouveret
. Lexical functions to represent derivational relations in specialized dictionaries. Terminology, 9(1):71–98 2003. CrossrefGoogle Scholar
A. Judea, H. Schütze, and S. Bruegmann
. Unsupervised training set generation for automatic acquisition of technical terminology in patents. In Proceedings of the 25th International Conference on Computational Linguistics (COLING 2014), pages 290–300 2014.Google Scholar
D. Jurafsky and J. H. Martin
. Speech and Language Processing: An Introduction to Natural Language Processing, Speech Recognition, and Computational Linguistics. Prentice-Hall, Upper Saddle River, NJ 2009.Google Scholar
J. S. Justeson and S. M. Katz
. Technical terminology: some linguistic properties and an algorithm for identification in text. Natural Language Engineering, 1(1):9–27 1995. CrossrefGoogle Scholar
K. Kageura
. The Quantitative Analysis of the Dynamics and Structure of Terminologies. John Benjamins, Amsterdam 2012. CrossrefGoogle Scholar
K. Kageura and B. Umino
. Methods of automatic term recognition: a review. Terminology, 3(2):259–289 1996. CrossrefGoogle Scholar
K. Kageura, M. Yoshioka, K. Takeuchi, K. Koyama, K. Tsuji, and F. Yoshikane
. Recent advances in automatic term recognition: Experiences from the NTCIR workshop on information retrieval and term recognition. Terminology, 6(2):151–173 2000. CrossrefGoogle Scholar
J.-D. Kim, T. Ohta, Y. Tateisi, and J. Tsujii
. Genia corpus – a semantically annotated corpus for bio-textmining. Bioinformatics, 19(suppl 1):1180–1182 2003. CrossrefGoogle Scholar
L. Kister
. Is it possible to predetermine a referent included in a French n de n structure? In S. P. Botley and A. M. McEnnery editors Corpus-Based and Computation Approches to Discourse Anaphora. John Benjamins, Amsterdam 2000.Google Scholar
G. Kleiber and I. Tamba
. L'hyperonymie revisitée : inclusion et hièrarchie. Langages, 98:7–32 1990. CrossrefGoogle Scholar
P. Kluegl, M. Toepfer, P. Beck, G. Fette, and F. Puppe
. UIMA ruta: Rapid development of rule-based information extraction applications. Natural Language Engineering, 22 (1):1–40 2016. CrossrefGoogle Scholar
R. Kocourek
. Textes et termes. Meta, 36(1) 1991. CrossrefGoogle Scholar
P. Koehn and K. Knight
. Empirical methods for compound splitting. In Proceedings of Eleventh Conference of the European Chapter of the Association for Computational Linguistics (EACL 2003), pages 187–193 2003.Google Scholar
M. Kraft
. Compositionality: The very idea. Research on Language and Computation, 5 (3):287–308 2007. CrossrefGoogle Scholar
O. Kraif. Exploration des cognats pour l'alignement. Traitement automatique des langues (TAL), 42(3):2–35 2002.Google Scholar
G. Kremer, K. Erk, S. Padó, and S. Thater
. What substitutes tell us – analysis of an “all-words” lexical substitution corpus. In Proceedings of the 14th Conference of the European Chapter of the Association for Computational Linguistics (EACL 2014), pages 540–549 2014. CrossrefGoogle Scholar
A. Lehmann and F. Martin-Berthet
. Introduction à la lexicologie : sémantique et morphologie. Armand Colin, Paris 2008.Google Scholar
P. Lerat
. Les langues spécialisées. Presses Universitaires de France, Paris 1995.Google Scholar
J. Lethuillier
. La synonymie en langue de spécialité. META, 34(3):443–449 1989. CrossrefGoogle Scholar
J. Levi
. The Syntax and the Semantics of Complex Nominals. Academic Press, London 1978.Google Scholar
M.-C. L'Homme
. La terminologie : principes et techniques. Presses de l'Université de Montréal, Montréal 2004.Google Scholar
. Sur la notion de “terme”. META, 4:1112–1131 2005. CrossrefGoogle Scholar
D. Lin
. Automatic retrieval and clustering of similar words. In Proceedings of the 36th Annual Meeting of the Association for Computational Linguistics and 17th International Conference on Computational Linguistics (ACL-COLING 1998), pages 768–774 1998. CrossrefGoogle Scholar
D. Lin, S. Zhao, L. Qin, and M. Zhou
. Identifying synonyms among distributionally similar words. In Proceedings of the 8th International Joint Conference on Artificial Intelligence (IJCAI 2003) 2003.Google Scholar
E. Loginova, A. Gojun, H. Blancafort, M. Guégan, T. Gornostay, and U. Heid
. Reference lists for the evaluation of term extraction tools. In Proceedings of the 10th International Congress of Terminology and Knowledge Engineering (TKE 2012) 2012.Google Scholar
E. Loginova Clouet and B. Daille
. Splitting of Compound Terms in non-Prototypical Compounding Languages. In Proceedings of the Workshop on Computational Approaches to Compound Analysis associated to the 25th International Conference on Computational Linguistics (COLING 2014), pages 11–19, Dublin, Ireland, Aug. 2014.Google Scholar
A. Ludeling
. Neoclassical word-formation. Humboldt-Universität Berlin, Berlin 2012.Google Scholar
J. Léon
. Lexies, synapsies, synthèmes : le renouveau des études lexicales en France au début des années 1960. In S. Scalise and I. Vogel editors History of Linguistics in Texts and Concepts, pages 405–418. Nodus Publikationen, Münster 2004.Google Scholar
N. Madmani and B. J. Dorr
. Generating phrasal and sentential paraphrases: A survey of data-driven methods. Computational Linguistics, 36(3):341–387 2010. CrossrefGoogle Scholar
B. Maia
. What are comparable corpora? In Workshop on Multilingual Corpora: Linguistic requirements and technical perspectives at Corpus Linguistics 2003, pages 27–34, Lancaster, UK 2003.Google Scholar
D. Malrieu and F. Rastier
. Genres et variations morphosyntaxiques. Texto 2002a.Google Scholar
C. D. Manning and H. Schütze
. Foundations of Statistical Natural Language Processing. MIT Press, Cambridge, MA 1999.Google Scholar
D. C. Manning, P. Raghavan, and H. Schütze
. Introduction to Information Retrieval. Cambridge University Press, Cambridge 2008. CrossrefGoogle Scholar
E. Marshman, T. Morgan, and I. Meyer
. French patterns for expressing concept relations. Terminology, 8(1):1–29 2002. CrossrefGoogle Scholar
W. Martin
. Remarks on collocations in sublanguages. Terminology and Traduction (T & T), pages 157–164 1992.Google Scholar
C. Melloni and A. Bisetto. Parasynthetic compounds. In S. Scalise and I. Vogel editors Cross-disciplinary issues in compounding, volume 311 of Current issues in linguistic theory, pages 199–217. John Benjamins, Amsterdam 2010. CrossrefGoogle Scholar
I. Mel'čuk, N. Arbatchewsky-Jumarie, L. Elnitsky, L. Iordanskaja, and A. Lessard
. Dictionnaire explicatifet combinatoire du français contemporain. Presses de L'Université de Montréal, Montréal 1984.Google Scholar
I. Meyer
. Extracting knowledge-rich contexts for terminography: A conceptual and methodological framework. In D. Bourigault, C. Jacquemin, and M.-C. L'Homme editors Recent Advances in Computational Terminology, pages 279–302. John Benjamins, Amsterdam 2001. CrossrefGoogle Scholar
O. Mitrofanova and V. Zakharov
. Automatic analysis of terminology in the russian corpus on corpus linguistics. In Proceedings of the Fifth International Conference on NLP, Corpus Linguistics, Corpus Based Grammar Research (DIALOG 2009), pages 249–254 2009.Google Scholar
M. Montesi and T. Navarrete
. Classifying web genres in context: A case study documenting the web genres used by a software engineer. Information Processing Management, 44:1410–1430 2008. CrossrefGoogle Scholar
M. Morel and J. François
. Le dictionnaire électronique des synonymes du CRISCO : un outil de plus en plus interactif. Revue française de linguistique appliquée, XX(1): 9–28 2015.Google Scholar
E. Morin and B. Daille
. Revising the compositional method for terminology acquisition from comparable corpora. In Proceedings of the 24th International Conference on Computational Linguistics (COLING 2012), pages 1797–1810 2012.Google Scholar
F. Morlane-Hondère
. Une approche linguistique de l'évaluation des ressources extraites par l'analyse distributionnelle automatique. PhD thesis, Université Toulouse II Le Mirail 2013.Google Scholar
J. Morris and G. Hirst
. Non-classical lexical semantic relations. In Proceeding of the Workshop on Computational Lexical semantics (CLS 2004) associated to Human Language Technologies: The Fifth Annual Conference ofthe North American Chapter of the ACL (HLT-NAACL 2004), pages 46–51 2004.Google Scholar
M. L. Murphy
. Semantic relations and the Lexicon: Antonymy, Synonymy and other Paradigms. Cambridge University Press, Cambridge 2003. CrossrefGoogle Scholar
N. Flaux and D. Van del Velde
Les noms en Français, esquisse de classement. Orphrys, Paris 2000.Google Scholar
F. Namer
. Flemm : un analyseur flexionnel de français à base de règles. Traitement automatique des langues (TAL), 41(2):523–547 2000.Google Scholar
. Automatiser l'analyse morphosémantique non affixale : le système DériF. Cahiers de Grammaire, 28:31–48 2003.Google Scholar
F. Namer and R. H. Baud
. Defining and relating biomedical terms: Towards a cross-language morphosemantics-based system. I. J. Medical Informatics, pages 226–233 2007. CrossrefGoogle Scholar
V. Nastase and S. Szpakowicz
. Exploring noun-modifier semantic relations. In Fifth International Workshop on Computational Semantics (IWCS-5), page 285–301, Tilburg, The Netherlands 2003.Google Scholar
R. Nazar, J. Vivaldi, and L. Wanner
. Automatic taxonomy extraction for specialized domains using distributional semantics. Terminology, 18(2):188–225 2012. CrossrefGoogle Scholar
A. Nazarenko, H. Zargayouna, and O. Hamon J. van Puymbrouck
. Evaluation des outils terminologiques : enjeux, difficultés et propositions. Traitement automatique des langues (TAL), 50(1):257–281 2009.Google Scholar
F. Neveu
. Dictionnaire des sciences du langage. Armand Colin, Paris 2004.Google Scholar
A. Névéol, S. Shooshan, J. Mork, and A. Aronson. Fine-grained indexing of teh biomedical literature: Mesh subhading attachment for a medline indexing tool. In Proceedings of 31th Annual Symposium of the American Medical Informatics Association (AMIA 2007), pages 553–557 2007.Google Scholar
C. Olston and M. Najork
. Web Crawling. Foundations and Trends in Information Retrieval, 4(3):175–246 2010. CrossrefGoogle Scholar
G. Otman
. Les représentations sémantiques en terminologie. Masson, Paris 1996.Google Scholar
M. Pageau
. étude descriptive des emprunts à l'anglais dans le domaine de la biologie cellulaire. Master's thesis, Université de Montréal, Faculté des arts et des sciences, Département de linguistique et de traduction 2003.Google Scholar
A. Panchenko and O. Morozova
. A study of hybrid similarity measures for semantic relation extraction. In Proceedings of the Workshop on Innovative Hybrid Approaches to the Processing of Textual Data (HYBRID 2012) associated to the 13th Conference of the European Chapter of the Association for Computational Linguistics (EACL 2012), pages 10–18 2012.Google Scholar
M. Paquot
. Academic Vocabulary in Learner Writing: From Extraction to Analysis. Bloomsbury Academic, London 2010.Google Scholar
B. H. Parnee, A. ter Meulen, and R. E. Wall
. Mathematical Methods in Linguistics, volume 30. Kluwer, Dordrecht 1990.Google Scholar
B. Partee
. Compositionality. In F. Landman and F. Veltman editors Varieties of Formal Semantics, pages 281–312. Foris, Dordrecht 1984.Google Scholar
S. Pavel and D. Nolet
. Précis de terminologie, Terminologie et Normalisation. Bureau de la traduction, Ministère des travaux publics et services au gouvernement, Canada 2001.Google Scholar
M. T. Pazienza, M. Pennacchiotti, and F. M. Zanzotto
. Terminology extraction: An analysis of linguistic and statistical approaches. In S. Sirmakessis editor Proceedings of the final conference of the Network of Excellence in Text Mining and its applications in Statistics conference (NEMIS 2004), volume 185 of Studies in Fuzziness and Soft Computing, pages 225–279. Springer Berlin Heidelberg 2005.Google Scholar
M. Pecman
. Variation as a cognitive device: How scientists construct knowledge through term formation. Terminology, 20(1):1–24 2014. CrossrefGoogle Scholar
S. Pereira, A. Névéol, O. Kerdelhué, E. Serrot, M. Joubert, and S. Darmoni
. Using multi-terminology indexing for the assignment of mesh descriptors to health resources in a french online calalogue. In Proceedings of 32th Annual Symposium of the American Medical Informatics Association (AMIA 2008) 2008.Google Scholar
V. Pirrelli, E. Guevara, and M. Baroni
. Computational issues in compound processing. In S. Scalise and I. Vogel editors Cross-disciplinary issues in compounding, volume 311 of Current issues in linguistic theory, pages 271–285. John Benjamins, Amsterdam 2010. CrossrefGoogle Scholar
A. Polguère
. Lexicologie et sémantique lexicale. Presses de l'Université de Montréal, Montréal 2003.Google Scholar
A. Périnet and T. Hamon
. Hybrid acquisition of semantic relations based on context normalization in distributional analysis. In Proceedings of the 10th International Conference on Terminology and Artificial Intelligence (TIA 2013), pages 113–122 2013.Google Scholar
P. Rayson and R. Garside
. Comparing corpora using frequency profiling. Presented at Comparing Corpora workshop, held in conjunction with the 38th annual meeting of the Association for Computational Linguistics (ACL 2000) 2000.
P. Resnik
. Selection and Information: A class-based approach to lexical relationships. PhD thesis, University of Pennsylvania 1993.Google Scholar
T. Roeper and M. E. A. Siegel. A lexical transformation for verbal compounds. Linguistic Inquiry, 9(2):199–260 1978.Google Scholar
J. Royauté
. Les groupes nominaux complexes et leur propriétés : application à l analyse de l'information. PhD thesis, Université Nancy 1 1999.Google Scholar
J. Sager
. A Practical Course in Terminology Processing. John Benjamins, Amsterdam 1990. CrossrefGoogle Scholar
B. Sagot
. The Lefff, a freely available and large-coverage morphological and syntactic lexicon for french. In Proceedings of the Seventh conference on International Language Resources and Evaluation (LREC 2010) 2010.Google Scholar
M. Sahlgren
. The distributional hypothesis. Rivista di Linguistica, 20(1)33–53 2008.Google Scholar
G. Salton and M. E. Lesk
. Computer evaluation of indexing and text processing. Journal of the Association for Computational Machinery, 15(1)8–36 1968. CrossrefGoogle Scholar
E. SanJuan and F. Ibekwe-SanJuan
. Text mining without documen context. Information Processing and Management, 26(6):1532–1552 2006. CrossrefGoogle Scholar
X. Saralegi, I. Vicente, and A. Gurrutxaga
. Automatic Extraction of Bilingual Terms from Comparable Corpora in a Popular Science Domain. In Proceedings of the Workshop on Comparable Corpora associated to the 6th international conference on Language Resources and Evaluation (LREC 2008) 2008.Google Scholar
A. Savary
. Computational inflection of multi-word units, a contrastive study of lexical approaches. Linguistic Issues in Language Technology (LiLT) CSLI Publications, 1(2): 1–55 2008.Google Scholar
S. Scalise and I. Vogel
editors Cross-disciplinary issues in compounding. John Benjamins, Amsterdam 2010. CrossrefGoogle Scholar
H. Schmid, A. Fitschen, and U. Heid
. Smor: A german computational morphology covering derivation, composition, and inflection. In Proceedings of the fourth international conference on Language Resources and Evaluation (LREC 2004), pages 1263–1266 2004.Google Scholar
G. Scurtu
. Traduire le vocabulaire juridique francais en roumain. Meta : journal des traducteurs /Meta: Translator's Journal, 53(4):884–898 2008. CrossrefGoogle Scholar
J. Sinclair
. Preliminary Recommendations on Corpus Typology. ) 2003.Google Scholar
G. A. Stephen
. String Searching Algorithms. World Scientific, Singapore 1994. CrossrefGoogle Scholar
K. Takeuchi, K. Kageura, B. Daille, and L. Romary
. Construction of grammar based term extraction model for Japanese. In Proceedings of the 3rd International Workshop on Computational Terminology (COMPUTERM 2004) associated to the 20th International Conference on Computational Linguistics (COLING 2004), pages 91–94 2004.Google Scholar
A. Tartier
. A Method for Observing Terminological Evolution. In G. Angelova, K. Bontcheva, R. Mitkov, N. Nicolov, and N. Nikolov editors Proceedings of the 4th Conference of Recent Advances in Natural Language Processing (RANLP 2003), pages 467–471 2003.Google Scholar
R. Temmerman
. Towards new ways of terminology description. The sociocognitive approach. John Benjamins, Amsterdam 2000. CrossrefGoogle Scholar
P. D. Turney and P. Pantel
. From frequency to meaning: Vector space models of semantics. Journal of Artificial Intelligence Research (JAIR), 37:141–188 2010.Google Scholar
A. Tutin
. Autour du lexique et de la phraséologie des écrits scientifiques. Revue française de linguistique appliquée, XII(2):5–14 2007.Google Scholar
K. Uchiyama
. A study for identifying domain-specific introductory terms in research papaers. In Proceedings of the 9th International Conference on Terminology and Artificial Intelligence (TIA 2011), pages 147–150 2011.Google Scholar
L. van der Plas and J. Tiedemann. Finding synonyms using automatic word alignment and measures of distributional similarity. In Proceedings of the 21st International Conference on Computational Linguistics and 44th Annual Meeting of the Association for Computational Linguistics (ACL 2006), pages 866–873 2006.Google Scholar
F. Ville-Ometz, J. Royauté, and A. Zasadzinski
. Enhancing in automatic recognition and extraction of term variants with linguistic features. Terminology, 13(1):61–84 2007. CrossrefGoogle Scholar
J. Vivaldi and H. Rodriguez
. Evaluation of terms and term extraction systems: a practical approach. Terminology, 13(2):225–248 2007. CrossrefGoogle Scholar
R. A. Wagner and M. J. Fischer
. The String-to-String Correction Problem. Association for Computing Machinery, 21(1):168–173, Jan. 1974. CrossrefGoogle Scholar
J.
. Measures and Applications of Lexical Distributional Similarity. PhD thesis, University of Sussex 2003.Google Scholar
M. Weller, A. Gojun, U. Heid, B. Daille, and R. Harastani
. Simple methods for dealing with term variation and term alignment. In Proceedings of the 9th International Conference on Terminology and Artificial Intelligence (TIA 2011), pages 87–93 2011.Google Scholar
G. C. Williams
. Les réseaux collocationnels dans la construction et l'exploitation d un corpus dans le cadre d'une communauté de discours scientifique. PhD thesis, Université de Nantes 1999.Google Scholar
M. Winston, R. Chaffin, and D. Herrmann
. A taxonomy of part-whole relations. Cognitive Science, 11(4):417–444 1987. CrossrefGoogle Scholar
E. Wüster
. The Machine Tool. An Interlingual Dictionary of Basic Conceps. Technical Press, London 1968.Google Scholar
F. Zanettin
. Bilingual comparable corpora and the training of translators. Meta : journal des traducteurs/Meta: Translator's Journal, 43(4):616–630 1998. CrossrefGoogle Scholar
Q. Zou, W. Chu, C. Morioka, G. Leazer, and H. Kangarloo
. IndexFinger: A method of extracting key concepts from clinical texts for indexing. In Proceedings of 27th Annual Symposium of the American Medical Informatics Association (AMIA 2003), pages 763–767 2003.Google Scholar
Cited by

Cited by 10 other publications

No author info given
2017. Publications received. Terminology. International Journal of Theoretical and Applied Issues in Specialized Communication 23:2  pp. 293 ff. Crossref logo
No author info given
2020.  In Lexical Semantics for Terminology [Terminology and Lexicography Research and Practice, 20], Crossref logo
Biel, Łucja & Agnieszka Doczekalska
2020. How do supranational terms transfer into national legal systems?. Terminology. International Journal of Theoretical and Applied Issues in Specialized Communication 26:2  pp. 184 ff. Crossref logo
Budin, Gerhard
2019.  In Terminologie : Epochen – Schwerpunkte – Umsetzungen [Kommunikation und Medienmanagement, ],  pp. 7 ff. Crossref logo
Faber, Pamela & Melania Cabezas-García
2019. Specialized Knowledge Representation: From Terms to Frames. Research in Language 17:2  pp. 197 ff. Crossref logo
Fernández-Silva, Sabela
2019. The Cognitive and Communicative Functions of Term Variation in Research Articles: A Comparative Study in Psychology and Geology. Applied Linguistics 40:4  pp. 624 ff. Crossref logo
Humbert-Droz, Julie, Aurélie Picton & Anne Condamines
2019. How to build a corpus for a tool-based approach to determinologisation in the field of particle physics. Research in Corpus Linguistics 7  pp. 1 ff. Crossref logo
Loukachevitch, Natalia & Ekaterina Parkhomenko
2018.  In Artificial Intelligence [Communications in Computer and Information Science, 934],  pp. 112 ff. Crossref logo
Loukachevitch, Natalia & Ekaterina Parkhomenko
2018.  In Text, Speech, and Dialogue [Lecture Notes in Computer Science, 11107],  pp. 126 ff. Crossref logo
Miyata, Rei & Kyo Kageura
2018. Building controlled bilingual terminologies for the municipal domain and evaluating them using a coverage estimation approach. Terminology. International Journal of Theoretical and Applied Issues in Specialized Communication 24:2  pp. 149 ff. Crossref logo

This list is based on CrossRef data as of 06 may 2021. Please note that it may not be complete. Sources presented here have been supplied by the respective publishers. Any errors therein should be reported to them.

Subjects

Terminology & Lexicography

Lexicography
Terminology
BIC Subject: CFM – Lexicography
BISAC Subject: LAN009000 – LANGUAGE ARTS & DISCIPLINES / Linguistics / General
U.S. Library of Congress Control Number:  2017018285 | Marc record