Term Variation in Specialised Corpora

Characterisation, automatic discovery and applications

| University of Nantes - LINA
HardboundAvailable
ISBN 9789027223432 | EUR 99.00 | USD 149.00
 
e-Book
ISBN 9789027265357 | EUR 99.00 | USD 149.00
 
This book addresses term variation which has been a very important topic in terminology, computational terminology and natural language processing for up to twenty years. This book presents the first complete inventory of term variants and the linguistic procedures that lead to their formation. It also takes into account issues raised by multilingual applications and presents ways to detect variants in five different languages: French, English, German, Spanish and Russian.

The book provides insights into the following issues: What is a variant? What are the main linguistic mechanisms involved in the transformation of base terms into variants? How can variants be automatically detected in texts? Should variation be taken into account in natural language processing applications?

This book is targeted at terminologists and linguists interested in term variation as well as researchers in natural language processing and computer science that must handle term variants in different kinds of applications.
Publishing status: Available
Table of Contents
Acknowledgements
xi–xii
Chapter 1. Introduction
2–8
Part I. Characterisation
12–80
Chapter 2. Definitions
12–31
Chapter 3. Conceptualisation of terminological variants
34–65
Chapter 4. Semantics of conceptual variants
68–80
Part II. Automatic discovery
84–168
Chapter 5. Primitive exploration of variants using comparable corpora
84–108
Chapter 6. Processing methods for the detection of variants from corpora
110–124
Chapter 7. Grammar of variants
126–146
Chapter 8. Synonymic variants
148–168
Part III. Applications and tools
172–221
Chapter 9. Terminology extraction
172–203
Chapter 10. End-user applications and tools
206–221
Part IV. Conclusions
226–233
Chapter 11. Term variants and their discovery
226–233
Bibliography
235–246
Appendix A. Notation
Appendix B. Multext categories
250
Appendix C. Search with Antconc
252–255
Appendix D. GGRV
258–269
Index
271–272
References

Bibliography

S. Abney
. Partial parsing via finite-state cascades. Natural Language Engineering, 2(4): 337–344 1996. Crossref link
K. Ahmad, A. Davies, H. Fulford, and M. Rogers
. What is a term? The semiautomatic extraction of terms from text. Translation Studies: An Interdiscipline, pages 267–278 1994.
D. Amiot and G. Dal
. Integrating neoclassical combining forms into a lexeme-based morphology. In Proceedings of the 5th Mediterranean Morphology Meeting (MMM5) 2007.
. La composition néoclassique en français et l'ordre des constituants. La composition dans les langues, pages 89–113 2008.
A. Aronson
. Effective mapping of biomedical text to the UMLS metathesaurus: The metamap program. In Proceedings of the 25th Annual Symposium of the American Medical Informatics Association (AMIA 2001), pages 17–21 2001.
M. Baroni and S. Bernardini
. BootCaT: Bootstrapping corpora and terms from the web. In Proceedings of the 4th conference on International Language Resources and Evaluation (LREC 2004), pages 1313–1316 2004.
M. Baroni and A. Lenci
. Distributional memory: A general framework for corpus-based semantics. Computational Linguisics, 36(4):673–721 2010. Crossref link
C. Barrière
. Recherche contextuelle d'équivalents en banque de terminologie. In Actes de la 17ème conférence sur le Traitement Automatique des Langues Naturelles (TALN 2010) 2010.
L. Bauer
. English word-formation. Cambridge University Press, Cambridge 1983. Crossref link
. Morphological productivity. Cambridge University Press, Cambridge 2001. Crossref link
L. Bauer and A. Renouf
. A corpus-based study of compounding in english. Journal of English Linguistics, 29:101–123 2001. Crossref link
H.-D. Béchade
. Phonétique et morphologie du français moderne et contemporain. Presses Universitaires de France, Paris 1992.
O. Ben-Kiki, C. Evans, and B. Ingerson
. Yaml ain't markup language (yamlTM) version 1.1. yaml. org, Tech. Rep 2005.
M. Benson
. Collocations and general-purpose dictionaries. International Journal of Lexicography, 3(1):23–25 1990. Crossref link
É. Benveniste
. Problèmes de linguistique générale, chapter Formes nouvelles de la composition nominale, pages 163–173. Gallimard, Paris 1966.
. Problèmes de linguistique générale. Gallimard, Paris 1974.
D. Bergmark, C. Lagoze, and A. Sbityakov
. Focused crawls, tunneling, and digital libraries. In M. Agosti and C. Thanos editors Research and Advanced Technology for Digital Librairies – 6th European Conference, ECDL 2002, volume 2458 of Lecture notes in computer science, pages 91–106. Springer, New York 2002.
D. Biber
. A Typology of English Texts. Linguistics, 27(1):3–43 1989. Crossref link
D. Biber, S. Johansson, G. Leech, S. Conrad, and E. Finegan. Longman Grammar of Spoken and Written English. Longman, London 1999.
M.-B. Billami, J. Camacho-Collados, E. Jacquey, and L. Kister
. Semantic annotation and terminology validation in full scientific articles in social sciences and humanities (annotation sémantique et validation terminologique en texte intégral en shs). In Actes de la 21e conférence sur le Traitement Automatique des Langues Naturelles (TALN 2014), pages 363–376 2014.
L. Bloomfield
. Language. Allen and Unwin, London 1935.
N. N. Bobyreva
. Syntactic way of term-building in terminological system of rhytmic gymnastics in the Russian and the English languages. TCHGPU, 12 2009.
E. Bolshakova
. Common scientific lexicon for automatic discourse analysis of scientific and technical texts. International Journal “Information Theories & pplications”, 15: 189–195 2008.
B. Bommier-Pincemin
. Diffusion ciblée automatique d'informations : conception et mise en œuvre d'une linguistique textuelle pour la caractérisation des destinataires et des documents. PhD thesis, Université Paris IV, Sorbonne 1999.
A. Bosredon
. Un adjectif de trop : l'adjectif de relation. Information grammaticale, 37: 3–7 1989. Crossref link
. Prédicativité et lexicalisation. Studia Romanica Posnaniensia, 16:85–100 1991.
A. Bougouin, F. Boudin, and B. Daille
. TopicRank: Graph-Based Topic Ranking for Keyphrase Extraction. In Proceedings of the 8th International Joint Conference on Natural Language Processing (IJCNLP 2013), pages 543–551 2013.
D. Bourigault
. LEXTER : un logiciel d'EXtraction de TERminologie. Application à l'extraction des connaissances à partir de textes. Mathématiques, informatique ap-pliquée aux sciences de l'Homme, École des Hautes Études en Sciences Sociales, Paris 1994.
D. Bourigault and C. Jacquemin
. Construction de ressources terminologiques. In J.-M. Pierrel editor Ingénierie des langues, pages 215–234. Hermes, Paris 2000.
D. Bourigault and M. Slodzian
. Pour une terminologie textuelle. Terminologies nou-velles, 19:29–32 1999.
L. Bowker and S. Hawkins
. Variation in the organization of medical terms – exploring some motivations of term choice. Terminology, 12(1):79–110 2006. Crossref link
L. Bowker and J. Pearson
. Working with Specialized Language: A Practical Guide to Using Corpora. Routledge, London 2002. Crossref link
D. A. Burgos Herera
. Towards an Image-Term Co-occurrence Model for Multilingual Terminolog Alignment and Cross-Language Image Indexing. PhD thesis, Universitat Pompeu Fabra 2014.
G. Burnage
. CELEX: A Guide for Users. Center for Lexical Information, Univerty of Nijmegen 1990.
H. Bussmann
. Routledge Dictionary of Language and Linguistics. Routledge, London 1996.
M. T. Cabré
. La terminologie: théorie, méthodes et applications. Armand Colin, Paris 1998.
. La terminologia. Representación y communicación. Una teoría de base communicativa. Universitat Pompeu Fabra, Barcelona 1999.
. Do we need an autonomous theory of terminology? Terminology, 5(1): 2–20 2000.
M. T. Cabré and J. Sager
. Terminology: Theory, Methods, and Applications. John Benjamins, Amsterdam 1999. Crossref link
M. T. Cabré, R. E. Bagot, and J. V. Platresi
. Automatic term detection: A review of current systems. In D. Bourigault, C. Jacquemin, and M.-C. L'Homme editors Recent Advances in Computational Terminology, volume 2 of Natural Language Processing, pages 53–88. John Benjamins, Amsterdam 2001.
E. Carey. L'énergie éolienne. Master's thesis, Maitrise LEA. Université Paris III 2004.
R. Chaffin and D. J. Herrmann
. The similarity and diversity of semantic relations. Memory and Cognition, 12(2):134–141 1984. Crossref link
A. X. Chang and C. D. Manning
. TokensRegex: Defining cascaded regular expressions over tokens. Technical Report CSTR 2014–02, Department of Computer Science, Stanford University 2014.
S. Chaudiron
. L'évaluation des systèmes de traitement de l'information textuelle : vers un changement de paradigme. Mémoire pour HDR, Université de Paris X, Paris 2001.
G. Ciapuscio
. Textos especializados y terminología. PhD thesis, Institut Universitari de Lingüística Aplicada, Universitat Pompeu Fabra, Barcelona 2003.
T. Collet
. La réduction des unités terminologiques complexes de type syntagmatique. Meta : journal des traducteurs / Meta: Translators' Journal, 42(1):193–206 1997. Crossref link
CollinsDictionary. Collins dictionary
2013.
G. Colpron
. Les anglicismes au québec : essai de classification et de numération comparative. Master's thesis, Université de Montreal, Faculté des lettres 1965.
A. Condamines
. Corpus analysis and conceptual relation patterns. Terminology, 8(1): 141–162 2002. Crossref link
. L'expression de la méronymie dans les petites annonces immobil-ières ; comparaison français, anglais, espagnol. Journal of French Language Studies (JFLS), 19:3–23 2009. Crossref link
M. Creutz and K. Lagus
. Unsupervised morpheme segmentation and morphology induction from text corpora using morfessor 1.0. In Computer and Information Science, Report A81, Helsinki University of Technology 2005.
M. Crochemore, C. Hancart, and T. Lecroq
. Algorithmique du texte. Vuibert, Paris 2001.
K. Crowston, B. Kwaśnik, and J. Rubleske
. Problems in the use-centered development of a taxonomy. In A. Meher, S. Sharoff, and M. Santini editors Genres on the Web, volume 42 of Text, Speech and Language Technologies series, Chapter 4. Springer, New York 2010.
D. A. Cruse
. Lexical Semantics. Cambridge University Press, Cambridge 1995.
. Meaning in Language: An Introduction to Semantics and Pragmatics. Oxford University Press, Oxford 2004.
J. R. Curran and M. Moens
. Improvements in automatic thesaurus extraction. In Proceedings of the Workshop on Unsupervised Lexical Acquisition associated to the 40th Annual Meeting of the Association for Computational Linguistics (ACL 2002), pages 59–66 2002.
B. Daille
. Variations and application-oriented terminology engineering. Terminology, 11 (1): 181–197 2005. Crossref link
B. Daille, B. Habert, C. Jacquemin, and J. Royauté
. Empirical observation of term variations and principles for their description. Terminology, 3(2):197–257 1996. Crossref link
B. Daille, C. Enguehard, C. Jacquin, R. Raharinirina, B. Ralalaoherivony, and C. Lehman
. Traitement automatique de la terminologie en langue malgache. In K. Chibout, J. Mariani, N. Masson, and F. Néel editors Ressources et évaluation en ingénierie des langues, Actualité scientifique – Universités Francophones, pages 225–242. De Boek & Larcier, Bruxelles 2000.
J. Dancette and M.-C. L'Homme
. Building Specialized Dictionaries using Lexical Functions. Linguistica Antverpiensa, 3:113–132 2004.
C. de
Groc. Babouk: Focused web crawling for corpus compilation and automatic terminology extraction. In Proceedings of the 2011 IEEE/WIC/ACM International Conferences on Web Intelligence and Intelligent Agent Technology (WI-IAT 2011), pages 497–498 2011.
L. Déléger and P. Zweigenbaum. Extracting lay paraphrases of specialized expressions from monolingual comparable medical corpora. In Proceedings of the 2nd Workshop on Building and Using Comparable Corpora: from Parallel to Non-parallel Corpora, pages 2–10, Suntec, Singapour 2009. Crossref link
A. Depierre
. Souvent HAEMA varie …: Les dérivés du grec HAEMA en anglais : Étude de cas de variation. Terminology, 13(2):155–176 2007. Crossref link
P. Drouin
. Term extraction using non-technical corpora as a point of leverage. Terminology, 9(1):99–117 2003. Crossref link
T. Dunning
. Accurate methods for the statistics of surprise and coincidence. Computational Linguistics, 19(1):61–74 1993.
M. Dupuch, T. Hamon, and N. Grabar
. Groupement de termes basé sur des régularités linguistiques et sémantiques dans un contexte cross-langue. In Actes de la 20e conférence sur le Traitement Automatique des Langues Naturelles (TALN 2013), pages 62–75 2013.
C. Durieux
. Fondement didactique de la traduction technique. La maison du dictionnaire, Paris 2010.
P. Dury
. Les indices d'évolution diachronique dans les dictionnaires de langue générale : une étude de cas autour de carbon. In F. Maniez and P. Dury editors Lexicologie et terminologie : histoire des mots, Travaux du CRTT, pages 269–284. Presse universitaire de l'Université Lumière-Lyon 2, Lyon 2007.
P. Dury and P. Drouin
. L'obsolescence des termes en langues de spécialité : une étude semi-automatique de la “nécrologie” en corpus informatisés, appliquée au domaine de l'écologie. In Proceedings of the XVII European LSP Symposium 2009, pages 1–11 2010.
DWIA
. Wind Energy Reference Manual. Danish Wind Industry Association 2003.
C. Dyer
. Using a maximum entropy model to build segmentation lattices for MT. In Proceedings of the Human Language Technologies: The 11th Annual Conference of the North American Chapter of the ACL (HLT-NAACL 2009), pages 406–414 2009.
R. Estopà, J. Vivaldi, and M. T. Cabré
. Use of greek and latin forms for term detection. In Proceedings of the 2nd conference on International Language Resources and Evaluation (LREC 2000) 2000.
R. Estopà Bagot
. Les unités de signification spécialisées élargissant l'objet du travail en terminologie. Terminology, 7(2):217–237 2001. Crossref link
B. S. Everitt
. The Analysis of Contingency Tables. Chapman & Hall, London 1992.
S. Evert
. Corpora and Collocations. In A. Lüdeling and M. Kytö editors Corpus Linguistics: An International Handbook, pages 1212–1248. Mouton de Gruyter, Berlin 2008.
S. Evert and B. Krenn
. Using small random samples for the manual evaluation of statistical association measures. Computer Speech & Language, 19(4):450–466 2005. Crossref link
L. Faigley and S. Witte
. Analyzing revision. College Composition and Communication, 32(4):400–414 1981. Crossref link
R. M. Fano
. Transmission of Information: A statistical Theory of Communications. MIT Press, Cambridge, MA 1961.
C. Fellbaum
editor WordNet: An Electronic Lexical Database. MIT Press, Cambridge, MA 1998.
O. Ferret
. Testing semantic similarity measures for extracting synonyms from a corpus. In Proceedings of the 7th conference on International Language Resources and Evaluation (LREC 2010) 2010.
O. Ferret. Identifying bad semantic neighbors for improving distributional thesauri. In Proceedings of the 51st Annual Meeting of the Association for Computational Linguistics (ACL 2013), pages 561–571 2013.
D. Ferrucci and A. Lally
. UIMA: an architectural approach to unstructured information processing in the corporate research environment. Natural Language Engineering, 10:327–348 2004. Crossref link
J. Firth
. A synopsis of linguistic theory 1930–1955. In F. R. Palmer editor Selected Papers of J. R. Firth 1952–1959, pages 168–205. Longman, London 1968.
T. Fontenelle
. Dictionnaires et outils de correction linguistique. Revue française de linguistique appliquée, X(2):119–128 2005.
P. Forchini and A. Murphy
. N-grams in comparable specialized corpora: Perspectives on phraseology, translation, and pedagogy. International Journal of Corpus Linguistics, 13(3):351–367 2008. Crossref link
B. Fradin
. Nouvelles approches en morphologie. Presses Universitaires de France, Paris 2003. Crossref link
K. Frantzi and S. Ananiadou
. The C-Value/NC-Value domain independent method for multi-word term extraction. Journal of Natural Language Processing, 6(3):145–179 2000. Crossref link
J. Freixa
. La variació terminològica: Anàlisi de la variació denominativa en textos de diferent grau d'especialització de l'àrea de medi ambient. PhD thesis, Institut Universitari de Lingüística Aplicada, Universitat Pompeu Fabra, Barcelona 2002.
. Causes of denominative variation – A typology proposal. Terminology, 12 (1):51–77 2006. Crossref link
O. Frunza and D. Inkpen
. Identification and Disambiguation of Cognates, False Friends, and Partial Cognates Using Machine Learning Techniques. International Journal of Linguistics, 1(1) 2009.
A. Fujita and P. Isabelle
. Expanding paraphrase lexicons by exploiting lexical variants. In Proceedings of the 2015 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, pages 630–640. Association for Computational Linguistics 2015.
W. A. Gale and K. W. Church
. Identifying word correspondences in parallel texts. In Proceedings of the Workshop Speech and Natural Language, pages 152–157 1991. Crossref link
F. Gaudin
. Socioterminologie. Une approche socioterminologique de la terminologie. De Boeck/Duculot, Bruxelles 2003.
A. Gelbukh and G. Sidorov
. Approach to construction of automatic morphological analysis systems for inflective languages with little effort. In A. Gelbukh editor Computational Linguistics and Intelligent Text Processing – 4th International Conference, CICLing 2003, volume 2588 of Lecture Notes in Computer Science, pages 215–220. Springer, New York 2003.
P. Gipe
. Wind power: renewable energy for home, farm, and business. Chelsea Green, Chelsea, VT 2004.
G. Grefenstette
. Corpus-derived first, second and third-order word affinities. In Proceedings of the 6th International Congress of the European Association for Lexicography (EURALEX 1994), pages 279–290 1994a.
. Explorations in Automatic Thesaurus Discovery. Kluwer, Boston, MA 1994b. Crossref link
G. Gross. Degré de figement des noms composés. Langages, 90:57–72 1988. Crossref link
M. Guégan and C. de Loupy
. Knowledge-poor approach to shallow parsing: Contribution of unsupervised part-of-speech induction. In G. Angelova, K. Bontcheva, R. Mitkov, and N. Nicolov editors Proceedings the Recent Advances in Natural Language Processing (RANLP 2011), pages 33–40 2011.
M. Hagiwara
. A supervised learning approach to automatic synonym identification based on distributional features. In Proceedings of the HLT Student Research Workshop associated to Workshop on the 46th Annual Meeting of the Association for Computational Linguistics (ACL 2008), pages 1–6 2008.
T. Hamon and A. Nazarenko
. Detection of synonymy links between terms: experiments and results. In D. Bourigault, C. Jacquemin, and M.-C. L'Homme editors Recent Advances in Computational Terminology, volume 2 of Natural Language Processing, pages 185–208. John Benjamins, Amsterdam 2001.
Y. Haralambous and E. Lavagnino
. La réduction de termes complexes dans les langues de spécialité. Traitement automatique des langues (TAL), 52(1):37–68 2011.
R. Harastani, B. Daille, and E. Morin
. Neoclassical compound alignments from comparable corpora. In Proceedings of the 13th International Conference Computational Linguistics and Intelligent Text Processing (CICLing 2012), pages 72–82 2012.
Z. S. Harris
. Distributional structure. Word, 10(23):146–162 1954. Crossref link
B. Hauer and G. Kondrak
. Clustering semantically equivalent words into cognate sets in multilingual lists. In Proceedings of the 5th International Joint Conference on Natural Language Processing (IJCNLP 2005), pages 865–873 2011.
E. Haugen
. The analysis of linguistic borrowing. Language: Journal of the Linguistic Society of America, 26:210–231 1950. Crossref link
F. J. Hausmann
. Kollokationen im deutschen wörterbuch. ein beitrag zur theorie des lexikographischen beispiels. In H. Bergenholtz and J. Mudgan editors Lexikographie und Grammatik. Akten des Essener Kolloquiums zur Grammatik im Worterbuch, volume 3 of Lexicographica Series Maior, pages 118–129. Niemeyer, Tübingen 1985.
A. Hazem and B. Daille
. Semi-compositional Method for Synonym Extraction of Multi-Word Terms. In Proceedings of the 9th edition of the Language Resources and Evaluation Conference (LREC 2014) 2014.
U. Heid
. On the representtion of collocational phenomena in sublanguage lexicons. In Proceedings of the Third International Congress on Terminology and Knowledge Engineering (TKE 1993), pages 343–353 1993.
D. Hindle
. Noun classification from predicate-argument structures. In Proceedings of the 28th Annual Meeting of the Association for Computational Linguistics (ACL 1990), pages 268–275 1990. Crossref link
D. Hull
. A practical approach to terminology alignment. In Proceedings of the First Workshop on Computational Terminology (COMPUTERM 1998), pages 1–7 1998.
F. Ibekwe-SanJuan
. Fouille de textes. Hermès Sciences – Lavoisier, Paris 2007.
C. Jacquemin
. Spotting and Discovering Terms through Natural Language Processing. MIT Press, Cambridge, MA 2001.
C. Jacquemin, B. Daille, J. Royauté, and X. Polanco
. In vitro evaluation for a program to machine-aided indexing. Information Processing & Management (IPM), 38(6):765–792 2002. Crossref link
M.-P. Jacques. Emploi de termes réduits comme révélateur de la centralité dans le domaine. In Actes des spetièmes journées scientifiques du réseau Lexicologie, Terminologie, Traduction (LTT) : Mots, termes et contextes, pages 299–308, Paris 2005 Archives Contemporaines.
E. Jacquey, A. Tutin, L. Kister, M.-P. Jacques, S. Hatier, and S. Ollinger
. Filtrage terminologique par le lexique transdisciplinaire scientifique : une expérimentation en sciences humaines. In Proceedings of the 10th International Conference on Terminology and Artificial Intelligence (TIA 2013), pages 121–128 2013.
G. J. F. Jones, M. Burke, J. Judge, A. Khasin, A. Lam-Adesina, and J. Wagner
. Experiments in monolingual, bilingual and multilingual retrieval. In C. Peters, P. Clough, J. Gonzalo, G. J. F. Jones, M. Kluck, and B. Magnini editors Multilingual Information Access for Text, Speech and Images: 5th Workshop of the Cross-Language Evaluation Forum, CLEF 2004, pages 207–220. Springer, Berlin 2005. Crossref link
A.-L. Jousse and M. Bouveret
. Lexical functions to represent derivational relations in specialized dictionaries. Terminology, 9(1):71–98 2003. Crossref link
A. Judea, H. Schütze, and S. Bruegmann
. Unsupervised training set generation for automatic acquisition of technical terminology in patents. In Proceedings of the 25th International Conference on Computational Linguistics (COLING 2014), pages 290–300 2014.
D. Jurafsky and J. H. Martin
. Speech and Language Processing: An Introduction to Natural Language Processing, Speech Recognition, and Computational Linguistics. Prentice-Hall, Upper Saddle River, NJ 2009.
J. S. Justeson and S. M. Katz
. Technical terminology: some linguistic properties and an algorithm for identification in text. Natural Language Engineering, 1(1):9–27 1995. Crossref link
K. Kageura
. The Quantitative Analysis of the Dynamics and Structure of Terminologies. John Benjamins, Amsterdam 2012. Crossref link
K. Kageura and B. Umino
. Methods of automatic term recognition: a review. Terminology, 3(2):259–289 1996. Crossref link
K. Kageura, M. Yoshioka, K. Takeuchi, K. Koyama, K. Tsuji, and F. Yoshikane
. Recent advances in automatic term recognition: Experiences from the NTCIR workshop on information retrieval and term recognition. Terminology, 6(2):151–173 2000. Crossref link
J.-D. Kim, T. Ohta, Y. Tateisi, and J. Tsujii
. Genia corpus – a semantically annotated corpus for bio-textmining. Bioinformatics, 19(suppl 1):1180–1182 2003. Crossref link
L. Kister
. Is it possible to predetermine a referent included in a French n de n structure? In S. P. Botley and A. M. McEnnery editors Corpus-Based and Computation Approches to Discourse Anaphora. John Benjamins, Amsterdam 2000.
G. Kleiber and I. Tamba
. L'hyperonymie revisitée : inclusion et hièrarchie. Langages, 98:7–32 1990. Crossref link
P. Kluegl, M. Toepfer, P. Beck, G. Fette, and F. Puppe
. UIMA ruta: Rapid development of rule-based information extraction applications. Natural Language Engineering, 22 (1):1–40 2016. Crossref link
R. Kocourek
. Textes et termes. Meta, 36(1) 1991. Crossref link
P. Koehn and K. Knight
. Empirical methods for compound splitting. In Proceedings of Eleventh Conference of the European Chapter of the Association for Computational Linguistics (EACL 2003), pages 187–193 2003.
M. Kraft
. Compositionality: The very idea. Research on Language and Computation, 5 (3):287–308 2007. Crossref link
O. Kraif. Exploration des cognats pour l'alignement. Traitement automatique des langues (TAL), 42(3):2–35 2002.
G. Kremer, K. Erk, S. Padó, and S. Thater
. What substitutes tell us – analysis of an “all-words” lexical substitution corpus. In Proceedings of the 14th Conference of the European Chapter of the Association for Computational Linguistics (EACL 2014), pages 540–549 2014. Crossref link
A. Lehmann and F. Martin-Berthet
. Introduction à la lexicologie : sémantique et morphologie. Armand Colin, Paris 2008.
P. Lerat
. Les langues spécialisées. Presses Universitaires de France, Paris 1995.
J. Lethuillier
. La synonymie en langue de spécialité. META, 34(3):443–449 1989. Crossref link
J. Levi
. The Syntax and the Semantics of Complex Nominals. Academic Press, London 1978.
M.-C. L'Homme
. La terminologie : principes et techniques. Presses de l'Université de Montréal, Montréal 2004.
. Sur la notion de “terme”. META, 4:1112–1131 2005. Crossref link
D. Lin
. Automatic retrieval and clustering of similar words. In Proceedings of the 36th Annual Meeting of the Association for Computational Linguistics and 17th International Conference on Computational Linguistics (ACL-COLING 1998), pages 768–774 1998. Crossref link
D. Lin, S. Zhao, L. Qin, and M. Zhou
. Identifying synonyms among distributionally similar words. In Proceedings of the 8th International Joint Conference on Artificial Intelligence (IJCAI 2003) 2003.
E. Loginova, A. Gojun, H. Blancafort, M. Guégan, T. Gornostay, and U. Heid
. Reference lists for the evaluation of term extraction tools. In Proceedings of the 10th International Congress of Terminology and Knowledge Engineering (TKE 2012) 2012.
E. Loginova Clouet and B. Daille
. Splitting of Compound Terms in non-Prototypical Compounding Languages. In Proceedings of the Workshop on Computational Approaches to Compound Analysis associated to the 25th International Conference on Computational Linguistics (COLING 2014), pages 11–19, Dublin, Ireland, Aug. 2014.
A. Ludeling
. Neoclassical word-formation. Humboldt-Universität Berlin, Berlin 2012.
J. Léon
. Lexies, synapsies, synthèmes : le renouveau des études lexicales en France au début des années 1960. In S. Scalise and I. Vogel editors History of Linguistics in Texts and Concepts, pages 405–418. Nodus Publikationen, Münster 2004.
N. Madmani and B. J. Dorr
. Generating phrasal and sentential paraphrases: A survey of data-driven methods. Computational Linguistics, 36(3):341–387 2010. Crossref link
B. Maia
. What are comparable corpora? In Workshop on Multilingual Corpora: Linguistic requirements and technical perspectives at Corpus Linguistics 2003, pages 27–34, Lancaster, UK 2003.
D. Malrieu and F. Rastier
. Genres et variations morphosyntaxiques. Texto 2002a.
C. D. Manning and H. Schütze
. Foundations of Statistical Natural Language Processing. MIT Press, Cambridge, MA 1999.
D. C. Manning, P. Raghavan, and H. Schütze
. Introduction to Information Retrieval. Cambridge University Press, Cambridge 2008. Crossref link
E. Marshman, T. Morgan, and I. Meyer
. French patterns for expressing concept relations. Terminology, 8(1):1–29 2002. Crossref link
W. Martin
. Remarks on collocations in sublanguages. Terminology and Traduction (T & T), pages 157–164 1992.
C. Melloni and A. Bisetto. Parasynthetic compounds. In S. Scalise and I. Vogel editors Cross-disciplinary issues in compounding, volume 311 of Current issues in linguistic theory, pages 199–217. John Benjamins, Amsterdam 2010. Crossref link
I. Mel'čuk, N. Arbatchewsky-Jumarie, L. Elnitsky, L. Iordanskaja, and A. Lessard
. Dictionnaire explicatifet combinatoire du français contemporain. Presses de L'Université de Montréal, Montréal 1984.
I. Meyer
. Extracting knowledge-rich contexts for terminography: A conceptual and methodological framework. In D. Bourigault, C. Jacquemin, and M.-C. L'Homme editors Recent Advances in Computational Terminology, pages 279–302. John Benjamins, Amsterdam 2001. Crossref link
O. Mitrofanova and V. Zakharov
. Automatic analysis of terminology in the russian corpus on corpus linguistics. In Proceedings of the Fifth International Conference on NLP, Corpus Linguistics, Corpus Based Grammar Research (DIALOG 2009), pages 249–254 2009.
M. Montesi and T. Navarrete
. Classifying web genres in context: A case study documenting the web genres used by a software engineer. Information Processing Management, 44:1410–1430 2008. Crossref link
M. Morel and J. François
. Le dictionnaire électronique des synonymes du CRISCO : un outil de plus en plus interactif. Revue française de linguistique appliquée, XX(1): 9–28 2015.
E. Morin and B. Daille
. Revising the compositional method for terminology acquisition from comparable corpora. In Proceedings of the 24th International Conference on Computational Linguistics (COLING 2012), pages 1797–1810 2012.
F. Morlane-Hondère
. Une approche linguistique de l'évaluation des ressources extraites par l'analyse distributionnelle automatique. PhD thesis, Université Toulouse II Le Mirail 2013.
J. Morris and G. Hirst
. Non-classical lexical semantic relations. In Proceeding of the Workshop on Computational Lexical semantics (CLS 2004) associated to Human Language Technologies: The Fifth Annual Conference ofthe North American Chapter of the ACL (HLT-NAACL 2004), pages 46–51 2004.
M. L. Murphy
. Semantic relations and the Lexicon: Antonymy, Synonymy and other Paradigms. Cambridge University Press, Cambridge 2003. Crossref link
N. Flaux and D. Van del Velde
Les noms en Français, esquisse de classement. Orphrys, Paris 2000.
F. Namer
. Flemm : un analyseur flexionnel de français à base de règles. Traitement automatique des langues (TAL), 41(2):523–547 2000.
. Automatiser l'analyse morphosémantique non affixale : le système DériF. Cahiers de Grammaire, 28:31–48 2003.
F. Namer and R. H. Baud
. Defining and relating biomedical terms: Towards a cross-language morphosemantics-based system. I. J. Medical Informatics, pages 226–233 2007. Crossref link
V. Nastase and S. Szpakowicz
. Exploring noun-modifier semantic relations. In Fifth International Workshop on Computational Semantics (IWCS-5), page 285–301, Tilburg, The Netherlands 2003.
R. Nazar, J. Vivaldi, and L. Wanner
. Automatic taxonomy extraction for specialized domains using distributional semantics. Terminology, 18(2):188–225 2012. Crossref link
A. Nazarenko, H. Zargayouna, and O. Hamon J. van Puymbrouck
. Evaluation des outils terminologiques : enjeux, difficultés et propositions. Traitement automatique des langues (TAL), 50(1):257–281 2009.
F. Neveu
. Dictionnaire des sciences du langage. Armand Colin, Paris 2004.
A. Névéol, S. Shooshan, J. Mork, and A. Aronson. Fine-grained indexing of teh biomedical literature: Mesh subhading attachment for a medline indexing tool. In Proceedings of 31th Annual Symposium of the American Medical Informatics Association (AMIA 2007), pages 553–557 2007.
C. Olston and M. Najork
. Web Crawling. Foundations and Trends in Information Retrieval, 4(3):175–246 2010. Crossref link
G. Otman
. Les représentations sémantiques en terminologie. Masson, Paris 1996.
M. Pageau
. étude descriptive des emprunts à l'anglais dans le domaine de la biologie cellulaire. Master's thesis, Université de Montréal, Faculté des arts et des sciences, Département de linguistique et de traduction 2003.
A. Panchenko and O. Morozova
. A study of hybrid similarity measures for semantic relation extraction. In Proceedings of the Workshop on Innovative Hybrid Approaches to the Processing of Textual Data (HYBRID 2012) associated to the 13th Conference of the European Chapter of the Association for Computational Linguistics (EACL 2012), pages 10–18 2012.
M. Paquot
. Academic Vocabulary in Learner Writing: From Extraction to Analysis. Bloomsbury Academic, London 2010.
B. H. Parnee, A. ter Meulen, and R. E. Wall
. Mathematical Methods in Linguistics, volume 30. Kluwer, Dordrecht 1990.
B. Partee
. Compositionality. In F. Landman and F. Veltman editors Varieties of Formal Semantics, pages 281–312. Foris, Dordrecht 1984.
S. Pavel and D. Nolet
. Précis de terminologie, Terminologie et Normalisation. Bureau de la traduction, Ministère des travaux publics et services au gouvernement, Canada 2001.
M. T. Pazienza, M. Pennacchiotti, and F. M. Zanzotto
. Terminology extraction: An analysis of linguistic and statistical approaches. In S. Sirmakessis editor Proceedings of the final conference of the Network of Excellence in Text Mining and its applications in Statistics conference (NEMIS 2004), volume 185 of Studies in Fuzziness and Soft Computing, pages 225–279. Springer Berlin Heidelberg 2005.
M. Pecman
. Variation as a cognitive device: How scientists construct knowledge through term formation. Terminology, 20(1):1–24 2014. Crossref link
S. Pereira, A. Névéol, O. Kerdelhué, E. Serrot, M. Joubert, and S. Darmoni
. Using multi-terminology indexing for the assignment of mesh descriptors to health resources in a french online calalogue. In Proceedings of 32th Annual Symposium of the American Medical Informatics Association (AMIA 2008) 2008.
V. Pirrelli, E. Guevara, and M. Baroni
. Computational issues in compound processing. In S. Scalise and I. Vogel editors Cross-disciplinary issues in compounding, volume 311 of Current issues in linguistic theory, pages 271–285. John Benjamins, Amsterdam 2010. Crossref link
A. Polguère
. Lexicologie et sémantique lexicale. Presses de l'Université de Montréal, Montréal 2003.
A. Périnet and T. Hamon
. Hybrid acquisition of semantic relations based on context normalization in distributional analysis. In Proceedings of the 10th International Conference on Terminology and Artificial Intelligence (TIA 2013), pages 113–122 2013.
P. Rayson and R. Garside
. Comparing corpora using frequency profiling. Presented at Comparing Corpora workshop, held in conjunction with the 38th annual meeting of the Association for Computational Linguistics (ACL 2000) 2000.
P. Resnik
. Selection and Information: A class-based approach to lexical relationships. PhD thesis, University of Pennsylvania 1993.
T. Roeper and M. E. A. Siegel. A lexical transformation for verbal compounds. Linguistic Inquiry, 9(2):199–260 1978.
J. Royauté
. Les groupes nominaux complexes et leur propriétés : application à l analyse de l'information. PhD thesis, Université Nancy 1 1999.
J. Sager
. A Practical Course in Terminology Processing. John Benjamins, Amsterdam 1990. Crossref link
B. Sagot
. The Lefff, a freely available and large-coverage morphological and syntactic lexicon for french. In Proceedings of the Seventh conference on International Language Resources and Evaluation (LREC 2010) 2010.
M. Sahlgren
. The distributional hypothesis. Rivista di Linguistica, 20(1)33–53 2008.
G. Salton and M. E. Lesk
. Computer evaluation of indexing and text processing. Journal of the Association for Computational Machinery, 15(1)8–36 1968. Crossref link
E. SanJuan and F. Ibekwe-SanJuan
. Text mining without documen context. Information Processing and Management, 26(6):1532–1552 2006. Crossref link
X. Saralegi, I. Vicente, and A. Gurrutxaga
. Automatic Extraction of Bilingual Terms from Comparable Corpora in a Popular Science Domain. In Proceedings of the Workshop on Comparable Corpora associated to the 6th international conference on Language Resources and Evaluation (LREC 2008) 2008.
A. Savary
. Computational inflection of multi-word units, a contrastive study of lexical approaches. Linguistic Issues in Language Technology (LiLT) CSLI Publications, 1(2): 1–55 2008.
S. Scalise and I. Vogel
editors Cross-disciplinary issues in compounding. John Benjamins, Amsterdam 2010. Crossref link
H. Schmid, A. Fitschen, and U. Heid
. Smor: A german computational morphology covering derivation, composition, and inflection. In Proceedings of the fourth international conference on Language Resources and Evaluation (LREC 2004), pages 1263–1266 2004.
G. Scurtu
. Traduire le vocabulaire juridique francais en roumain. Meta : journal des traducteurs /Meta: Translator's Journal, 53(4):884–898 2008. Crossref link
J. Sinclair
. Preliminary Recommendations on Corpus Typology. ) 2003.
G. A. Stephen
. String Searching Algorithms. World Scientific, Singapore 1994. Crossref link
K. Takeuchi, K. Kageura, B. Daille, and L. Romary
. Construction of grammar based term extraction model for Japanese. In Proceedings of the 3rd International Workshop on Computational Terminology (COMPUTERM 2004) associated to the 20th International Conference on Computational Linguistics (COLING 2004), pages 91–94 2004.
A. Tartier
. A Method for Observing Terminological Evolution. In G. Angelova, K. Bontcheva, R. Mitkov, N. Nicolov, and N. Nikolov editors Proceedings of the 4th Conference of Recent Advances in Natural Language Processing (RANLP 2003), pages 467–471 2003.
R. Temmerman
. Towards new ways of terminology description. The sociocognitive approach. John Benjamins, Amsterdam 2000. Crossref link
P. D. Turney and P. Pantel
. From frequency to meaning: Vector space models of semantics. Journal of Artificial Intelligence Research (JAIR), 37:141–188 2010.
A. Tutin
. Autour du lexique et de la phraséologie des écrits scientifiques. Revue française de linguistique appliquée, XII(2):5–14 2007.
K. Uchiyama
. A study for identifying domain-specific introductory terms in research papaers. In Proceedings of the 9th International Conference on Terminology and Artificial Intelligence (TIA 2011), pages 147–150 2011.
L. van der Plas and J. Tiedemann. Finding synonyms using automatic word alignment and measures of distributional similarity. In Proceedings of the 21st International Conference on Computational Linguistics and 44th Annual Meeting of the Association for Computational Linguistics (ACL 2006), pages 866–873 2006.
F. Ville-Ometz, J. Royauté, and A. Zasadzinski
. Enhancing in automatic recognition and extraction of term variants with linguistic features. Terminology, 13(1):61–84 2007. Crossref link
J. Vivaldi and H. Rodriguez
. Evaluation of terms and term extraction systems: a practical approach. Terminology, 13(2):225–248 2007. Crossref link
R. A. Wagner and M. J. Fischer
. The String-to-String Correction Problem. Association for Computing Machinery, 21(1):168–173, Jan. 1974. Crossref link
J.
. Measures and Applications of Lexical Distributional Similarity. PhD thesis, University of Sussex 2003.
M. Weller, A. Gojun, U. Heid, B. Daille, and R. Harastani
. Simple methods for dealing with term variation and term alignment. In Proceedings of the 9th International Conference on Terminology and Artificial Intelligence (TIA 2011), pages 87–93 2011.
G. C. Williams
. Les réseaux collocationnels dans la construction et l'exploitation d un corpus dans le cadre d'une communauté de discours scientifique. PhD thesis, Université de Nantes 1999.
M. Winston, R. Chaffin, and D. Herrmann
. A taxonomy of part-whole relations. Cognitive Science, 11(4):417–444 1987. Crossref link
E. Wüster
. The Machine Tool. An Interlingual Dictionary of Basic Conceps. Technical Press, London 1968.
F. Zanettin
. Bilingual comparable corpora and the training of translators. Meta : journal des traducteurs/Meta: Translator's Journal, 43(4):616–630 1998. Crossref link
Q. Zou, W. Chu, C. Morioka, G. Leazer, and H. Kangarloo
. IndexFinger: A method of extracting key concepts from clinical texts for indexing. In Proceedings of 27th Annual Symposium of the American Medical Informatics Association (AMIA 2003), pages 763–767 2003.
Cited by

Cited by other publications

No author info given
2017. Publications received. Terminology 23:2  pp. 293 ff. Crossref logo
No author info given
2020.  In Lexical Semantics for Terminology [Terminology and Lexicography Research and Practice, 20], Crossref logo
Budin, Gerhard
2019.  In Terminologie : Epochen – Schwerpunkte – Umsetzungen [Kommunikation und Medienmanagement, ],  pp. 7 ff. Crossref logo
Fernández-Silva, Sabela
2019. The Cognitive and Communicative Functions of Term Variation in Research Articles: A Comparative Study in Psychology and Geology. Applied Linguistics 40:4  pp. 624 ff. Crossref logo
Humbert-Droz, Julie, Aurélie Picton & Anne Condamines
2019. How to build a corpus for a tool-based approach to determinologisation in the field of particle physics. Research in Corpus Linguistics 7  pp. 1 ff. Crossref logo
Loukachevitch, Natalia & Ekaterina Parkhomenko
2018.  In Artificial Intelligence [Communications in Computer and Information Science, 934],  pp. 112 ff. Crossref logo
Loukachevitch, Natalia & Ekaterina Parkhomenko
2018.  In Text, Speech, and Dialogue [Lecture Notes in Computer Science, 11107],  pp. 126 ff. Crossref logo
Miyata, Rei & Kyo Kageura
2018. Building controlled bilingual terminologies for the municipal domain and evaluating them using a coverage estimation approach. Terminology 24:2  pp. 149 ff. Crossref logo

This list is based on CrossRef data as of 13 may 2020. Please note that it may not be complete. Sources presented here have been supplied by the respective publishers. Any errors therein should be reported to them.

Subjects

Terminology & Lexicography

Lexicography
Terminology
BIC Subject: CFM – Lexicography
BISAC Subject: LAN009000 – LANGUAGE ARTS & DISCIPLINES / Linguistics / General
U.S. Library of Congress Control Number:  2017018285