The cycle of lexicographic and linguistic work involved in compiling a computational phraseological database is divided into three phases and described in relation to the specific challenges multi-word expressions (MWEs) pose for a lexical database. Data collection is a process that is far from complete for the MWEs found in English, with the variability of some phrases making identification of all occurrences in large corpora a major challenge. Formalization of the form and variability ofMWEs is an interrelated process which can improve tools for data collection and other applications. Increased use of the phraseological lexical database in NLP applications can ultimately lead to further insights into the nature of MWEs and to improvements in the database. Due to the volume of lexicographic data on MWEs that still needs to be collected, analysed and formalized, and the cyclical nature of the work, the resulting lexical database should be reusable in as many applications as possible. WordManager-PhraseManager, the lexical resource described in the second part of the chapter, can capture the variability ofMWEs in a way that allows for maximum reusability of lexical data.
2023. ‘Arm’s length’ phraseology?. Terminology. International Journal of Theoretical and Applied Issues in Specialized Communication 29:1 ► pp. 1 ff.
Rojas Díaz, José Luis, Juan Manuel Pérez Sánchez & Alejandro Arroyave Tobón
2022. Specialized Idioms: From LGP to LSP Phraseological Paradigm. In Computational and Corpus-Based Phraseology [Lecture Notes in Computer Science, 13528], ► pp. 206 ff.
Pérez Sánchez, Juan Manuel & José Luis Rojas Díaz
2021. ‘Beyond the Grave’: Equivalence of Specialized Phraseological Units Containing Dead Metaphors in an English-Spanish Dictionary of Commerce and Economics. Mutatis Mutandis. Revista Latinoamericana de Traducción 14:2 ► pp. 527 ff.
This list is based on CrossRef data as of 23 july 2024. Please note that it may not be complete. Sources presented here have been supplied by the respective publishers.
Any errors therein should be reported to them.