Véronique Hoste
List of John Benjamins publications for which Véronique Hoste plays a role.
Book series
Journal
ISSN 0378-4169 | E-ISSN 1569-9927
Tagging terms in text: A supervised sequential labelling approach to automatic term extraction Terminology 28:1, pp. 157–189 | Article
2022 As with many tasks in natural language processing, automatic term extraction (ATE) is increasingly approached as a machine learning problem. So far, most machine learning approaches to ATE broadly follow the traditional hybrid methodology, by first extracting a list of unique candidate terms,… read more
HAMLET: Hybrid Adaptable Machine Learning approach to Extract Terminology Terminology 27:2, pp. 254–293 | Article
2021 Automatic term extraction (ATE) is an important task within natural language processing, both separately, and as a preprocessing step for other tasks. In recent years, research has moved far beyond the traditional hybrid approach where candidate terms are extracted based on part-of-speech… read more
An automatic part-of-speech tagger for Middle Low German International Journal of Corpus Linguistics 22:1, pp. 107–140 | Article
2017 Syntactically annotated corpora are highly important for enabling large-scale diachronic and diatopic language research. Such corpora have recently been developed for a variety of historical languages, or are still under development. One of those under development is the fully tagged and parsed… read more
Parallel corpora make sense: Bypassing the knowledge acquisition bottleneck for Word Sense Disambiguation International Journal of Corpus Linguistics 19:3, pp. 333–367 | Article
2014 We present a multilingual approach to Word Sense Disambiguation (WSD), which automatically assigns the contextually appropriate sense to a given word. Instead of using a predefined monolingual sense-inventory, we use a language-independent framework by deriving the senses of a given word from word… read more
HypoTerm: Detection of hypernym relations between domain-specific terms in Dutch and English Lexical semantic approaches to terminology, Faber, Pamela and Marie-Claude L'Homme (eds.), pp. 250–278 | Article
2014 HypoTerm is a data-driven semantic relation finder that starts from a list of automatically extracted domain- and user-specific terms from technical corpora, and generates a list of relations between these terms. This research study focused on the detection of hypernym relations between relevant… read more
TExSIS: Bilingual terminology extraction from parallel corpora using chunk-based alignment Terminology 19:1, pp. 1–30 | Article
2013 We report on TExSIS, a flexible bilingual terminology extraction system that uses a sophisticated chunk-based alignment method for the generation of candidate terms, after which the specificity of the candidate terms is determined by combining several statistical filters. Although the set-up of… read more
Classification-based scientific term detection in patient information Terminology 16:1, pp. 1–29 | Article
2010 Although intended for the “average layman”, both in terms of readability and contents, the current patient information still contains many scientific terms. Different studies have concluded that the use of scientific terminology is one of the factors, which greatly influences the readability of… read more