Article published in:
Computational terminology and filtering of terminological informationEdited by Patrick Drouin, Natalia Grabar, Thierry Hamon, Kyo Kageura and Koichi Takeuchi
[Terminology 24:1] 2018
► pp. 41–65
Clinical sublanguages
Vocabulary structure and its impact on term weighting
Leonie Grön | KU Leuven
Ann Bertels | KU Leuven
Due to its specific linguistic properties, the language found in clinical records has been characterized as a distinct sublanguage. Even within the clinical domain, though, there are major differences in language use, which has led to more fine-grained distinctions based on medical fields and document types. However, previous work has mostly neglected the influence of term variation. By contrast, we propose to integrate the potential for term variation in the characterization of clinical sublanguages. By analyzing a corpus of clinical records, we show that the different sections of these records vary systematically with regard to their lexical, terminological and semantic composition, as well as their potential for term variation. These properties have implications for automatic term recognition, as they influence the performance of frequency-based term weighting.
Keywords: clinical sublanguage, term variation, electronic health records, Dutch
Article outline
- 1.Background
- 2.Related research
- 3.Sublanguages, semantic classes and variation types
- 3.1Sublanguages
- 3.2Classes of medical concepts
- 3.3Types of variation
- 4.Corpus study 1: Characterization of sublanguages across sections
- 4.1Corpus characteristics
- 4.2Preprocessing
- 4.3Annotation procedure and feature set
- 4.4Research questions of corpus study 1
- 4.5Results of corpus study 1
- 4.5.1Global lexical structure
- 4.5.2Distribution of semantic types across sections
- 4.5.3Distribution of term types across sections
- 4.6Discussion of corpus study 1
- 5.Corpus study 2: Impact of vocabulary structure on frequency-based term weighting
- 5.1Research questions of corpus study 2
- 5.2Corpus and preprocessing
- 5.3Term filtering
- 5.4Results of corpus study 2
- 5.4.1Precision
- 5.4.2Recall
- 5.5Discussion of corpus study 2
- 6.Conclusion
- Notes
-
References
Published online: 31 May 2018
https://doi.org/10.1075/term.00013.gro
https://doi.org/10.1075/term.00013.gro
References
Afzal, Zubair, Ewoud Pons, Ning Kang, Miriam Sturkenboom, Martijn J. Schuemie, and Jan A. Kors
Ahmad, Khurshid, Lee Gillam, and Lena Tostevin
1999 “University of Surrey Participation in TREC8: Weirdness Indexing for Logical Document Extrapolation and Retrieval (WILDER).” In Proceedings of the 8th Text Retrieval Conference (TREC-8), ed. by Ellen M. Voorhees, and Donna K. Harman, 717–724. Washington: National Institute of Standards and Technology.
Bansler, Jørgen P., Erling C. Havn, Kjeld Schmidt, and Troels Mønsted
Bowker, Lynne, and Shane Hawkins
Chiaramello, Emma, Francesco Pinciroli, Alberico Bonalumi, Angelo Caroli, and Gabriella Tognola
Doing-Harris, Kristina, Olga Patterson, Sean Igo, and John Hurdle
Doing-Harris, Kristina, Yarden Livnat, and Stephane Meystre
Faber, Pamela
Faber, Pamela, and Pilar León-Araúz
Feldman, Keith, and Nicholas Hazekamp
Frantzi, Katerina, Sophia Ananiadou, and Hideki Mima
Friedman, Carol
Friedman, Carol, Pauline Kra, and Andrey Rzhetsky
Grigonyte, Gintare, Maria Kvist, Mats Wirén, Sumithra Velupillai, and Aron Henriksson
Harris, Zellig Sabbettai
He, Zhe, Zhiwei Chen, Sanghee Oh, Jinghui Hou, and Jiang Bian
Jensen, Lotte G., and Claus Bossen
Kaufman, David R., Barbara Sheehan, Peter Stetson, Ashish R. Bhatt, and I. Adele
Leaman, Robert, Ritu Khare, and Zhiyong Lu
León-Araúz, Pilar, Pamela Faber, and Silvia Montero Martínez
Lossio-Ventura, Juan Antonio, Clement Jonquet, Mathieu Roche, and Maguelonne Teisseire
Biomedical Term Extraction: Overview and a New Methodology.” Information Retrieval Journal 19 (2016): 59–99.
Lövestam, Elin, Sumithra Velupillai, and Maria Kvist
Patterson, Olga O., and John F. Hurdle
Periñán-Pascual, Carlos
Riveros, Alejandro, Maria De-Arteaga, Fabio A. Gonzalez, and Sergio Jimenez
2014 “MindLab-UNAL: Comparing Metamap and T-Mapper for Medical Concept Extraction in SemEval 2014 Task 7.” In Proceedings of the 8th International Workshop on Semantic Evaluation (SemEval 2014), edited by Preslav Nakov and Torsten Zesch, 424–27. Dublin, Ireland: Association for Computational Linguistics. 

Roberts, Angus
Rosenbloom, S Trent, Joshua C. Denny, Hua Xu, Nancy Lorenzi, William W. Stead, and Kevin B. Johnson
Sager, Naomi, Margaret Lyman, Christine Bucknall, Ngo Nhan, and Leo Tick
Siklósi, Borbála, Attila Novák, and Gábor Prószéky
Stetson, Peter D., Stephen B. Johnson, Matthew Scotch, and George Hripcsak
Temmerman, Rita
Temnikova, Irina, Ivelina Nikolova, William Baumgartner, Galia Angelova, and Kevin Cohen
Topaz, Maxim, Kenneth Lai, Dawn Dowding, Victor Lei, Anna Zisberg, Kathryn H. Bowles, and Li Zhou
Cited by
Cited by 1 other publications
Vezzani, Federica & Giorgio Maria Di Nunzio
This list is based on CrossRef data as of 07 february 2022. Please note that it may not be complete. Sources presented here have been supplied by the respective publishers. Any errors therein should be reported to them.