Terminology extraction for global content management
The role of terminology in content management has often been underrated. Term extraction has been identified by the information industry as an area requiring focus. Term extraction benefits both the content authoring and the translation process. Supplying key product terms to translation services several weeks before the actual translation begins reduces translation time, improves translation quality, and saves effort (and thus money) by reducing duplication of work. Getting the key terms ready in a timely manner can be difficult without some automation. This paper describes the process of proposing, designing, developing, and deploying a terminology extraction tool. The tool extracts nouns and noun groups, excludes non-translatable terms and known product terms, and displays a context for each extracted item. This is done based on full parsing of the text with a broad-coverage parser. The tool is made available to users on a Web server.
Cited by (3)
Cited by three other publications
Zhao, Chongchong, Chao Dong & Xiaoming Zhang
2018.
ROCP: A Rapid Ontology Construction Platform from Unstructured Data.
Data Science Journal 17
Al-Thubaity, Abdul Mohsen, Marwa Khan, Saad Alotaibi & Badriyya Alonazi
2014.
2014 International Conference on Asian Language Processing (IALP),
► pp. 1 ff.
Bae, Young-Jun, Ji-Hui Im, Cheol-Young Ock, Kang-Seop Shim, Dong-Myoung Kim & Ho-Seop Choe
2007.
Sixth International Conference on Advanced Language Processing and Web Information Technology (ALPIT 2007),
► pp. 177 ff.
This list is based on CrossRef data as of 10 july 2024. Please note that it may not be complete. Sources presented here have been supplied by the respective publishers.
Any errors therein should be reported to them.