Parallel corpora make sense
Bypassing the knowledge acquisition bottleneck for Word Sense Disambiguation
We present a multilingual approach to Word Sense Disambiguation (WSD), which automatically assigns the contextually appropriate sense to a given word. Instead of using a predefined monolingual sense-inventory, we use a language-independent framework by deriving the senses of a given word from word alignments on a multilingual parallel corpus, which we made available for corpus linguistics research. We built five WSD systems with English as the input language and translations in five supported languages (viz. French, Dutch, Italian, Spanish and German) as senses. The systems incorporate both binary translation features and local context features. The experimental results are very competitive, which confirms our initial hypothesis that each language contributes to the disambiguation of polysemous words. Because our system extracts all information from the parallel corpus, it offers a flexible language-independent approach, which implicitly deals with the sense distinctness issue and allows us to bypass the knowledge acquisition bottleneck for WSD.
References
Agirre, E. & Martinez, D
2000 “
Exploring automatic word sense disambiguation with decision lists and the Web”. In
Proceedings of the COLING 2000 Workshop on Semantic Annotation and Intelligent Content
, 11-19.
Agirre, E. & Martinez, D
2004 “
The Basque Country University system: English and Basque tasks”. In
Proceedings of Senseval-3: Third International Workshop on the Evaluation of Systems for the Semantic Analysis of Text
. Barcelona, Spain, 44–48.
Agirre, E. & Edmonds, P
2006 Word Sense Disambiguation. Algorithms and applications. Seacaucus: Springer.
Boden, B
2010 “Word Sense Disambiguation based on parallel corpora: A comparison of the meanings and translations of five ambiguous words found in the Europarl corpus with those found in the dictionary”. Unpublished Master’s dissertation, Ghent University College.
Brown, P., Pietra, S., Pietra, V. & Mercer, R
1991 “
Word-sense disambiguation using statistical methods”. In
Proceedings of the 29th Annual Meeting of the Association for Computational Linguistics
. Berkeley, California, 264–270.
Chan, Y. & Ng H
2005 “
Scaling up Word Sense Disambiguation via parallel texts”. In
Proceedings of the 20th National Conference on Artificial Intelligence (AAAI 2005)
. Pittsburgh, Pennsylvania, 1037–1042.
Daelemans, W. & van den Bosch, A
2005 Memory-based Language Processing. Cambridge University Press.
Dagan, I., Itai, A. & Schwall, U
1991 “
Two languages are more informative than one”. In
Proceedings of the 29th Annual Meeting of the Association for Computational Linguistics
, 130–137.
Diab, M
2004 “Word Sense Disambiguation within a multilingual framework”. Unpublished doctoral dissertation, University of Maryland, USA.
Diab, M. & Resnik, P
2002 “
An unsupervised method for word sense tagging using parallel corpora”. In
Proceedings of the 40
th Annual Meeting of the Association for Computational Linguistics, Philadelphia, USA, 255–262.
Fellbaum, C
1998 WordNet: An Electronic Lexical Database. Cambridge: MIT Press.
Gale, W. & Church, K
1993 “A program for aligning sentences in bilingual corpora”.
Computational Linguistics, 19 (1), 75–102.
Gale, W., Church, K. & Yarowsky, D
1992a “A method for disambiguating word senses in a large corpus”.
Computers and the Humanities, 261, 415–439.
Gale, W., Church, K. & Yarowsky, D
1992b “
One sense per discourse”. In
Proceedings of the DARPA Speech and Natural Language Workshop
, New York, USA, 233–237.
Gale, W. & Church, K
1991 “
A program for aligning sentences in bilingual corpora”. In
Proceeding of the 29th Annual Meeting of the Association for Computational Linguistics
, Berkeley, California, USA, 177–184.
Gentner, D
1981 “Some interesting differences between verbs and nouns”.
Cognition and Brain Theory, 4 (2), 161-178.
van Gompel M
2010 “
UvT-WSD1: A cross-lingual word sense disambiguation system”. In
Proceedings of the 5th International Workshop on Semantic Evaluation,
Association for Computational Linguistics
, Uppsala, Sweden, 238–241.
Gries, S
2006 “Corpus-based methods and cognitive semantics: The many meanings of to run
. In
S. Th. Gries &
A. Stefanowitsch (Eds.),
Corpora in Cognitive Linguistics: Corpus-based Approaches to Syntax and Lexis, 57-99.
Guo, W. & Diab, M
2010 “
COLEPL and COLSLM: An unsupervised WSD approach to multilingual lexical substitution, Tasks 2 and 3 SemEval 2010”. In
Proceedings of the 5th International Workshop on Semantic Evaluation, Association for Computational Linguistics
, Uppsala, Sweden, 129–133.
Hoste, V., Hendrickx, I., Daelemans, W. & van den Bosch, A
2002 “Parameter optimization for machine-learning of word sense disambiguation”.
Natural Language Engineering, Special Issue on Word Sense Disambiguation Systems, 8 (4), 311-325.
Hovy, E.H., Marcus, M., Palmer, M., Pradhan, S., Ramshaw, L. & Weischedel, R
2006 “
OntoNotes: The 90% solution”. In
Proceedings of the Human Language Technology and North Americal Association of Computational Linguistics conference (HLT-NAACL 2006)
, New York, NY, 57-60.
Ide, N. & Véronis, J
1998 “Word sense disambiguation: The state of the art”.
Computational Linguistics, 24 (1), 1–40.
Ide, N., Erjavec, T. & Tufis¸ D
2002 “Sense discrimination with parallel corpora”. In
ACL-2002Workhop on Word Sense Disambiguation: Recent Successes and Future Directions
, Philadelphia, PA, 54–60.
Kaplan, A
1955 “An experimental study of ambiguity and context”.
Mechanical Translation, 2 (2), 39–46.
Kilgarriff, A
1997 “I don’t believe in word senses”.
Computers and the Humanities, 31 (2), 91–113.
Kilgarriff, A. & Palmer, M
2000 “Senseval: Special issue of the journal Computers and the Humanities”.
Computers and the Humanities, 341 (1-2).
Koehn, P
2005 “Europarl: A parallel corpus for statistical machine translation”. In
Tenth Machine Translation Summit
, Phuket, Thailand, 79–86.
Lopez de Lacalle, O
2009 “Domain-specific word sense disambiguation”. Unpublished doctoral dissertation, University of the Basque Country, Donostia, Basque Country.
Lefever, E. & Hoste, V
2010 “
SemEval-2010 Task 3: Cross-lingual word sense disambiguation”. In
Proceedings of the 5th International Workshop on Semantic Evaluation, ACL 2010
, Uppsala, Sweden, 15–20.
Lefever, E., Hoste, V. & De Cock, M
2011 “
ParaSense or how to use parallel corpora for word sense disambiguation”. In
Proceedings of the 49
th Annual Meeting of the Assocation for Computational Linguistics: Human Language Technologies, Portland, Oregon, USA, 317-322.
Lefever, E
2012 “ParaSense: Parallel corpora for word sense disambiguation”. Unpublished doctoral dissertation, Ghent University, Belgium.
Lefever, E., Hoste, V. & De Cock, M
2013 “Five languages are better than one: An attempt to bypass the data acquisition bottleneck for WSD”. In
A. Gelbukh (Ed.),
CICLing 2013, Part I, LNCS 7816,Berlin/Heidelberg: Springer-Verlag, 343-354.
Lesk, M
1986 “
Automated sense disambiguation using machine-readable dictionaries: How to tell a pine cone from an ice cream cone”. In
1986 ACM SIGDOC Conference
, Toronto, Canada, 24–26.
Macken, L., De Clercq, O. & Paulussen, H
2011,
“Dutch parallel corpus: A balanced copyright-cleared parallel corpus”.
Meta, 56 (2), 374-390.
Màrquez, L., Escudero, G., Martínez, D. & Rigau, G
2006 “Supervised corpus-based methods for WSD”. In
Agirre, E. &
Edmonds, P. (Eds.),
Word Sense Disambiguation: Algorithms and Applications. New York, NY: Springer, 167–216.
McCarthy, D
2009 “Word Sense Disambiguation: An overview”.
Language and Linguistics Compass, 3 (2), 537-558.
Mihalcea, R
2006 “Knowledge based methods for word sense disambiguation”. In
E. Agirre &
P. Edmonds (Eds.),
Word Sense Disambiguation. Algorithms and applications. Text, Speech and Language Technology. Secaucus: Springer-Verlag, 107–131.
Mihalcea, R., Sinha, R. & McCarthy, D
2010 “
SemEval-2010 Task 2: Cross-lingual lexical substitution”. In
Proceedings of SemEval-2010: 5
th International Workshop on Semantic Evaluaions, ACL 2010, Uppsala, Sweden, 9-14.
Mohler, M. & Mihalcea, R
2008 “
Babylon parallel text builder: Gathering parallel texts for low-density languages”. In
Proceedings of the Sixth International Conference on Language Resources and Evaluation (LREC’08),
Marrakech, Morocco, 28-30.
Navigli, R. & Lapata, M
2007 “
Graph connectivity measures for unsupervised word sense disambiguation”. In
Proceedings of the 20th international joint conference on Artifical intelligence, Hyderabad, India, 1683-1688.
Navigli, R
2009 “Word Sense Disambiguation: A survey”.
ACM Computing Surveys, 41 (2), 1–69.
Navigli, R. & Ponzetto, S
2010 “
BabelNet: Building a very large multilingual semantic network”. In
Proceedings of the 48th Annual Meeting of the Association for Computational Linguistics
, Uppsala, Sweden, 216–225.
Ng, H. & Lee, H
1996 “
Integrating multiple knowledge sources to disambiguate word sense: An exemplar-based approach”. In
Proceedings of the Thirty-Fourth Annual Meeting of the Association for Computational Linguistics (ACL-96)
, 40–47.
Ng, H., Wang, B. & Chan, Y
2003 “Exploiting parallel texts for word sense disambiguation: An empirical study”. In
41st Annual Meeting of the Association for Computational Linguistics (ACL)
, Sapporo, Japan, 455–462.
Och, F. & Ney, H
2003 “A systematic comparison of various statistical alignment models”.
Computational Linguistics, 29 (1), 19–51.
Palmer, M., Dang, H. & Fellbaum, C
2007 “Making fine-grained and coarse-grained sense distinctions, both manually and automatically”.
Journal of Natural Language Engineering, 13 (2), 137-163.
Rada, R., Mili, H., Bicknell, E. & Blettner, M
1989 “Development and application of a metric on semantic nets”.
IEEE Transactions on Systems, Man, and Cybernetics, 19 (1), 17–30.
Resnik, P
1999 “Disambiguating noun groupings with respect to WordNet senses”. In
S. Armstrong,
K. Church,
P. Isabelle,
S. Manzi,
E. Tzoukermann &
D. Yarowsky (Eds.),
Natural Language Processing Using Very Large Corpora, Dordrecht: Kluwer Academic Publishers, 77–98.
Resnik, P. & Smith, N
2003 “The web as a parallel corpus”.
Computational Linguistics, 29 (3), 349-380.
Rieger, C. & Small, S
1979 “
Word expert parsing”. In
Proceedings of the 6
th International Joint Conference on Artificial Intelligence (IJCAI), Tokyo, Japan, 723–728.
Schmid, H
1994 “
Probabilistic part-of-speech tagging using decision trees”. In
Proceedings of the International Conference on new methods in Language Processing
, Manchester, UK, 44-49.
Schütze, H
1998 “Automatic word sense discrimination”.
Computational Linguistics, 24 (1), 97–123.
Shah, R., Lin, B., Gershman, A. & Frederking, R
2010 “
SYNERGY: A named entity recognition system for resource-scarce languages such as Swahili using online machine translation”. In
Proceedings of the Second Workshop on African Language Technology (AFLAT 2010)
, Valletta, Malta, 21-26.
Sinha, R. & Mihalcea, R
2007 “
Unsupervised graph-based word sense disambiguation using measures of word semantic similarity”.
Proceedings of the IEEE International Conference on Semantic Computing (ICSC2007), Irvine, CA, USA, 363-369.
Tufiş¸ D., Cristea, D. & Stamou, S
2004a “BalkaNet: Aims, methods, results, and perspectives. A general overview”.
Romanian Journal of Information Science and Technology, 7 (1–2), 9–43.
Tufiş¸ D., Ion, R. & Ide, N
2004b “
Fine-grained word sense disambiguation based on parallel corpora, word alignment, word clustering and aligned wordnets”. In
Proceedings of the 20th International Conference on Computational Linguistics
(COLING 2004), Association for Computational Linguistics, Geneva, Switzerland, 1312–1318.
Vossen, P
1998 EuroWordNet: A Multilingual Database with Lexical Semantic Networks. Norwell, MA: Kluwer Academic Publishers.
Wasow, T., Perfors, A. & Beaver, D
2005 “The puzzle of ambiguity”. In
C.O. Orgun &
P. Sells (Eds.),
Morphology and The Web of Grammar: Essays in Memory of Steven G. Lapointe. Stanford: CSLI Publications, 265-282.
Weaver, W
1949 “Translation”. In
W.N. Locke &
A.D. Booth (Eds.),
Machine Translation of Languages: Fourteen Essays. Cambridge: Technology Press of the Massachusetts Institute of Technology, 15-23.
Wilks, Y
1975 “Preference semantics”. In
Keenan, E. (Ed.),
Formal Semantics of Natural Language. Cambridge: Cambridge University Press, 329–348.
Yarowsky, D
1993 “
One sense per collocation”. In
Proceedings of the ARPA Human Language Technology Workshop
, Plainsboro, USA, 265–271.
Yarowksy, D
1995 “
Unsupervised word sense disambiguation rivaling supervised methods”. In
Proceedings of the 33
rd annual meeting on Association for Computational Linguistics, Cambridge, Massachusetts, 189-196.
Yarowsky, D
2000 “Hierarchical decision lists for word sense disambiguation”.
Computers and the Humanities, 34 (2), 179-186.
Cited by
Cited by 1 other publications
Lim, Lily
2021.
A Corpus-Based Examination of the Translation of the Suffix –ism into Chinese. In
New Perspectives on Corpus Translation Studies [
New Frontiers in Translation Studies, ],
► pp. 29 ff.
This list is based on CrossRef data as of 4 july 2024. Please note that it may not be complete. Sources presented here have been supplied by the respective publishers.
Any errors therein should be reported to them.