Parallel corpora make sense
Bypassing the knowledge acquisition bottleneck for Word Sense Disambiguation
We present a multilingual approach to Word Sense Disambiguation (WSD), which automatically assigns the contextually appropriate sense to a given word. Instead of using a predefined monolingual sense-inventory, we use a language-independent framework by deriving the senses of a given word from word alignments on a multilingual parallel corpus, which we made available for corpus linguistics research. We built five WSD systems with English as the input language and translations in five supported languages (viz. French, Dutch, Italian, Spanish and German) as senses. The systems incorporate both binary translation features and local context features. The experimental results are very competitive, which confirms our initial hypothesis that each language contributes to the disambiguation of polysemous words. Because our system extracts all information from the parallel corpus, it offers a flexible language-independent approach, which implicitly deals with the sense distinctness issue and allows us to bypass the knowledge acquisition bottleneck for WSD.
Keywords: cross-lingual Word Sense Disambiguation, parallel corpus, SemEval, polysemy, Word Sense Disambiguation
Published online: 01 September 2014
Agirre, E. & Martinez, D.
2000 “Exploring automatic word sense disambiguation with decision lists and the Web”. In Proceedings of the COLING 2000 Workshop on Semantic Annotation and Intelligent Content , 11-19.
2004 “The Basque Country University system: English and Basque tasks”. In Proceedings of Senseval-3: Third International Workshop on the Evaluation of Systems for the Semantic Analysis of Text . Barcelona, Spain, 44–48.
Agirre, E. & Edmonds, P.
Brown, P., Pietra, S., Pietra, V. & Mercer, R.
1991 “Word-sense disambiguation using statistical methods”. In Proceedings of the 29th Annual Meeting of the Association for Computational Linguistics . Berkeley, California, 264–270.
Chan, Y. & Ng H.
2005 “Scaling up Word Sense Disambiguation via parallel texts”. In Proceedings of the 20th National Conference on Artificial Intelligence (AAAI 2005) . Pittsburgh, Pennsylvania, 1037–1042.
Daelemans, W. & van den Bosch, A.
Dagan, I., Itai, A. & Schwall, U.
1991 “Two languages are more informative than one”. In Proceedings of the 29th Annual Meeting of the Association for Computational Linguistics , 130–137.
Diab, M. & Resnik, P.
2002 “An unsupervised method for word sense tagging using parallel corpora”. In Proceedings of the 40 th Annual Meeting of the Association for Computational Linguistics, Philadelphia, USA, 255–262.
Gale, W. & Church, K.
Gale, W., Church, K. & Yarowsky, D.
1992b “One sense per discourse”. In Proceedings of the DARPA Speech and Natural Language Workshop , New York, USA, 233–237.
Gale, W. & Church, K.
1991 “A program for aligning sentences in bilingual corpora”. In Proceeding of the 29th Annual Meeting of the Association for Computational Linguistics , Berkeley, California, USA, 177–184.
van Gompel M.
2010 “UvT-WSD1: A cross-lingual word sense disambiguation system”. In Proceedings of the 5th International Workshop on Semantic Evaluation, Association for Computational Linguistics , Uppsala, Sweden, 238–241.
Guo, W. & Diab, M.
2010 “COLEPL and COLSLM: An unsupervised WSD approach to multilingual lexical substitution, Tasks 2 and 3 SemEval 2010”. In Proceedings of the 5th International Workshop on Semantic Evaluation, Association for Computational Linguistics , Uppsala, Sweden, 129–133.
Hoste, V., Hendrickx, I., Daelemans, W. & van den Bosch, A.
Hovy, E.H., Marcus, M., Palmer, M., Pradhan, S., Ramshaw, L. & Weischedel, R.
2006 “OntoNotes: The 90% solution”. In Proceedings of the Human Language Technology and North Americal Association of Computational Linguistics conference (HLT-NAACL 2006) , New York, NY, 57-60.
Ide, N. & Véronis, J.
Ide, N., Erjavec, T. & Tufis¸ D.
2002 “Sense discrimination with parallel corpora”. In ACL-2002Workhop on Word Sense Disambiguation: Recent Successes and Future Directions , Philadelphia, PA, 54–60.
Kilgarriff, A. & Palmer, M.
2005 “Europarl: A parallel corpus for statistical machine translation”. In Tenth Machine Translation Summit , Phuket, Thailand, 79–86.
Lopez de Lacalle, O.
Lefever, E. & Hoste, V.
2010 “SemEval-2010 Task 3: Cross-lingual word sense disambiguation”. In Proceedings of the 5th International Workshop on Semantic Evaluation, ACL 2010 , Uppsala, Sweden, 15–20.
Lefever, E., Hoste, V. & De Cock, M.
2011 “ParaSense or how to use parallel corpora for word sense disambiguation”. In Proceedings of the 49 th Annual Meeting of the Assocation for Computational Linguistics: Human Language Technologies, Portland, Oregon, USA, 317-322.
Lefever, E., Hoste, V. & De Cock, M.
Macken, L., De Clercq, O. & Paulussen, H.
Màrquez, L., Escudero, G., Martínez, D. & Rigau, G.
Mihalcea, R., Sinha, R. & McCarthy, D.
2010 “SemEval-2010 Task 2: Cross-lingual lexical substitution”. In Proceedings of SemEval-2010: 5 th International Workshop on Semantic Evaluaions, ACL 2010, Uppsala, Sweden, 9-14.
Mohler, M. & Mihalcea, R.
2008 “Babylon parallel text builder: Gathering parallel texts for low-density languages”. In Proceedings of the Sixth International Conference on Language Resources and Evaluation (LREC’08), Marrakech, Morocco, 28-30.
Navigli, R. & Lapata, M.
2007 “Graph connectivity measures for unsupervised word sense disambiguation”. In Proceedings of the 20th international joint conference on Artifical intelligence, Hyderabad, India, 1683-1688.
Navigli, R. & Ponzetto, S.
2010 “BabelNet: Building a very large multilingual semantic network”. In Proceedings of the 48th Annual Meeting of the Association for Computational Linguistics , Uppsala, Sweden, 216–225.
Ng, H. & Lee, H.
1996 “Integrating multiple knowledge sources to disambiguate word sense: An exemplar-based approach”. In Proceedings of the Thirty-Fourth Annual Meeting of the Association for Computational Linguistics (ACL-96) , 40–47.
Ng, H., Wang, B. & Chan, Y.
2003 “Exploiting parallel texts for word sense disambiguation: An empirical study”. In 41st Annual Meeting of the Association for Computational Linguistics (ACL) , Sapporo, Japan, 455–462.
Och, F. & Ney, H.
Palmer, M., Dang, H. & Fellbaum, C.
Rada, R., Mili, H., Bicknell, E. & Blettner, M.
Resnik, P. & Smith, N.
Rieger, C. & Small, S.
1979 “Word expert parsing”. In Proceedings of the 6 th International Joint Conference on Artificial Intelligence (IJCAI), Tokyo, Japan, 723–728.
1994 “Probabilistic part-of-speech tagging using decision trees”. In Proceedings of the International Conference on new methods in Language Processing , Manchester, UK, 44-49.
Shah, R., Lin, B., Gershman, A. & Frederking, R.
2010 “SYNERGY: A named entity recognition system for resource-scarce languages such as Swahili using online machine translation”. In Proceedings of the Second Workshop on African Language Technology (AFLAT 2010) , Valletta, Malta, 21-26.
Sinha, R. & Mihalcea, R.
2007 “Unsupervised graph-based word sense disambiguation using measures of word semantic similarity”. Proceedings of the IEEE International Conference on Semantic Computing (ICSC2007), Irvine, CA, USA, 363-369.
Tufiş¸ D., Cristea, D. & Stamou, S.
Tufiş¸ D., Ion, R. & Ide, N.
2004b “Fine-grained word sense disambiguation based on parallel corpora, word alignment, word clustering and aligned wordnets”. In Proceedings of the 20th International Conference on Computational Linguistics (COLING 2004), Association for Computational Linguistics, Geneva, Switzerland, 1312–1318.
Wasow, T., Perfors, A. & Beaver, D.
1993 “One sense per collocation”. In Proceedings of the ARPA Human Language Technology Workshop , Plainsboro, USA, 265–271.
1995 “Unsupervised word sense disambiguation rivaling supervised methods”. In Proceedings of the 33 rd annual meeting on Association for Computational Linguistics, Cambridge, Massachusetts, 189-196.