Chapter 13
Binomial adverbs in Germanic and Romance Languages
A corpus-based study
As a special type of multiword expressions, binomials are a frequent phenomenon in many languages. We focus on binomial adverbs that are coordinations of two adverbial constituents. Their syntactic contribution to a sentence is adverbial as well and their semantic contribution is idiomatic. They have many uses, such as to intensify (first and foremost), express tendency (more and more), frequency (over and over [again]), vagueness (more or less), determination (sooner or later) etc.
In this work, we describe our approaches to identify binomial adverbs in a large multiparallel corpus. Alongside the well-known measure of reversibility, we also calculate measures of statistical association and look for single-word translation equivalents in other languages. Combining these features facilitates the identification of binomial adverbs.
Article outline
- 1.Introduction
- 2.Related work
- 3.Corpus preparation
- 4.Identification of candidates
- 5.Candidate ranking
- 6.Expression boundaries
- 7.Translation variants
- 8.Conclusions
-
Acknowledgements
-
Note
-
References
-
Appendix
References (24)
References
Bendz, Gerhard. 1965. Ordpar. Stockholm: P. A. Nordstedt & Söners Förlag.
Callegaro, Elena. 2017. “Parallel Corpora for the Investigation of (Variable) Article Use in English: A Construction Grammar Approach.” PhD diss., University of Zurich.
Constant, Matthieu, and Joakim Nivre. 2016. “A Transition-Based System for Joint Lexical and Syntactic Analysis.” In Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics (ACL), 161–171.
Copestake, Ann, and Aurélie Herbelot. 2011. “Exciting and interesting: issues in the generation of binomials.” In Proceedings of the UCNLG+Eval: Language Generation and Evaluation Workshop, 45–53.
Evert, Stefan. 2008. “Corpora and collocations.” In Corpus Linguistics. An International Handbook, edited by A. Lüdeling and M. Kytö, 2:1212–1248. Berlin: Walter de Gruyter.
Graën, Johannes. 2018. “Exploiting Alignment in Multiparallel Corpora for Applications in Linguistics and Language Learning.” PhD diss., University of Zurich.
Graën, Johannes, Dolores Batinic, and Martin Volk. 2014. “Cleaning the Europarl Corpus for Linguistic Applications.” In Proceedings of the Conference on Natural Language Processing (KONVENS), 222–227. Stiftung Universität Hildesheim, October.
Graën, Johannes, Mara Bertamini, and Martin Volk. 2018. “Cutter – a Universal Multilingual Tokenizer.” In Proceedings of the 3rd Swiss Text Analytics Conference, edited by Mark Cieliebak, Don Tuggener, and Fernando Benites, 75–81. CEUR Workshop Proceedings 2226. CEUR-WS, June.
Graën, Johannes, Tannon Kew, Anastassia Shaitarova, and Martin Volk. 2019. “Modelling Large Parallel Corpora: The Zurich Parallel Corpus Collection.” In Proceedings of the 7th Workshop on Challenges in the Management of Large Corpora (CMLC), edited by Piotr Bański et al.
Koehn, Philipp. 2005. “Europarl: A parallel corpus for statistical machine translation.” In Proceedings of the 10th Machine Translation Summit, 5:79–86. Asia-Pacific Association for Machine Translation.
Kopaczyk, Joanna, and Hans Sauer, eds. 2017a. Binomials in the History of English: Fixed and Flexible. Cambridge University Press.
Kopaczyk, Joanna, and Hans Sauer. 2017b. “Defining and exploring binomials.” Binomials in the History of English: Fixed and Flexible: 1–23.
Lambrecht, Knud. 1984. “Formulaicity, frame semantics, and pragmatics in German binomial expressions.” Language 60 (4): 753–796.
Malkiel, Yakov. 1959. “Studies in irreversible binomials.” Lingua 8:113–160.
Masini, Francesca. 2006. “Binomi coordinati in italiano.” In Prospettive nello studio del lessico italiano: Atti del 9. congresso SILFI, 2:563–571.
Müller, Gereon. 1997. “Beschränkungen für Binomialbildungen im Deutschen.” Zeitschrift für Sprachwissenschaft 16 (1): 25–51.
Östling, Robert. 2012. “Stagger: A modern POS tagger for Swedish.” In Proceedings of the 4th Swedish Language Technology Conference (SLTC).
Petrov, Slav, Dipanjan Das, and Ryan McDonald. 2012. “A Universal Part-of-Speech Tagset.” In Proceedings of the 8th International Conference on Language Resources and Evaluation (LREC), edited by Nicoletta Calzolari et al. Istanbul: European Language Resources Association (ELRA). isbn: 978-2-9517408-7-7.
Schmid, Helmut. 1994. “Probabilistic part-of-speech tagging using decision trees.” In Proceedings of International Conference on New Methods in Natural Language Processing, 12:44–49.
Shannon, Claude Elwood. 1948. “A Mathematical Theory of Communication.” The Bell System Technical Journal 27:379–423.
Voghera, Miriam. 2004. “Polirematiche.” In La formazione delle parole in italiano, 56–69. Max Niemeyer Verlag.
Volk, Martin, Simon Clematide, Johannes Graën, and Phillip Ströbel. 2016. “Bi-particle Adverbs, PoS-Tagging and the Recognition of German Separable Prefix Verbs.” In Proceedings of the 13th Conference on Natural Language Processing (KONVENS), 297–305. September.
Volk, Martin, and Johannes Graën. 2017. “Multi-word Adverbs – How well are they handled in Parsing and Machine Translation?” In Proceedings of The 3rd Workshop on Multi-word Units in Machine Translation and Translation Technology (MUMTTT).
Cited by (1)
Cited by one other publication
Więcławska, Edyta
2023.
Approaching Legal Multinomials from the Sociolinguistic Perspective – Insights into Authorship-Based Distinctions.
International Journal for the Semiotics of Law - Revue internationale de Sémiotique juridique 36:4
► pp. 1699 ff.
This list is based on CrossRef data as of 4 july 2024. Please note that it may not be complete. Sources presented here have been supplied by the respective publishers.
Any errors therein should be reported to them.