Article published in:
International Journal of Corpus Linguistics
Vol. 23:2 (2018) ► pp. 183215
Biber, B., Reppen, R., Schnur, E., & Ghanem, R.
(2016) On the (non)utility of Juilland’s D to measure lexical dispersion in large corpora. International Journal of Corpus Linguistics, 21(4): 439–464. CrossrefGoogle Scholar
Church, K., & Hanks, P.
(1990) Word association norms, Mutual Information, and lexicography. Computational Linguistics, 16(1): 22–29.Google Scholar
Daudaravičius, V., & Marcinkevičienė, R.
(2004) Gravity counts for the boundaries of collocations. International Journal of Corpus Linguistics, 9(2): 321–348. CrossrefGoogle Scholar
Davies, M.
(2008-) The Corpus of Contemporary American English (COCA): 520 million words, 1990-present. Available online at http://​corpus​.byu​.edu​/coca/ (last accessed June 2018).
Dunn, J.
(2017) Computational learning of construction grammars. Language and Cognition, 9(2): 254–292. CrossrefGoogle Scholar
(2018) Finding variants for construction-based dialectometry: A corpus-based approach to regional CxGs. Cognitive Linguistics, 29(2): 275–311. CrossrefGoogle Scholar
Ellis, N.
(2007) Language acquisition as rational contingency learning. Applied Linguistics, 27(1): 1–24. CrossrefGoogle Scholar
Erhan, D., Bengio, Y., Courville, A., Manzagol, P., Vincent, P., & Bengio, S.
(2010) Why does unsupervised pre-training help deep learning? Journal of Machine Learning Research, 111: 625–660.Google Scholar
Evert, S.
(2005) The Statistics of Word Co-Occurrences: Word Pairs and Collocations (Unpublished doctoral dissertation). Stuttgart, University of Stuttgart.Google Scholar
Gries, St. Th.
(2008) Dispersions and adjusted frequencies in corpora. International Journal of Corpus Linguistics, 13(4): 403–437. CrossrefGoogle Scholar
(2010) Bigrams in registers, domains, and varieties: A bigram gravity approach to the homogeneity of corpora. In Mahlberg, M., Diaz, V. & Smith, C. (Eds.) Proceedings of the 2009 Corpus Linguistics Conference. Liverpool: University of Liverpool.Google Scholar
(2012) Frequencies, probabilities, and association measures in usage- / exemplar-based linguistics. Studies in Language, 11(3): 477–510.Google Scholar
(2013) 50-something years of work on collocations: What is or should be next. International Journal of Corpus Linguistics, 18(1): 137–165. CrossrefGoogle Scholar
Gries, St. Th., & Mukherjee, J.
(2010) Lexical gravity across varieties of English: An ICE-based study of n-grams in Asian Englishes. International Journal of Corpus Linguistics, 15(4): 520–548. CrossrefGoogle Scholar
Gries, St. Th., & Stefanowitsch, A.
(2004) Extending collostructional analysis: A corpus-based perspective on ‘alternations’. International Journal of Corpus Linguistics, 9(1): 97–129. CrossrefGoogle Scholar
Jelinek, F.
(1990) Self-organizing language modeling for speech recognition. In A. Waibel & K. Lee (eds.), Readings in Speech Recognition (pp. 450–506). San Mateo, CA: Morgan Kaufmann. CrossrefGoogle Scholar
Koehn, P.
(2005) Europarl: A parallel corpus for statistical machine translation. In Proceedings of the 10th Machine Translation Summit 2005 (pp. 79–86). Tokyo: Asia-Pacific Association for Machine Translation.Google Scholar
Michelbacher, L., Evert, S., & Schutze, H.
(2007) Asymmetric association measures. In N. Nicolov, G. Angelova & R. Mitkov (Eds.), Proceedings of the 6th International Conference on Recent Advances in Natural Language Processing (RANLP) (pp. 367–372). Amsterdam/Philadelphia: John Benjamins. 367–372.Google Scholar
Nguyen, D. Q., Nguyen, D. Q., Pham, D. D., & Pham, S. B.
(2016) A robust transformation-based learning approach using ripple down rules for part-of-speech tagging. AI Communications, 29(3): 409–422. CrossrefGoogle Scholar
Pecina, P.
(2009) Lexical association measures and collocation extraction. Language Resources and Evaluation, 44(1/2): 137–158.Google Scholar
Pedersen, T.
(1998) Dependent bigram identification. In J. Mostow & C. Rich (Eds.), Proceedings of the 15th National Conference on Artificial Intelligence (AAAI-98) (p. 1197). Menlo Park, CA: The AAAI Press.Google Scholar
Pennington, J., Socher, R., & Manning, C.
(2014) GloVe: Global vectors for word representation. In B. Pang & W. Daelemans (Eds.), Proceedings of Empirical Methods in Natural Language Processing (EMNLP) (pp. 1532–1543) Stroudsburg, PA: Association for Computational Linguistics.Google Scholar
Shimohata, S., Sugio, T., & Nagata, J.
(1997) Retrieving collocations by co-occurrences and word order constraints. In P. Cohen & W. Wahlster (Eds.), Proceedings of the Association for Computational Linguistics Annual Meeting (pp. 476–481). Stroudsburg, PA: Association for Computational Linguistics.Google Scholar
Wible, D., & Tsao, N.
(2010) StringNet as a computational resource for discovering and investigating linguistic constructions. In M. Sahlgren & O. Knutsson (Eds.), Proceedings of the Workshop on Extracting and Using Constructions in Computational Linguistics (NAACL-HTL) (pp. 25–31). Stroudsburg, PA: Association for Computational Linguistics.Google Scholar
Wiechmann, D.
(2008) On the computation of collostructional strength: Testing measures of association as expressions of lexical bias. Corpus Linguistics and Linguistic Theory, 4(2): 253–290. CrossrefGoogle Scholar
Zhai, C.
(1997) Exploiting context to identify lexical atoms: A statistical view of linguistic context. In P. Brezillon (Ed.), Proceedings of the First International and Interdisciplinary Conference on Modeling and Using Contex (pp.119–129). Rio de Janeiro, Brazil.Google Scholar