References
Alfter, D., Tiedemann, T. L., & Volodina, E.
(2020) Expert judgments versus crowdsourcing in ordering multi-word expressions. Eighth Swedish Language Technology Conference (SLTC). [URL]
Arase, Y., Uchida, S., & Kajiwara, T.
(2022) CEFR-based sentence difficulty annotation and assessment. Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, 6206–6219. DOI logoGoogle Scholar
Bax, S.
(2012) Text inspector: Online text analysis tool. [URL]
Capel, A.
(2015) The English vocabulary profile. In J. Harrison & F. Barker (Eds.), English profile in practice 5 1 (pp. 9–27). Cambridge University Press.Google Scholar
Chen, Y. H., & Baker, P.
(2016) Investigating criterial discourse features across second language development: Lexical bundles in rated learner essays, CEFR B1, B2 and C1. Applied Linguistics, 37 (6), 849–880. DOI logoGoogle Scholar
Chujo, K., Oghigian, K., & Akasegawa, S.
(2015) A corpus and grammatical browsing system for remedial EFL learners. In A. Leńko-Szymańska & A. Boulton (Eds.), Multiple affordances of language corpora for data-driven learning (pp. 109–130). John Benjamins. DOI logoGoogle Scholar
Collins-Thompson, K.
(2014) Computational assessment of text readability: A survey of current and future research. ITL-International Journal of Applied Linguistics, 165 (2), 97–135. DOI logoGoogle Scholar
Douglas, C. E., & Fligner, A. M.
(1991) On distribution-free multiple comparisons in the one-way analysis of variance. Communications in Statistics: Theory and Methods, 20 1, 127–139. DOI logoGoogle Scholar
Dürlich, L., & François, T.
(2018) EFLLex: A graded lexical resource for learners of English as a foreign language. Proceedings of the Eleventh International Conference on Language Resources and Evaluation (LREC 2018), 873–879.Google Scholar
Dwass, M.
(1960) Some k-sample rank-order tests. In I. Olkin, S. G. Ghurye, W. Hoeffding, W. G. Madow, & B. H. Mann (Eds.), Contributions to Probability and Statistics. Essays in Honor of H. Hotelling (pp. 198–202). Stanford University Press.Google Scholar
Ehara, Y.
(2018) Building an English vocabulary knowledge dataset of Japanese English-as-a-second-language learners using crowdsourcing. Proceedings of the Eleventh International Conference on Language Resources and Evaluation (LREC 2018), 485–488.Google Scholar
Flesch, R.
(1948) A new readability yardstick. Journal of Applied Psychology, 32 (3), 221–233. DOI logoGoogle Scholar
François, T.
(2015) When readability meets computational linguistics: A new paradigm in readability. Revue française de linguistique appliquée, 20 (2), 79–97. [URL]. DOI logo
François, T., & Fairon, C.
(2012) An “AI readability” formula for French as a foreign language. Proceedings of the 2012 Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning, 466–477.Google Scholar
Harrison, J.
(2015) The English grammar profile. In J. Harrison & F. Barker (Eds.), English profile in practice 5 1 (pp. 28–48). Cambridge University Press.Google Scholar
Hawkins, J. A., & Filipović, L.
(2012) Criterial features in L2 English: Specifying the reference levels of the Common European Framework (Vol. 11). Cambridge University Press.Google Scholar
Ishii, Y., & Tono, Y.
(2018) Investigating Japanese EFL learners’ overuse/underuse of English grammar categories and their relevance to CEFR levels. Proceedings of the 4th Asia Pacific Corpus Linguistics Conference, 160–165.Google Scholar
Jiang, C., Maddela, M., Lan, W., Zhong, Y., & Xu, W.
(2020) Neural CRF model for sentence alignment in text simplification. arXiv preprint arXiv:2005.02324. DOI logoGoogle Scholar
Khallaf, N., & Sharoff, S.
(2021) Automatic difficulty classification of Arabic sentences. Proceedings of the Sixth Arabic Natural Language Processing Workshop, 105–114.Google Scholar
Kilgarriff, A., Husák, M., McAdam, K., Rundell, M., & Rychlý, P.
(2008) GDEX: Automatically finding good dictionary examples in a corpus. Proceedings of the XIII EURALEX international congress, 11, 425–432.Google Scholar
Kim, S.
(2021) Generalizability of CEFR criterial grammatical features in a Korean EFL corpus across A1, A2, B1, and B2 levels. Language Assessment Quarterly, 18 (3), 273–295. DOI logoGoogle Scholar
Klare, G. R.
(1968) The role of word frequency in readability. Elementary English, 45 (1), 12–22.Google Scholar
Marcus, M. P., Santorini, B., & Marcinkiewicz, M. A.
(1993) Building a large annotated corpus of English: The Penn Treebank. Computational Linguistics, 19 (2), 313–330.Google Scholar
Nation, I.
(2006) How large a vocabulary is needed for reading and listening? Canadian Modern Language Review, 63 (1), 59–82. DOI logoGoogle Scholar
Nishihara, D., Kajiwara, T., & Arase, Y.
(2019) Controllable text simplification with lexical constraint loss. Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics: Student Research Workshop, 260–266. DOI logoGoogle Scholar
Ozasa, T., Weir, G., & Fukui, M.
(2007) Measuring readability for Japanese learners of English. Proceedings of the 12th Conference of Pan-Pacific Association of Applied Linguistics, 122–125.Google Scholar
Pilán, I., Vajjala, S., & Volodina, E.
(2016) A readable read: Automatic assessment of language learning materials based on linguistic complexity. arXiv preprint arXiv:1603.08868.Google Scholar
Pilán, I., Volodina, E., & Johansson, R.
(2014) Rule-based and machine learning approaches for second language sentence-level readability. Proceedings of the ninth workshop on innovative use of NLP for building educational applications, 174–184. DOI logoGoogle Scholar
Qi, P., Zhang, Y., Zhang, Y., Bolton, J., & Manning, C. D.
(2020) Stanza: A Python natural language processing toolkit for many human languages. arXiv preprint arXiv:2003.07082. DOI logoGoogle Scholar
Salamoura, A., & Saville, N.
(2010) Exemplifying the CEFR: Criterial features of written learner English from the English Profile Programme. In I. Bartning, M. Maisa, & I. Vedder (Eds.), Communicative proficiency and linguistic development: Intersections between SLA and language testing research (Vol. 11, pp. 101–132). European Second Language Association.Google Scholar
Steel, R. G. D.
(1960) A rank sum test for comparing all pairs of treatments, Technometrics, 21, 197–207. DOI logoGoogle Scholar
Uchida, S., & Negishi, M.
(2018) Assigning CEFR-J levels to English texts based on textual features. Proceedings of the 4th Asia Pacific Corpus Linguistics Conference, 463–467.Google Scholar
Vajjala, S., & Meurers, D.
(2014) Assessing the relative reading level of sentence pairs for text simplification. Proceedings of the 14th Conference of the European Chapter of the Association for Computational Linguistics, 288–297. DOI logoGoogle Scholar