Article published in:
Recent Advances in Automatic Readability Assessment and Text SimplificationEdited by Thomas François and Delphine Bernhard
[ITL - International Journal of Applied Linguistics 165:2] 2014
► pp. 136–162
Readability for foreign language learning
The importance of cognates
In this paper, we analyse the differences between L1 acquisition and L2 learning and identify four main aspects: input quality and quantity, mapping processes, cross-lingual influence, and reading experience. As a consequence of these differences, we conclude that L1 readability measures cannot be directly mapped to L2 readability. We propose to calculate L2 readability for various dimensions and for smaller units. It is particularly important to account for the cross-lingual influence from the learner’s L1 and other previously acquired languages and for the learner’s higher experience in reading.In our analysis, we focus on lexical readability as it has been found to be the most influential dimension for L2 reading comprehension. We discuss the features frequency, lexical variation, concreteness, polysemy, and context specificity and analyse their impact on L2 readability. As a new feature specific to L2 readability, we propose the cognateness of words with words in languages the learner already knows. A pilot study confirms our assumption that learners can deduce the meaning of new words by their cognateness to other languages.
Published online: 23 January 2015
https://doi.org/10.1075/itl.165.2.02bei
https://doi.org/10.1075/itl.165.2.02bei
References
Adamson, G.W., & Boreham, J
Al-Khalifa, H.S., & Al-Ajlan, A
Aluisio, S., Specia, L., Gasperin, C., & Scarton, C
(2010) Readability assessment for text simplification.
Proceedings of the NAACL HLT 2010 Fifth Workshop on Innovative Use of NLP for Building Educational Applications
(pp. 1–9). Association for Computational Linguistics.
Anagnostou, N.K., & Weir, G.R.S
(2006) From corpus-based collocation frequencies to readability measure.
ICT in the Analysis, Teaching and Learning of Languages
(pp. 33–46).
Beinborn, L., Zesch, T., & Gurevych, I
(2012) Towards fine-grained readability measures for self-directed language learning.
Proceedings of the 1st Workshop on NLP for Computer-Assisted Language Learning
(Vol. 801, pp. 11–19). Linköping University Electronic Press, Linköping Universitet.
(2013) Cognate production using character-based machine translation.
Proceedings of the Sixth International Joint Conference on Natural Language Processing
(pp. 883–891). Asian Federation of Natural Language Processing.
Benjamin, R.G
Brew, C., & McKelvie, D
(1996) Word-pair extraction for lexicography.
Proceedings of the Second International Conference on new Methods in Language Processing
(pp. 45–55).
Cenoz, J
Coleman, M., & Liau, T
Collins-Thompson, K., & Callan, J
Cook, V.J., Long, J., & McDonough, S
Curran, J
Danielsson, P., & Mühlenbock, K
De Groot, A.M.B., & Keijzer, R
DeKeyser, R.M
Dell’Orletta, F., Montemagni, S., & Venturi, G
(2011) READ-IT: Assessing readability of Italian texts with a view to text simplification.
Proceedings of the Second Workshop on Speech and Language Processing for Assistive Technologies
(pp. 73–83).
Ellis, N
(1994) Consciousness in second language learning: Psychological perspectives on the role of conscious processes in vocabulary acquisition. AILA Review, 111.
Feng, L., Elhadad, N., & Huenerfauth, M
(2009) Cognitively motivated features for readability assessment.
Proceedings of the 12th Conference of the European Chapter of the Association for Computational Linguistics
(pp. 229–237). Morristown, NJ, USA, Association for Computational Linguistics.
Ferreira de Souza, V
François, T., & Fairon, C
(2012) An “AI readability” formula for French as a foreign language.
Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning
(pp. 466–477).
Gomes, L., & Pereira Lopes, J.G
Graesser, A.C., & McNamara, D
Haastrup, K
Hall, P.A.V., & Dowling, G.R
Heilman, M.J., Collins-Thompson, K., Callan, J., & Eskenazi, M
(2007) Combining lexical and grammatical features to improve readability measures for first and second language texts.
Proceedings of NAACL-HLT
(pp. 460–467).
Hoshino, N., & Kroll, J.F
Inkpen, D., Frunza, O., & Kondrak, G
(2005) Automatic identification of cognates and false friends in French and English.
Proceedings of the International Conference Recent Advances in Natural Language Processing
(pp. 251–257).
Jiang, N
Kaushanskaya, M., & Rechtzigel, K
Kincaid, J.P., Fishburne Jr, R.P., Rogers, R.L., & Chissom, B.S
Kondrak, G
(2000) A new algorithm for the alignment of phonetic sequences.
Proceedings of the 1st NAACL
(pp. 288–295).
Kondrak, G., & Dorr, B
(2004) Identification of confusable drug names: A new approach and evaluation methodology.
Proceedings of the 20th International Conference on Computational Linguistics
(pp. 952–958).
Kroll, J.F., & Stewart, E
Larsson, P
Laufer, B., & Ravenhorst-Kalovski, G.C
(2010) Lexical threshold revisited: Lexical text coverage, learners’ vocabulary size and reading comprehension, 22(1), 15–30.
Lemhöfer, K., & Dijkstra, T
Lervåg, A., & Aukrust, V.G
Lin, D
(1998) Automatic retrieval and clustering of similar words.
Proceedings of the 17th International Conference on on Computational Linguistics
(Vol. 21, pp. 768–774). Association for Computational Linguistics.
List, J.-M
(2012) LexStat: Automatic detection of cognates in multilingual wordlists.
Proceedings of the EACL 2012 Joint Workshop of LINGVIS\UNCLH
(pp. 117–125).
Lotto, L., & De Groot, A.M.B
McDonald, S., & Ramscar, M
(2001) Testing the distributional hypothesis: The influence of context on judgements of semantic similarity.
Proceedings of the 23rd Annual Conference of the Cognitive Science Society
(pp. 611–616).
Melamed, I.D
Mitkov, R., Pekar, V., Blagoev, D., & Mulloni, A
Montalvo, S., Pardo, E.G., Martinez, R., & Fresno, V
(2012) Automatic cognate identification based on a fuzzy combination of string similarity measures.
Proceedings of the IEEE International Conference on Fuzzy Systems
(pp. 1–8).
Mulloni, A., & Pekar, V
(2006) Automatic detection of orthographic cues for cognate recognition.
Proceedings of the 5th International Conference on Language Resources and Evaluation
(pp. 2387–2390).
Nakov, S
(2009) Automatic identification of false friends in parallel corpora: Statistical and semantic approach, Serdica Journal of Computing, 3(2), 133–158.
Nation, P
Odlin, T
Paribakht, T., & Wesche, M
Pitler, E., & Nenkova, A
(2008) Revisiting readability: A unified framework for predicting text quality.
Proceedings of the Conference on Empirical Methods in Natural Language Processing
(pp. 186–195). Association for Computational Linguistics.
Potter, M.C., So, K.-F., Eckardt, B. Von, & Feldman, L.B
Ringbom, H., & Jarvis, S
Sato, S., Matsuyoshi, S., & Kondoh, Y
(2008) Automatic assessment of Japanese text readability based on a textbook corpus.
Proceedings of the 6th International Conference on Language Resources and Evaluation (LREC)
. Marrakech, Morocco.
Schmidt, R
Schwarm, S.E., & Ostendorf, M
(2005) Reading level assessment using support vector machines and statistical language models.
Proceedings of the 43rd Annual Meeting of the ACL
(pp. 523–530).
Sepúlveda Torres, L., & Aluisio, S.M
(2011) Using machine learning methods to avoid the pitfall of cognates and false friends in Spanish-Portuguese word pairs.
Proceedings of the 8th Brazilian Symposium in Information and Human Language Technology
(pp. 67–76).
Sigurd, B., Eeg-Olofsson, M., & Van Weijer, J
Simard, M., Foster, G.F., & Isabelle, P
(1992) Using cognates to align sentences in bilingual corpora (pp. 67–81).
Tharp, J.B
Tonelli, S., Manh, K.T., & Pianta, E
(2012) Making readability indices readable.
Proceedings of NAACL-HLT: Workshop on Predicting and Improving Text Readability for Target Reader Populations
(pp. 40–48).
Uitdenbogerd, S
(2005) Readability of French as a foreign language and its uses.
Proceedings of the Australian Document Computing Symposium
(pp. 19–25).
Vajjala, S., & Meurers, D
(2012) On improving the accuracy of readability classification using insights from second language acquisition.
Proceedings of the 7th Workshop on Innovative Use of NLP for Building Educational Applications (BEA7)
(pp. 163–173).
Verspoor, M., & Lowie, W
Volodina, E., & Pijetlovic, D
(2013) Towards a gold standard for Swedish CEFR-based ICALL.
Proceedings of the 2nd Workshop on NLP for Computer-Assisted Language Learning
(pp. 48–65). Linköping University Electronic Press, Linköping Universitet.
Vor der Brück, T., Hartrumpf, S., & Helbig, H
(2008) A readability checker with supervised learning using deep syntactic and semantic indicators.
Proceedings of the 11th International Multiconference: Information Society
(pp. 92–97).
Vygotsky, L
Zesch, T., & Gurevych, I
Zhang, D., & Koda, K
Zobel, J., & Dart, P
(1996) Phonetic string matching: lessons from information retrival.
Proceedings of the 19th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval
(pp. 166–172).
Cited by
Cited by 2 other publications
Zhang, Haomin, Yuting Han, Xi Cheng, Jie Sun & Shoran Ohara
Zhang, Haomin, Yuting Han, Xing Zhang & Liuran Cui
This list is based on CrossRef data as of 15 april 2022. Please note that it may not be complete. Sources presented here have been supplied by the respective publishers. Any errors therein should be reported to them.