Language independent optimization of text readability formulas with deep reinforcement learning
Readability formulas are used to assess the level of difficulty of a text. These language dependent formulas are
introduced with pre-defined parameters. Deep reinforcement learning models can be used for parameter optimization. In this article
we argue that an Actor-Critic based model can be used to optimize the parameters in the readability formulas. Furthermore, a
selection model is proposed for selecting the most suitable formula to assess the readability of the input text. English and
Persian data sets are used for both training and testing. The experimental results of the parameter optimization model show that,
on average, the F-score of the model for English increases from 24.7% in the baseline to 38.8%, and for Persian from 23.5% to
47.7%. The proposed algorithm selection model further improves the parameter optimization model to 65.5% based on F-score for both
English and Persian.
Article outline
- 1.Introduction
- 2.Related Works
- 3.Proposed method
- 3.1Reinforcement learning
- 3.1.1State space
- 3.1.2Action space
- 3.1.3Rewarding
- 3.2Parameter optimization
- 3.3Algorithm selection
- 3.4Model training
- 4.Experimental results
- 4.1Dataset
- 4.1.1English dataset
- 4.1.2Persian dataset
- 4.2Setup of experiments
- 4.3Results and discussion
- 5.Conclusion and future work
- Notes
-
References
References (70)
References
Al Qundus, J., Paschke, A., Gupta, S., Alzouby, A. M., & Yousef, M. (2020). Exploring
the impact of short-text complexity and structure on its quality in social media. Journal of
Enterprise Information Management.
Amstad, T. (1978). Wie verständlich sind unsere zeitungen? [How Readable Are Our
Newspapers?] Zurich, Switzerland: University of Zurich.
Antunes, H., & Lopes, C. T. (2019). Analyzing
the adequacy of readability indicators to a non-English language. International Conference
of the Cross-Language Evaluation Forum for European
Languages, (pp. 149–155).
Azpiazu, I. M., & Pera, M. S. (2019). Multiattentive
recurrent neural network architecture for multilingual readability assessment. Transactions of
the Association for Computational
Linguistics, 71, 421–436.
Balyan, R., McCarthy, K. S., & McNamara, D. S. (2018). Comparing
machine learning classification approaches for predicting expository text difficulty. Grantee
Submission.
Bijankhan, M. (2004). The
role of corpora in writing a grammar: Introducing a software. Journal of
Linguistics, 19(2), 48–67.
Bohnet, B. (2009). Efficient
parsing of syntactic and semantic dependency structures. Proceedings of the 13th Conference on
Computational Natural Language Learning: Shared
Task, (pp. 67–72).
Cha, M., Gwon, Y., & Kung, H. (2017). Language
modeling by clustering with word embeddings for text readability assessment. Proceedings of the
2017 ACM on Conference on Information and Knowledge
Management, (pp. 2003–2006).
Cuayáhuitl, H., Lee, D., Ryu, S., Cho, Y., Choi, S., Indurthi, S., Yu, S., Choi, H., Hwang, I., & Kim, J. (2019). Ensemble-based
deep reinforcement learning for
chatbots. Neurocomputing, 3661, 118–130.
Dale, E., & Chall, J. S. (1948). A
formula for predicting readability: Instructions. Educational research
bulletin, 37–54.
Dayani, M. (2000). A
criteria for assessing the Persian texts’ readability. Journal of Social Science and
Humanities, 101, 35–48.
DuBay, W. H. (2004). The
principles of readability. Impact Information.
Dueppen, A. J., Bellon-Harn, M. L., Radhakrishnan, N., & Manchaiah, V. (2019). Quality
and readability of English-language internet information for voice disorders. Journal of
Voice, 33(3), 290–296.
Eslami, M., SharifiAtashgah, M., Lamjiri, S. A., & Zandi, T. (2004). Persian
productive lexicon. Proceedings of the 1st Workshop on the Persian Language and
Computer.
Flesch, R. (1979). How
to Write Plain English: A Book for Lawyers and Consumers. Harper &
Row.
Flesch, R. (1948). A
new readability yardstick. Journal of Applied
Psychology, 32(3), 221.
Franccois, T., & Fairon, C. (2012). An
“AI readability” formula for French as a foreign language. Proceedings of the 2012 Joint
Conference on Empirical Methods in Natural Language Processing and Computational Natural Language
Learning, (pp. 466–477).
Ghaffari, M., MahmoodiBakhtiyari, B., & Zolfaghari, H. (2004). Let’s
learn Persian (Vol. 1–3). Madreseh Publication.
Ghayoomi, M. (2012). Bootstrapping
the development of an HPSG-based treebank for Persian. Linguistic Issues in Language
Technology, 7(1).
Ghayoomi, M. (2013). Introducing
a treebank and a statistical parser for Persian. Proceedings of the 8th Conference of Iranian
Linguistics, 21, 666–679.
Ghayoomi, M. (2019). Transition
from rule-based to statistical lemmatization in Persian. Proceedings of the 5th National
Conference on Computational
Linguistics, (pp. 57–86).
Ghayoomi, M. (2022). Application
of computational linguistics to predict language proficiency level of Persian learners’
textbooks. Journal of Language
Horizons, 6(1), 29–52.
Ghayoomi, M., & Kuhn, J. (2014). Converting
an HPSG-based treebank into its parallel dependency-based treebank. Proceedings of the 9th
International Conference on Language Resources and
Evaluation, (pp. 802–809).
Goudjil, M., Koudil, M., Bedda, M., & Ghoggali, N. (2018). A
novel active learning method using svm for text classification. International Journal of
Automation and
Computing, 15(3), 290–298.
Gunning, R. (1952). The
Technique of Clear Writing. McGraw-Hill.
Hafner, R., & Riedmiller, M. (2011). Reinforcement
learning in feedback control. Machine
Learning, 84(1–2), 137–169.
Hausknecht, M., & Stone, P. (2015). Deep
reinforcement learning in parameterized action space. arXiv preprint
arXiv:1511.04143. [URL]
Jiang, Z., Gu, Q., Yin, Y., & Chen, D. (2018). Enriching
word embeddings with domain knowledge for readability assessment. Proceedings of the 27th
International Conference on Computational
Linguistics, (pp. 366–378).
Karačić, J., Dondio, P., Buljan, I., Hren, D., & Marušić, A. (2019). Languages
for different health information readers: Multitrait-multimethod content analysis of cochrane systematic reviews textual
summary formats. BMC Medical Research
Methodology, 19(1), 75.
Keneshloo, Y., Ramakrishnan, N., & Reddy, C. K. (2019). Deep
transfer reinforcement learning for text summarization. Proceedings of the 2019 SIAM
International Conference on Data Mining, 675–683.
Kincaid, J. P., Fishburne Jr, R. P., Rogers, R. L., & Chissom, B. S. (1975). Derivation
of new readability formulas (automated readability index, Fog count and Flesch reading ease formula) for navy enlisted
personnel. Technical Report. Naval Technical Training Command Millington TN Research
Branch.
Kingma, D. P., & Ba, J. (2014). Adam:
A method for stochastic optimization. arXiv preprint arXiv:1412.6980. [URL]
Klein, D., & Manning, C. D. (2003). Accurate
unlexicalized parsing. Proceedings of the 41st Meeting of the Association for Computational
Linguistics, (pp. 423–430).
Lillicrap, T. P., Hunt, J. J., Pritzel, A., Heess, N., Erez, T., Tassa, Y., Silver, D., & Wierstra, D. (2015). Continuous
control with deep reinforcement learning. arXiv preprint
arXiv:1509.02971. [URL]
Lively, B. A., & Pressey, S. L. (1923). A
method for measuring the “vocabulary Burden” of textbooks. Educational Administration and
Supervision, 91, 389–398.
Manek, A. S., Shenoy, P. D., Mohan, M. C., & Venugopal, K. (2017). Aspect
term extraction for sentiment analysis in large movie reviews using Gini index feature selection method and SVM
classifier. World wide
web, 20(2), 135–154.
Mikolov, T., Sutskever, I., Chen, K., Corrado, G. S., & Dean, J. (2013). Distributed
representations of words and phrases and their
compositionality. In C. J. C. Burges, L. Bottou, M. Welling, Z. Ghahramani, & K. Q. Weinberger (Eds.), Advances
in neural information processing
systems 261 (pp. 3111–3119). Curran Associates, Inc.
Mnih, V., Kavukcuoglu, K., Silver, D., Graves, A., Antonoglou, I., Wierstra, D., & Riedmiller, M. (2013). Playing
Atari with deep reinforcement learning. arXiv preprint arXiv:1312.5602. [URL]
Mohammadi, H., & Khasteh, S. H. (2020). A
machine learning approach to Persian text readability assessment using a crowdsourced
dataset. 2020 28th Iranian Conference on Electrical Engineering
(ICEE), 1–7.
Mohammadi, H., & Khasteh, S. H. (2019). Text
as environment: A deep reinforcement learning text readability assessment model. arXiv
preprint arXiv:1912.05957. [URL]
Müller, T., Cotterell, R., Fraser, A., & Schütze, H. (2015). Joint
lemmatization and morphological tagging with lemming. Proceedings of the 2015 Conference on
Empirical Methods in Natural Language
Processing, (pp. 2268–2274).
Müller, T., Schmid, H., & Schütze, H. (2013). Efficient
higher-order CRFs for morphological tagging. Proceedings of the 2013 Conference on Empirical
Methods in Natural Language
Processing, (pp. 322–332).
Narayan, S., Cohen, S. B., & Lapata, M. (2018). Ranking
sentences for extractive summarization with reinforcement learning. arXiv preprint
arXiv:1802.08636. [URL].
Ngo-Ye, T. L., Sinha, A. P., & Sen, A. (2017). Predicting
the helpfulness of online reviews using a scripts-enriched text regression model. Expert
Systems with
Applications, 711, 98–110.
Nuruzzaman, M., & Hussain, O. K. (2018). A
survey on chatbot implementation in customer service industry through deep neural
networks. 2018 IEEE 15th International Conference on e-Business Engineering
(ICEBE), (pp. 54–61).
Pancer, E., Chandler, V., Poole, M., & Noseworthy, T. J. (2019). How
readability shapes social media engagement. Journal of Consumer
Psychology, 29(2), 262–270.
Poornamdariyan, T. (1994). The
Persian Lesson for Foreign Persian Learners (For Beginners). Institute for
Humanities; Cultural Studies Publications.
Rottensteiner, S. (2010). Structure,
function and readability of new textbooks in relation to comprehension. Procedia-Social and
Behavioral Sciences, 21, 3892–3898.
SaffarMoghaddam, A. (2003). General
Persian: Basic constructions. Council of Extending Persian Language; Linguistics at the
Institute for Humanities; Cultural Studies.
SaffarMoghaddam, A. (2008). The
Persian language (Vol. 1–4). Council of Extending Persian Language; Linguistics at the
Institute for Humanities; Cultural Studies.
Salton, G. M., Wong, A., & Yang, C.-S. (1975). A
vector space model for automatic indexing. Communications of the
ACM, 18(11), 613–620.
Samareh, Y. (1989). Teaching
the Persian Language (Vol. 1). Alhoda International Publications.
Samareh, Y. (2005). Teaching
the Persian Language (Vol. 2–4). Alhoda International Publications.
Senter, R., & Smith, E. A. (1967). Automated
readability index (tech. rep.). CINCINNATI UNIV OH.
Serban, I. V., Sankar, C., Germain, M., Zhang, S., Lin, Z., Subramanian, S., Kim, T., Pieper, M., Chandar, S., Ke, N. R., et al. (2017). A
deep reinforcement learning chatbot. arXiv preprint arXiv:1709.02349. [URL]
Shen, C., Gonzalez, Y., Chen, L., Jiang, S. B., & Jia, X. (2018). Intelligent
parameter tuning in optimization-based iterative ct reconstruction via deep reinforcement
learning. IEEE transactions on medical
imaging, 37(6), 1430–1439.
Sherman, L. (1893). Analytics
of Literature: A Manual for the Objective Study of English Prose and
Poetry. Ginn. [URL]
Silveira, N., Dozat, T., de Marneffe, M. C., Bowman, S., Connor, M., Bauer, J., & Manning, C. D. (2014). A
gold standard dependency corpus for English. Proceedings of the 9th International Conference on
Language Resources and
Evaluation, (pp. 2897–2904).
Song, S., Huang, H., & Ruan, T. (2019). Abstractive
text summarization using lstm-cnn based deep learning. Multimedia Tools and
Applications, 78(1), 857–875.
Sutton, R. S., & Barto, A. G. (1998). Reinforcement
Learning: An Introduction (Vol. 135). MIT press Cambridge.
Temnikova, I., Vieweg, S., & Castillo, C. (2015). The
case for readability of crisis communications in social media. Proceedings of the 24th
International Conference on World Wide
Web, (pp. 1245–1250).
Vajjala, S., & Lučić, I. (2018). OnestopEnglish
corpus: A new corpus for automatic readability assessment and text simplification. Proceedings
of the 13th Workshop on Innovative Use of NLP for Building Educational
Applications, (pp. 297–304).
Wang, Y., & Jin, H. (2019). A
deep reinforcement learning based multi-step coarse to fine question answering (MSCQA)
system. Proceedings of the AAAI Conference on Artificial
Intelligence, 331, 7224–7232.
Wasike, B. (2018). Preaching
to the choir? An analysis of newspaper readability vis-a-vis public
literacy. Journalism, 19(11), 1570–1587.
Watkins, C. J., & Dayan, P. (1992). Q-learning. Machine
Learning, 8(3–4), 279–292.
Watkins, C. J. C. H. (1989). Learning
from Delayed Rewards. Doctoral Dissertation. King’s College. Cambridge, UK.
Xia, M., Kochmar, E., & Briscoe, T. (2019). Text
readability assessment for second language learners. arXiv preprint
arXiv:1906.07580. [URL]
Zalmout, N., Saddiki, H., & Habash, N. (2016). Analysis
of foreign language teaching methods: An automatic readability approach. Proceedings of the 3rd
workshop on natural language processing techniques for educational applications
(NLPTEA2016), (pp. 122–130).
Zarghamiyan, M. (1998). Series
of Teaching the Persian Language (From Beginner to Advanced) (Vol. 1). Council of Extending
Persian Language; Linguistics.
Zarghamiyan, M. (2001). Series
of Teaching the Persian Language (From Beginner to Advanced) (Vol. 2–3). Council of Extending
Persian Language; Linguistics.
Cited by (2)
Cited by two other publications
Hadizadeh Moghaddam, Arya & Saeedeh Momtazi
2024.
A semantic modular framework for events topic modeling in social media.
Multimedia Tools and Applications 83:4
► pp. 10755 ff.
Mayahi, Ahmad Jaber & Emman Naser Alshatti
2023.
2023 Computer Applications & Technological Solutions (CATS),
► pp. 1 ff.
This list is based on CrossRef data as of 5 august 2024. Please note that it may not be complete. Sources presented here have been supplied by the respective publishers.
Any errors therein should be reported to them.