Article published in:
Recent Advances in Automatic Readability Assessment and Text SimplificationEdited by Thomas François and Delphine Bernhard
[ITL - International Journal of Applied Linguistics 165:2] 2014
► pp. 299–323
Making numerical information more accessible
The implementation of a Numerical Expression Simplification System for Spanish
Susana Bautista | Universidad Complutense de Madrid
Horacio Saggion | Universitat Pompeu Fabra
Are rounded numbers easier to understand than exact numbers? Information in newspapers often takes the form of numerical expressions which pose comprehension problems for many people, including people with disabilities, low literacy levels or lack of access to advanced technology. The purpose of this paper is to motivate and describe a rule-based lexical component that simplifies numerical expressions in Spanish texts. We propose a simplification approach that makes news articles more accessible to readers with specials needs by rewriting difficult numerical expressions in a simpler way. We carried out a study that identifies powerful simplification strategies to simplify numerical information in a text by analysing a parallel corpus of original texts and their manual simplifications. The study is complemented with an analysis of simplifications obtained in response to a questionnaire where subjects were asked to produce simplifications of numerical expressions in context. Finally, we implemented and evaluated a simplification system that mimics the simplification strategies that were found to be effective.
Keywords: Simplification Strategies, Text Accessibility, Simplification Corpus, Numerical Expressions
Published online: 23 January 2015
https://doi.org/10.1075/itl.165.2.07bau
https://doi.org/10.1075/itl.165.2.07bau
References
Agencia Servimedia
(2010) Retrieved June 22, 2013 from www.servimedia.es
Aluísio, S.M., Specia, L., Pardo, T.A., Maziero, E., & Fortes, R
(2008) Towards Brazilian Portuguese automatic text simplification systems.
ACM Symposium on Document Engineering 2008
, 240–248.
Anula, A
(2007) Tipos de Textos, Complejidad Lingüística y Facilitación Lectora.
Actas del Sexto Congreso de Hispanistas de Asia
(pp. 45–61).
Aswani, N., Tablan, V., Bontcheva, K. & Cunningham, H
(2005) Indexing and querying linguistic metadata and document content.
Proceedings of 5th International Conference on Recent Advances in Natural Language Processing
. Borovets, Bulgaria.
Bautista, S., Drndarevic, B., Hervás, R., Saggion, H., & Gervás, P
Bautista, S., Gervás, P., & Madrid, R.I
(2009) Feasibility analysis for semiautomatic conversion of text to improve readability.
Proceedings of the Second International Conference on Information and Communication Technology and Accessibility
. Hammamet, Tunisia.
Bautista, S., Hervás, R., Gervás, P., Power, R., & Williams, S
(2011) How to make numerical inormation accessible: Experimental identification of simplification strategies.
Proceedings of the 13th IFIP TC13 Conference on Human-Computer Interaction ( INTERACT)
. Lisbon, Portugal.
(2013) A system for the simplification of numerical expressions at different levels of understandability.
Workshop Natural Language Processing for Improving Textual Accessibility
. Attlanta, USA.
Bisantz, A.M., Schinzing, S., & Munch, J
Bohnet, B., & Nivre, J
(2012) A transition-based system for joint part-of-speech tagging and labeled non-projective dependency parsing.
ENMLP-CoNLL
(pp. 1455–1465). Jeju Island, Korea.
Bott, S., & Saggion, H
(2011) An unsupervised alignment algorithm for text simplification corpus construction.
Workshop on Monolingual Text-to-Text Generation
. Portland, USA.
(2012) Automatic simplification of Spanish text for e-accessibility.
Proceedings of the 13th International Conference on Computers Helping People with Special Needs
(pp. 54–56). Linz, Austria.
Bott, S., Rello, L., Drndarevic, B., & Saggion, H
(2012) Can Spanish be simpler? LexSiS: Lexical simplification for Spanish.
The 24th International Conference on Computational Linguistics
. Mumbai, India.
Canning, Y
(2000) Cohesive simplification of newspaper text aphasic readers.
Proceedings of the 3rd Annual CLUK Doctoral Research Colloquium
.
Caseli, H.M., Pereira, T.F., Specia, L., Pardo, T.A.S., Gasperin, C., & Aluisio, S.M
(2009) Building a Brazilian Portuguese parallel corpus of original and simplified texts. In
Proceedings of International Conference on Intelligent Text Processing and Computational Linguistics
. Mexico City, Mexico.
Chandrasekar, R., & Srinivas, B
Chandrasekar, R., Doran, C., & Srinivas, B
(1996) Motivations and methods for text simplifications.
Proceedings of the 16th International Conference on Computational Linguistics
(pp. 1041–1044). Copenhagen, Denmark.
Chinchor, B.M
(1993) Survey of the message understanding conferences.
Proceedings of the Workshop on Human Language Technology
(pp. 56–60).
Clark, D
Devlin, S., & Tait, J
Devlin, S., & Unthank, G
(2006) Helping aphasic people process online information.
Proceedings of the 8th international ACM SIGACCESS conference on Computers and Accessibility
(pp. 225–226). New York, USA.: ACM.
Dieckmann, N., Slovic, P., & Peters, E
Drndarevic B, Stajner S, Bott S, Bautista S., & Saggion H
(2013) Automatic text simplification in Spanish: A compartive evaluation of complementing modules.
International Conference on Intelligent Text Processing and Computational Linguistics
. Samos, Greece.
Drndarevic, B., & Saggion, H
(2012) Reducing text complexity through automatic lexical simplification: An Empirical Study for Spanish.
Sociedad Española para el Procesamiento del Lenguaje Natural
(pp. 13–20).
(2012) Towards automatic lexical simplification in Spanish: An empirical study.
Workshop Predicting and Improving Text Readability for Target Reader Populations
. Montreal, Canada.
EAGLES
(n.d.). Retrieved June 22, 2013, from EAGLES. http://nlp.lsi.upc.edu/freeling/doc/tagsets/tagset-es.html
Freyhoff, G., Hess, G., Kerr, L, Menzel, E., Tronbacke, B., & Veken, K.V.D
(1998) European guidelines for the production of easy-to-read information. Retrieved June 22, 2013, from http://www.osmhi.org/contentpics/139/EuropeanGuidelinesforETRpublications.pdf
Herrera, A., & Macizo, P
Klebanov, B.B., Knight, K., & Marcu, D
(2004) Text simplification for information-seeking applications.
On the Move to Meaningful Internet Systems, Lecture Notes in Computer Science
(pp. 735–747). Springer Verlag.
Krifka, M
Lee, C., Hwang, Y.-G., Oh, H.-J., Lim, S., Heo, J., Lee, C.-H., … Jang M.G
(2006) Fine-grained named entity recognition using conditional random fields for question answering. (pp. 581–587). Lecture notes in Computer Science volumen 4182.
Li, X. a
Max, A
(2006) Writing for language-impaired readers.
Proceeding on International Conference on Intelligent Text Processing and Computational Linguistics
(pp. 567–570). Mexico City, Mexico.
Maynard, D., Tablan, V., Cunningham, H., Ursu, C., Saggion, H., Bontcheva, K., & Wilks, Y
Mishra, H., Mirshra, A., & Shiy B
Moriceau, V
(2006) Generating intelligent numerical answers in a question-answering system.
Proceedings of the Fourth International Natural Language Generation Conference
(pp. 103–110).
Nivre, J
(2003) An efficient algorithm for projective dependency parsing.
Proceedings of the 8th International Workshop on Parsing Technologies
(pp. 149–160). Nancy, France.
OpenNLP
(n.d.). Retrieved June 22, 2013, from http://opennlp.apache.org/documentation.html
Padró, Ll., Collado, M., Reese, S., Lloberes, M., & Castelln, I
(2010) Freeling 2.1: Five years of open-source language processing tools.
Proceedings of the 7th International Conference on Language Resources and Evaluation
. Valletta, Malta.
Peters, E., Hibbard, J., Slovic, P., & Dieckmann, N
Petersen, S.E., & Ostendorf, M
(2007) Text simplification for language learners: A corpus analysis.
Proceedings of Workshop on Speech and Language Technology for Education
.
Proyecto Simplext
(2010) Retrieved June 22, 2013, from www.simplext.es
Rello, L., Bautista, S., Baeza-Yates, R., Gervás, P., Hervás, R., & Saggion, H
Saggion, H., Gómez-Martínez, E., Esteban Etayo, E., Anula, A., & Bourg, L
Salguero, M. and Alameda, J
Siddharthan, A
(2002) Resolving attachment and clause boundary amgiguities for simplifying relative clause constructs.
Proceedings of the Student Research Workshop, 40th Meeting of the Association of Computational Linguistics
.
(2003) Syntactic Simplification and Text Cohesion. Ph. D dissertation, research and language and computation.
Snow, C
Specia, L, Jahuar, S.K., & Milhacea, R
(2012) SemEval-2012 Task 1: Englosh lexical simplification.
Proceedings of the SemEval Conference
. Montréal, Canada.
Specia, L
Sundheim, R.G
(1996) Message understanding conference-6: A Brief History.
International Conference on Computational Linguistics
(pp. 466–471).
The Plain Language Action and Information Network (PLAIN)
(2005) Retrieved June 22, 2013, from http://www.plainlanguage.gov
W3C
(2008) Web Content Accessibility Guidelines. Retrieved June 22, 2013, from http://www.w3.org/TR/WCAG20/
Williams, S., & Power, R
Williams, S., Reiter, E., & Osman, L.M
(2003) Experiments with discourse-level choices and readability.
Proceedings of the European Natural Language Generation Workshop
, (pp. 127–134). Budapest, Hungary.
Zhu, Z., Bernhard, D., & Gurevych, I
(2010) A monolingual tree-based translation model for sentence simplification.
Proceedings of the 23rd International Conference on Computational Linguistics
. Beijing, China.
Cited by
Cited by 1 other publications
Saggion, Horacio
This list is based on CrossRef data as of 15 april 2022. Please note that it may not be complete. Sources presented here have been supplied by the respective publishers. Any errors therein should be reported to them.