C-ORAL-ROM

Integrated Reference Corpora for Spoken Romance Languages

Editors
| University of Florence
| University of Florence
HardboundAvailable
ISBN 9789027222862 (Eur) | EUR 120.00
ISBN 9781588115485 (USA) | USD 180.00
 
e-Book
ISBN 9789027294579 | EUR 120.00 | USD 180.00
 

The C-ORAL-ROM book and DVD provide a unique set of comparable corpora of spontaneous speech for the main Romance languages, French, Italian, Portuguese and Spanish. The corpora are accompanied by comparative linguistic studies, models and standard linguistic measures of spoken language variability. Each corpus is built to the same design using identical sampling techniques, and each corpus is presented in multimedia format, allowing simultaneous access to aligned acoustic and textual information. Texts are headed with information about provenance, participants, etc. and the transcriptions show changes of speaker. Speech acts are tagged according to the evidence of prosodic criteria. Each corpus totals 300,000 words and presents formal and informal speech in a variety of contexts of use, dialogue structure and text genres, semantic domains and speech act typologies. The corpora have great statistical relevance for spoken language structures and can address key issues in human language technology such as speech recognition in unrestricted discourse, the suitability of speech synthesis in natural prosody, and multilingual applications of the spoken language interface. The work provides new data and innovative theoretical perspectives that are relevant for corpus linguistics, romance linguistics, syntactic theory, speech and prosody research, and second language acquisition.

The original C-ORAL-ROM DVD was made to run under Windows XP when Windows 7 and 8 were not yet in existence. A new version of WINPITCH-C-ORAL-ROM makes it possible to run the C-ORAL-ROM DVD under Windows 7 and 8. It can be downloaded from www.winpitch.com/

[Studies in Corpus Linguistics, 15]  2005.  xviii, 304 pp. (incl. DVD)
Publishing status: Available
Table of Contents
Acknowledgements
xi–xiii
Preface
Claire Blanche-Benveniste
xv–xvii
1. The C-ORAL-ROM resource
Massimo Moneglia and Philippe Martin
1–70
2. The Italian corpus
Emanuela Cresti, Alessandro Panunzi and Antonietta Scarano
71–110
3. The French corpus
Estelle Campione, Jean Véronis and José Deulofeu
111–133
4. The Spanish corpus
Antonio Moreno-Sandoval, Gillermo de la Madrid, Manuel Alcántara, Ana Gonzalez, José M. Guirao and Raúl De la Torre
135–161
5. The Portuguese corpus
Maria Fernanda Bacelar do Nascimento, José Bettencourt Gonçalves, Rita Veloso, Sandra Antunes, Florbela Barreto and Raquel Amaro
163–207
6. Notes on lexical strategy, structural strategies and surface clause indexes in the C-ORAL-ROM spoken corpora
Emanuela Cresti
209–256
7. Appendix
Massimo Moneglia, Marco Fabbri, Silvia Quazza, Andrea Panizza, Morena Danieli, Juan Maríia Garrido and Marc Swerts
257–276
Bibliography
277–303
“This is a great resource for researchers in the areas of Romance linguistics, corpus linguistics, syntax, second language acquisition, and speech and prosody research. The operation of the DVD and the tools included in is quite straightforward.”
“Sarà chiaro anche da questa segnalazione per forza di cose breve che C-ORAL-ROM è un lavoro molto notevole, di eccellente qualità, all’avanguardia anche per rigore metodologico, che arricchisce la linguistica romanza in un settore da prevedere sempre più in sviluppo qual è la corpus linguistics e mette a disposizione della communità scientifica un importante strumento di ricerca.”
Cited by

Cited by other publications

No author info given
2014. Canonical tag questions in English, Spanish and Portuguese: A discourse-functional study. Languages in Contrast 14:1  pp. 93 ff. Crossref logo
No author info given
2019.  In Rhapsodie [Studies in Corpus Linguistics, 89], Crossref logo
Aguilar, Lourdes & Yurena M. Gutiérrez-González
2018. Patterns of prominence, phrasing and tonal events in Spanish news reading: An illustrative case study. Loquens 5:1  pp. 048 ff. Crossref logo
AVANZI, MATHIEU & ELISABETH DELAIS-ROUSSARIE
2011. Introduction: Regards croisés sur la prosodie du français – des données à la modélisation. Journal of French Language Studies 21:1  pp. 1 ff. Crossref logo
Bambini, Valentina
2005. Corpus di italiano parlato: Vol. 1: Introduzione, Vol. 2: Corpora, CD-Rom. Journal of Pragmatics 37:6  pp. 949 ff. Crossref logo
Biber, Douglas & Shelley Staples
2014.  In Spoken Corpora and Linguistic Studies [Studies in Corpus Linguistics, 61],  pp. 271 ff. Crossref logo
Campillos Llanos, Leonardo
2014. A Spanish learner oral corpus for computer-aided error analysis. Corpora 9:2  pp. 207 ff. Crossref logo
Cardona, Ana Llopis
2016. El uso de los aproximativosmás o menosyaproximadamenteen el continuo variacional. Spanish in Context 13:3  pp. 371 ff. Crossref logo
Caron, Bernard
2018.  In Information Structure in Lesser-described Languages [Studies in Language Companion Series, 199],  pp. 157 ff. Crossref logo
Cresti, Emanuela
2014.  In Spoken Corpora and Linguistic Studies [Studies in Corpus Linguistics, 61],  pp. 365 ff. Crossref logo
Cresti, Emanuela & Massimo Moneglia
2018. The definition of the TOPIC within Language into Act Theory and its identification in spontaneous speech corpora. Revue Romane 53:1  pp. 30 ff. Crossref logo
De Cesare Greenwald, Anna-Maria & Margarita Borreguero Zuloaga
2014.  In Discourse Segmentation in Romance Languages [Pragmatics & Beyond New Series, 250],  pp. 55 ff. Crossref logo
Degand, Liesbeth, Anne Catherine Simon, Noalig Tanguy & Thomas Van Damme
2014.  In Discourse Segmentation in Romance Languages [Pragmatics & Beyond New Series, 250],  pp. 243 ff. Crossref logo
Garrido, Juan María, David Escudero, Lourdes Aguilar, Valentín Cardeñoso, Emma Rodero, Carme de-la-Mota, César González, Carlos Vivaracho, Sílvia Rustullet, Olatz Larrea, Yesika Laplaza, Francisco Vizcaíno, Eva Estebas, Mercedes Cabrera & Antonio Bonafonte
2013. Glissando: a corpus for multidisciplinary prosodic studies in Spanish and Catalan. Language Resources and Evaluation 47:4  pp. 945 ff. Crossref logo
GAUCHER, DAMIEN
2015. Sémantique temporelle et accord du participe passé en français parlé: une analyse variationniste. Journal of French Language Studies 25:1  pp. 65 ff. Crossref logo
Ghezzi, Chiara & Piera Molinelli
2016. Politeness markers from Latin to Italian. Journal of Historical Pragmatics 17:2  pp. 307 ff. Crossref logo
Gómez González, María de los Ángeles
2016.  In Genre- and Register-related Discourse Features in Contrast [Benjamins Current Topics, 87],  pp. 93 ff. Crossref logo
Izre'el, Shlomo
2018.  In Afroasiatic [Current Issues in Linguistic Theory, 339],  pp. 235 ff. Crossref logo
Izre'el, Shlomo & Amina Mettouchi
2015.  In Corpus-based Studies of Lesser-described Languages [Studies in Corpus Linguistics, 68],  pp. 13 ff. Crossref logo
Jouvet, Denis, Katarina Bartkova, Mathilde Dargnat & Lou Lee
2017.  In Statistical Language and Speech Processing [Lecture Notes in Computer Science, 10583],  pp. 32 ff. Crossref logo
Kellert, Olga
2016.  In À la recherche de la prédication [Lingvisticæ Investigationes Supplementa, 32],  pp. 53 ff. Crossref logo
Lastres-López, Cristina
2019.  In Corpus-based Research on Variation in English Legal Discourse [Studies in Corpus Linguistics, 91],  pp. 51 ff. Crossref logo
Lauwers, Peter & Claude Duée
2011. From aspect to evidentiality: The subjectification path of the French semi-copula se faire and its Spanish cognate hacerse. Journal of Pragmatics 43:4  pp. 1042 ff. Crossref logo
Lombardi Vallauri, Edoardo
2016.  In Insubordination [Typological Studies in Language, 115],  pp. 145 ff. Crossref logo
Malibert, Il-Il & Martine Vanhove
2015.  In Corpus-based Studies of Lesser-described Languages [Studies in Corpus Linguistics, 68],  pp. 117 ff. Crossref logo
Matos Rocha, Luiz Fernando & Pablo Arantes
2016.  In The Conversation Frame [Human Cognitive Processing, 55],  pp. 215 ff. Crossref logo
MEINSCHAEFER, JUDITH, SVEN BONIFER & CHRISTINE FRISCH
2015. Variable and invariable liaison in a corpus of spoken French. Journal of French Language Studies 25:3  pp. 367 ff. Crossref logo
Mettouchi, Amina, Martine Vanhove & Dominique Caubet
2015.  In Corpus-based Studies of Lesser-described Languages [Studies in Corpus Linguistics, 68], Crossref logo
Mettouchi, Amina, Martine Vanhove & Dominique Caubet
2015.  In Corpus-based Studies of Lesser-described Languages [Studies in Corpus Linguistics, 68],  pp. 1 ff. Crossref logo
Mihatsch, Wiltrud
2018. De la escritura científica a la conversación coloquial adolescente. Spanish in Context 15:2  pp. 281 ff. Crossref logo
Mittmann, Maryualê Malvessi, Tommaso Raso & Heliana Mello
2009.  In 2009 Seventh Brazilian Symposium in Information and Human Language Technology,  pp. 179 ff. Crossref logo
Moneglia, Massimo
2014.  In Spoken Corpora and Linguistic Studies [Studies in Corpus Linguistics, 61],  pp. 152 ff. Crossref logo
Moneglia, Massimo & Tommaso Raso
2014.  In Spoken Corpora and Linguistic Studies [Studies in Corpus Linguistics, 61],  pp. 468 ff. Crossref logo
Nicolás Martínez, Carlota & Marina Lombán Somacarrera
2018. MINI-CORPUS del español para IPIC. CHIMERA: Revista de Corpus de Lenguas Romances y Estudios Lingüísticos 5:2  pp. 197 ff. Crossref logo
Panunzi, Alessandro & Maryualê M. Mittmann
2014.  In Spoken Corpora and Linguistic Studies [Studies in Corpus Linguistics, 61],  pp. 129 ff. Crossref logo
Pons Bordería, Salvador
2014.  In Discourse Segmentation in Romance Languages [Pragmatics & Beyond New Series, 250],  pp. 1 ff. Crossref logo
Pons Bordería, Salvador
2018. The combination of discourse markers in spontaneous conversations. Revue Romane 53:1  pp. 121 ff. Crossref logo
Raso, Tommaso
2014.  In Spoken Corpora and Linguistic Studies [Studies in Corpus Linguistics, 61],  pp. 411 ff. Crossref logo
Raso, Tommaso & Heliana Mello
2012.  In Computational Processing of the Portuguese Language [Lecture Notes in Computer Science, 7243],  pp. 362 ff. Crossref logo
Raso, Tommaso & Heliana Ribeiro De Mello
2014.  In Spoken Corpora and Linguistic Studies [Studies in Corpus Linguistics, 61],  pp. 1 ff. Crossref logo
Ribeiro De Mello, Heliana
2014.  In Spoken Corpora and Linguistic Studies [Studies in Corpus Linguistics, 61],  pp. 27 ff. Crossref logo
Ricca, Davide & Jacqueline Visconti
2014.  In Diachronic Corpus Pragmatics [Pragmatics & Beyond New Series, 243],  pp. 133 ff. Crossref logo
RIVERA, SEMILLA M., ELIZABETH A. BATES, ARACELI OROZCO-FIGUEROA & NICOLE Y. Y. WICHA
2010. Spoken verb processing in Spanish: An analysis using a new online resource. Applied Psycholinguistics 31:1  pp. 29 ff. Crossref logo
Roque Amaral, Eduardo Tadeu, Wiltrud Mihatsch, F. Neveu, G. Bergounioux, M.-H. Côté, J.-M. Fournier, L. Hriba & S. Prévost
2016. Le nom françaispersonneen comparaison avec le portugais brésilienpessoaet l’allemandPerson– des noms en voie de pronominalisation ?. SHS Web of Conferences 27  pp. 12015 ff. Crossref logo
Sabio, Frédéric
2018. On the syntax of spoken French. Revue Romane 53:1  pp. 6 ff. Crossref logo
Silber-Varod, Vered
2014.  In Dependency Linguistics [Linguistik Aktuell/Linguistics Today, 215],  pp. 207 ff. Crossref logo
Thomas, Anita
2010. The influence of lexical aspect and input frequency in the L2 French of adult beginners. Nordic Journal of Linguistics 33:2  pp. 169 ff. Crossref logo
Valentini, Cristina
2013. Phrasal verbs in Italian dubbed dialogues: a multimedia corpus-based study. Perspectives 21:4  pp. 543 ff. Crossref logo
Viola, Lorella
2017. A corpus-based investigation of language change in Italian. Journal of Historical Linguistics 7:3  pp. 372 ff. Crossref logo
Viola, Lorella
2019. From linguistic innovation to language change. Revue Romane Crossref logo
ÅGREN, MALIN & JOOST VAN DE WEIJER
2013. Input frequency and the acquisition of subject-verb agreement in number in spoken and written French. Journal of French Language Studies 23:3  pp. 311 ff. Crossref logo

This list is based on CrossRef data as of 28 november 2019. Please note that it may not be complete. Sources presented here have been supplied by the respective publishers. Any errors therein should be reported to them.

Subjects

Electronic/Multimedia Products

Electronic/Multimedia Products
BIC Subject: CF – Linguistics
BISAC Subject: LAN009000 – LANGUAGE ARTS & DISCIPLINES / Linguistics / General
U.S. Library of Congress Control Number:  2005041056