C-ORAL-ROM

Integrated Reference Corpora for Spoken Romance Languages

Editors
ORCID logoEmanuela Cresti | University of Florence
ORCID logoMassimo Moneglia | University of Florence
HardboundAvailable
ISBN 9789027222862 (Eur) | EUR 120.00
ISBN 9781588115485 (USA) | USD 180.00
 
e-Book
ISBN 9789027294579 | EUR 120.00 | USD 180.00
 
Google Play logo

The C-ORAL-ROM book and DVD provide a unique set of comparable corpora of spontaneous speech for the main Romance languages, French, Italian, Portuguese and Spanish. The corpora are accompanied by comparative linguistic studies, models and standard linguistic measures of spoken language variability. Each corpus is built to the same design using identical sampling techniques, and each corpus is presented in multimedia format, allowing simultaneous access to aligned acoustic and textual information. Texts are headed with information about provenance, participants, etc. and the transcriptions show changes of speaker. Speech acts are tagged according to the evidence of prosodic criteria. Each corpus totals 300,000 words and presents formal and informal speech in a variety of contexts of use, dialogue structure and text genres, semantic domains and speech act typologies. The corpora have great statistical relevance for spoken language structures and can address key issues in human language technology such as speech recognition in unrestricted discourse, the suitability of speech synthesis in natural prosody, and multilingual applications of the spoken language interface. The work provides new data and innovative theoretical perspectives that are relevant for corpus linguistics, romance linguistics, syntactic theory, speech and prosody research, and second language acquisition.

The original C-ORAL-ROM DVD was made to run under Windows XP when Windows 7 and 8 were not yet in existence. A new version of WINPITCH-C-ORAL-ROM makes it possible to run the C-ORAL-ROM DVD under Windows 7 and 8. It can be downloaded from www.winpitch.com/

[Studies in Corpus Linguistics, 15] 2005.  xviii, 304 pp. (incl. DVD)
Publishing status: Available
Table of Contents
“This is a great resource for researchers in the areas of Romance linguistics, corpus linguistics, syntax, second language acquisition, and speech and prosody research. The operation of the DVD and the tools included in is quite straightforward.”
“Sarà chiaro anche da questa segnalazione per forza di cose breve che C-ORAL-ROM è un lavoro molto notevole, di eccellente qualità, all’avanguardia anche per rigore metodologico, che arricchisce la linguistica romanza in un settore da prevedere sempre più in sviluppo qual è la corpus linguistics e mette a disposizione della communità scientifica un importante strumento di ricerca.”
Cited by

Cited by 105 other publications

Abeillé, Anne & Danièle Godard
2012. La Grande Grammaire du Français et la variété des données. Langue française n°176:4  pp. 47 ff. DOI logo
Aguilar, Lourdes & Yurena M. Gutiérrez-González
2018. Patterns of prominence, phrasing and tonal events in Spanish news reading: An illustrative case study. Loquens 5:1  pp. 048 ff. DOI logo
AVANZI, MATHIEU & ELISABETH DELAIS-ROUSSARIE
2011. Introduction: Regards croisés sur la prosodie du français – des données à la modélisation. Journal of French Language Studies 21:1  pp. 1 ff. DOI logo
Avanzi, Mathieu & Laure Anne Johnsen
2015. Asyndètes temporelles. Langages N° 200:4  pp. 103 ff. DOI logo
Ballarè, Silvia & Massimo Cerruti
2023. Sociolinguistic variation in spoken Italian: An introduction. Sociolinguistica 37:1  pp. 1 ff. DOI logo
Bambini, Valentina
2005. Corpus di italiano parlato: Vol. 1: Introduzione, Vol. 2: Corpora, CD-Rom. Journal of Pragmatics 37:6  pp. 949 ff. DOI logo
Barros, Camila & Saulo Santos
2022. A Protocol for Comparing Gesture and Prosodic Boundaries in Multimodal Corpora. In Computational Processing of the Portuguese Language [Lecture Notes in Computer Science, 13208],  pp. 313 ff. DOI logo
Baude, Olivier
2007. Aspects juridiques et éthiques de la conservation et de la diffusion des corpus oraux. Revue française de linguistique appliquée XII:1  pp. 85 ff. DOI logo
Biber, Douglas & Shelley Staples
2014. Exploring the prosody of stance. In Spoken Corpora and Linguistic Studies [Studies in Corpus Linguistics, 61],  pp. 271 ff. DOI logo
Blanche-Benveniste, Claire
2007. Corpus de langue parlée et description grammaticale de la langue. Langage et société n° 121-122:3  pp. 129 ff. DOI logo
Bossaglia, Giulia, Heliana Mello & Tommaso Raso
2020. Chapter 7. Illocution as a unit of reference for spontaneous speech. In In Search of Basic Units of Spoken Language [Studies in Corpus Linguistics, 94],  pp. 221 ff. DOI logo
Bérard, Lolita
2020. La partie orale du Corpus d’Étude pour le Français Contemporain (CÉFC). Langages N° 219:3  pp. 25 ff. DOI logo
Campillos Llanos, Leonardo
2014. A Spanish learner oral corpus for computer-aided error analysis. Corpora 9:2  pp. 207 ff. DOI logo
Cardona, Ana Llopis
2016. El uso de los aproximativosmás o menosyaproximadamenteen el continuo variacional. Spanish in Context 13:3  pp. 371 ff. DOI logo
Caron, Bernard
2018. Chapter 6. Macrosyntactic corpus annotation. In Information Structure in Lesser-described Languages [Studies in Language Companion Series, 199],  pp. 157 ff. DOI logo
Cavalcante, Frederico Amorim, Tommaso Raso, Giulia Bossaglia, Maryualê Mittmann & Bruno Rocha
2020. Topic unit detection in spontaneous speech. CHIMERA: Revista de Corpus de Lenguas Romances y Estudios Lingüísticos 7  pp. 69 ff. DOI logo
Cimmino, Doriana
Cresti, Emanuela
2014. Syntactic properties of spontaneous speech in the Language into Act Theory. In Spoken Corpora and Linguistic Studies [Studies in Corpus Linguistics, 61],  pp. 365 ff. DOI logo
Cresti, Emanuela
2020. Ancora sulla paratassi dello scritto letterario. CHIMERA: Revista de Corpus de Lenguas Romances y Estudios Lingüísticos 7  pp. 23 ff. DOI logo
Cresti, Emanuela
Cresti, Emanuela
2021. The Appendix of Comment according to Language into Act Theory. CHIMERA: Revista de Corpus de Lenguas Romances y Estudios Lingüísticos 8  pp. 45 ff. DOI logo
Cresti, Emanuela & Massimo Moneglia
2018. The definition of the TOPIC within Language into Act Theory and its identification in spontaneous speech corpora. Revue Romane. Langue et littérature. International Journal of Romance Languages and Literatures 53:1  pp. 30 ff. DOI logo
Cresti, Emanuela & Massimo Moneglia
2023. The role of prosody for the expression of illocutionary types. The prosodic system of questions in spoken Italian and French according to Language into Act Theory. Frontiers in Communication 8 DOI logo
Cresti, Emanuela, Massimo Moneglia & Ida Tucci
2011. Annotation de l'entretien d'Anita Musso selon la Théorie de la langue en acte. Langue française n°170:2  pp. 95 ff. DOI logo
de Albuquerque, Marianna Bicalho
2024. Gestures and multilevel discourse in spontaneous speech corpora: the case of reported speech. DILEF. Rivista digitale del Dipartimento di Lettere e Filosofia :3  pp. 243 ff. DOI logo
De Cesare, Anna-Maria
2011. L'italien ecco et les français voici , voilà . Regards croisés sur leurs emplois dans les textes écrits. Langages n° 184:4  pp. 51 ff. DOI logo
De Cesare, Anna-Maria
2022. To be or not to be focus adverbials?. In When Data Challenges Theory [Linguistik Aktuell/Linguistics Today, 273],  pp. 204 ff. DOI logo
De Cesare Greenwald, Anna-Maria & Margarita Borreguero Zuloaga
2014. The contribution of the Basel model to the description of polyfunctional discourse markers. In Discourse Segmentation in Romance Languages [Pragmatics & Beyond New Series, 250],  pp. 55 ff. DOI logo
Debaisieux, Jeanne-Marie & Christophe Benzitoun
2020. Présentation. Langages N° 219:3  pp. 9 ff. DOI logo
Degand, Liesbeth, Anne Catherine Simon, Noalig Tanguy & Thomas Van Damme
2014. Initiating a discourse unit in spoken French. In Discourse Segmentation in Romance Languages [Pragmatics & Beyond New Series, 250],  pp. 243 ff. DOI logo
Deulofeu, Henri-José
2011. Peut-on établir un système de ponctuation des transcriptions de textes oraux linguistiquement fondé ? Les propositions du groupe Rhapsodie. Langue française n°172:4  pp. 115 ff. DOI logo
Deulofeu, Henri-José & Jeanne-Marie Debaisieux
2012. Une tâche à accomplir pour la linguistique française du XXI e siècle : élaborer une grammaire des usages du français. Langue française n°176:4  pp. 27 ff. DOI logo
Deulofeu, José
2014. La problématique de la liaison entre prédications à la lumière de la distinction entre construction et énoncé : intégration versus insertion. Langue française n° 182:2  pp. 59 ff. DOI logo
Dostie, Gaétane & Claus D. Pusch
2007. Présentation. Les marqueurs discursifs. Sens et variation. Langue française n° 154:2  pp. 3 ff. DOI logo
Fallegger, Livia Gaudino
2013.  Annette Gerstenberg, Generation und Sprachprofile im höheren Lebensalter, Frankfurt am Main, Klostermann, 2011, XII + 372 p.. zrph 129:3  pp. 785 ff. DOI logo
Ferrari, Angela
2017. Leggere la virgola. Una prima ricognizione. CHIMERA: Revista de Corpus de Lenguas Romances y Estudios Lingüísticos 4:2  pp. 145 ff. DOI logo
Ferroni, Roberta & Marilisa Birello
2022. L’insegnamento dei segnali discorsivi allora, dunque e beh: riflessioni metapragmatiche di studenti di italiano LS. Cuadernos de Filología Italiana 29  pp. 125 ff. DOI logo
Gabriel, Christoph & Jonas Grünke
2022. Unmarked use of marked syntactic structures. In When Data Challenges Theory [Linguistik Aktuell/Linguistics Today, 273],  pp. 240 ff. DOI logo
Garassino, Davide
2022. Translation as a source of pragmatic interference?. In When Data Challenges Theory [Linguistik Aktuell/Linguistics Today, 273],  pp. 272 ff. DOI logo
Garassino, Davide
2022. A contrastive perspective on French and Italian wh-in situ questions. Functions of Language 29:1  pp. 25 ff. DOI logo
Garrido, Juan María, David Escudero, Lourdes Aguilar, Valentín Cardeñoso, Emma Rodero, Carme de-la-Mota, César González, Carlos Vivaracho, Sílvia Rustullet, Olatz Larrea, Yesika Laplaza, Francisco Vizcaíno, Eva Estebas, Mercedes Cabrera & Antonio Bonafonte
2013. Glissando: a corpus for multidisciplinary prosodic studies in Spanish and Catalan. Language Resources and Evaluation 47:4  pp. 945 ff. DOI logo
GAUCHER, DAMIEN
2015. Sémantique temporelle et accord du participe passé en français parlé: une analyse variationniste. Journal of French Language Studies 25:1  pp. 65 ff. DOI logo
Ghezzi, Chiara & Piera Molinelli
2016. Politeness markers from Latin to Italian. Journal of Historical Pragmatics 17:2  pp. 307 ff. DOI logo
Gómez González, María de los Ángeles
2014. Canonical tag questions in English, Spanish and Portuguese. Languages in Contrast 14:1  pp. 93 ff. DOI logo
Gómez González, María de los Ángeles
2016. Canonical tag questions in English, Spanish and Portuguese. In Genre- and Register-related Discourse Features in Contrast [Benjamins Current Topics, 87],  pp. 93 ff. DOI logo
Heidinger, Steffen
2022. Corpus Data and the Position of Information Focus in Spanish. Studies in Hispanic and Lusophone Linguistics 15:1  pp. 67 ff. DOI logo
Izre'el, Shlomo
2018. Unipartite clauses. In Afroasiatic [Current Issues in Linguistic Theory, 339],  pp. 235 ff. DOI logo
Izre'el, Shlomo
2020. Chapter 3. Applying criteria of spontaneous Hebrew speech segmentation to English. In In Search of Basic Units of Spoken Language [Studies in Corpus Linguistics, 94],  pp. 337 ff. DOI logo
Izre'el, Shlomo, Heliana Mello, Alessandro Panunzi & Tommaso Raso
2020. Introduction. In search of a basic unit of spoken language. In In Search of Basic Units of Spoken Language [Studies in Corpus Linguistics, 94],  pp. 1 ff. DOI logo
Izre'el, Shlomo & Amina Mettouchi
2015. Representation of speech in CorpAfroAs. In Corpus-based Studies of Lesser-described Languages [Studies in Corpus Linguistics, 68],  pp. 13 ff. DOI logo
Kellert, Olga
2016. Syntagme prépositionnel dans les constructions de possession. In À la recherche de la prédication [Lingvisticæ Investigationes Supplementa, 32],  pp. 53 ff. DOI logo
Lacheret-Dujour, Anne, Sylvain Kahane, F. Neveu, B. Harmegnies, L. Hriba, S. Prévost & A. Steuckardt
2020. Unités syntaxiques et unités intonatives majeures en français parlé : inclusion, fragmentation, chevauchement. SHS Web of Conferences 78  pp. 14005 ff. DOI logo
Lastres-López, Cristina
2019. Chapter 3. Conditionals in spoken courtroom and parliamentary discourse in English, French, and Spanish. In Corpus-based Research on Variation in English Legal Discourse [Studies in Corpus Linguistics, 91],  pp. 51 ff. DOI logo
Lauwers, Peter & Claude Duée
2011. From aspect to evidentiality: The subjectification path of the French semi-copula se faire and its Spanish cognate hacerse. Journal of Pragmatics 43:4  pp. 1042 ff. DOI logo
Lombardi Vallauri, Edoardo
Matos Rocha, Luiz Fernando & Pablo Arantes
MEINSCHAEFER, JUDITH, SVEN BONIFER & CHRISTINE FRISCH
2015. Variable and invariable liaison in a corpus of spoken French. Journal of French Language Studies 25:3  pp. 367 ff. DOI logo
Mettouchi, Amina, Martine Vanhove & Dominique Caubet
2015. Preface. In Corpus-based Studies of Lesser-described Languages [Studies in Corpus Linguistics, 68], DOI logo
Mettouchi, Amina, Martine Vanhove & Dominique Caubet
2015. Preface. In Corpus-based Studies of Lesser-described Languages [Studies in Corpus Linguistics, 68],  pp. 1 ff. DOI logo
Mihatsch, Wiltrud
2010. Les approximateurs quantitatifs entre scalarité et non-scalarité. Langue française n° 165:1  pp. 125 ff. DOI logo
Mihatsch, Wiltrud
2018. De la escritura científica a la conversación coloquial adolescente. Spanish in Context 15:2  pp. 281 ff. DOI logo
Mihatsch, Wiltrud
2018. From ad hoc category to ad hoc categorization: The proceduralization of Argentinian Spanishtipo. Folia Linguistica 52:s39-s1  pp. 147 ff. DOI logo
Mittmann, Maryualê Malvessi, Tommaso Raso & Heliana Mello
2009. 2009 Seventh Brazilian Symposium in Information and Human Language Technology,  pp. 179 ff. DOI logo
Moneglia, Massimo
2014. The variation of action verbs in multilingual spontaneous speech corpora. In Spoken Corpora and Linguistic Studies [Studies in Corpus Linguistics, 61],  pp. 152 ff. DOI logo
Moneglia, Massimo
2022. Le unità di informazione Parentetiche alla periferia destra del Comment nella Teoria della Lingua in Atto. DILEF. Rivista digitale del Dipartimento di Lettere e Filosofia :1  pp. 88 ff. DOI logo
Moneglia, Massimo & Alessandro Panunzi
2022. Micro-Diachronic Corpora for Measuring the Lexical Change of Spontaneous Speech in Florence Compared to Standard Italian. Langages N° 226:2  pp. 41 ff. DOI logo
Moneglia, Massimo & Tommaso Raso
2014. Appendix. In Spoken Corpora and Linguistic Studies [Studies in Corpus Linguistics, 61],  pp. 468 ff. DOI logo
Nicolás Martínez, Carlota & Marina Lombán Somacarrera
2018. MINI-CORPUS del español para IPIC. CHIMERA: Revista de Corpus de Lenguas Romances y Estudios Lingüísticos 5:2  pp. 197 ff. DOI logo
Panunzi, Alessandro & Maryualê M. Mittmann
Pecorari, Filippo
2017. Puntini di sospensione e mimesi del parlato. Le facce del rapporto tra punteggiatura e prosodia. CHIMERA: Revista de Corpus de Lenguas Romances y Estudios Lingüísticos 4:2  pp. 175 ff. DOI logo
Poggio, Anabella L.
2020. La Interfaz Sintaxis-Pragmática. Estudios teóricos, descriptivos y experimentales. Pragmática Sociocultural / Sociocultural Pragmatics 8:1  pp. 133 ff. DOI logo
Pons Bordería, Salvador
2014. Models of discourse segmentation in Romance languages. In Discourse Segmentation in Romance Languages [Pragmatics & Beyond New Series, 250],  pp. 1 ff. DOI logo
Pons Bordería, Salvador
2018. The combination of discourse markers in spontaneous conversations. Revue Romane. Langue et littérature. International Journal of Romance Languages and Literatures 53:1  pp. 121 ff. DOI logo
Pusch, Claus D.
2007. Faut dire  : variation et sens d'un marqueur parenthétique entre connectivité et (inter)subjectivité. Langue française n° 154:2  pp. 29 ff. DOI logo
Raso, Tommaso
2014. Prosodic constraints for discourse markers. In Spoken Corpora and Linguistic Studies [Studies in Corpus Linguistics, 61],  pp. 411 ff. DOI logo
Raso, Tommaso & Heliana Mello
2012. The C-ORAL-BRASIL I: Reference Corpus for Informal Spoken Brazilian Portuguese. In Computational Processing of the Portuguese Language [Lecture Notes in Computer Science, 7243],  pp. 362 ff. DOI logo
Raso, Tommaso & Heliana Ribeiro De Mello
2014. Introduction: Spoken corpora and linguistic studies. In Spoken Corpora and Linguistic Studies [Studies in Corpus Linguistics, 61],  pp. 1 ff. DOI logo
Ribeiro De Mello, Heliana
2014. Methodological issues for spontaneous speech corpora compilation. In Spoken Corpora and Linguistic Studies [Studies in Corpus Linguistics, 61],  pp. 27 ff. DOI logo
Ricca, Davide & Jacqueline Visconti
2014. On the development of the Italian truth adverbsdavveroandveramente. In Diachronic Corpus Pragmatics [Pragmatics & Beyond New Series, 243],  pp. 133 ff. DOI logo
RIVERA, SEMILLA M., ELIZABETH A. BATES, ARACELI OROZCO-FIGUEROA & NICOLE Y. Y. WICHA
2010. Spoken verb processing in Spanish: An analysis using a new online resource. Applied Psycholinguistics 31:1  pp. 29 ff. DOI logo
Rocha, Bruno, Tommaso Raso, Heliana Mello & Lucia Ferrari
2022. Information structure in the speech of individuals with schizophrenia. CHIMERA: Revista de Corpus de Lenguas Romances y Estudios Lingüísticos 9  pp. 217 ff. DOI logo
Rodríguez, Catalina Fuentes
2023. Construcciones exclamativas de rechazo. Spanish in Context 20:1  pp. 178 ff. DOI logo
Roque Amaral, Eduardo Tadeu, Wiltrud Mihatsch, F. Neveu, G. Bergounioux, M.-H. Côté, J.-M. Fournier, L. Hriba & S. Prévost
2016. Le nom françaispersonneen comparaison avec le portugais brésilienpessoaet l’allemandPerson– des noms en voie de pronominalisation ?. SHS Web of Conferences 27  pp. 12015 ff. DOI logo
Rosemeyer, Malte
2021. Two Types of Constructionalization Processes in Spanish and Portuguese Cleftedwh-interrogatives. Studies in Hispanic and Lusophone Linguistics 14:1  pp. 117 ff. DOI logo
Rosemeyer, Malte & María Sol Sansiñena
2022. How sentence type influences the interpretation of Spanish future constructions. Functions of Language 29:1  pp. 116 ff. DOI logo
Sabio, Frédéric
2018. On the syntax of spoken French. Revue Romane. Langue et littérature. International Journal of Romance Languages and Literatures 53:1  pp. 6 ff. DOI logo
Saccone, Valentina & Chiara Trombetta
2021. Parenthetical Units and Structures in Italian and German spoken language: Prosodic and textual analysis. CHIMERA: Revista de Corpus de Lenguas Romances y Estudios Lingüísticos 8  pp. 1 ff. DOI logo
Sammarco, Carmela
2021. Chapter 8. Online text mapping. In Building Categories in Interaction [Studies in Language Companion Series, 220],  pp. 211 ff. DOI logo
Santos, Giovani
2020. Designing and building SCoPE²: A spoken corpus of Brazilian Portuguese and L2-English. Research in Corpus Linguistics 8  pp. 49 ff. DOI logo
Silber-Varod, Vered
Silvano, Purificação & María Gómez González
2024. Developing a Comparative Model of Predicted Associations for Invariable Question Tag Types in British English and European Portuguese. In Constructional and Cognitive Explorations of Contrastive Linguistics,  pp. 173 ff. DOI logo
Stark, Elisabeth
2017. Pertinence de l’analyse grammaticale en linguistique variationnelle. Langage et société N° 160-161:2  pp. 283 ff. DOI logo
Thaler, Verena
2020. Sí queals polyphone Struktur im gesprochenen Spanisch. Romanistisches Jahrbuch 71:1  pp. 305 ff. DOI logo
Thomas, Anita
2010. The influence of lexical aspect and input frequency in the L2 French of adult beginners. Nordic Journal of Linguistics 33:2  pp. 169 ff. DOI logo
Valentini, Cristina
2013. Phrasal verbs in Italian dubbed dialogues: a multimedia corpus-based study. Perspectives 21:4  pp. 543 ff. DOI logo
Viola, Lorella
2017. A corpus-based investigation of language change in Italian. Journal of Historical Linguistics 7:3  pp. 372 ff. DOI logo
Viola, Lorella
2020. On the diachrony ofgiusto?(‘right?’) in Italian. Journal of Historical Pragmatics 21:1  pp. 83 ff. DOI logo
Viola, Lorella
2020. From linguistic innovation to language change. Revue Romane. Langue et littérature. International Journal of Romance Languages and Literatures 55:1  pp. 95 ff. DOI logo
Viola, Lorella
2022. On the use ofsì?(‘yes?’) as invariant follow-up in Italian. Journal of Historical Pragmatics 23:2  pp. 175 ff. DOI logo
Williams, Christopher
2020. Review of Fanego, Teresa and Paula Rodríguez-Puente eds. 2019. Corpus-based Research on Variation in English Legal Discourse. Amsterdam: John Benjamins. ISBN: 978-9-027-20235-2. https://doi.org/10.1075/scl.91. Research in Corpus Linguistics 8  pp. 178 ff. DOI logo
Williams, Geoffrey
2012. Corpora: French‐Language. In The Encyclopedia of Applied Linguistics, DOI logo
Yatziv-Malibert, Il-Il & Martine Vanhove
2015. Quotative constructions and prosody in some Afroasiatic languages. In Corpus-based Studies of Lesser-described Languages [Studies in Corpus Linguistics, 68],  pp. 117 ff. DOI logo
ÅGREN, MALIN & JOOST VAN DE WEIJER
2013. Input frequency and the acquisition of subject-verb agreement in number in spoken and written French. Journal of French Language Studies 23:3  pp. 311 ff. DOI logo
[no author supplied]
[no author supplied]
2020. References. In Introduction to Corpus Linguistics,  pp. 233 ff. DOI logo

This list is based on CrossRef data as of 16 april 2024. Please note that it may not be complete. Sources presented here have been supplied by the respective publishers. Any errors therein should be reported to them.

Subjects

Electronic/Multimedia Products

Electronic/Multimedia Products

Main BIC Subject

CF: Linguistics

Main BISAC Subject

LAN009000: LANGUAGE ARTS & DISCIPLINES / Linguistics / General
ONIX Metadata
ONIX 2.1
ONIX 3.0
U.S. Library of Congress Control Number:  2005041056 | Marc record