C-ORAL-ROM

Integrated Reference Corpora for Spoken Romance Languages

Editors
ORCID logoEmanuela Cresti | University of Florence
ORCID logoMassimo Moneglia | University of Florence
HardboundAvailable
ISBN 9789027222862 (Eur) | EUR 120.00
ISBN 9781588115485 (USA) | USD 180.00
 
e-Book
ISBN 9789027294579 | EUR 120.00 | USD 180.00
 
Google Play logo

The C-ORAL-ROM book and DVD provide a unique set of comparable corpora of spontaneous speech for the main Romance languages, French, Italian, Portuguese and Spanish. The corpora are accompanied by comparative linguistic studies, models and standard linguistic measures of spoken language variability. Each corpus is built to the same design using identical sampling techniques, and each corpus is presented in multimedia format, allowing simultaneous access to aligned acoustic and textual information. Texts are headed with information about provenance, participants, etc. and the transcriptions show changes of speaker. Speech acts are tagged according to the evidence of prosodic criteria. Each corpus totals 300,000 words and presents formal and informal speech in a variety of contexts of use, dialogue structure and text genres, semantic domains and speech act typologies. The corpora have great statistical relevance for spoken language structures and can address key issues in human language technology such as speech recognition in unrestricted discourse, the suitability of speech synthesis in natural prosody, and multilingual applications of the spoken language interface. The work provides new data and innovative theoretical perspectives that are relevant for corpus linguistics, romance linguistics, syntactic theory, speech and prosody research, and second language acquisition.

The original C-ORAL-ROM DVD was made to run under Windows XP when Windows 7 and 8 were not yet in existence. A new version of WINPITCH-C-ORAL-ROM makes it possible to run the C-ORAL-ROM DVD under Windows 7 and 8. It can be downloaded from www.winpitch.com/

[Studies in Corpus Linguistics, 15] 2005.  xviii, 304 pp. (incl. DVD)
Publishing status: Available
Table of Contents
“This is a great resource for researchers in the areas of Romance linguistics, corpus linguistics, syntax, second language acquisition, and speech and prosody research. The operation of the DVD and the tools included in is quite straightforward.”
“Sarà chiaro anche da questa segnalazione per forza di cose breve che C-ORAL-ROM è un lavoro molto notevole, di eccellente qualità, all’avanguardia anche per rigore metodologico, che arricchisce la linguistica romanza in un settore da prevedere sempre più in sviluppo qual è la corpus linguistics e mette a disposizione della communità scientifica un importante strumento di ricerca.”
Cited byCited by 81 other publicationsAguilar, Lourdes & Yurena M. Gutiérrez-González2018. Patterns of prominence, phrasing and tonal events in Spanish news reading: An illustrative case study. Loquens 5:1 ► pp. 048 ff. AVANZI, MATHIEU & ELISABETH DELAIS-ROUSSARIE2011. Introduction: Regards croisés sur la prosodie du français – des données à la modélisation. Journal of French Language Studies 21:1 ► pp. 1 ff. Avanzi, Mathieu & Laure Anne Johnsen2015. Asyndètes temporelles. Langages N° 200:4 ► pp. 103 ff. Bambini, Valentina2005. Corpus di italiano parlato: Vol. 1: Introduzione, Vol. 2: Corpora, CD-Rom. Journal of Pragmatics 37:6 ► pp. 949 ff. Barros, Camila & Saulo Santos2022. A Protocol for Comparing Gesture and Prosodic Boundaries in Multimodal Corpora. In Computational Processing of the Portuguese Language [Lecture Notes in Computer Science, 13208], ► pp. 313 ff. Baude, Olivier2007. Aspects juridiques et éthiques de la conservation et de la diffusion des corpus oraux. Revue française de linguistique appliquée XII:1 ► pp. 85 ff. Biber, Douglas & Shelley Staples2014. Exploring the prosody of stance. In Spoken Corpora and Linguistic Studies [Studies in Corpus Linguistics, 61], ► pp. 271 ff. Bossaglia, Giulia, Heliana Mello & Tommaso Raso2020. Chapter 7. Illocution as a unit of reference for spontaneous speech. In In Search of Basic Units of Spoken Language [Studies in Corpus Linguistics, 94], ► pp. 221 ff. Campillos Llanos, Leonardo2014. A Spanish learner oral corpus for computer-aided error analysis. Corpora 9:2 ► pp. 207 ff. Cardona, Ana Llopis2016. El uso de los aproximativosmás o menosyaproximadamenteen el continuo variacional. Spanish in Context 13:3 ► pp. 371 ff. Caron, Bernard2018. Chapter 6. Macrosyntactic corpus annotation. In Information Structure in Lesser-described Languages [Studies in Language Companion Series, 199], ► pp. 157 ff. Cavalcante, Frederico Amorim, Tommaso Raso, Giulia Bossaglia, Maryualê Mittmann & Bruno Rocha2020. Topic unit detection in spontaneous speech. CHIMERA: Revista de Corpus de Lenguas Romances y Estudios Lingüísticos 7 ► pp. 69 ff. Cimmino, Doriana2021. Chapter 6. Interactions between distribution and functional uses in Italian adversative pragmatic markers. In Pragmatic Markers and Peripheries [Pragmatics & Beyond New Series, 325], ► pp. 171 ff. Cresti, Emanuela2014. Syntactic properties of spontaneous speech in the Language into Act Theory. In Spoken Corpora and Linguistic Studies [Studies in Corpus Linguistics, 61], ► pp. 365 ff. 2020. Ancora sulla paratassi dello scritto letterario. CHIMERA: Revista de Corpus de Lenguas Romances y Estudios Lingüísticos 7 ► pp. 23 ff. 2020. Chapter 6. The pragmatic analysis of speech and its illocutionary classification according to the Language into Act Theory. In In Search of Basic Units of Spoken Language [Studies in Corpus Linguistics, 94], ► pp. 181 ff. Cresti, Emanuela & Massimo Moneglia2018. The definition of the TOPIC within Language into Act Theory and its identification in spontaneous speech corpora. Revue Romane. Langue et littérature. International Journal of Romance Languages and Literatures 53:1 ► pp. 30 ff. De Cesare, Anna-Maria2022. To be or not to be focus adverbials?. In When Data Challenges Theory [Linguistik Aktuell/Linguistics Today, 273], ► pp. 204 ff. De Cesare Greenwald, Anna-Maria & Margarita Borreguero Zuloaga2014. The contribution of the Basel model to the description of polyfunctional discourse markers. In Discourse Segmentation in Romance Languages [Pragmatics & Beyond New Series, 250], ► pp. 55 ff. Degand, Liesbeth, Anne Catherine Simon, Noalig Tanguy & Thomas Van Damme2014. Initiating a discourse unit in spoken French. In Discourse Segmentation in Romance Languages [Pragmatics & Beyond New Series, 250], ► pp. 243 ff. Ferroni, Roberta & Marilisa Birello2022. L’insegnamento dei segnali discorsivi allora, dunque e beh: riflessioni metapragmatiche di studenti di italiano LS. Cuadernos de Filología Italiana 29 ► pp. 125 ff. Gabriel, Christoph & Jonas Grünke2022. Unmarked use of marked syntactic structures. In When Data Challenges Theory [Linguistik Aktuell/Linguistics Today, 273], ► pp. 240 ff. Garassino, Davide2022. Translation as a source of pragmatic interference?. In When Data Challenges Theory [Linguistik Aktuell/Linguistics Today, 273], ► pp. 272 ff. 2022. A contrastive perspective on French and Italian wh-in situ questions. Functions of Language 29:1 ► pp. 25 ff. Garrido, Juan María, David Escudero, Lourdes Aguilar, Valentín Cardeñoso, Emma Rodero, Carme de-la-Mota, César González, Carlos Vivaracho, Sílvia Rustullet, Olatz Larrea, Yesika Laplaza, Francisco Vizcaíno, Eva Estebas, Mercedes Cabrera & Antonio Bonafonte2013. Glissando: a corpus for multidisciplinary prosodic studies in Spanish and Catalan. Language Resources and Evaluation 47:4 ► pp. 945 ff. GAUCHER, DAMIEN2015. Sémantique temporelle et accord du participe passé en français parlé: une analyse variationniste. Journal of French Language Studies 25:1 ► pp. 65 ff. Ghezzi, Chiara & Piera Molinelli2016. Politeness markers from Latin to Italian. Journal of Historical Pragmatics 17:2 ► pp. 307 ff. Gómez González, María de los Ángeles2014. Canonical tag questions in English, Spanish and Portuguese. Languages in Contrast 14:1 ► pp. 93 ff. 2016. Canonical tag questions in English, Spanish and Portuguese. In Genre- and Register-related Discourse Features in Contrast [Benjamins Current Topics, 87], ► pp. 93 ff. Heidinger, Steffen2022. Corpus Data and the Position of Information Focus in Spanish. Studies in Hispanic and Lusophone Linguistics 15:1 ► pp. 67 ff. Izre'el, Shlomo2018. Unipartite clauses. In Afroasiatic [Current Issues in Linguistic Theory, 339], ► pp. 235 ff. 2020. Chapter 3. Applying criteria of spontaneous Hebrew speech segmentation to English. In In Search of Basic Units of Spoken Language [Studies in Corpus Linguistics, 94], ► pp. 337 ff. Izre'el, Shlomo, Heliana Mello, Alessandro Panunzi & Tommaso Raso2020. Introduction. In search of a basic unit of spoken language. In In Search of Basic Units of Spoken Language [Studies in Corpus Linguistics, 94], ► pp. 1 ff. Izre'el, Shlomo & Amina Mettouchi2015. Representation of speech in CorpAfroAs. In Corpus-based Studies of Lesser-described Languages [Studies in Corpus Linguistics, 68], ► pp. 13 ff. Jouvet, Denis, Katarina Bartkova, Mathilde Dargnat & Lou Lee2017. Analysis and Automatic Classification of Some Discourse Particles on a Large Set of French Spoken Corpora. In Statistical Language and Speech Processing [Lecture Notes in Computer Science, 10583], ► pp. 32 ff. Kellert, Olga2016. Syntagme prépositionnel dans les constructions de possession. In À la recherche de la prédication [Lingvisticæ Investigationes Supplementa, 32], ► pp. 53 ff. Lacheret-Dujour, Anne, Sylvain Kahane, F. Neveu, B. Harmegnies, L. Hriba, S. Prévost & A. Steuckardt2020. Unités syntaxiques et unités intonatives majeures en français parlé : inclusion, fragmentation, chevauchement. SHS Web of Conferences 78 ► pp. 14005 ff. Lastres-López, Cristina2019. Chapter 3. Conditionals in spoken courtroom and parliamentary discourse in English, French, and Spanish. In Corpus-based Research on Variation in English Legal Discourse [Studies in Corpus Linguistics, 91], ► pp. 51 ff. Lauwers, Peter & Claude Duée2011. From aspect to evidentiality: The subjectification path of the French semi-copula se faire and its Spanish cognate hacerse. Journal of Pragmatics 43:4 ► pp. 1042 ff. Lombardi Vallauri, Edoardo2016. Chapter 6. Insubordinated conditionals in spoken and non-spoken Italian. In Insubordination [Typological Studies in Language, 115], ► pp. 145 ff. Matos Rocha, Luiz Fernando & Pablo Arantes2016. Intonation of fictive vs. actual direct speech in a Brazilian Portuguese corpus. In The Conversation Frame [Human Cognitive Processing, 55], ► pp. 215 ff. MEINSCHAEFER, JUDITH, SVEN BONIFER & CHRISTINE FRISCH2015. Variable and invariable liaison in a corpus of spoken French. Journal of French Language Studies 25:3 ► pp. 367 ff. Mettouchi, Amina, Martine Vanhove & Dominique Caubet2015. Preface. In Corpus-based Studies of Lesser-described Languages [Studies in Corpus Linguistics, 68], ► pp. 1 ff. 2015. Preface. In Corpus-based Studies of Lesser-described Languages [Studies in Corpus Linguistics, 68], Mihatsch, Wiltrud2018. From ad hoc category to ad hoc categorization: The proceduralization of Argentinian Spanishtipo. Folia Linguistica 52:s39-s1 ► pp. 147 ff. 2018. De la escritura científica a la conversación coloquial adolescente. Spanish in Context 15:2 ► pp. 281 ff. Mittmann, Maryualê Malvessi, Tommaso Raso & Heliana Mello2009. 2009 Seventh Brazilian Symposium in Information and Human Language Technology, ► pp. 179 ff. Moneglia, Massimo2014. The variation of action verbs in multilingual spontaneous speech corpora. In Spoken Corpora and Linguistic Studies [Studies in Corpus Linguistics, 61], ► pp. 152 ff. 2022. Le unità di informazione Parentetiche alla periferia destra del Comment nella Teoria della Lingua in Atto. DILEF. Rivista digitale del Dipartimento di Lettere e Filosofia :1 ► pp. 88 ff. Moneglia, Massimo & Tommaso Raso2014. Appendix. In Spoken Corpora and Linguistic Studies [Studies in Corpus Linguistics, 61], ► pp. 468 ff. Nicolás Martínez, Carlota & Marina Lombán Somacarrera2018. MINI-CORPUS del español para IPIC. CHIMERA: Revista de Corpus de Lenguas Romances y Estudios Lingüísticos 5:2 ► pp. 197 ff. Panunzi, Alessandro & Maryualê M. Mittmann2014. The IPIC resource and a cross-linguistic analysis of information structure in Italian and Brazilian Portuguese. In Spoken Corpora and Linguistic Studies [Studies in Corpus Linguistics, 61], ► pp. 129 ff. Poggio, Anabella L.2020. La Interfaz Sintaxis-Pragmática. Estudios teóricos, descriptivos y experimentales. Pragmática Sociocultural / Sociocultural Pragmatics 8:1 ► pp. 133 ff. Pons Bordería, Salvador2014. Models of discourse segmentation in Romance languages. In Discourse Segmentation in Romance Languages [Pragmatics & Beyond New Series, 250], ► pp. 1 ff. 2018. The combination of discourse markers in spontaneous conversations. Revue Romane. Langue et littérature. International Journal of Romance Languages and Literatures 53:1 ► pp. 121 ff. Raso, Tommaso2014. Prosodic constraints for discourse markers. In Spoken Corpora and Linguistic Studies [Studies in Corpus Linguistics, 61], ► pp. 411 ff. Raso, Tommaso & Heliana Mello2012. The C-ORAL-BRASIL I: Reference Corpus for Informal Spoken Brazilian Portuguese. In Computational Processing of the Portuguese Language [Lecture Notes in Computer Science, 7243], ► pp. 362 ff. Raso, Tommaso & Heliana Ribeiro De Mello2014. Introduction: Spoken corpora and linguistic studies. In Spoken Corpora and Linguistic Studies [Studies in Corpus Linguistics, 61], ► pp. 1 ff. Ribeiro De Mello, Heliana2014. Methodological issues for spontaneous speech corpora compilation. In Spoken Corpora and Linguistic Studies [Studies in Corpus Linguistics, 61], ► pp. 27 ff. Ricca, Davide & Jacqueline Visconti2014. On the development of the Italian truth adverbsdavveroandveramente. In Diachronic Corpus Pragmatics [Pragmatics & Beyond New Series, 243], ► pp. 133 ff. RIVERA, SEMILLA M., ELIZABETH A. BATES, ARACELI OROZCO-FIGUEROA & NICOLE Y. Y. WICHA2010. Spoken verb processing in Spanish: An analysis using a new online resource. Applied Psycholinguistics 31:1 ► pp. 29 ff. Rodríguez, Catalina Fuentes2023. Construcciones exclamativas de rechazo. Spanish in Context Roque Amaral, Eduardo Tadeu, Wiltrud Mihatsch, F. Neveu, G. Bergounioux, M.-H. Côté, J.-M. Fournier, L. Hriba & S. Prévost2016. Le nom françaispersonneen comparaison avec le portugais brésilienpessoaet l’allemandPerson– des noms en voie de pronominalisation ?. SHS Web of Conferences 27 ► pp. 12015 ff. Rosemeyer, Malte2021. Two Types of Constructionalization Processes in Spanish and Portuguese Cleftedwh-interrogatives. Studies in Hispanic and Lusophone Linguistics 14:1 ► pp. 117 ff. Rosemeyer, Malte & María Sol Sansiñena2022. How sentence type influences the interpretation of Spanish future constructions. Functions of Language 29:1 ► pp. 116 ff. Sabio, Frédéric2018. On the syntax of spoken French. Revue Romane. Langue et littérature. International Journal of Romance Languages and Literatures 53:1 ► pp. 6 ff. Sammarco, Carmela2021. Chapter 8. Online text mapping. In Building Categories in Interaction [Studies in Language Companion Series, 220], ► pp. 211 ff. Santos, Giovani2020. Designing and building SCoPE²: A spoken corpus of Brazilian Portuguese and L2-English. Research in Corpus Linguistics 8 ► pp. 49 ff. Silber-Varod, Vered2014. Dependencies over prosodic boundary tones in spontaneous spoken Hebrew. In Dependency Linguistics [Linguistik Aktuell/Linguistics Today, 215], ► pp. 207 ff. Thaler, Verena2020. Sí queals polyphone Struktur im gesprochenen Spanisch. Romanistisches Jahrbuch 71:1 ► pp. 305 ff. Thomas, Anita2010. The influence of lexical aspect and input frequency in the L2 French of adult beginners. Nordic Journal of Linguistics 33:2 ► pp. 169 ff. Valentini, Cristina2013. Phrasal verbs in Italian dubbed dialogues: a multimedia corpus-based study. Perspectives 21:4 ► pp. 543 ff. Viola, Lorella2017. A corpus-based investigation of language change in Italian. Journal of Historical Linguistics 7:3 ► pp. 372 ff. 2020. From linguistic innovation to language change. Revue Romane. Langue et littérature. International Journal of Romance Languages and Literatures 55:1 ► pp. 95 ff. 2020. On the diachrony of giusto? (‘right?’) in Italian. Journal of Historical Pragmatics 21:1 ► pp. 83 ff. 2022. On the use of sì? (‘yes?’) as invariant follow-up in Italian. Journal of Historical Pragmatics 23:2 ► pp. 175 ff. Williams, Christopher2020. Review of Fanego, Teresa and Paula Rodríguez-Puente eds. 2019. Corpus-based Research on Variation in English Legal Discourse. Amsterdam: John Benjamins. ISBN: 978-9-027-20235-2. https://doi.org/10.1075/scl.91. Research in Corpus Linguistics 8 ► pp. 178 ff. Yatziv-Malibert, Il-Il & Martine Vanhove2015. Quotative constructions and prosody in some Afroasiatic languages. In Corpus-based Studies of Lesser-described Languages [Studies in Corpus Linguistics, 68], ► pp. 117 ff. ÅGREN, MALIN & JOOST VAN DE WEIJER2013. Input frequency and the acquisition of subject-verb agreement in number in spoken and written French. Journal of French Language Studies 23:3 ► pp. 311 ff. [no author supplied]2019. Rhapsodie [Studies in Corpus Linguistics, 89], 2020. References. In Introduction to Corpus Linguistics, ► pp. 233 ff. This list is based on CrossRef data as of 6 march 2023. Please note that it may not be complete. Sources presented here have been supplied by the respective publishers. Any errors therein should be reported to them.
Subjects

Electronic/Multimedia Products

Electronic/Multimedia Products

Main BIC Subject

CF: Linguistics

Main BISAC Subject

LAN009000: LANGUAGE ARTS & DISCIPLINES / Linguistics / General
U.S. Library of Congress Control Number:  2005041056 | Marc record