A scientific methodology for researching CALL interaction data
Multimodal LEarning and TEaching Corpora
This chapter gives an overview of one possible staged methodology for structuring LCI data by presenting a new scientific object, LEarning and TEaching Corpora (LETEC). Firstly, the chapter clarifies the notion of corpora, used in so many different ways in language studies, and underlines how corpora differ from raw language data. Secondly, using examples taken from actual online learning situations, the chapter illustrates the methodology that is used to collect, transform and organize data from online learning situations in order to make them sharable through open-access repositories. The ethics and rights for releasing a corpus as OpenData are discussed. Thirdly, the authors suggest how the transcription of interactions may become more systematic, and what benefits may be expected from analysis tools, before opening the CALL research perspective applied to LCI towards its applications to teacher-training in Computer-Mediated Communication (CMC), and the common interests the CALL field shares with researchers in the field of Corpus Linguistics working on CMC.
References (47)
Baldry, A., & Thibault, P.-J
(
2006)
Multimodal transcription and text analysis. London: Equinox.
![Google Scholar](https://benjamins.com/logos/google-scholar.svg)
Barab, S.A., Kling, R., & Gray, J.H
(Eds.) (
2004)
Designing for virtual communities in the service of learning. Cambridge, United Kingdom: Cambridge University Press.
![DOI logo](https://benjamins.com/logos/doi-logo.svg)
![Google Scholar](https://benjamins.com/logos/google-scholar.svg)
Bellik, Y.D., & Teil, D
(
1992)
Définitions terminologiques pour la communication multimodale. In
Les actes des 4èmes Journées sur l’ingénierie des interfaces Homme-Machine,
IHM’92, Telecom Paris Publ., Paris, 30 Nov. - 2 Dec., 229-232. Retrieved from
[URL]![Google Scholar](https://benjamins.com/logos/google-scholar.svg)
Boulton, A
(
2011)
Data-driven learning: The perpetual enigma. In
S. Gozdz-Roszkowski(Ed.),
Explorations across languages and corpora (pp. 563–580). Frankfurt, Germany: Peter Lang.
![Google Scholar](https://benjamins.com/logos/google-scholar.svg)
Chanier, T., Poudat, C., Sagot, B., Antoniadis, B., Wigham, C.R., Hriba L., … Seddah, D
(
2014)
The CoMeRe corpus for French: Structuring and annotating heterogeneous CMC genres.
Journal of Language Technology and Computational Linguistics, 29(2), 1–31.
![Google Scholar](https://benjamins.com/logos/google-scholar.svg)
Chanier, T
(
2013)
EUROCALL 2013, Survey on CALL in the digital humanities: Considering CALL journals, research data
. Paper presented at
EUROCALL 2013
, Évora, Portugal.
Chanier, T., Reffay, C., Betbeder, M.-L., Ciekanski, M., & Lamy, M.-N
(
2009)
LETEC (Learning and Teaching Corpus) Copéas [corpus]. Mulce.org: Clermont Université.
![Google Scholar](https://benjamins.com/logos/google-scholar.svg)
Ciekanski, M., & Chanier, T
(
2008)
Developing online multimodal verbal communication to enhance the writing process in an audio-graphic conferencing environment.
ReCALL, 20(2), 162–182.
![DOI logo](https://benjamins.com/logos/doi-logo.svg)
![Google Scholar](https://benjamins.com/logos/google-scholar.svg)
CINES
(
2014)
Description of resource formats eligible for archiving. National Computing Center for Higher Education.
[URL]
CoMeRe
(
2015)
Repository of computer-mediated communication (CMC) corpora. [webservice] Ortolang: Nancy.
[URL]
Creative Commons
(
2015)
Licences for publishing knowledge and data in open access formats [website].
[URL]
Crystal, D
(
2004)
The language of the Internet. Cambridge, United Kingdom: Cambridge University Press.
![Google Scholar](https://benjamins.com/logos/google-scholar.svg)
DARIAH
(
2013)
Digital research infrastructure for arts and humanities [website].
[URL]
Dortmunder Chat-Korpus
(
2003–2009)
German corpus of informal text chat [website].
[URL]
DWDS
(
2013)
Das Digitale Wörterbuch der deutschen Sprache [website].
[URL]
Dyke, G., Lund, K., Jeong, H., Medina, R., Suthers, D.D., van Aalst, J., … Looi, C.-K
(
2011)
Technological affordances for productive multivocality in analysis. In
H. Spada,
G. Stahl,
N. Miyake,
N. Law, &
K.M. Cheng(Eds.),
Connecting Computer-Supported Collaborative Learning to Policy and Practice: Proceedings of the 9th International Conference on Computer-Supported Collaborative Learning (CSCL 2011
) (Vol. I, pp. 454–461). Hong Kong, China: International Society of the Learning Sciences.
Flewitt, R., Hampel, R., Hauck, M., & Lancaster, L
(
2009)
What are multimodal data and transcription?In
C. Jewitt(Ed.),
The Routledge Handbook of Multimodal Analysis (pp. 40–53). London, United Kingdom: Routledge.
![Google Scholar](https://benjamins.com/logos/google-scholar.svg)
Goodwin, C., & Duranti, A
(
1992)
Rethinking Context: An introduction. In
C. Goodwin &
A. Duranti (Eds.),
Rethinking context: Language as an interactive phenomenon (pp. 191-227). Cambridge: Cambridge University Press.
![Google Scholar](https://benjamins.com/logos/google-scholar.svg)
Granger, S
(
2004)
Computer learner corpus research: Current status and future prospects. In
U. Connor &
T. Upton(Eds.),
Applied corpus linguistics: A multidimensional perspective (pp. 123–145). Amsterdam, Netherlands: Rodopi.
![Google Scholar](https://benjamins.com/logos/google-scholar.svg)
Guichon, N., & Hauck, M
(
2011)
Teacher education research in CALL and CMC: More in demand than ever.
ReCALL, 23(3), 187–199.
![DOI logo](https://benjamins.com/logos/doi-logo.svg)
![Google Scholar](https://benjamins.com/logos/google-scholar.svg)
Halliday, M.A.K
(
1989)
“Part A†in
M.A.K. Halliday &
R. Hasan (Eds.),
Language, context and cext: Aspects of language in a social-semiotic perspective (pp. 1-49). Oxford: Oxford University Press.
![Google Scholar](https://benjamins.com/logos/google-scholar.svg)
Hatch, T., & Grossman, P
(
2009)
Learning to look beyond the boundaries of representation.
Journal of Teacher Education, 60(1), 70–85.
![DOI logo](https://benjamins.com/logos/doi-logo.svg)
![Google Scholar](https://benjamins.com/logos/google-scholar.svg)
Huma-Num
(
2014)
French national corpora consortiums in various areas of arts and humanities.
[URL]
IMS Learning
(
2004)
Description of learning scenarios (IMS-LD), and data packaging (IMS-CP). IMS Global Learning Consortium. Retrieved from
[URL]
Francis, W.N., & Kučera, H
(
1964)
A standard corpus of present-day edited American English, for use with digital computers. Providence, RI: Brown University.
![Google Scholar](https://benjamins.com/logos/google-scholar.svg)
Laks, B
(
2010)
La linguistique des usages: de l’exemplum au datum. In
P. Cappeau,
H. Choquet, &
F. Valetoupolos(Eds.),
L’exemple et le corpus quel statut? Travaux linguistique du Cerlico, 23 (pp. 13–28). Rennes, France: Presses Universitaire de Rennes.
![Google Scholar](https://benjamins.com/logos/google-scholar.svg)
Lamy, M.-N
(
2012)
Click if you want to speak: Reframing CA for research into multimodal conversations in online learning.
International Journal of Virtual and Personal Learning Environments, 3(1), 1–18.
![DOI logo](https://benjamins.com/logos/doi-logo.svg)
![Google Scholar](https://benjamins.com/logos/google-scholar.svg)
Lave, J., & Wenger, E
(
1991)
Situated learning: Legitimate peripheral participation. Cambridge, United Kingdom: Cambridge University Press.
![DOI logo](https://benjamins.com/logos/doi-logo.svg)
![Google Scholar](https://benjamins.com/logos/google-scholar.svg)
Mackey, A., & Gass, S.M
(
2005)
Second language research: Methodology and design. New York, NY: Routledge.
![Google Scholar](https://benjamins.com/logos/google-scholar.svg)
MacWhinney, B
(
2009)
Manual of the CHAT transcription format used in the CHILDES project. Retrieved from
[URL]
Mulce documentation
(
2015)
Documentation on Mulce repository and Mulce methodology [website].
[URL]
Mulce repository
(
2013)
Repository of learning and teaching (LETEC) corpora [webservice]. Clermont Université: MULCE.org.
[URL]
Open Knowledge
(
2013)
Definition of “open” with respect to knowledge and data.
[URL]
O’Keefe, A., McCarthy, M., & Carter, R
(
2007)
From corpus to classroom: Language use and language teaching. Cambridge, United Kingdom: Cambridge University Press.
![DOI logo](https://benjamins.com/logos/doi-logo.svg)
![Google Scholar](https://benjamins.com/logos/google-scholar.svg)
Reffay, C., Betbeder, M.-L., & Chanier, T
(
2012)
Multimodal learning and teaching corpora exchange: Lessons learned in 5 years by the Mulce project.
International Journal of Technology Enhanced Learning, 4(1), 1–20.
![DOI logo](https://benjamins.com/logos/doi-logo.svg)
![Google Scholar](https://benjamins.com/logos/google-scholar.svg)
Schiffrin, D
(
1994)
Approaches to discourse. Malden, MA: Blackwell.
![Google Scholar](https://benjamins.com/logos/google-scholar.svg)
Shih, Y.-C
(
2014)
Communication strategies in a multimodal virtual communication context.
System, 42, 34–47.
![DOI logo](https://benjamins.com/logos/doi-logo.svg)
![Google Scholar](https://benjamins.com/logos/google-scholar.svg)
Sindoni, M.G
(
2013)
Spoken and written discourse in online interactions: A multimodal approach. New York, NY: Routledge.
![Google Scholar](https://benjamins.com/logos/google-scholar.svg)
Sloetjes, H., & Wittenburg, P
(
2008)
Annotation by category – ELAN and ISO DCR.
Proceedings of the 6th International Conference on Language Resources and Evaluation
, 816–820.
TEI
(
2015)
Text Encoding Initiative. [website].
[URL]
TEI-CMC
(
2015)
Computer-Mediated Communication working group of the TEI consortium [website].
[URL]
The Open University
(
2001)
Research methods in education. Milton Keynes, United Kingdom: The Open University.
![Google Scholar](https://benjamins.com/logos/google-scholar.svg)
Wigham, C.R., & Chanier, T
(
2013)
Pedagogical corpus: Reflective teaching journals. [corpus]Mulce.org: Clermont Université.
[URL]
Wigham, C.R. & Chanier, T
(
2014)
Pedagogical corpora as a means to reuse research data and analyses in teacher-training. In
J. Colpaert,
A. Aerts, &
M. Oberhofer(Eds),
Proceedings of the sixteenth international CALL conference, 7–9 July 2014 (pp. 360–365). Antwerp, Belgium: University of Antwerp.
![Google Scholar](https://benjamins.com/logos/google-scholar.svg)
Wigham, C.R., & Chanier, T
(
2015)
Interactions between text chat and audio modalities for L2 communication and feedback in the synthetic world Second Life.
Computer Assisted Language Learning, 28(3), 260–283.
![DOI logo](https://benjamins.com/logos/doi-logo.svg)
![Google Scholar](https://benjamins.com/logos/google-scholar.svg)
Yun, H., & Chanier, T
(
2014)
Corpus d’apprentissage FAVI (Français académique virtuel international). Banque de corpus CoMeRe. [corpus] Ortolang.fr: Nancy.
![Google Scholar](https://benjamins.com/logos/google-scholar.svg)
Zourou, K
(
2013)
Research challenges in informal social networked language learning communities.
eLearning Papers, 34, 1–11.
![Google Scholar](https://benjamins.com/logos/google-scholar.svg)
Cited by (1)
Cited by 1 other publications
Chanier, Thierry & Marie‐Noëlle Lamy
2017.
Researching Technology‐mediated Multimodal Interaction. In
The Handbook of Technology and Second Language Teaching and Learning,
► pp. 428 ff.
![DOI logo](//benjamins.com/logos/doi-logo.svg)
This list is based on CrossRef data as of 24 june 2024. Please note that it may not be complete. Sources presented here have been supplied by the respective publishers.
Any errors therein should be reported to them.