A scientific methodology for researching CALL interaction data
Multimodal LEarning and TEaching Corpora
This chapter gives an overview of one possible staged methodology for structuring LCI data by presenting a new scientific object, LEarning and TEaching Corpora (LETEC). Firstly, the chapter clarifies the notion of corpora, used in so many different ways in language studies, and underlines how corpora differ from raw language data. Secondly, using examples taken from actual online learning situations, the chapter illustrates the methodology that is used to collect, transform and organize data from online learning situations in order to make them sharable through open-access repositories. The ethics and rights for releasing a corpus as OpenData are discussed. Thirdly, the authors suggest how the transcription of interactions may become more systematic, and what benefits may be expected from analysis tools, before opening the CALL research perspective applied to LCI towards its applications to teacher-training in Computer-Mediated Communication (CMC), and the common interests the CALL field shares with researchers in the field of Corpus Linguistics working on CMC.
References
Baldry, A., & Thibault, P.-J
(
2006)
Multimodal transcription and text analysis. London: Equinox.
Barab, S.A., Kling, R., & Gray, J.H
(Eds.) (
2004)
Designing for virtual communities in the service of learning. Cambridge, United Kingdom: Cambridge University Press.
Bellik, Y.D., & Teil, D
(
1992)
Définitions terminologiques pour la communication multimodale. In
Les actes des 4èmes Journées sur l’ingénierie des interfaces Homme-Machine,
IHM’92, Telecom Paris Publ., Paris, 30 Nov. - 2 Dec., 229-232. Retrieved from
[URL]
Boulton, A
(
2011)
Data-driven learning: The perpetual enigma. In
S. Gozdz-Roszkowski(Ed.),
Explorations across languages and corpora (pp. 563–580). Frankfurt, Germany: Peter Lang.
Chanier, T., Poudat, C., Sagot, B., Antoniadis, B., Wigham, C.R., Hriba L., … Seddah, D
(
2014)
The CoMeRe corpus for French: Structuring and annotating heterogeneous CMC genres.
Journal of Language Technology and Computational Linguistics, 29(2), 1–31.
Chanier, T
(
2013)
EUROCALL 2013, Survey on CALL in the digital humanities: Considering CALL journals, research data
. Paper presented at
EUROCALL 2013
, Évora, Portugal.
Chanier, T., Reffay, C., Betbeder, M.-L., Ciekanski, M., & Lamy, M.-N
(
2009)
LETEC (Learning and Teaching Corpus) Copéas [corpus]. Mulce.org: Clermont Université.
Ciekanski, M., & Chanier, T
(
2008)
Developing online multimodal verbal communication to enhance the writing process in an audio-graphic conferencing environment.
ReCALL, 20(2), 162–182.
CINES
(
2014)
Description of resource formats eligible for archiving. National Computing Center for Higher Education.
[URL]
CoMeRe
(
2015)
Repository of computer-mediated communication (CMC) corpora. [webservice] Ortolang: Nancy.
[URL]
Creative Commons
(
2015)
Licences for publishing knowledge and data in open access formats [website].
[URL]
Crystal, D
(
2004)
The language of the Internet. Cambridge, United Kingdom: Cambridge University Press.
DARIAH
(
2013)
Digital research infrastructure for arts and humanities [website].
[URL]
Dortmunder Chat-Korpus
(
2003–2009)
German corpus of informal text chat [website].
[URL]
DWDS
(
2013)
Das Digitale Wörterbuch der deutschen Sprache [website].
[URL]
Dyke, G., Lund, K., Jeong, H., Medina, R., Suthers, D.D., van Aalst, J., … Looi, C.-K
(
2011)
Technological affordances for productive multivocality in analysis. In
H. Spada,
G. Stahl,
N. Miyake,
N. Law, &
K.M. Cheng(Eds.),
Connecting Computer-Supported Collaborative Learning to Policy and Practice: Proceedings of the 9th International Conference on Computer-Supported Collaborative Learning (CSCL 2011
) (Vol. I, pp. 454–461). Hong Kong, China: International Society of the Learning Sciences.
Flewitt, R., Hampel, R., Hauck, M., & Lancaster, L
(
2009)
What are multimodal data and transcription?In
C. Jewitt(Ed.),
The Routledge Handbook of Multimodal Analysis (pp. 40–53). London, United Kingdom: Routledge.
Goodwin, C., & Duranti, A
(
1992)
Rethinking Context: An introduction. In
C. Goodwin &
A. Duranti (Eds.),
Rethinking context: Language as an interactive phenomenon (pp. 191-227). Cambridge: Cambridge University Press.
Granger, S
(
2004)
Computer learner corpus research: Current status and future prospects. In
U. Connor &
T. Upton(Eds.),
Applied corpus linguistics: A multidimensional perspective (pp. 123–145). Amsterdam, Netherlands: Rodopi.
Guichon, N., & Hauck, M
(
2011)
Teacher education research in CALL and CMC: More in demand than ever.
ReCALL, 23(3), 187–199.
Halliday, M.A.K
(
1989)
“Part A†in
M.A.K. Halliday &
R. Hasan (Eds.),
Language, context and cext: Aspects of language in a social-semiotic perspective (pp. 1-49). Oxford: Oxford University Press.
Hatch, T., & Grossman, P
(
2009)
Learning to look beyond the boundaries of representation.
Journal of Teacher Education, 60(1), 70–85.
Huma-Num
(
2014)
French national corpora consortiums in various areas of arts and humanities.
[URL]
IMS Learning
(
2004)
Description of learning scenarios (IMS-LD), and data packaging (IMS-CP). IMS Global Learning Consortium. Retrieved from
[URL]
Francis, W.N., & Kučera, H
(
1964)
A standard corpus of present-day edited American English, for use with digital computers. Providence, RI: Brown University.
Laks, B
(
2010)
La linguistique des usages: de l’exemplum au datum. In
P. Cappeau,
H. Choquet, &
F. Valetoupolos(Eds.),
L’exemple et le corpus quel statut? Travaux linguistique du Cerlico, 23 (pp. 13–28). Rennes, France: Presses Universitaire de Rennes.
Lamy, M.-N
(
2012)
Click if you want to speak: Reframing CA for research into multimodal conversations in online learning.
International Journal of Virtual and Personal Learning Environments, 3(1), 1–18.
Lave, J., & Wenger, E
(
1991)
Situated learning: Legitimate peripheral participation. Cambridge, United Kingdom: Cambridge University Press.
Mackey, A., & Gass, S.M
(
2005)
Second language research: Methodology and design. New York, NY: Routledge.
MacWhinney, B
(
2009)
Manual of the CHAT transcription format used in the CHILDES project. Retrieved from
[URL]
Mulce documentation
(
2015)
Documentation on Mulce repository and Mulce methodology [website].
[URL]
Mulce repository
(
2013)
Repository of learning and teaching (LETEC) corpora [webservice]. Clermont Université: MULCE.org.
[URL]
Open Knowledge
(
2013)
Definition of “open” with respect to knowledge and data.
[URL]
O’Keefe, A., McCarthy, M., & Carter, R
(
2007)
From corpus to classroom: Language use and language teaching. Cambridge, United Kingdom: Cambridge University Press.
Reffay, C., Betbeder, M.-L., & Chanier, T
(
2012)
Multimodal learning and teaching corpora exchange: Lessons learned in 5 years by the Mulce project.
International Journal of Technology Enhanced Learning, 4(1), 1–20.
Schiffrin, D
(
1994)
Approaches to discourse. Malden, MA: Blackwell.
Shih, Y.-C
(
2014)
Communication strategies in a multimodal virtual communication context.
System, 42, 34–47.
Sindoni, M.G
(
2013)
Spoken and written discourse in online interactions: A multimodal approach. New York, NY: Routledge.
Sloetjes, H., & Wittenburg, P
(
2008)
Annotation by category – ELAN and ISO DCR.
Proceedings of the 6th International Conference on Language Resources and Evaluation
, 816–820.
TEI
(
2015)
Text Encoding Initiative. [website].
[URL]
TEI-CMC
(
2015)
Computer-Mediated Communication working group of the TEI consortium [website].
[URL]
The Open University
(
2001)
Research methods in education. Milton Keynes, United Kingdom: The Open University.
Wigham, C.R., & Chanier, T
(
2013)
Pedagogical corpus: Reflective teaching journals. [corpus]Mulce.org: Clermont Université.
[URL]
Wigham, C.R. & Chanier, T
(
2014)
Pedagogical corpora as a means to reuse research data and analyses in teacher-training. In
J. Colpaert,
A. Aerts, &
M. Oberhofer(Eds),
Proceedings of the sixteenth international CALL conference, 7–9 July 2014 (pp. 360–365). Antwerp, Belgium: University of Antwerp.
Wigham, C.R., & Chanier, T
(
2015)
Interactions between text chat and audio modalities for L2 communication and feedback in the synthetic world Second Life.
Computer Assisted Language Learning, 28(3), 260–283.
Yun, H., & Chanier, T
(
2014)
Corpus d’apprentissage FAVI (Français académique virtuel international). Banque de corpus CoMeRe. [corpus] Ortolang.fr: Nancy.
Zourou, K
(
2013)
Research challenges in informal social networked language learning communities.
eLearning Papers, 34, 1–11.
Cited by
Cited by 1 other publications
Chanier, Thierry & Marie‐Noëlle Lamy
2017.
Researching Technology‐mediated Multimodal Interaction. In
The Handbook of Technology and Second Language Teaching and Learning,
► pp. 428 ff.
This list is based on CrossRef data as of 21 march 2024. Please note that it may not be complete. Sources presented here have been supplied by the respective publishers.
Any errors therein should be reported to them.