Test interpretation, test use, and pedagogical implications
Test score interpretation and use are the staple of construct validity. As such, in addition to the concern with measurement accuracy, it is imperative that the meaning of test scores and their intended use(s) be also documented. Along these lines, qualitative speech analyses are undertaken in the present paper to help in the interpretation of the dimensions underlying student performance on oral tasks. Results of these analyses yield rich information that explicate the meaning of the dimensions by delineating their specific features as manifested in the speech samples. Also discussed in the paper are the ramifications of these results for pedagogical use. Insights that linguistic accuracy and communicative skills in general, and their specific features specifically, provide for instructional material and activities are addressed. Furthermore, a case is made for curricular improvements to help learners develop well-rounded L2 abilities and to improve their use of the language for real-life communication. Finally, with regard to assessment, it is argued that generic assessment criteria do not reflect the critical features operating in a given context, and assessment practitioners are urged to study their contexts of use and to tailor their criteria according to the particulars of those contexts.
References (32)
References
Al-Batal, M. (1995) Issues in the teaching of the productive skills in Arabic. In M. Al-Batal (ed.) The Teaching of Arabic as a Foreign Language Provo, UT, American Association of Teachers of Arabic.
Alosh, M. (1991) Arabic diglossia and its impact on teaching Arabic as a foreign language. In G. Ervin (ed.) International Perspectives on Foreign Language Teaching. Chicago, IL, National Textbook Company.
Alderson, C. & Clapham, C. (1995) Assessing student performance in the ESL classroom. TESOL Quarterly, 291, 184–187.
American Educational Research Association, American Psychological Association, & National Council on Measurement in Education. (1985) Standards for Educational and Psychological Testing. Washington, DC, Author.
Anastasi, A. (1986) Evolving concepts of test validation. Annual Review of Psychology, 371: 1–15.
Anastasi, A. (1990) Ability testing in the 1980’s and beyond: Some major trends. Public Personnel Management, 181:471–485.
Bachman, L. (1990) Fundamental Considerations in Language Testing. Oxford, Oxford University Press.
Barnwell, D. (1989) ‘Naive’ native speakers and judgements of oral proficiency in Spanish. Language Testing, 61:152–163.
Belnap, K. (1995) The institutional setting of Arabic language teaching: A survey of program coordinators and teachers of Arabic in U.S. institutions of higher learning. In M. Al-Batal (ed.) The Teaching of Arabic as a Foreign Language. Provo, UT, American Association of Teachers of Arabic.
Brown, A. (1995) The effect of rater variables in the development of an occupation-specific language performance test. Language Testing, 121:1–15.
Chalhoub-Deville, M. (forthcoming) Theoretical models, assessment frameworks, and test construction. Language Testing.
Chalhoub-Deville, M. (1995a) Deriving oral assessment scales across different tests and rater groups. Language Testing, 121:16–33.
Chalhoub-Deville, M. (1995b) A contextualized approach to describing oral language proficiency. Language Learning, 451: 251–281.
Douglas, D. & Chapelle, C. (1993) In D. Douglas & C. Chapelle (Eds.), A New Decade of Language Testing Research. Alexandria VA, Teachers of English to Speakers of Other Languages Inc.
Elgibali, A. & Taha, Z. (1995) Teaching Arabic as a foreign language: Challenges of the nineties. In M. Al-Batal (Ed.), The Teaching of Arabic as a Foreign Language. Provo UT, American Association of Teachers of Arabic.
Ellis, R. (1995) Interpretation tasks for grammar teaching. TESOL Quarterly, 291: 87–105.
Galloway, V. (1980) Perceptions of the communicative efforts of American students of Spanish. Modern Language Journal, 641: 428–433.
Guion, R. (1980) On trinitarian doctrines of validity. Professional Psychology, 111:385–398.
Hadden, B. (1991) Teacher and non-teacher perceptions of second-language communication. Language Learning, 411, 1–24.
Kramsch, C. (1991) The order of discourse in language teaching. In B. Freed (ed.) Foreign Language Acquisition Research and the Classroom. Lexington MA, D. C. Heath and Company.
Ludwig, J. (1982) Native-speaker judgments of second language learners’ efforts at communication: A review. Modern Language Journal, 661: 274–283.
Messick, S. (1975) The standard problem: Meaning and values in measurement and evaluation. American Psychologist, 301, 955–966.
Messick, S. (1989) Validity. In R. L. Linn (ed.) Educational Measurement (3rd ed). New York, American Council on Education/Macmillan.
Moss, P.A. (1992) Shifting conceptions of validity in educational measurement: Implications for performance assessment. Review of Educational Research, 621:229–58.
National Council on Measurement in Education. (1995) Code of Professional Responsibilities in Educational Measurement. Washington, DC: Author.
Rutherford, W. (1987) Second Language Grammar: Learning and Teaching. NY: Longman.
Ryding, K. (1995) Discourse competence in TAFL: Skill levels and choice of language variety in the Arabic classroom. In M. Al-Batal (ed.) The Teaching of Arabic as a Foreign Language. Provo, UT, American Association of Teachers of Arabic.
Shepard, L. A. (1993) Evaluating test validity. In L. Darling-Hammond (ed.) Review of research in education, 191. Washington, DC, American Educational Research Association.
Taha, Z. (1995) The grammar controversy: What to teach and why. In M. Al-Batal (ed.) The Teaching of Arabic as a Foreign Language. Provo, UT, American Association of Teachers of Arabic.
Younes, M. (1995) An integrated curriculum for elementary Arabic. In M. Al-Batal (ed.) The Teaching of Arabic as a Foreign Language. Provo, UT, American Association of Teachers of Arabic.
Upshur, J. & Turner, C. (March, 1995) Task, judge and scale effects in the rating of speaking ability of primary school ESL learners. Paper presented at the Annual Meeting of Language Testing Research Colloquium, Long Beach, CA.
Wesche, M. (1992) Performance testing for work-related second language assessment. In E. Shohamy & R. Walton (eds) Language Assessment for Feedback: Testing and Other Strategies. Dubuque, IA: Kendall/Hunt Publishing Company.
Cited by (1)
Cited by one other publication
Jazi Shaydied Alotaibi, Abdullah Alotaibi, Sharifa Alasiry, Bader Alrasheadi, Wdad Alanazy, Sameer Alkubati & Llego, Jordan
2024.
Reasons for academic cheating in a cohort of nursing students in Saudi Arabia: a cross-sectional study.
Journal of Medicine and Life 17:4
► pp. 418 ff.
This list is based on CrossRef data as of 2 september 2024. Please note that it may not be complete. Sources presented here have been supplied by the respective publishers.
Any errors therein should be reported to them.