References

References

Andrich, D.
(1978) Relationships between the Thurstone and Rasch approaches to item scaling. Applied Psychological Measurement 2 (3), 451–462. CrossrefGoogle Scholar
Barik, H. C.
(1971) A description of various types of omissions, additions and errors of translation encountered in simultaneous interpretation. Meta 16 (4), 199–210. CrossrefGoogle Scholar
Bradley, R. A. & Terry, M. E.
(1952) Rank analysis of incomplete block designs: The method of paired comparisons. Biometrika 39 (3/4), 324–345. CrossrefGoogle Scholar
Bramley, T.
Bramley, T., Bell, J. & Pollitt, A.
(1998) Assessing changes in standards over time using Thurstone paired comparisons. Education Research and Perspectives 25 (2), 1–23.Google Scholar
Bühler, H.
(1986) Linguistic (semantic) and extralinguistic (pragmatic) criteria for the evaluation of conference interpretation and interpreters. Multilingua 5, 231–235.Google Scholar
CCHI
(2012) Technical report on the development and pilot testing of the Certified Healthcare Interpreter™ (CHI™) examination for Arabic and Mandarin. https://​cchicertification​.org​/uploads​/CCHI​_Technical​_Report​-CHI​-Arabic​_Mandarin​.pdf (Accessed 9 June 2021).
Chen, J., Yang, H-B. & Han, C.
(2021) Holistic versus analytic scoring of spoken-language interpreting: A multi-perspectival comparative analysis. Manuscript submitted for publication.Google Scholar
Cheung, A. K. F.
(2015) Scapegoating the interpreter for listeners’ dissatisfaction with their level of understanding: An experimental study. Interpreting 17 (1), 46–63. CrossrefGoogle Scholar
Han, C.
(2015) Investigating rater severity/leniency in interpreter performance testing: A multifaceted Rasch measurement approach. Interpreting 17 (2), 255–283. CrossrefGoogle Scholar
(2017) Using analytic rating scales to assess English–Chinese bi-directional interpreting: A longitudinal Rasch analysis of scale utility and rater behaviour. Linguistica Antverpiensia, New Series: Themes in Translation Studies 16, 196–215. https://​lans​-tts​.uantwerpen​.be​/index​.php​/LANS​-TTS​/article​/view​/429​/407
(2018) Using rating scales to assess interpretation: Practices, problems and prospects. Interpreting 20 (1), 59–95. CrossrefGoogle Scholar
(2019) A generalizability theory study of optimal measurement design for a summative assessment of English/Chinese consecutive interpreting. Language Testing 36 (3), 419–438. CrossrefGoogle Scholar
(2021) Interpreting testing and assessment: A state-of-the-art review. Language Testing. CrossrefGoogle Scholar
Hartley, A., Mason, I., Peng, G. & Perez, I.
(2003) Peer- and self-assessment in conference interpreter training. Centre for Languages, Linguistics and Area Studies. https://​researchportal​.hw​.ac​.uk​/en​/publications​/peer​-and​-self​-assessment​-in​-conference​-interpreting​-training
International School of Linguists
(2017) Diploma in Public Service Interpreting: Learner handbook. London, UK. https://​www​.islinguists​.com​/wp​-content​/uploads​/2017​/07​/ISL​-DPSI​-Handbook​-v4​.2​.pdf (Accessed 9 June 2021).
Jones, I. & Inglis, M.
(2015) The problem of assessing problem solving: Can comparative judgement help? Educational Studies in Mathematics 89 (3), 337–355. CrossrefGoogle Scholar
Jones, I. & Wheadon, C.
(2015) Peer assessment using comparative and absolute judgement. Studies in Educational Evaluation 47, 93–101. CrossrefGoogle Scholar
Jones, I., Swan, M. & Pollitt, A.
(2015) Assessing mathematical problem solving using comparative judgement. International Journal of Science and Mathematics Education 13, 151–177. CrossrefGoogle Scholar
Laming, D.
(2004) Marking university examinations: Some lessons from psychophysics. Psychology Learning and Teaching 3 (2), 89–96. CrossrefGoogle Scholar
Lee, J.
(2008) Rating scales for interpreting performance assessment. The Interpreter and Translator Trainer 2 (2), 165–184. CrossrefGoogle Scholar
Lee, S-B.
(2015) Developing an analytic scale for assessing undergraduate students’ consecutive interpreting performances. Interpreting 17 (2), 226–254. CrossrefGoogle Scholar
Linacre, J. M.
(2002) What do infit and outfit, mean-square and standardized mean? Rasch Measurement Transactions 16 (2), 878.Google Scholar
Liu, M.
(2013) Design and analysis of Taiwan’s interpretation certification examination. In D. Tsagari & R. van Deemter (Eds.), Assessment issues in language translation and interpreting. Frankfurt: Peter Lang, 163–178.Google Scholar
Luce, R.
(1959) Individual choice behavior. New York: Wiley.Google Scholar
McMahon, S. & Jones, I.
(2015) A comparative judgement approach to teacher assessment. Assessment in Education: Principles, Policy & Practice 22 (3), 368–389. CrossrefGoogle Scholar
Myford, C. M. & Wolfe, E. W.
(2003) Detecting and measuring rater effects using many-facet Rasch measurement: Part I. Journal of Applied Measurement 4 (4), 386–422.Google Scholar
National Center for State Courts
(2019) Federal Court Interpreter Certification Examination for Spanish/English: Examinee handbook. https://​www​.prometric​.com​/sites​/default​/files​/2019​-09​/fcice​_examineehandbook​.pdf (Accessed 9 June 2021).
Pöchhacker, F.
(2002) Researching interpreting quality: Models and methods. In G. Garzone & M. Viezzi (Eds.), Interpreting in the 21st century: Challenges and opportunities. Amsterdam: John Benjamins, 95–106. CrossrefGoogle Scholar
Pollitt, A.
(2012a) Comparative judgement for assessment. International Journal of Technology and Design Education 22 (2), 157–170. CrossrefGoogle Scholar
(2012b) The method of adaptive comparative judgement. Assessment in Education: Principles, Policies & Practice 19 (3), 281–300.Google Scholar
Pollitt, A. & Murray, N. L.
(1996) What raters really pay attention to? In M. Milanovic, & N. Saville (Eds.), Studies in language testing 3: Performance testing, cognition and assessment. Cambridge: Cambridge University Press, 74–91.Google Scholar
PSI Services LLC
(2013) Development and validation of oral examinations for medical interpreter certification: Mandarin, Russian, Cantonese, Korean, and Vietnamese forms. https://​nbcmi​.memberclicks​.net​/assets​/docs​/tech​-report​-development​-validation​-language​-forms​.pdf (Accessed 9 June 2021).
Riccardi, A.
(1998) Evaluation in interpretation: Macrocriteria and microcriteria. In E. Hung (Ed.), Teaching translation and interpreting 4. Building bridges. Amsterdam: John Benjamins, 115–127. CrossrefGoogle Scholar
Roberts, R. P.
(2000) Interpreter assessment tools for different settings. In R. P. Roberts, S. E. Carr, D. Abraham & A. Dufour (Eds.), Critical link 2: Interpreters in the community. Amsterdam: John Benjamins, 103–130. CrossrefGoogle Scholar
Salkind, N. J.
(2007) Encyclopedia of measurement and statistics. Thousand Oaks, CA: Sage. CrossrefGoogle Scholar
Schjoldager, A.
(1996) Assessment of simultaneous interpreting. In C. Dollerup & V. Appel (Eds.), Teaching translation and interpreting 3: New horizons. Amsterdam: John Benjamins, 187–195. CrossrefGoogle Scholar
Setton, R. & Dawrant, A.
(2016) Conference interpreting: A trainer’s guide. Amsterdam: John Benjamins. CrossrefGoogle Scholar
Setton, R. & Motta, M.
(2007) Syntacrobatics: Quality and reformulation in simultaneous-with-text. Interpreting 9 (2), 199–230. CrossrefGoogle Scholar
Thurstone, L. L.
(1927) A law of comparative judgment. Psychological Review 34 (4), 273–286. CrossrefGoogle Scholar
(1954) The measurement of values. Psychological Review 61 (1), 47–58. CrossrefGoogle Scholar
Tiselius, E.
(2009) Revisiting Carroll’s scales. In C. V. Angelelli & H. E. Jacobson (Eds.), Testing and assessment in translation and interpreting studies. Amsterdam: John Benjamins, 95–121. CrossrefGoogle Scholar
Verhavert, S., Bouwer, R., Donche, V. & De Maeyer, S.
(2019) A meta-analysis on the reliability of comparative judgement. Assessment in Education: Principles, Policy & Practice 26 (5), 541–562. CrossrefGoogle Scholar
Wu, S.
(2010) Assessing simultaneous interpreting: A study on test reliability and examiners’ assessment behavior. PhD thesis, Newcastle University.
(2013) How do we assess students in the interpreting examinations? In D. Tsagari & R. van Deemter (Eds.), Assessment issues in language translation and interpreting. Frankfurt am Main: Peter Lang, 15–33.Google Scholar