Brown, A.
(1995) The effect of rater variables in the development of an occupation-specific language performance test. Language Testing, 12 1, 1–15. DOI logoGoogle Scholar
Carey, M. D., Mannell, R. H., & Dunn, P. K.
(2011) Does a rater’s familiarity with a candidate’s pronunciation affect the rating in oral proficiency interviews? Language Testing, 28 1, 201–219. DOI logoGoogle Scholar
Chen, J.
(2009) Authenticity in accreditation tests for interpreters in China. The Interpreter and Translator Trainer, 3 1, 257–273. DOI logoGoogle Scholar
Fayer, J. M., & Krasinski, E.
(1987) Native and nonnative judgments of intelligibility and irritation. Language Learning, 37 1, 313–326. DOI logoGoogle Scholar
Gile, D.
(2009) Interpreting studies: A critical review from within. Monografías De Traducción E Interpretación, 1 1, 135–155. DOI logoGoogle Scholar
Gui, M.
(2012) Exploring differences between Chinese and American EFL teachers’ evaluation of speech performance. Language Assessment Quarterly, 9 1, 186–203. DOI logoGoogle Scholar
Han, C.
(2017) Using analytic rating scales to assess English-Chinese bi-directional interpreting: A longitudinal Rasch analysis of scale utility and rater behaviour. Linguistica Antverpiensia, New Series: Themes in Translation Studies, 16 1, 196–215. DOI logoGoogle Scholar
(2018) A longitudinal quantitative investigation into the concurrent validity of self and peer assessment applied to English-Chinese bi-directional interpretation in an undergraduate interpreting course. Studies in Educational Evaluation, 58 1, 187–196. DOI logoGoogle Scholar
(2019) A generalizability theory study of optimal measurement design for a summative assessment of English/Chinese consecutive interpreting. Language Testing, 361, 419–438. DOI logoGoogle Scholar
(2022) Interpreting testing and assessment: A state-of-the-art review. Language Testing, 39 1, 30–55. DOI logoGoogle Scholar
Han, C., & Riazi, M.
(2018) The accuracy of student self-assessments of English-Chinese bidirectional interpretation: A longitudinal quantitative study. Assessment and Evaluation in Higher Education, 43 1, 386–398. DOI logoGoogle Scholar
Hill, K.
(1996) Who should be the judge? The use of non-native speakers as raters on a test of English as an international language. Melbourne Papers in Language Testing, 5 1, 29–49.Google Scholar
Huang, B. H.
(2013) The effects of accent familiarity and language teaching experience on raters’ judgments of non-native speech. System, 41 1, 770–785. DOI logoGoogle Scholar
Huang, B., Alegre, A., & Eisenberg, A.
(2016) A cross-linguistic investigation of the effect of raters’ accent familiarity on speaking assessment. Language Assessment Quarterly, 13 1, 25–41. DOI logoGoogle Scholar
Kim, Y-H.
(2009a) A G-theory analysis of rater effect in ESL speaking assessment. Applied Linguistics, 30 1, 435–40. DOI logoGoogle Scholar
(2009b) An investigation into native and non-native teachers’ judgments of oral English performance: a mixed methods approach. Language Testing, 26 1, 187–217. DOI logoGoogle Scholar
Linacre, J. M.
(2002) What do infit and outfit, mean-square and standardized mean? Rasch Measurement Transactions, 16 1, 878. [URL]
Liu, M-H.
(2011) Methodology in interpreting studies: A methodological review of evidence-based research. In B. Nicodemus & L. Swabey (Eds.), Advances in interpreting research: Inquiry in action (pp. 85–120). John Benjamins. DOI logoGoogle Scholar
(2013) Design and analysis of Taiwan’s interpretation certification examination. In D. Tsagari & R. van Deemter (Eds.), Assessment issues in language translation and interpreting (pp. 163–178). Peter Lang.Google Scholar
Liu, M-H., Chang, C-C., & Wu, S -C.
(2008) Interpretation evaluation practices: Comparison of eleven schools in Taiwan, China, Britain, and the USA. Compilation and Translation Review, 1 1, 1–42.Google Scholar
Mellinger, C. D., & Hanson, T. A.
(2017) Quantitative research methods in translation and interpreting studies. Routledge.Google Scholar
Seleskovitch, D., & Lederer, M.
(1989) Pédagogie raisonnée de l’interprétation. Didier Érudition.Google Scholar
Setton, R., & Dawrant, A.
Skaaden, H.
(2013) Assessing interpreter aptitude in a variety of languages. In D. Tsagari & R. van Deemter (Eds.), Assessment issues in language translation and interpreting (pp. 35–50). Peter Lang.Google Scholar
Su, W.
(2019) Exploring native English teachers’ and native Chinese teachers’ assessment of interpreting. Language and Education, 33 1, 577–594. DOI logoGoogle Scholar
Wei, J., & Llosa, L.
(2015) Investigating differences between American and Indian raters in assessing TOEFL iBT speaking tasks. Language Assessment Quarterly, 12 1, 283–304. DOI logoGoogle Scholar
Wink, P., Gass, S., & Myford, C.
(2012) Raters’ L2 background as a potential source of bias in rating oral performance. Language Testing, 30 1, 231–252. DOI logoGoogle Scholar
Xi, X., & Mollaun, P.
(2011) Using raters from India to score a large-scale speaking test. Language Learning, 61 1, 1222–1255. DOI logoGoogle Scholar
Yeh, S-P., & Liu, M.
(2006) A more objective approach to interpretation evaluation: Exploring the use of scoring rubrics. Journal of the National Institute for Compilation and Translation, 34 1, 57–78.Google Scholar
Zhang, Y., & Elder, C.
(2011) Judgments of oral proficiency by non-native and native English speaking teacher raters: Competing or complementary constructs? Language Testing, 28 1, 31–50. DOI logoGoogle Scholar
(2014) Investigating native and non-native English-speaking teacher raters’ judgements of oral proficiency in the College English Test-Spoken English Test (CETSET). Assessment in Education: Principles, Policy & Practice, 21 1, 306–325. DOI logoGoogle Scholar