Article published in:
Australian Review of Applied Linguistics
Vol. 17:2 (1994) ► pp. 77103


Brown, A.
(1993) The effect of rater variables in the development of an occupation-specific language performance test. Paper presented at the annual Language Testing Research Colloquium, Cambridge, U.K., August 2-5
Elder, C.
(1993) Are rater judgements of teacher effectiveness wholly language based. Paper presented at the annual Language Testing Research Colloquium, Cambridge, U.K., August 2-5
Fayer, J.M. and E. Krasinski
(1987) Native and nonnative judgements of intelligibility and irritation. Language Learning 37,3:313–326 CrossrefGoogle Scholar
Linacre, J A M.
(1992a) FACETS computer program for many faceted Rasch measurement (version 2.62). Chicago IL: Mesa PressGoogle Scholar
Linacre, J.M.
(1992b) A User’s Guide to Facets. Chicago IL: Mesa PressGoogle Scholar
McNamara, T.
(1990) Item Response Theory and the validation of an ESP test for health professionals, Language Testing 7:52–75 CrossrefGoogle Scholar
in preparation) Second Language Performance Assessment. Unpublished manuscript.
Shohamy, E., C.M. Gordon and R. Kraemer
(1992) The effect of rater’ background and training on the reliability of direct writing test. The Modern Language Journal 76,1:27–33 CrossrefGoogle Scholar
Stahl, J. and Lunz, M.
(1992) Judge Performance Reports. Paper presented at AERA, San Francisco, April
Wigglesworth, G.
(1993) Exploring bias analysis as a tool for improving rater consistency in assessing oral interaction. Language Testing, 10,3:305–335 CrossrefGoogle Scholar
Cited by

Cited by 16 other publications

Ang-Aw, Hui Teng & Christine Chuen Meng Goh
2011. Understanding Discrepancies in Rater Judgement on National-Level Oral Examination Tasks. RELC Journal 42:1  pp. 31 ff. Crossref logo
Brown, James Dean & Russell Changseob Ahn
2011. Variables that affect the dependability of L2 pragmatics tests. Journal of Pragmatics 43:1  pp. 198 ff. Crossref logo
Congdon, Peter J. & Joy MeQueen
2000. The Stability of Rater Severity in Large-Scale Assessment Programs. Journal of Educational Measurement 37:2  pp. 163 ff. Crossref logo
Finn, Bridgid, Burcu Arslan & Matthew Walsh
2020. Applying Cognitive Theory to the Human Essay Rating Process. Applied Measurement in Education 33:3  pp. 223 ff. Crossref logo
Finn, Bridgid, Cathy Wendler, Kathryn L. Ricker‐Pedley & Burcu Arslan
2018.  Does the Time Between Scoring Sessions Impact Scoring Accuracy? An Evaluation of Constructed‐Response Essay Responses on the GRE ® General Test . ETS Research Report Series 2018:1  pp. 1 ff. Crossref logo
Goh, Christine C. M. & Hui Teng Ang-Aw
2018.  In Teacher Involvement in High-Stakes Language Testing,  pp. 197 ff. Crossref logo
Kondo-Brown, Kimi
2002. A FACETS analysis of rater bias in measuring Japanese second language writing performance. Language Testing 19:1  pp. 3 ff. Crossref logo
Schaefer, Edward
2008. Rater bias patterns in an EFL writing assessment. Language Testing 25:4  pp. 465 ff. Crossref logo
Tajeddin, Zia & Minoo Alemi
2014. Criteria and Bias in Native English Teachers’ Assessment of L2 Pragmatic Appropriacy: Content and FACETS Analyses. The Asia-Pacific Education Researcher 23:3  pp. 425 ff. Crossref logo
Walsh, Matthew M., Burcu Arslan & Bridgid Finn
2021. Computational Cognitive Modeling of Human Calibration and Validity Response Scoring for the Graduate Record Examinations (GRE). Journal of Applied Research in Memory and Cognition 10:1  pp. 143 ff. Crossref logo
Wesolowski, Brian C., Stefanie A. Wind & George Engelhard
2015. Rater fairness in music performance assessment: Evaluating model-data fit and differential rater functioning. Musicae Scientiae 19:2  pp. 147 ff. Crossref logo
Winke, Paula
2012.  In The Encyclopedia of Applied Linguistics, Crossref logo
Winke, Paula
2014.  In Technology-mediated TBLT [Task-Based Language Teaching, 6],  pp. 263 ff. Crossref logo
Winke, Paula & Susan Gass
2013. The Influence of Second Language Experience and Accent Familiarity on Oral Proficiency Rating: A Qualitative Investigation. TESOL Quarterly 47:4  pp. 762 ff. Crossref logo
Winke, Paula, Susan Gass & Carol Myford
Winke, Paula, Susan Gass & Carol Myford
2013. Raters’ L2 background as a potential source of bias in rating oral performance. Language Testing 30:2  pp. 231 ff. Crossref logo

This list is based on CrossRef data as of 27 november 2021. Please note that it may not be complete. Sources presented here have been supplied by the respective publishers. Any errors therein should be reported to them.