This study aims to explore four dimensions of test validity: face validity, construct validity, consequential validity and criterion validity in the context of the Duolingo English Test. Through descriptive research, it brings about some inspiration for test takers to consider how to choose an appropriate test.
References
[1]
Brenzel, J., & Settles, B. (2017). The Duolingo English Test—Design, Validity, and Value (pp. 1-3). DET Whitepaper (Short). https://s3.amazonaws.com/duolingo-papers/other/DET_ShortPaper.pdf
[2]
Clark, T., Callam, C., Paul, N., Stoltzfus, M., & Turner, D. (2020). Testing in the Time of Covid-19: A Sudden Transition to Unproctored Online Exams. Journal of Chemical Education, 97, 3413-3417. https://doi.org/10.1021/acs.jchemed.0c00546
[3]
Dörnyei, Z. (2007) Research Methods in Applied Linguistics: Quantitative Qualitative and Mixed Methodologies. Oxford University Press.
[4]
Duolingo, Inc. (2021) Analysis of the Validity, Design and Development of the Duolingo English Test. Duolingo, Inc. https://d23cwzsbkjbm45.cloudfront.net/media/resources/standards/validity.pdf
[5]
Idnani, D., Kubadia, A., Jain, Y., & Churi, P. (2021). Experience of Conducting Online Test during Covid-19 Lockdown: A Case Study of NMIMS University. International Journal of Engineering Pedagogy, 11, 49-63. https://doi.org/10.3991/ijep.v11i1.15215
[6]
McDonald, M. (2005). Validity, Data Sources. In Encyclopaedia of Social Measurement (pp. 939-948). Elsevier. https://doi.org/10.1016/B0-12-369398-5/00046-3
[7]
Messick, S. (1996). Validity and Washback in Language Testing. ETS Research Report Series, 1, i-18. https://doi.org/10.1002/j.2333-8504.1996.tb01695.x
[8]
Prisacari, A., Holme, T., & Danielson, J. (2017). Comparing Student Performance Using Computer and Paper-Based Tests: Results from Two Studies in General Chemistry. Journal of Chemical Education, 94, 1822-1830. https://doi.org/10.1021/acs.jchemed.7b00274
[9]
Reckase, M. (1998). Consequential Validity from the Test Developer’s Perspective. Educational Measurement: Issues and Practice, 17, 13-16. https://doi.org/10.1111/j.1745-3992.1998.tb00827.x
Rubio, D. (2005). Content Validity. In Encyclopaedia of Social Measurement (pp. 495-498). Elsevier. https://doi.org/10.1016/B0-12-369398-5/00397-2
[12]
Settles, B., LaFlair, G., & Hagiwara, M. (2020). Machine Learning-Driven Language Assessment. Transactions of the Association for Computational Linguistics, 8, 247-263. https://doi.org/10.1162/tacl_a_00310
[13]
Shou, Y., Sellbom, M., & Chen, H. (2022). Fundamentals of Measurement in Clinical Psychology. Comprehensive Clinical Psychology, 4, 13-35. https://doi.org/10.1016/B978-0-12-818697-8.00110-2
[14]
Teglasi, H. (1998). Assessment of Schema and Problem-Solving Strategies with Projective Techniques. Comprehensive Clinical Psychology, 4, 459-499. https://doi.org/10.1016/B0080-4270(73)00005-5
[15]
Yao, D. (2023). Examining the Subjective Fairness of At-Home and Online Tests: Taking Duolingo English Test as an Example. PLOS ONE, 18, e0291629. https://doi.org/10.1371/journal.pone.0291629