Empirical Research on Improving of Validity and Reliability in Educational Assessment Using Cognitive Intelligence Models
Main Article Content
Keywords
cognitive intelligence models, educational assessment, personalized feedback, AI in education
Abstract
Purpose: The primary objective of this study was to explore the application of cognitive intelligence models in educational assessment, focusing on how these models can improve assessment validity, reliability, and feedback effectiveness. The research aimed to evaluate whether AI-powered assessments could better capture student performance, provide personalized feedback, and contribute to more accurate educational evaluations. Methodology: An empirical research design was employed to investigate the impact of cognitive intelligence models on educational assessments. Data were collected from a sample of students using both AI-powered assessments and traditional paper-based exams. The study involved comparing key performance indicators such as assessment validity, reliability, and feedback effectiveness. Statistical analyses, including Cronbach’s alpha for reliability and t-tests for performance comparison, were used to assess the significance of the results. Results: The findings indicated a significant improvement in the validity and reliability of AI-powered assessments. The experimental group using AI assessments scored 85.6% on average, while the control group using traditional exams scored 78.2%, with the difference being statistically significant (p < 0.01). AI assessments demonstrated higher reliability (Cronbach’s alpha = 0.92) compared to traditional exams (Cronbach’s alpha = 0.81). Personalized feedback from the AI system was highly rated by students, with 85% reporting it to be actionable and beneficial for improving learning outcomes. Additionally, AI assessments proved effective in predicting future student performance with an accuracy of 87%. Conclusion: The results demonstrate that cognitive intelligence models significantly enhance the quality of educational assessments, providing more valid, reliable, and personalized evaluations. These models offer advantages over traditional assessment methods by adapting to individual student needs and providing real-time, actionable feedback. Despite the potential benefits, challenges related to data privacy, algorithmic bias, and the role of human educators must be addressed to ensure the ethical and effective implementation of AI in educational settings.
References
- [1] Gitomer, D. H., Martínez, J. F., & Battey, D., et al. (2021) Assessing the assessment: Evidence of reliability and validity in the edTPA. American Educational Research Journal, 58(1), 3–31.
- [2] Tuah, N. A. A., & Naing, L. (2021) Is online assessment in higher education institutions during COVID-19 pandemic reliable?. Siriraj Medical Journal, 73(1), 61–68.
- [3] Hickman, L., Bosch, N., & Ng, V., et al. (2022) Automated video interview personality assessments: Reliability, validity, and generalizability investigations. Journal of Applied Psychology, 107(8), 1323.
- [4] Sillat, L. H., Tammets, K., & Laanpere, M. (2021) Digital competence assessment methods in higher education: A systematic literature review. Education Sciences, 11(8), 402.
- [5] Berlian, M., Vebrianto, R., & Thahir, M. (2021) Development of Webtoon Non-Test Instrument as Education Media. International Journal of Evaluation and Research in Education, 10(1), 185–192.
- [6] Radeswandri, R., Budiawan, A., & Vebrianto, R., et al. (2021) Developing instrument to measure the use of online comic as educational media. Journal of Education and Learning (EduLearn), 15(1), 119–126.
- [7] Hooda, M., Rana, C., & Dahiya, O., et al. (2022) Artificial intelligence for assessment and feedback to enhance student success in higher education. Mathematical Problems in Engineering, 2022(1), 5215722.
- [8] French, S., Dickerson, A., & Mulder, R. A. (2024) A review of the benefits and drawbacks of high-stakes final examinations in higher education. Higher Education, 88(3), 893–918.
- [9] Wang, B., Rau, P. L. P., & Yuan, T. (2023) Measuring user competence in using artificial intelligence: validity and reliability of artificial intelligence literacy scale. Behaviour & information technology, 42(9), 1324–1337.
- [10] Khan, M. A., Vivek, V., & Khojah, M., et al. (2021) Learners’ perspective towards e-exams during COVID-19 outbreak: Evidence from higher educational institutions of India and Saudi Arabia. International Journal of Environmental Research and Public Health, 18(12), 6534.
- [11] Iglesias Pérez, M. C., Vidal-Puga, J., & Pino Juste, M. R. (2022) The role of self and peer assessment in Higher Education. Studies in Higher Education, 47(3), 683–692.
- [12] Gašević, D., Greiff, S., & Shaffer, D. W. (2022) Towards strengthening links between learning analytics and assessment: Challenges and potentials of a promising new bond. Computers in Human Behavior, 134, 107304.
- [13] Akhmedov, B. A. (2022) Analysis of the Reliability of the Test form of Knowledge Control in Cluster Education. Psychology and Education, 59(2), 403–418.
