Kung TH, Cheatham M, Medenilla A, Sillos C, De Leon L, Elepano C, et al. Performance of chatgpt on USMLE: potential for AI-assisted medical education using large language models. PLoS Digit Health. 2023;2(2):e0000198.
Chen TC, Multala E, Kearns P, Delashaw J, Dumont A, Maraganore D, et al. Assessment of ChatGPT’s performance on neurology written board examination questions. BMJ Neurol Open. 2023;5(2):e000530.
Bhayana R, Krishna S, Bleakney RR. Performance of ChatGPT on a radiology board-style examination: insights into current strengths and limitations. Radiology. 2023;307(5):e230582.
Dashti M, Khosraviani F, Azimi T, Hefzi D, Ghasemi S, Fahimipour A, et al. Assessing ChatGPT-4’s performance on the US prosthodontic exam: impact of fine-tuning and contextual prompting vs. base knowledge, a cross-sectional study. BMC Med Educ. 2025;25(1):761.
Jin HK, Lee HE, Kim E. Performance of ChatGPT-3.5 and GPT-4 in national licensing examinations for medicine, pharmacy, dentistry, and nursing: a systematic review and meta-analysis. BMC Med Educ. 2024;24(1):1013.
Qingquan T, Feng R, Bin Z, Jingyu Z, Ganglei L, Yanwen Z, et al. Iteratively refined ChatGPT outperforms clinical mentors in generating high-quality interprofessional education clinical scenarios: a comparative study. BMC Med Educ. 2025;25(1):845.
Jin Z, Abola R, Bargnes V 3rd, Tsivitis A, Rahman S, Schwartz J, et al. The utility of generative artificial intelligence chatbot (ChatGPT) in generating teaching and learning material for anesthesiology residents. Front Artif Intell. 2025;8:1582096.
Thesen T, Tuan RL, Blumer J, Lee MW. LLM-based generation of USMLE-style questions with ASPET/AMSPC knowledge objectives: All RAGs and no riches. Br J Clin Pharmacol. 2025. https://doi.org/10.1002/bcp.70119.
Lin Y, Li C, Li H. Comparing AI-generated and traditional textbook multiple-choice questions in nursing education: A prompt engineering-based Delphi study. Med Teach. 2025:1–12. https://doi.org/10.1080/0142159X.2025.2533401.
Mondal H, Dhanvijay AD. AI versus human-generated MCQs: The need for psychometric analysis. Med Teach. 2025:1. https://doi.org/10.1080/0142159X.2025.2532776.
Elnaem MH, Okuyan B, Mubarak N, Thabit AK, AbouKhatwa MM, Ramatillah DL, et al. Students’ acceptance and use of generative AI in pharmacy education: international cross-sectional survey based on the extended unified theory of acceptance and use of technology. Int J Clin Pharm. 2025. https://doi.org/10.1007/s11096-025-01936-w.
Law AK, So J, Lui CT, Choi YF, Cheung KH, Kei-Ching Hung K, et al. AI versus human-generated multiple-choice questions for medical education: a cohort study in a high-stakes examination. BMC Med Educ. 2025;25(1):208.
Kiyak YS, Emekli E. Chatgpt prompts for generating multiple-choice questions in medical education and evidence on their validity: a literature review. Postgrad Med J. 2024;100(1189):858–65.
Tekin M, Yurdal MO, Toraman C, Korkmaz G, Uysal I. Is AI the future of evaluation in medical education?? AI vs. human evaluation in objective structured clinical examination. BMC Med Educ. 2025;25(1):641.
Rathje S, Mirea DM, Sucholutsky I, Marjieh R, Robertson CE, Van Bavel JJ. GPT is an effective tool for multilingual psychological text analysis. Proc Natl Acad Sci U S A. 2024;121(34):e2308950121.
Grevisse C. LLM-based automatic short answer grading in undergraduate medical education. BMC Med Educ. 2024;24(1):1060.
Quah B, Zheng L, Sng TJH, Yong CW, Islam I. Reliability of chatgpt in automated essay scoring for dental undergraduate examinations. BMC Med Educ. 2024;24(1):962.
Marz M, Himmelbauer M, Boldt K, Oksche A. Legal aspects of generative artificial intelligence and large language models in examinations and theses. GMS J Med Educ. 2024;41(4):Doc47.
Alanazi K, Curle S. Challenges experienced by students studying medicine through English medium instruction. Front Educ. 2024;9. https://doi.org/10.3389/feduc.2024.1364860.
Reynolds BL, Zhang XF, Ding C. A mixed-methods study of English vocabulary for medical purposes: medical students’ needs, difficulties, and strategies. Appl Linguist Rev. 2023;14(3):643–78.
R Development Core Team: R: A Language and Environment for Statistical Computing. In. Vienna, Austria: R Foundation for Statistical Computing. 2023. https://www.R-project.org/.
Cohen J. Statistical power analysis for the behavioral sciences. New York: Routledge; 1988.
Dupont WD, Plummer WD Jr (1990) Power and sample size calculations. A review and computer program. Control Clin Trials. 11(2):116–128
Reis M, Reis F, Kunde W. Influence of believed AI involvement on the perception of digital medical advice. Nat Med. 2024;30(11):3098–100.
Reif JA, Larrick RP, Soll JB. Evidence of a social evaluation penalty for using AI. Proc Natl Acad Sci U S A. 2025;122(19):e2426766122.
Reis M, Reis F, Kunde W. Public perception of physicians who use artificial intelligence. JAMA Netw Open. 2025;8(7):e2521643.
Buabbas AJ, Miskin B, Alnaqi AA, Ayed AK, Shehab AA, Syed-Abdul S, et al. Investigating students’ perceptions towards artificial intelligence in medical education. Healthcare. 2023;11(9). https://doi.org/10.3390/healthcare11091298.
Nagi F, Salih R, Alzubaidi M, Shah H, Alam T, Shah Z, et al. Applications of Artificial Intelligence (AI) in Medical Education: A Scoping Review. Stud Health Technol Inform. 2023;305:648–51.
Abdekhoda M, Dehnad A. Adopting artificial intelligence driven technology in medical education. Interact Technol Smart Educ. 2024;21. https://doi.org/10.1108/ITSE-12-2023-0240.
Kimmerle J, Timm J, Festl-Wietek T, Cress U, Herrmann-Werner A. Medical students’ attitudes toward AI in medicine and their expectations for medical education. J Med Educ Curric Dev. 2023;10:23821205231219344.
Leon-Dominguez U. Potential cognitive risks of generative transformer-based AI chatbots on higher order executive functions. Neuropsychology. 2024;38(4):293–308.
Cambra-Fierro JJ, Blasco MF, López-Pérez MEE, Trifu A. ChatGPT adoption and its influence on faculty well-being: an empirical research in higher education. Educ Inf Technol. 2025;30(2):1517–38.
Crawford J, Allen KA, Pani B, Cowling M. When artificial intelligence substitutes humans in higher education: the cost of loneliness, student success, and retention. Stud High Educ. 2024;49(5):883–97.
Kalam KT, Rahman JM, Islam MR, Dewan SMR. ChatGPT and mental health: friends or foes? Health Sci Rep. 2024;7(2):e1912.
Wu R, Yu ZG. Do AI chatbots improve students learning outcomes? Evidence from a meta-analysis. Br J Educ Technol. 2024;55(1):10.
Al-Roomi K, Alzayani S, Almarabheh A, Alqahtani M, Aldosari F, Aladwani M, et al. Familiarity and applications of artificial intelligence in health professions education: perspectives of students in a community-oriented medical school. Cureus. 2024;16(11):e73425.
Xu YQ, Zhu JD, Wang MK, Qian F, Yang YL, Zhang J. The impact of a digital game-based AI chatbot on students’ academic performance, higher-order thinking, and behavioral patterns in an information technology curriculum. Appl Sci. 2024;14(15). https://doi.org/10.3390/app14156418.
Khan RA, Jawaid M, Khan AR, Sajjad M. ChatGPT – reshaping medical education and clinical management. Pak J Med Sci. 2023;39(2):605–7.
Karatas F, Eriçok B, Tanrikulu L. Reshaping curriculum adaptation in the age of artificial intelligence: mapping teachers’ AI-driven curriculum adaptation patterns. Brit Educ Res J. 2025;51(1):154–80.
Raskob K, Duman H, Kinder J, Lee J, Wilson J, Segerson K. Twelve tips to harness the power of AI for curriculum mapping. Med Teach. 2025:1–10. https://doi.org/10.1080/0142159X.2025.2513427.