Exploring the performance of ChatGPT-4 in the Taiwan audiologist qualification examination: Preliminary observational study highlighting the potential of AI chatbots in hearing care

Shangqiguo WANG, Changgeng MO, Yuan CHEN, Xiaolu DAI, Huiyi WANG, Xiaoli SHEN

Research output: Contribution to journalArticlespeer-review

Abstract

Background: Artificial intelligence (AI) chatbots, such as ChatGPT-4, have shown immense potential for application across various aspects of medicine, including medical education, clinical practice, and research.

Objective:
This study aimed to evaluate the performance of ChatGPT-4 in the 2023 Taiwan Audiologist Qualification Examination, thereby preliminarily exploring the potential utility of AI chatbots in the fields of audiology and hearing care services.

Methods:
ChatGPT-4 was tasked to provide answers and reasoning for the 2023 Taiwan Audiologist Qualification Examination. The examination encompassed six subjects: (1) basic auditory science, (2) behavioral audiology, (3) electrophysiological audiology, (4) principles and practice of hearing devices, (5) health and rehabilitation of the auditory and balance systems, and (6) auditory and speech communication disorders (including professional ethics). Each subject included 50 multiple-choice questions, with the exception of behavioral audiology, which had 49 questions, amounting to a total of 299 questions.

Results:
The correct answer rates across the 6 subjects were as follows: 88% for basic auditory science, 63% for behavioral audiology, 58% for electrophysiological audiology, 72% for principles and practice of hearing devices, 80% for health and rehabilitation of the auditory and balance systems, and 86% for auditory and speech communication disorders (including professional ethics). The overall accuracy rate for the 299 questions was 75%, which surpasses the examination’s passing criteria of an average 60% accuracy rate across all subjects. A comprehensive review of ChatGPT-4’s responses indicated that incorrect answers were predominantly due to information errors.

Conclusions:
ChatGPT-4 demonstrated a robust performance in the Taiwan Audiologist Qualification Examination, showcasing effective logical reasoning skills. Our results suggest that with enhanced information accuracy, ChatGPT-4’s performance could be further improved. This study indicates significant potential for the application of AI chatbots in audiology and hearing care services. Copyright © 2024 Shangqiguo Wang, Changgeng Mo, Yuan Chen, Xiaolu Dai, Huiyi Wang, Xiaoli Shen.
Original languageEnglish
Article number e55595
JournalJMIR Medical Education
Volume10
Early online dateApr 2024
DOIs
Publication statusPublished - 2024

Citation

Wang, S., Mo, C., Chen, Y., Dai, X., Wang, H., & Shen, X. (2024). Exploring the performance of ChatGPT-4 in the Taiwan audiologist qualification examination: Preliminary observational study highlighting the potential of AI chatbots in hearing care. JMIR Medical Education, 10, Article e55595. https://doi.org/10.2196/55595

Keywords

  • ChatGPT
  • Medical education
  • Artificial intelligence
  • AI
  • Audiology
  • Hearing care
  • Natural language processing
  • Large language model
  • Taiwan
  • Hearing
  • Hearing specialist
  • Audiologist
  • Examination
  • Information accuracy
  • Educational technology
  • Healthcare services
  • Chatbot
  • Health care services

Fingerprint

Dive into the research topics of 'Exploring the performance of ChatGPT-4 in the Taiwan audiologist qualification examination: Preliminary observational study highlighting the potential of AI chatbots in hearing care'. Together they form a unique fingerprint.