ChatGPT-3 and ChatGPT-4, OpenAI’s language processing models, flunked the 2021 and 2022 American College of Gastroenterology Self-Assessment Tests, according to a study published earlier this week in The American Journal of Gastroenterology.
ChatGPT is a large language model that generates human-like text in response to users’ questions or statements.
Researchers at The Feinstein Institutes for Medical Research asked the two versions of ChatGPT to answer questions on the tests to evaluate its abilities and accuracy.
Each test includes 300 multiple-choice questions. Researchers copied and pasted each multiple-choice question and answer, excluding those with image requirements, into the AI-powered platform.