A study comparing the performance of OpenAI's ChatGPT (GPT-3.5), Google Bard, and Microsoft Bing (Precision mode) in answering 77 physiology case vignettes showed that ChatGPT significantly outperformed the others (ChatGPT 3.19±0.3, Bard 2.91±0.5, Bing Chat 2.15±0.6, on a scale of 0 to 4). Two physiologists independently scored the responses of the LLMs for accuracy.

While the results highlight the potential for incorporating AI systems into medical education, the study acknowledges the need for further research to determine the effectiveness of these models in different medical fields. It's also possible that specific AI models fine-tuned for medical tasks will win the race, such as Google's recently unveiled Med-PaLM M, which also incorporates vision.

Ad
Join our community
Join the DECODER community on Discord, Reddit or Twitter - we can't wait to meet you.
Ad
Join our community
Join the DECODER community on Discord, Reddit or Twitter - we can't wait to meet you.
Support our independent, free-access reporting. Any contribution helps and secures our future. Support now:
Bank transfer
Sources
Online journalist Matthias is the co-founder and publisher of THE DECODER. He believes that artificial intelligence will fundamentally change the relationship between humans and computers.
Join our community
Join the DECODER community on Discord, Reddit or Twitter - we can't wait to meet you.