Content
summary Summary

Researchers have created an AI model that can understand and explain medical images in two languages, with particularly strong results for Arabic content.

Ad

An international team led by researchers from Mohamed Bin Zayed University has developed BiMediX2, the first AI system of its kind that can analyze and describe medical images in both English and Arabic.

The system works with a wide range of medical imagery, from X-rays and MRI scans to microscopic images, providing detailed descriptions and answering questions about what it sees in either language.

In testing, BiMediX2 showed significant improvements over existing technology, performing 9 percent better with English text and an impressive 20 percent better with Arabic content, according to the technical report.

Ad
Ad

Trained on 1.6 million medical records

The system's impressive performance comes from training on a massive dataset of 1.6 million medical texts and images. To ensure accuracy in both languages, the team used GPT-4o to create initial Arabic translations, which medical experts then reviewed for quality.

Under the hood, BiMediX2 runs on the Llama 3.1 architecture, specially tuned for medical applications. When put to the test, it proved better than GPT-4o at spotting incorrect medical information.

Flowchart: BiMediX2 architecture for medical image analysis with Vision Encoder, Llama 3.1 and English-Arabic bilingual translation.
BiMediX2 combines Vision Encoder, Meta Llama 3.1 and GPT-4o to provide seamless bilingual analysis, automatically translating findings between English and Arabic with expert validation. | Image: Mullappilly, Kurpath et al.

While the results are promising, the researchers stress that BiMediX2 is currently meant for research only, not clinical use. Like all AI systems, it can still make mistakes or generate incorrect information.

The team has made the BiMediX2 models available on Hugging Face and introduced BiMed-MBench, a new bilingual benchmark for testing similar systems.

Ad
Ad
Join our community
Join the DECODER community on Discord, Reddit or Twitter - we can't wait to meet you.
Support our independent, free-access reporting. Any contribution helps and secures our future. Support now:
Bank transfer
Summary
  • Researchers at Mohamed Bin Zayed University have developed BiMediX2, an AI system that can analyze medical images in both English and Arabic.
  • BiMediX2 can process various types of medical images such as X-rays, MRI images and microscopic images. It generates precise descriptions and answers questions about the images in both languages.
  • The system has been trained with 1.6 million medical texts and images, with particular emphasis on the quality of Arabic translations. It is based on the Llama 3.1 architecture and is intended for research purposes, not for clinical use.
Max is managing editor at THE DECODER. As a trained philosopher, he deals with consciousness, AI, and the question of whether machines can really think or just pretend to.
Join our community
Join the DECODER community on Discord, Reddit or Twitter - we can't wait to meet you.