Google's MatCha is a foundation model trained for both chart de-rendering and mathematical reasoning. Chart de-rendering explores the reverse engineering of charts, plots, or graphics to reveal their underlying data table or code, while math reasoning seeks to solve question-based problems on textual mathematical datasets. By combining these tasks, MatCha significantly outperforms existing models for visual language understanding of charts. The researchers also proposed DePlot, a model built on top of MatCha for improved reasoning on charts through translation to tables.

Ad
Bild: ChartQA
Support our independent, free-access reporting. Any contribution helps and secures our future. Support now:
Bank transfer
Sources
Max is managing editor at THE DECODER. As a trained philosopher, he deals with consciousness, AI, and the question of whether machines can really think or just pretend to.
Join our community
Join the DECODER community on Discord, Reddit or Twitter - we can't wait to meet you.