Google's MatCha is a foundation model for understanding charts
Google's MatCha is a foundation model trained for both chart de-rendering and mathematical reasoning. Chart de-rendering explores the reverse engineering of charts, plots, or graphics to reveal their underlying data table or code, while math reasoning seeks to solve question-based problems on textual mathematical datasets. By combining these tasks, MatCha significantly outperforms existing models for visual language understanding of charts. The researchers also proposed DePlot, a model built on top of MatCha for improved reasoning on charts through translation to tables.

AI News Without the Hype – Curated by Humans
As a THE DECODER subscriber, you get ad-free reading, our weekly AI newsletter, the exclusive "AI Radar" Frontier Report 6× per year, access to comments, and our complete archive.
Subscribe now