AI in practice

Jun 19, 2024Jun 19, 2024

Microsoft releases Florence 2 Vision models that can outperform larger specialist models

Matthias is the co-founder and publisher of THE DECODER, exploring how AI is fundamentally changing the relationship between humans and computers.

Profile

E-Mail

Microsoft has released a set of vision models called Florence 2. Florence 2 is a prompt-based vision model designed for computer vision and image processing tasks such as image description, object recognition, localization, and segmentation. According to Microsoft, Florence 2 can outperform other specialized and larger vision models in some tasks. To train Florence, Microsoft created the FLD-5B dataset, which contains 5.4 billion annotations for 126 million images. The models come in two sizes, with 0.23B and 0.77B parameters, and are available on Hugging Face for commercial use under the MIT license.

Support our independent, free-access reporting. Any contribution helps and secures our future. Support now:

Bank transfer

Sources

Hugging Face Paper

Matthias Bastian

Matthias is the co-founder and publisher of THE DECODER, exploring how AI is fundamentally changing the relationship between humans and computers.

Profile

E-Mail

AI research

Jun 30, 2025Jun 30, 2025

Microsoft’s MAI-DxO boosts AI diagnostic accuracy and cuts costs by nearly 70 percent

News, tests and reports about VR, AR and MIXED Reality.

What happens next with MIXED My personal farewell to MIXED Meta and Anduril are now jointly developing XR headsets for the US military MIXED-NEWS.com

AI in practice

Jun 28, 2025Jun 28, 2025

OpenAI renting Google TPUs sends a strong warning shot to Microsoft

AI in practice

Jun 27, 2025Jun 27, 2025

Microsoft’s Braga AI chip faces six-month delay, trails Nvidia’s Blackwell

Google News

Join our community

Join the DECODER community on Discord, Reddit or Twitter - we can't wait to meet you.

Microsoft releases Florence 2 Vision models that can outperform larger specialist models

Microsoft’s MAI-DxO boosts AI diagnostic accuracy and cuts costs by nearly 70 percent

OpenAI renting Google TPUs sends a strong warning shot to Microsoft

Microsoft’s Braga AI chip faces six-month delay, trails Nvidia’s Blackwell

Cloudflare CEO Matthew Prince sees trouble ahead for the open web

New Othello experiment supports the world model hypothesis for large language models

ChatGPT might be draining your brain, MIT warns - what ‘cognitive debt’ means for you

Microsoft releases Florence 2 Vision models that can outperform larger specialist models

Microsoft’s MAI-DxO boosts AI diagnostic accuracy and cuts costs by nearly 70 percent

OpenAI renting Google TPUs sends a strong warning shot to Microsoft

Microsoft’s Braga AI chip faces six-month delay, trails Nvidia’s Blackwell