Microsoft has released a set of vision models called Florence 2. Florence 2 is a prompt-based vision model designed for computer vision and image processing tasks such as image description, object recognition, localization, and segmentation. According to Microsoft, Florence 2 can outperform other specialized and larger vision models in some tasks. To train Florence, Microsoft created the FLD-5B dataset, which contains 5.4 billion annotations for 126 million images. The models come in two sizes, with 0.23B and 0.77B parameters, and are available on Hugging Face for commercial use under the MIT license.
Support our independent, free-access reporting. Any contribution helps and secures our future. Support now:
Sources
News, tests and reports about VR, AR and MIXED Reality.
Dead Second is the perfect after-work VR shooter for Meta Quest 3
Meli turns you into the conductor of a stunning mixed reality world on Meta Quest 3
Steam Sale: Big discounts on many popular VR games — Half-Life: Alyx, Ghosts of Tabor & more
MIXED-NEWS.com
Join our community
Join the DECODER community on Discord, Reddit or Twitter - we can't wait to meet you.