Million-image insect dataset promises advances in biodiversity
Midjourney prompted by THE DECODER
The BIOSCAN-1M Insect Dataset aims to expand the cataloging of insect biodiversity through a large dataset of one million hand-labeled insect images. This curated image dataset is primarily intended for training computer vision models to provide image-based taxonomic assessments. Each record is taxonomically classified by an expert and includes associated genetic information, such as raw nucleotide barcode sequences and assigned barcode index numbers. The ultimate goal is to create a comprehensive survey of global biodiversity.

AI News Without the Hype – Curated by Humans
As a THE DECODER subscriber, you get ad-free reading, our weekly AI newsletter, the exclusive "AI Radar" Frontier Report 6× per year, access to comments, and our complete archive.
Subscribe now
Source: 1M Insects