No foundational LLM currently complies with EU AI Act

Researchers at Stanford University have studied which foundational AI language models can be used under the EU AI Act. The results speak for or against the EU AI Act, depending on your point of view.

None of the ten language models examined achieved full compliance with the EU AI Act, which would be equivalent to 48 points. The researchers evaluated the language models in a total of twelve categories, each worth a maximum of four points.

The categories include transparency of data sources, handling of proprietary data, risk mitigation, and computational and energy requirements. Two lead authors performed the initial scoring according to a predetermined methodology. Their scores were then discussed and voted on by all authors.

Eine Tabelle mit den Kategorien der Evaluation. — The categories of the evaluation. | Image: Stanford University

Open Source at the top of the list

The two most EU-compliant models are Bloom (36 points) from Big Science and GPT-NeoX (29 points) from EleutherAI. Both models are open source and therefore more transparently documented than those of commercial providers, which take competition into account. But while open-source models are generally getting better, they still lack performance and are likely to continue to do so, according to OpenAI CEO Sam Altman.

Persistent challenges where most model providers perform poorly include the use of copyrighted data in training data, unclear information about computational and energy requirements, unclear information about risk mitigation measures, and a lack of standards for evaluating model performance, particularly regarding adverse effects.

Die Sprachmodelle im Vergleich. — A comparison of the language models. | Image: Stanford University

Because of the Brussels effect, the Stanford researchers believe the EU AI law is the most important regulatory AI initiative currently, as lawmakers around the world will look to it for guidance and multinational companies will seek consistent AI development processes. This, in turn, will shape the digital supply chain and the societal impact of AI, the researchers said.

AI Act compliance "within reach"

Despite the low compliance of many providers, the authors of the study believe that an overall score in the 30 to 40 range would be achievable for many through "meaningful, but plausible changes." Incentives such as fines for non-compliance may be sufficient here, without much regulatory pressure.

We believe sufficient transparency to satisfy the Act’s requirements related to data, compute and other factors should be commercially feasible if foundation model providers collectively take action as the result of industry standards or regulation.

From the study

Implementing the Act's 12 requirements would lead to "significant positive change in the foundation model ecosystem" and is within reach for most providers, despite poor results at first glance. However, the current trend is toward less transparency.

"Overall, our analysis speaks to a broader trend of waning transparency: providers should take action to collectively set industry standards that improve transparency, and policymakers should take action to ensure adequate transparency underlies this general-purpose technology."

Join our community

Join the DECODER community on Discord, Reddit or Twitter - we can't wait to meet you.

Recommendation

AI research

No foundational LLM currently complies with EU AI Act

Open Source at the top of the list

AI Act compliance "within reach"

Apple's claims about large reasoning models face fresh scrutiny from a new study

AI system StreamDiT generates livestream videos from text at 16 fps 512p

Researchers used 1,600 YouTube fail videos to show AI models struggle with surprises

AI coding can make developers slower even if they feel faster

OpenAI launches new ChatGPT agent that automates complex tasks for Pro, Plus, and Team

Kimi-K2 is the next open-weight AI milestone from China after Deepseek

New Energy-Based Transformer architecture aims to bring better "System 2 thinking" to AI models

No foundational LLM currently complies with EU AI Act

Open Source at the top of the list

AI Act compliance "within reach"

Share

Bank details