Content
summary Summary

Elon Musk's AI company xAI has unveiled its new Grok 2 language model and a smaller Grok 2 Mini variant.

Ad

Initial tests indicate the models are competitive with other top AI systems in various benchmarks. Grok 2 also appears to have fewer restrictions on image generation compared to other providers. Both models are now available for X users.

According to xAI, both Grok 2 and Grok 2 Mini show significant improvements over Grok 1.5 on academic benchmarks in reasoning, reading comprehension, math, science, and programming. They perform on par with SOTA models in graduate level science (GPQA), general knowledge (MMLU, MMLU-Pro), and math competition tasks (MATH).

Image: xAI

On the LMSYS ranking, a popular comparison benchmark for language models, an early version of Grok 2 called "sus-column-r" ranked third overall, matching GPT-4o's score. The model excels in programming (2nd place), difficult prompts (4th place), and mathematics (2nd place).

Ad
Ad

Early user reports suggest that Grok 2's image generation has fewer limitations than other providers. For example, it can generate political illustrations and images of real people, while ChatGPT and other image generators mostly refuse such requests, or at least try to.

Image: via De Kraker

One user noted Grok 2 seems unconcerned with copyrights. Another user said Grok 2's image generation outperforms Meta Llama's without "dumb guardrails."

Image: via Brendon

The ability to generate images of real people, especially politicians, and AI propaganda raises concerns about misinformation, especially in the run-up to the US elections. Most major model providers block such content to prevent fake political images.

Another risk of AI-generated fake images is that people will claim that real images are AI-generated to discredit them. Essentially, this leads people to stop trusting everything they hear, read, and see unless it comes from a source they trust, making them easy prey for those in authority. Donald Trump has claimed at least twice that real images are AI-generated, fueling conspiracy theories.

Recently, U.S. secretaries of state sent Musk an open letter asking about AI-generated fake news on X. Musk himself shared a video with an AI-cloned voice of Vice President Kamala Harris calling Joe Biden "senile."

Recommendation

xAI works with Black Forest Labs to expand Grok's image capabilities on X by testing the FLUX.1 model. Premium subscribers can try Grok 2 by updating the X app. A new Enterprise API for developers is coming later this month.

Musk claims Grok 3, expected late this year, will be the most powerful AI system ever created.

Ad
Ad
Join our community
Join the DECODER community on Discord, Reddit or Twitter - we can't wait to meet you.
Support our independent, free-access reporting. Any contribution helps and secures our future. Support now:
Bank transfer
Summary
  • Elon Musk's AI startup xAI has introduced its new language model Grok 2 and its smaller variant Grok 2 Mini. Initial tests show that the models can keep up with the competition in benchmarks.
  • In various academic benchmarks, Grok 2 and Grok 2 Mini achieve performance levels that can compete with other top models in areas such as scientific knowledge, general knowledge, and competitive math tasks.
  • Grok 2's image generation seems to have fewer limitations than other providers. The model can represent political illustrations and real people, which is a critical issue in the context of fake news and especially the US elections this fall.
Sources
Online journalist Matthias is the co-founder and publisher of THE DECODER. He believes that artificial intelligence will fundamentally change the relationship between humans and computers.
Join our community
Join the DECODER community on Discord, Reddit or Twitter - we can't wait to meet you.