Ad
Skip to content
Read full article about: Ukraine opens its battlefield data to allies to train AI models for autonomous drones

Ukraine opens its battlefield data to allies to train AI models for autonomous drones.

"Today, Ukraine has a unique array of battlefield data that is unmatched anywhere else in the world," Defense Minister Mykhailo Fedorov wrote on Telegram. "This includes millions of annotated images collected during tens of thousands of combat flights."

Fedorov had first announced the plan in January, shortly after taking office. Now, he says a platform has been created that provides allies and companies with constantly updating datasets and large quantities of photos and video footage. The goal is to accelerate the development of AI models that can guide drones to their targets without a pilot or quickly analyze vast pools of data.

Ukraine wants to increase the role played by autonomous systems in the war. Top commander Oleksandr Syrskyi said the war had "entered a new phase" with platoons of drone interceptors are now being created inside the Ukrainian armed forces.

Ad
Read full article about: Perplexity's "Personal Computer" promises a tireless AI agent for $200 a month

Perplexity AI's "Personal Computer" is an AI assistant that works around the clock - handling emails, presentations, and app control. It runs on a dedicated Mac Mini connected to the user's local apps and Perplexity's servers, controllable from any device. CEO Aravind Srinivas called it a "digital proxy" that never sleeps on X. The service builds on Perplexity Computer, which launched in February and bundles multiple AI models.

Security features include a kill switch and an activity log. Access requires the Max subscription at 200 dollars per month, with only a waiting list available for now. Perplexity is also launching an enterprise version that connects to over 400 tools like Salesforce and Snowflake - the company claims it completed 3.25 years' worth of work internally in four weeks. The concept draws comparisons to the controversial OpenClaw, whose developer now works at OpenAI. Agent-based AI systems dominate the current landscape but face sharp criticism around resource demands and security vulnerabilities.

Read full article about: Meta delays its next AI model Avocado after internal tests show it can't keep up with Google and OpenAI

Meta has reportedly delayed its next AI model, codenamed "Avocado." Originally set for mid-March 2026, it won't ship until May at the earliest, reports the New York Times, citing three people familiar with the matter.

In internal tests, Avocado fell short of leading models from Google, OpenAI, and Anthropic in logical reasoning, programming, and writing. It beat Meta's previous model and Google's Gemini 2.5 but couldn't match Gemini 3.0. Meta's leadership even discussed temporarily licensing Gemini, though no decision was made. A next-gen model codenamed "Watermelon" is already planned. Meta is also building an image and video generator codenamed "Mango."

Meta says updates are coming "very soon," with more models planned this year. The company found early success with its open Llama models but lost momentum with Llama 4. CEO Mark Zuckerberg has since poured billions into AI, including $14.3 billion in Scale AI. Scale AI's CEO Alexandr Wang now runs Meta's frontier AI division, "TBD Lab," tasked with building superintelligent AI systems. Reports also suggest Meta may be moving away from its open-source strategy.

Read full article about: Grok 4.20 trails Gemini and GPT-5.4 by a wide margin but sets a new record for not hallucinating

xAI's Grok 4.20 can't keep up with the top AI models in benchmarks but hallucinates less than any other model tested. According to Artificial Analysis, Grok 4.20 Beta scores 48 on the Intelligence Index with reasoning enabled, well behind Gemini 3.1 Pro Preview and GPT-5.4 at 57, but still a 6-point improvement over Grok 4.

Grok hängt den neuesten Modellen der großen KI-Labore hinterher. | Bild: Artificial Analysis
Grok trails the latest models from major AI labs in overall benchmark performance. | Image: Artificial Analysis

xAI shipped three API variants: with reasoning, without reasoning, and a multi-agent mode. The model supports a 2-million-token context window and costs 2 or 6 dollars per million tokens; cheaper than Grok 4 and competitively priced among Western models.

Where Grok 4.20 stands out, of all things, is factual reliability. On the AA Omniscience test, it hit a 78 percent non-hallucination rate, a record, according to Artificial Analysis. The test measures how often a model fabricates an answer instead of admitting it doesn't know, alongside factual recall. Grok 4.20 only got it wrong about one in five times when it didn't have the answer.

Ad
Read full article about: US War Department CTO says Anthropic's AI models "pollute" the supply chain with built-in ethics

Emil Michael, the US Department of War's chief technology officer, made clear that classifying Anthropic as a supply chain risk is an ideologically motivated move. Claude models "pollute" the supply chain because they have a "different policy preference" baked into them, Michael told CNBC. He pointed to Anthropic's "constitution," a ruleset emphasizing ethics and safety, which he said could result in soldiers receiving "ineffective weapons, ineffective body armor, ineffective protection." The measure was "not meant to be punitive," he added.

Anthropic is the first US company to receive this classification, which is normally reserved for foreign adversaries. The AI company is suing over the designation and has drawn support from Microsoft, OpenAI, and Google employees, as well as former US military personnel. Anthropic has previously pushed back against its own AI models being used for US mass surveillance and autonomous weapons.

The administration has already signaled its intent to control AI along ideological lines by enacting regulations targeting so-called "woke AI," framed as a commitment to political neutrality. The approach echoes the Chinese government's own efforts to exert political control over AI models.

Comment Source: CNBC

Copilot Health marks Microsoft's entry into the AI health race alongside OpenAI and Anthropic

Microsoft is launching Copilot Health, an AI health assistant that pulls data from wearables, medical records, and lab results to deliver personalized health advice. Long term, the company says it’s working toward “medical superintelligence.”

Read full article about: Claude can now create interactive charts and visualizations directly in chat

Anthropic has launched a new beta feature for its AI chatbot Claude: the ability to generate interactive diagrams, charts, and visualizations directly within the conversation. The feature builds on a preview called "Imagine with Claude" from last fall, combining it with the existing "Artifacts" functionality - but embedded right in the chat flow instead of in a side panel, and labeled as "temporary," according to Anthropic.

Claude decides on its own when a visualization would be helpful, though users can also request one directly. Examples include interactive compound interest curves, an interactive decision tree, and a clickable periodic table. The feature is available across all pricing tiers.

Ad
Read full article about: ChatGPT still leads the chatbot market but its dominance is slipping as Google's Gemini gains ground

ChatGPT still dominates the chatbot market, but its lead is shrinking. New data from Similarweb shows OpenAI's chatbot accounted for just 61.7 percent of global AI web traffic in February 2026, down from 75.7 percent twelve months earlier. The biggest winner is Google Gemini, which more than quadrupled its share from 5.7 percent to 24.4 percent over the same period. Grok (3.4 percent) and Claude (3.3 percent) have overtaken DeepSeek (3.2 percent) for the first time, claiming third and fourth place. Claude crossed the three percent mark for the first time in February, though it's much stronger in the B2B market, according to a separate study.

ChatGPT still leads overall, but Google Gemini has closed the gap significantly. These figures only cover web traffic. | Image: Similarweb

In absolute numbers, ChatGPT recorded 5.35 billion visits in February, while Gemini pulled in 2.11 billion. Grok came in at 298.5 million visits, Claude at 290.3 million, Deepseek at 246.4 million, and Perplexity at 153.8 million. Microsoft's Copilot stagnated at 1.1 percent market share, though that only reflects the web version. Microsoft's actual share of the enterprise market is likely much higher.