Perplexity AI's "Personal Computer" is an AI assistant that works around the clock - handling emails, presentations, and app control. It runs on a dedicated Mac Mini connected to the user's local apps and Perplexity's servers, controllable from any device. CEO Aravind Srinivas called it a "digital proxy" that never sleeps on X. The service builds on Perplexity Computer, which launched in February and bundles multiple AI models.
Security features include a kill switch and an activity log. Access requires the Max subscription at 200 dollars per month, with only a waiting list available for now. Perplexity is also launching an enterprise version that connects to over 400 tools like Salesforce and Snowflake - the company claims it completed 3.25 years' worth of work internally in four weeks. The concept draws comparisons to the controversial OpenClaw, whose developer now works at OpenAI. Agent-based AI systems dominate the current landscape but face sharp criticism around resource demands and security vulnerabilities.
Meta has reportedly delayed its next AI model, codenamed "Avocado." Originally set for mid-March 2026, it won't ship until May at the earliest, reports the New York Times, citing three people familiar with the matter.
In internal tests, Avocado fell short of leading models from Google, OpenAI, and Anthropic in logical reasoning, programming, and writing. It beat Meta's previous model and Google's Gemini 2.5 but couldn't match Gemini 3.0. Meta's leadership even discussed temporarily licensing Gemini, though no decision was made. A next-gen model codenamed "Watermelon" is already planned. Meta is also building an image and video generator codenamed "Mango."
xAI's Grok 4.20 can't keep up with the top AI models in benchmarks but hallucinates less than any other model tested. According to Artificial Analysis, Grok 4.20 Beta scores 48 on the Intelligence Index with reasoning enabled, well behind Gemini 3.1 Pro Preview and GPT-5.4 at 57, but still a 6-point improvement over Grok 4.
Grok trails the latest models from major AI labs in overall benchmark performance. | Image: Artificial Analysis
xAI shipped three API variants: with reasoning, without reasoning, and a multi-agent mode. The model supports a 2-million-token context window and costs 2 or 6 dollars per million tokens; cheaper than Grok 4 and competitively priced among Western models.
Where Grok 4.20 stands out, of all things, is factual reliability. On the AA Omniscience test, it hit a 78 percent non-hallucination rate, a record, according to Artificial Analysis. The test measures how often a model fabricates an answer instead of admitting it doesn't know, alongside factual recall. Grok 4.20 only got it wrong about one in five times when it didn't have the answer.
Emil Michael, the US Department of War's chief technology officer, made clear that classifying Anthropic as a supply chain risk is an ideologically motivated move. Claude models "pollute" the supply chain because they have a "different policy preference" baked into them, Michael told CNBC. He pointed to Anthropic's "constitution," a ruleset emphasizing ethics and safety, which he said could result in soldiers receiving "ineffective weapons, ineffective body armor, ineffective protection." The measure was "not meant to be punitive," he added.
Copilot Health marks Microsoft's entry into the AI health race alongside OpenAI and Anthropic
Microsoft is launching Copilot Health, an AI health assistant that pulls data from wearables, medical records, and lab results to deliver personalized health advice. Long term, the company says it’s working toward “medical superintelligence.”
Anthropic has launched a new beta feature for its AI chatbot Claude: the ability to generate interactive diagrams, charts, and visualizations directly within the conversation. The feature builds on a preview called "Imagine with Claude" from last fall, combining it with the existing "Artifacts" functionality - but embedded right in the chat flow instead of in a side panel, and labeled as "temporary," according to Anthropic.
Claude decides on its own when a visualization would be helpful, though users can also request one directly. Examples include interactive compound interest curves, an interactive decision tree, and a clickable periodic table. The feature is available across all pricing tiers.
ChatGPT still dominates the chatbot market, but its lead is shrinking. New data from Similarweb shows OpenAI's chatbot accounted for just 61.7 percent of global AI web traffic in February 2026, down from 75.7 percent twelve months earlier. The biggest winner is Google Gemini, which more than quadrupled its share from 5.7 percent to 24.4 percent over the same period. Grok (3.4 percent) and Claude (3.3 percent) have overtaken DeepSeek (3.2 percent) for the first time, claiming third and fourth place. Claude crossed the three percent mark for the first time in February, though it's much stronger in the B2B market, according to a separate study.
ChatGPT still leads overall, but Google Gemini has closed the gap significantly. These figures only cover web traffic. | Image: Similarweb
In absolute numbers, ChatGPT recorded 5.35 billion visits in February, while Gemini pulled in 2.11 billion. Grok came in at 298.5 million visits, Claude at 290.3 million, Deepseek at 246.4 million, and Perplexity at 153.8 million. Microsoft's Copilot stagnated at 1.1 percent market share, though that only reflects the web version. Microsoft's actual share of the enterprise market is likely much higher.
Meta's JEPA architecture outperforms standard AI methods in noisy medical imaging
Researchers have presented an AI model for cardiac ultrasound based on Meta’s JEPA architecture that outperforms common methods such as masked autoencoder or contrastive learning, according to their benchmarks.