Matthias Bastian

Feb 4, 2024

Adept recently introduced Fuyu-Heavy, a new multimodal AI model for digital agents. Fuyu-Heavy is the third most capable multimodal model after GPT-4V and Gemini Ultra, and excels in multimodal reasoning and UI understanding, the company says. It performs well on traditional multimodal benchmarks and matches or exceeds the performance of models in the same performance class on standard text-based benchmarks. The model performs similarly to Claude 2.0 on chat scores, and slightly better than Gemini Pro on the MMMU benchmark. Fuyu-Heavy will soon power Adept's enterprise product, and lessons learned from its development have already been applied to its successor. The following video demonstrates the model's ability to understand a user interface.

Comment

Matthias Bastian

Feb 4, 2024

AI in practice

Google gives Google Maps LLM upgrade for better AI search

Google is introducing a new AI-powered way to discover places in Maps, initially available for select local guides in the US.

Matthias Bastian

Feb 4, 2024

AI research

AI agents can increase military escalation and nuclear risks, study says

a detailed and intense 16:9 illustration that combines the letters of the English alphabet in military camouflage with a glitch aesthetic, scattered across the scene. Incorporate a prominent atomic symbol and a dramatic depiction of a nuclear strike, with a mushroom cloud in the background. The letters should be easily recognizable and cover the entire scene, reflecting the chaos and impact of a nuclear event. The composition should be dynamic and filled with action, representing the destructive power of nuclear warfare in a visually striking manner.

Matthias Bastian

Feb 3, 2024

AI and society

GenAI could disrupt over 200,000 entertainment industry jobs by 2026, says study

A 16:9 image showcasing a glitch aesthetic wild collection of symbolic motifs from the film, music, and gaming industries. The image should have a vibrant, chaotic composition with elements like film reels, musical notes, vinyl records, game controllers, and iconic symbols associated with movies, music, and video games. The elements should be arranged in a dynamic, overlapping manner, embodying a sense of movement and energy. The background should have a digital, glitchy texture, featuring bright, contrasting colors to enhance the wild, energetic theme of the collection.

Matthias Bastian

Feb 3, 2024

AI research

NYU researchers develop AI that mimics a toddler's language learning journey

a messy living room with children's toys, as seen through the video camera eyes of a small robot in a computer vision glitch style

Matthias Bastian

Feb 2, 2024

AI in practice

Ambassadors from the EU's 27 member states have unanimously approved the world's first comprehensive set of rules for artificial intelligence, confirming a political agreement reached in December. The law regulates AI based on its potential for harm. Despite reservations from France, Germany and Italy, who called for less stringent rules for high-performance AI models such as Open AI's GPT-4, the final version of the law includes transparency requirements for all models and additional obligations for high-risk models. The internal market and civil liberties committees will adopt the AI legislation on February 13, followed by a plenary vote on April 10 and 11.

Comment

Matthias Bastian

Feb 2, 2024

AI in practice

How Meta CEO Mark Zuckerberg plans to make money from open-source AI

Matthias Bastian

Feb 2, 2024

AI in practice

Ioannis Antonoglou, a former AI researcher at Google DeepMind, has departed the company to launch an AI agent startup with two former colleagues, Sherjil Ozair and Misha Laskin, The Information reports. The trio has begun fundraising for their venture, which could potentially compete with startups like Adept and Imbue in the AI agent field. AI agents use technology similar to conversational AI chatbots, such as ChatGPT, to perform complex tasks like booking flights or researching business competitors. According to Alphabet CEO Sundar Pichai, this is a direction Google is also exploring with its Bard chatbot. For Google Deepmind, it's another high-profile AI departure. Recently, three researchers left the company to start a generative AI lab for images and music. Other AI startups led by Google alumni include Character AI, Mistral, Sakana AI, and Reka AI.

Comment

Matthias Bastian

Feb 1, 2024

AI in practice

Meta plans to deploy its Artemis AI chip this year to reduce reliance on Nvidia GPUs

Matthias Bastian

Feb 1, 2024

AI in practice

Nomic AI has released an open-source embedding model called Nomic Embed that outperforms OpenAI's Ada-002 and text-embedding-3-small models on both short and long-context tasks. The model is fully reproducible, auditable, and supports a context length of 8192. Nomic Embed outperformed its competitors on the Massive Text Embedding Benchmark (MTEB) and the LoCo Benchmark, but fell short on the Jina Long Context Benchmark. Model weights and full training data are published for "complete model auditability". Nomic Embed is also available via the Nomic Atlas Embedding API with one million free tokens for production workloads and via the Nomic Atlas Enterprise offering for enterprises.

Comment