AI research Archive

Matthias Bastian

Jun 24, 2023

AI research

Google AudioPaLM can translate text with your voice

With "AudioPaLM", Google extends the large PaLM-2 language model with audio capabilities. Among other things, this enables spoken translations with original voice without specific pre-training.

Matthias Bastian

Jun 23, 2023

AI research

AI voice synthesis startup ElevenLabs has launched a tool to detect and prevent generative audio fraud. Users can upload audio samples, and the AI speech classifier will determine whether the content was generated by its platform, with a claimed accuracy rate of 99% for unmodified input and 90% for modified input.

Concurrent with the launch, ElevenLabs raised $19 million in a Series A funding round co-led by Nat Friedman, Daniel Gross, and Andreessen Horowitz. The company plans to use the investment to build a voice AI research center and launch additional products targeting market verticals such as publishing, gaming, and entertainment.

Comment Source: Elevenlabs

Maximilian Schreiner

Jun 22, 2023

AI research

How Tesla plans to win the robot race with foundation models

Robotics experts give their views on Tesla's recently unveiled AI robot. What do they think of the prototype and its capabilities?

Matthias Bastian

Jun 22, 2023

AI research

Microsoft's tiny Phi-1 language model shows how important data quality is for AI training

A stack of textbooks, in garish neon design, AI generated

Matthias Bastian

Jun 21, 2023

AI research

Large Language Models (LLMs) are transforming software development, but their newness and complexity can be daunting for developers. In a comprehensive blog post, Matt Bornstein and Rajko Radovanovic provide a reference architecture for the emerging LLM application stack that captures the most common tools and design patterns used in the field. The reference architecture showcases in-context learning, a design pattern that allows developers to work with out-of-the-box LLMs and control their behavior with smart prompts and private contextual data.

"Pre-trained AI models represent the most significant architectural change in software since the internet."

Matt Bornstein and Rajko Radovanovic

Comment Source: a16z

Maximilian Schreiner

Jun 21, 2023

AI research

Google DeepMind's RoboCat improves itself

Matthias Bastian

Jun 20, 2023

AI research

No foundational LLM currently complies with EU AI Act

Maximilian Schreiner

Jun 19, 2023

AI research

Deepmind's new AI agent learns 26 games in two hours

Matthias Bastian

Jun 18, 2023

AI research

Chatbots allow people with no lab training to create pandemic viruses, study finds

A virus in a digital environment in an abstract form

Maximilian Schreiner

Jun 18, 2023

AI research

Meta's Voicebox is like Stable Diffusion for voices: The generative AI model synthesizes speech from text and can be used for various speech tasks. Voicebox generates realistic and expressive voices and allows attributes such as tone, style or accent to be adopted from audio files.

According to Meta, Voicebox outperforms existing speech synthesis models such as Microsoft's VALL-E in terms of speech quality and naturalness. "As the first versatile, efficient model that successfully performs task generalization, we believe Voicebox could usher in a new era of generative AI for speech.," Meta said. Due to the risk of misuse, the team has also developed a system for recognizing synthesized speech and has no plans to release Voicebox for the time being.

Video: Meta

Comment Source: Meta