Ad
Short

Elevenlabs has released ElevenLabs UI, an open-source library featuring 22 components designed for speech and audio applications. According to the company, the toolkit makes it easy to build user interfaces for chatbots, transcription tools, music projects, or voice agents. All components are fully customizable and distributed under the MIT license, based on the shadcn/ui framework.

Examples include "transcriber-01," a dictation module for web apps, and "voice-chat-03," a chat interface with built-in state management. Additional modules like audio players, conversation bars, and interactive visualizations are also available on the project website.

Developers can freely use, modify, and integrate the source code into their own projects.

Ad
Ad
Short

Microsoft unveiled several new multimodal AI models for Azure AI Foundry at OpenAI DevDay in October 2025. The update includes GPT-image-1-mini, GPT-realtime-mini, and GPT-audio-mini, along with security improvements for GPT-5-chat-latest and the analytics model GPT-5-pro. The new models are designed to help developers build AI applications for text, image, audio, and video faster and at lower cost.

The Microsoft Agent Framework, an open-source SDK for coordinating multiple AI agents, is now available, as is OpenAI's new Agent SDK.

Ad
Ad
Short

OpenAI is adding new controls to its Sora video app. According to Sora head Bill Peebles, users can now decide where AI-generated versions of themselves can appear - for example, blocking political content or banning certain words. Users can also set style guidelines for their digital likeness. These updates come in response to criticism over abusive deepfakes on the platform. Peebles also announced that Sora will soon officially support cameos featuring copyrighted characters. Recently, CEO Sam Altman said rights holders should have "more control" and will soon receive a share of Sora's revenue.

Ad
Ad
Google News