Ad
Ad
Ad
Short

Zyphra has released Zonos-v0.1, an open source model that turns text into natural-sounding speech and can clone voices using just seconds of audio data. The new model supports five languages - English, Japanese, Chinese, French, and German - and gives users control over speaking speed, pitch, audio quality, and emotional tone. According to Zyphra, the model processes audio faster than real-time when running on an RTX 4090 GPU. Zyphra has made Zonos available in two versions: a pure transformer model and a hybrid model that combines state-space models with transformers. Both versions were trained on approximately 200,000 hours of audio data, primarily in English. Users can try out Zonos through a user-friendly Gradio interface, with easy Docker installation for local use. The model is also accessible through the Zyphra Playground or via API for those who prefer cloud-based solutions.

Ad
Ad
Ad
Ad
Short

OpenAI has added sharing capabilities to Canvas, its built-in editor for ChatGPT. The new feature lets users share their Canvas projects with others, enabling real-time viewing, interaction, and editing. Canvas serves as a workspace where users can develop text and code alongside ChatGPT. The editor comes with specialized tools for writing and coding projects, including a Python emulator that runs code directly in the browser. The sharing update follows OpenAI's recent moves to make Canvas more widely available. The company has rolled out Canvas access to all web users and built it into the ChatGPT desktop app for macOS.

Google News