AI Tool Tips: Prompt Engineering and first Whisper software

DALL-E 2 prompted by THE DECODER

Phraser is supposed to help with prompt generation for DALL-E 2 and co., while OpenAI's Whisper enables free audio transcriptions.

Image AIs let even people who can barely hold a pen generate creative art. Provided they master so-called "prompt engineering" - the art of giving the AI the right image command.

This is not as trivial as it sounds. For one thing, of course, you have to be fundamentally capable of translating an image idea into the most pictorial language possible. For another, generative image AIs such as DALL-E 2, Midjourney, or Stable Diffusion have countless parameters and styles that strongly influence image generation.

The Phraser web software is designed to facilitate prompt engineering. As usual, you have to develop the image idea yourself, but when it comes to finding the style, Phraser provides support along the various parameters of the individual systems.

Through a step-by-step menu, you can decide

on the medium (e.g., photo, template, movie poster),
create a text description with the most important components,
choose color, texture, and resolution
and decide on camera settings, the mood, and the era.

After logging in, you get the appropriate prompt for the initially selected image AI. In addition, the software inspires you with similar images that have already been generated and somewhat match your prompt.

OpenAI Whisper arrives in first tools

With Whisper, OpenAI recently released an open-source model for speech recognition and transcription in various languages. OpenAI makes the model freely accessible and available free of charge - the first developers are downloading it and integrating it into tools.

With YouTube Whisperer, the cloud platform Hugging Face already has an implementation of the model in a simple user interface that can be used to transcribe YouTube videos.

Whisper by OpenAI, also on Hugging Face, can turn words spoken into a microphone into text within a few seconds. However, the software is only available as a demo, which stops after 30 seconds. But you can record several texts in a row.

Recommendation

AI in practice

Tesla unveils Cybercab robot taxi, but robot Optimus is the bigger deal

Probably the most interesting project currently is Stage Whisper: Here a team of volunteers is working together to develop a simple and free transcription app based on Whisper, which can be used by people who are less familiar with the technology. A first version is expected to be released in just a few weeks. Anyone who wants to get involved can sign up on Stage Whisper's Discord channel.

Another project on Github, "Whispering," wants to use Whisper for real-time transcription.

Join our community

Join the DECODER community on Discord, Reddit or Twitter - we can't wait to meet you.

AI Tool Tips: Prompt Engineering and first Whisper software

OpenAI Whisper arrives in first tools

Tesla unveils Cybercab robot taxi, but robot Optimus is the bigger deal

Pika Labs launches new "Pikadditions" video inpainting feature

Black Forest Labs expands FLUX.1 with four new AI tools for image editing

Flux 1.1 Pro AI image model adds "amateur" RAW photo mode and 4K image generation

OpenAI launches new ChatGPT agent that automates complex tasks for Pro, Plus, and Team

Kimi-K2 is the next open-weight AI milestone from China after Deepseek

New Energy-Based Transformer architecture aims to bring better "System 2 thinking" to AI models

AI Tool Tips: Prompt Engineering and first Whisper software

OpenAI Whisper arrives in first tools

Share

Bank details