MiniMax has introduced Hailuo 02, the second generation of its video AI model, with major upgrades in both performance and price.
The new model uses an architecture called Noise-aware Compute Redistribution (NCR), which MiniMax says improves training and inference efficiency by a factor of 2.5. The NCR architecture handles long video sequences differently depending on the stage of training. Early in training, when artificial noise is heavily introduced into the data, videos are compressed as much as possible. Later, when the training videos are clearer, the model processes them at full resolution.

Compared to the previous version, Hailuo 02 has three times more parameters and four times more training data, with MiniMax also noting improvements in data quality and diversity. The company hasn't disclosed exact parameter counts or dataset sizes.
According to MiniMax, Hailuo 02 shows clear gains in handling complex prompts and simulating physical processes. The company claims it's currently the only model able to accurately generate intricate scenes like gymnastics routines.
Video: MiniMax
Hailuo 02 is available in three variants: 768p for six seconds, 768p for ten seconds, and 1080p for six seconds. The previous model was limited to 720p, six-second videos at 25 fps.
In the Artificial Analysis Video Arena benchmark, where users rate videos from competing AI models, Hailuo 02 finished second in the image-to-video category. It placed just behind Bytedance's Seedance and ahead of Google's much-hyped Veo 3. However, this version of Veo 3 does not support audio, which is a major part of its appeal.

Since its demo launch in August last year, people have created over 3.7 billion videos using the Hailuo platform, according to MiniMax. The company describes its initial rollout as very random, but says it quickly attracted widespread attention from creators worldwide.
The model can be accessed via web interface, mobile app, or API. For API users, generating a six-second 768p video costs $0.28, while a 1080p version costs $0.49. By comparison, producing an eight-second 1080p video with Google Veo 3 can cost around $3, depending on the plan.
MiniMax says it's working to improve generation speed, stability, and add new features beyond the current text-to-video and image-to-video options. Competing platforms like Runway already offer more advanced capabilities, such as tracking shots.
The Hailuo 02 release is part of "MiniMax Week," a five-day event during which the Chinese startup also unveiled an open-source language model, MiniMax-M1, complete with parameter counts and a technical paper. In contrast, technical details about Hailuo 02's training architecture remain undisclosed.