xAI · 2026-06-16 · major
Grok Imagine Video 1.5 — xAI's image-to-video model goes GA at $0.14/sec 720p
Grok Imagine Video 1.5 is generally available on the xAI Imagine API, grok.com/imagine, and the Grok iOS and Android apps. xAI prices 720p output at $0.14 per second and says a 6-second 720p clip renders in about 25 seconds, down from 40+ in the prior model.

xAI's image-to-video model — the engine behind Grok Imagine's video clips — is now generally available as a pay-per-second API.
Key specs
| 720p price | $0.14 / sec |
|---|---|
| 480p price | $0.08 / sec |
| Image input | $0.01 / image |
| 720p render time | ~25 sec for 6-sec clip |
| Clip length | 1–15 sec, 24 fps |
Quick facts
| Maker | xAI |
|---|---|
| Model name | grok-imagine-video-1.5 |
| Modes | text-to-video, image-to-video, reference-to-video |
| Resolutions | 480p or 720p, 24 fps |
| Clip length | 1–15 seconds |
| Price (720p) | $0.14 / sec ($4.20 / min) |
| Availability | Imagine API + grok.com/imagine + iOS/Android |
Pricing
| Image input | $0.01 / image |
|---|---|
| 480p video | $0.08 / sec |
| 720p video · $4.20 / min | $0.14 / sec |
What is it?
Grok Imagine Video 1.5 is xAI's image-to-video model, generally available on the Imagine API plus grok.com/imagine and the Grok iOS and Android apps. It accepts a text prompt, an image, or up to seven reference images and returns short clips with sound effects, ambience, and speech synthesized in the same pass.
How does it work?
Grok Imagine Video 1.5 generates 1–15 second clips at 480p or 720p in seven aspect ratios. xAI exposes three modes: text-to-video from a prompt alone, image-to-video that animates a still, and reference-to-video that grounds the output in up to seven reference images for consistent characters, styles, or settings. xAI says a Video 1.5 Fast pipeline renders a 6-second 720p clip in about 25 seconds, down from 40+ in the prior model.
Why does it matter?
Putting a documented, pay-per-second image-to-video API in front of developers — $0.14 per second at 720p — gives Grok Imagine Video 1.5 a third serious generally-available slot alongside OpenAI Sora and Google Veo. Synchronized audio generated in the same pass is the feature xAI is pitching against rivals that require a separate audio step.
Who is it for?
App and creative-tool developers, marketing teams, social-media producers, animation pipelines.
Try it
grok-imagine-video-1.5 on the xAI Imagine API