AI/TLDR

xAI · 2026-06-16 · major

Grok Imagine Video 1.5 — xAI's image-to-video model goes GA at $0.14/sec 720p

Grok Imagine Video 1.5 is generally available on the xAI Imagine API, grok.com/imagine, and the Grok iOS and Android apps. xAI prices 720p output at $0.14 per second and says a 6-second 720p clip renders in about 25 seconds, down from 40+ in the prior model.

Grok Imagine Video 1.5 announcement card showing xAI's image-to-video model

xAI's image-to-video model — the engine behind Grok Imagine's video clips — is now generally available as a pay-per-second API.

Key specs

720p price$0.14 / sec
480p price$0.08 / sec
Image input$0.01 / image
720p render time~25 sec for 6-sec clip
Clip length1–15 sec, 24 fps

Quick facts

MakerxAI
Model namegrok-imagine-video-1.5
Modestext-to-video, image-to-video, reference-to-video
Resolutions480p or 720p, 24 fps
Clip length1–15 seconds
Price (720p)$0.14 / sec ($4.20 / min)
AvailabilityImagine API + grok.com/imagine + iOS/Android

Pricing

Image input$0.01 / image
480p video$0.08 / sec
720p video · $4.20 / min$0.14 / sec
source ↗

What is it?

Grok Imagine Video 1.5 is xAI's image-to-video model, generally available on the Imagine API plus grok.com/imagine and the Grok iOS and Android apps. It accepts a text prompt, an image, or up to seven reference images and returns short clips with sound effects, ambience, and speech synthesized in the same pass.

How does it work?

Grok Imagine Video 1.5 generates 1–15 second clips at 480p or 720p in seven aspect ratios. xAI exposes three modes: text-to-video from a prompt alone, image-to-video that animates a still, and reference-to-video that grounds the output in up to seven reference images for consistent characters, styles, or settings. xAI says a Video 1.5 Fast pipeline renders a 6-second 720p clip in about 25 seconds, down from 40+ in the prior model.

Why does it matter?

Putting a documented, pay-per-second image-to-video API in front of developers — $0.14 per second at 720p — gives Grok Imagine Video 1.5 a third serious generally-available slot alongside OpenAI Sora and Google Veo. Synchronized audio generated in the same pass is the feature xAI is pitching against rivals that require a separate audio step.

Who is it for?

App and creative-tool developers, marketing teams, social-media producers, animation pipelines.

Try it

grok-imagine-video-1.5 on the xAI Imagine API

Sources · 2 outlets

Tags

  • image-to-video
  • text-to-video
  • video-generation
  • generative-video
  • xai
  • grok
  • grok-imagine
  • api
  • multimodal
  • synchronized-audio

← All releases · Learn AI