Grok Imagine is live

Create Videos With Grok Imagine

Using the Grok Imagine model, experience three distinct effects—fun, normal, and spicy—under the same prompt.

Click to upload

JPEG, PNG, WEBP up to 10MB

Note: Spicy mode is not currently supported in Tusheng videos, but support for Spicy mode will be added in the future.

Cost: --Sign in to view credits
Preview

Ready to Generate

Your generated video will appear here

What is the
Grok Imagine

Grok Imagine is xAI’s multimodal image and video generation model that converts text or images into short visual outputs with coherent motion and synchronized audio.

Grok Imagine offers three distinct effect modes: fun, normal, and spicy. Each mode has its own unique characteristics, and of course, the most popular mode right now is the spicy mode.

How It Works

Create Grok Imagine videos in three steps, for both text and image inputs.

1

Select Mode

Choose Text to Video or Image to Video, then enter your prompt (required in Text mode).

2

Upload & Tune

Upload one image for Image to Video, then select aspect ratio (text mode) and motion mode (fun/normal/spicy).

3

Generate

Generate, preview, and download your video once the task completes.

Core Highlights of Grok Imagine Features

Three breakthroughs that set Grok Imagine apart for cinematic, controllable video generation.

Hybrid Mode

Grok Imagine is not just a video generation model; it also supports image generation as a hybrid model. We will later add image generation capabilities, and the generated images can be extended to the image-to-video generation mode.

Try Now

Audio and motion synchronization

Grok Imagine-generated videos come with background sound effects, seamlessly integrating visual effects with synchronized audio. Each video features background music that matches the rhythm and tone of the visuals, significantly streamlining the workflow.

Try Now

Multi-mode hybrid

Grok Imagine offers three generation modes—fun, normal, and spicy—each delivering distinct effects. Try Grok Imagine's Spicy mode for more expressive or artistic video results. Each mode alters how the model interprets prompts, granting you greater creative freedom.

Try Now

Fast and efficient

Grok Imagine generates videos at lightning speed while maintaining exceptional quality, producing results in mere seconds—taking half the time or less compared to similar models.

Try Now

What are the use cases for Grok Imagine?

From social content to ecommerce, Grok Imagine unlocks fast, cinematic video creation.

Social media videos

Grok Imagine excels at creating social media shareable videos, especially those generated in fun mode, making it incredibly easy to craft short, impactful, and shareable videos for platforms like Instagram, TikTok, and X.

Creative Exploration

With Grok Imagine's diverse modes, you can effortlessly explore different visual styles and effects—especially the “spicy” mode. Designers can leverage it to experiment with varied narrative approaches, color palettes, and dynamic effects, creating expressive or abstract works for visual design research.

Entertainment value and playability

The Grok Imagine video model boasts exceptionally strong entertainment value and playability, particularly in Spicy Mode. With its more relaxed constraints, many users leverage it to create and share outstanding effect examples, delivering a more engaging experience to a broader audience.

Grok Imagine Gallery

Shot with Grok Imagine

Play
Play
Play
Play
Play
Play

FAQ about Grok Imagine

Some Information You Might Want to Know About Grok Imagine

Absolutely. All three modes of Grok Imagine are available in VideosGo. The spicy mode functions normally for text-to-video generation, while the spicy mode for image-to-video generation is currently being integrated. The other two modes operate without issue.
VideosGo

Experience
Grok Imagine

Bring cinematic pipelines, curated workflows, and pro showcases into your studio.