New Release

Generate Photorealistic Images in Sub-Seconds

Z-Image-Turbo redefines speed in AI image generation. Create stunning visuals up to 1536x1536 with unparalleled efficiency and quality.

Runs on 16GB VRAM
8-Step Generation
Bilingual Text Rendering
Z-Image-Turbo generating a photorealistic image

Key Features of Z-Image-Turbo

Experience the future of AI image generation with Tongyi-MAI's groundbreaking technology.

Ultra-Fast Generation

Ultra-Fast Generation

Achieve sub-second inference latency with just 8 NFEs (Number of Function Evaluations) on enterprise H800 GPUs, thanks to our efficient Scalable Single-Stream DiT architecture (S3-DiT).

Consumer Hardware Compatible

Consumer Hardware Compatible

Run Z-Image-Turbo on consumer devices with 16GB VRAM, democratizing access to professional-grade image generation without data-center infrastructure.

Bilingual Text Rendering

Bilingual Text Rendering

Seamlessly integrate text in both English and Chinese directly within your generated images, making Z-Image-Turbo ideal for cross-market campaigns and multilingual content creation.

How Z-Image-Turbo Works

Unlock the power of ultra-fast image generation in three simple steps.

1

Input Your Prompt

Enter a detailed text description of the image you want to generate. Leverage prompt enhancement with reasoning for optimal results.

2

Generate in 8 Steps

Z-Image-Turbo uses just 8 NFEs to create a photorealistic image with exceptional aesthetic quality, powered by our Decoupled-DMD distillation.

3

Download & Use

Download your high-resolution image and use it for any purpose – from marketing materials to personal projects. Guidance scale should be set to 0.0.

Frequently Asked Questions

Learn more about the capabilities and technical specifications of Z-Image-Turbo.

Z-Image-Turbo utilizes a Scalable Single-Stream Diffusion Transformer (S3-DiT) architecture and a novel Decoupled-DMD distillation algorithm, enabling photorealistic image generation in just 8 steps (NFEs).
Z-Image-Turbo comfortably runs on consumer devices with 16GB VRAM, making it accessible to a wide range of users without the need for expensive enterprise GPUs.
Compared to models like FLUX.2 and Qwen-Image which require 20B+ parameters, Z-Image-Turbo achieves competitive performance with only 6B parameters, offering a balance of quality and accessibility.
The guidance scale should be set to 0.0 for Z-Image-Turbo models, as they are specifically distilled and optimized for this configuration. Varying from this may produce unpredictable results.

Ready to experience the speed of Z-Image-Turbo?

Transform your creative workflow with sub-second photorealistic image generation.