The text prompt describing the desired video motion
Click to upload or drag and drop
Supported formats: JPEG, PNG, WEBP Maximum file size: 10MB
URL of the image to use as the first frame. Must be publicly accessible
The duration of the generated video in seconds
Video resolution. Valid values: 720p, 1080p
Negative prompt to describe content to avoid
Whether to enable prompt rewriting using LLM
Random seed for reproducibility. If None, a random seed is chosen
Explore different use cases and parameter configurations
The text prompt for video generation. Supports Chinese and English, max 800 characters.
The duration of the generated video in seconds
The aspect ratio of the generated video
Video resolution tier
Negative prompt to describe content to avoid. Max 500 characters.
Whether to enable prompt rewriting using LLM. Improves results for short prompts but increases processing time.
Random seed for reproducibility. If None, a random seed is chosen.
Explore different use cases and parameter configurations
Complete guide to using
Alibaba Wan 2.5 API – AI Video Generation with Audio Sync
From text to video or image to video, Wan 2.5 API on Kie.ai delivers cinematic visuals, synchronized audio, and flexible outputs — all at a fraction of the cost.

Introducing the Alibaba Wan 2.5 API for AI Video Creation
Alibaba Wan 2.5 is a state-of-the-art AI video generation model, designed to transform text prompts and reference images into cinematic video outputs. Originally released on Alibaba Cloud’s DashScope platform, it demonstrates advanced capabilities in visual realism, motion dynamics, and audio synchronization. To make these features easier to integrate, Alibaba offers the Wan 2.5 API, which includes both text-to-video (T2V) and image-to-video (I2V) preview endpoints. With the wan2.5-t2v-preview api and wan2.5-i2v-preview api, developers can generate short videos enhanced by lip-sync and audio alignment. Beyond DashScope, Kie.ai now provides direct access to the Wan 2.5 API, giving creators and developers a more flexible, cost-effective way to bring Alibaba’s cutting-edge video technology into apps, workflows, and creative projects—making it a strong alternative to Google’s Veo 3.
Generation Methods Supported by Wan 2.5 API
Text-to-Video(wan2.5-t2v-preview api )
The wan2.5-t2v-preview api enables developers to generate videos directly from text prompts. By describing scenes, actions, and environments, it produces cinematic video clips with smooth motion and synchronized audio—perfect for storyboards, marketing campaigns, and social media content.
Image-to-Video(wan2.5-i2v-preview api )
The wan2.5-i2v-preview api transforms static images into dynamic short videos. It preserves the original identity and style of the image while adding lifelike animations and perspective changes, making it ideal for portraits, product showcases, and creative storytelling.
Key Features That Make Wan2.5 API Stand Out
Native Audio & Seamless A/V Sync with Wan 2.5 API
The Wan 2.5 API makes it possible to generate video and audio together in a single request. Dialogues, ambient sounds, and background music are automatically synchronized with visuals, delivering immersive outputs without extra editing.
Accurate Prompt Adherence with Wan 2.5 Preview API
With the Wan 2.5 text-to-video API, complex prompts are followed more faithfully. Camera angles, lighting setups, and scene dynamics are captured with higher precision, giving developers confidence that each API call will translate creative instructions into consistent video results.
Flexible Style Adaptation through Wan-2.5 API
The Wan 2.5 Preview API supports a wide range of visual styles—from cinematic realism to anime or illustration. It preserves character identity and scene coherence, allowing developers to integrate versatile aesthetics into their applications through a single API.
Multi-Mode API with Flexible Video Generation Options
Wan2.5 API provides both wan2.5-t2v-preview api (text-to-video) and wan2.5-i2v-preview api (image-to-video) endpoints. All modes support multiple resolutions (720p, 1080p), while aspect ratio choices (16:9, 9:16, 1:1) are available for text-to-video generation.
Wan 2.5 API vs. Veo 3: Which Fits Your Needs?
Both Alibaba Wan 2.5 API and Google Veo 3 represent the latest in AI video generation, offering text-to-video and image-to-video capabilities with audio. But their strengths are not the same. Veo 3 is built for cinematic realism, while Wan 2.5 API focuses on native audio-video sync, flexible output options, and stronger multilingual performance.
| Feature | Wan 2.5 API (Alibaba) | Veo 3 (Google) |
|---|---|---|
| Generation Modes | Text-to-Video (wan2.5-t2v-preview api) & Image-to-Video (wan2.5-i2v-preview api) | Text-to-Video & Image-to-Video |
| Audio & A/V Sync | Native audio-video generation with dialogue, ambient sound, and BGM | Audio available but less integrated; focus remains on visuals |
| Prompt Adherence | Strong fidelity to complex instructions, including camera, lighting, and motion | Excellent realism, but may struggle with highly detailed or abstract prompts |
| Style Adaptation | Cinematic realism, anime, illustration; strong stylization support | Focus on cinematic realism, less flexible for stylized outputs |
| Multilingual Support | Reliable with Chinese & minor languages | Limited; often defaults to “unknown language” in non-English prompts |
| Video Duration | Up to 10 seconds | Up to ~8 seconds |
| Aspect Ratio Options | 16:9, 9:16, 1:1 (T2V) | Primarily cinematic formats; fewer ratio options |
How to Get Started with Wan 2.5 API Free on Kie.ai
Step 1: Sign Up / Log In & Get Your Wan 2.5 API Key
Create a account on Kie.ai or log in if you already have one. Once inside the dashboard, generate your Wan 2.5 API key. This secure key will authenticate your requests and connect your app to Alibaba Wan 2.5 endpoints.
Step 2: Test for Wan 2.5 API Free in the Playground
Before integrating, try the Kie.ai API Playground. Here you can run wan2.5-t2v-preview api (text-to-video) and wan2.5-i2v-preview api (image-to-video) with sample prompts. This free testing environment helps you experiment with resolutions, aspect ratios, and audio sync outputs before deployment.
Step 3: Deploy Wan 2.5 API in Your Workflow
Once you’re satisfied with the results, integrate the Wan 2.5 API into your application or workflow. Use the API key to call endpoints directly, customize outputs with prompts, and scale video generation for your project — whether it’s short-form content, marketing campaigns, or creative storytelling apps.
Tips for Getting the Best Results with Alibaba Wan 2.5 API
To make the most of Wan 2.5 API, it’s important to craft clear, detailed, and structured prompts. The model responds best when both the visual and audio instructions are spelled out. Here are practical recommendations:
Write Dialogue with Precision
When adding speech, don’t just request “dialogue.” Instead, provide the exact words to be spoken and specify who says them. This is especially important in multi-character scenes where order and clarity matter. For example: Character A: “We have to keep moving.” Character B: “Not until we find shelter.” By writing dialogue this way, you ensure the API assigns the right lines to the right characters.
Control Silence Explicitly
In some videos, the atmosphere should be driven by visuals or sound effects alone. If you don’t want dialogue, make that clear in your prompt. Adding phrases such as “no dialogue” or “no actors speaking” prevents unintended voices from appearing. This small detail keeps your output aligned with the creative vision.
Define Background Audio and Atmosphere
Beyond dialogue, ambient sound and music set the emotional tone. Be specific about the kind of environment or soundtrack you want, whether it’s natural or dramatic. Examples include: “soft rain tapping on windows with distant thunder” or “fast-paced action music with heavy percussion.” The clearer you are, the better the model can synchronize visuals with sound to create an immersive result.
Enrich Scene Descriptions with Detail
Wan 2.5 excels when prompts include setting, lighting, camera perspective, and mood. Instead of writing “a person walking on a road,” expand the description to capture cinematic elements. For example: A wide shot of a mountain road at sunset, golden light flooding the sky, a cyclist racing downhill, with energetic background music in the background. This depth of description allows the API to produce more natural, dynamic, and visually coherent videos.
Why Choose Kie.ai for AI Video Generation with Wan 2.5 Preview API
Affordable Wan 2.5 API Pricing
Get budget-friendly access to the Alibaba Wan 2.5 API through Kie.ai. Whether you use the text-to-video or image-to-video endpoint, our pricing makes large-scale AI video generation cost-effective.
Free Wan 2.5 API Trial in Playground
Test wan2.5-t2v-preview and wan2.5-i2v-preview instantly with no upfront cost. The Kie.ai Playground lets you experiment with prompts, aspect ratios, and resolutions, and preview synchronized audio-video outputs before deploying.
Complete Wan 2.5 API Documentation
Kie.ai provides full documentation for Wan 2.5 text-to-video API and image-to-video API. From generating your API key to deployment, our guides include clear examples and best practices to help developers integrate quickly and confidently.