text-to-video

Pixverse

Pixverse V6 文生视频

使用 Pixverse V6 模型根据文本提示生成视频，支持可配置的分辨率、时长、宽高比以及可选的音频生成。

Alibaba

万相 Wan 2.7参考生视频

万相 Wan 2.7参考生视频模型，支持多模态输入（文本/图像/视频），可将人或物体作为主角，生成单角色表演或多角色互动视频。支持智能分镜，生成多镜头视频。支持720P和1080P分辨率，时长2~10秒，按秒计费。输出默认包含音频。

Pixverse

PixVerse C1 文生视频

PixVerse C1 文生视频模型，支持通过文本描述生成高质量视频，支持多种分辨率和宽高比，可选音频同步生成，视频时长1-15秒。

Google

Veo 3.1 Lite 文本生成视频

使用 Google Veo 3.1 Lite 模型根据文本提示生成视频。支持 4s/6s/8s 时长，720p/1080p 分辨率，16:9 和 9:16 宽高比，可选音频生成。

Kling

Kling v3.0 Pro: Text to Video

Kling 3.0 is a high-quality model designed for video generation. Its strengths lie in smooth motion and cinematography that closely mimics real-life footage, with excellent control over the rhythm of character movements, camera movements (zooms, pans, and tilts), and spatial relationships within scenes. It delivers consistent results in terms of material texture, lighting variations, and detail consistency (including character clothing, props, and backgrounds). It is ideal for creating short films, storyboards for commercials, and dynamic proof-of-concepts, and its controllability can be further enhanced through clear shot script prompts.It supports an ultra-fast inference API, offers stable performance with no waiting time, and delivers exceptional value for money.

Kling

Kling v3.0: Standard Text-to-Video

Alibaba

Wan 2.1 Text to Video

Alitongyi Wan is renowned for its high image quality, strong temporal consistency, and ability to handle complex prompts, making it ideal for large-scale commercial video generation. Wan 2.1 enhances motion stability and texture detail, making it suitable for bulk production in e-commerce and advertising. Text-to-video capabilities allow users to generate storyboards and cinematographic language directly from prompts, enabling rapid prototyping from script to finished video. The real-time inference API offers stable performance, zero wait time, and affordable pricing.

Alibaba

Wan 2.2 Text to Video

Alibaba Tongyi Wan is renowned for its high image quality, strong temporal consistency, and ability to handle complex prompts, making it ideal for large-scale commercial video generation. Wan 2.2 enhances shot continuity and the naturalness of character movements, delivering more stable results in complex scenes. Text-to-video capabilities allow users to generate storyboards and cinematic language directly from prompts, enabling rapid prototyping from script to finished video. The real-time inference API offers stable performance with no waiting time and is affordably priced.

Alibaba

Wan 2.5 Text-to-Video Preview

Alibaba Tongyi Wan is renowned for its high image quality, strong temporal consistency, and sophisticated prompt adherence, making it ideal for large-scale commercial video generation. Wan 2.5 delivers further improvements in image clarity and prompt adherence, while the preview version facilitates rapid trial-and-error testing. Text-to-video capabilities allow users to generate storyboards and cinematographic styles directly from prompts, enabling quick prototyping from script to finished video. The real-time inference API offers stable performance with no waiting time and is affordably priced.

ByteDance

Seedance 1.5 Pro: Text to Video

The Seedance series offers reliable generation capabilities, making it ideal for production environments. Designed for production-level use, this series prioritizes stability and controllable output. Text-to-video capabilities allow users to generate storyboards and cinematographic styles directly from prompts, enabling rapid prototyping from script to finished video. The real-time inference API delivers stable performance with no waiting time and is affordably priced.

Alibaba

Wan 2.6 Text to Video

The Wan2.6 series offers reliable generation capabilities, making it ideal for production environments. Designed for production-grade use, this series prioritizes stability and predictable output. Its text-to-video capabilities allow users to generate storyboards and cinematographic language directly from prompts, enabling rapid prototyping from script to finished video. The real-time inference API delivers stable performance with no waiting time and is affordably priced.

OpenAI

Sora 2 Text to Video

The Sora 2 series offers reliable generation capabilities, making it ideal for production environments. Designed for production-grade use, this series prioritizes stability and controllable output. Its text-to-video capabilities allow users to generate storyboards and cinematographic styles directly from prompts, enabling rapid prototyping from script to finished video. The real-time inference API delivers stable performance with no waiting time and is affordably priced.