Generor 登录注册

生成器公开

视频使用最新的 AI 模型生成和扩展视频——文本生成视频和图像生成视频

AI 模型

每个模型各有所长。所示费用为每秒视频的价格。

Hailuo 2 3-16

Advanced physics simulation for realistic complex movements. Supports end frame for loop creation.

Seedance 1.0 Pro Fast 3-12

Cheaper, faster variant of Seedance 1.0 Pro via BytePlus. ~50% cost savings vs the standard 1.0 Pro at the same resolutions.

Seedance 1.5 Pro 3-12

ByteDance Seedance 1.5 Pro via BytePlus. High-precision audio-visual sync, cinematic motion, and emotional expression. Supports first/last frames.

Seedance 1.0 Lite I2V 4-14

ByteDance Seedance 1.0 Lite Image-to-Video — efficient, cost-effective animation of source images. Direct via BytePlus.

Dreamina Seedance 2.0 Fast 5-20

Faster, cheaper sibling of Seedance 2.0 via BytePlus. Same multimodal capability set (text + image + video + audio references, native sync audio, multi-shot narration) at roughly 40% lower per-second cost. Trades a bit of fidelity for speed — ideal for iteration loops.

Seedance 1.0 Pro 6-30

ByteDance Seedance 1.0 Pro via BytePlus. Comprehensive and powerful video generation with strong motion control.

Dreamina Seedance 2.0 8-30

ByteDance flagship multimodal video model via BytePlus — accepts text + reference images + reference video + reference audio. Native synchronized audio, pro camera controls, multi-shot narration. Cheaper than the Replicate route and exposes capabilities Replicate hides.

P-Video 8

Pruna P-Video — fast, affordable text-to-video at $0.02/sec. Standard aspect ratios, 3-15 second clips. Strong bang-for-buck for prototypes and short-form content.

Grok Imagine Video 10

xAI Imagine API video — fast text-to-video and image-to-video at a flat $0.05/sec. Async polling pattern; supports 720p, 1-15 second durations. Native audio is included on every generation (cannot be turned off).

Wan 2.6 I2V Flash 10-15

Fast image-to-video with optional audio sync. Faster inference than standard Wan 2.6 I2V. Up to 15 seconds.

Happy Horse 1.0 I2V 14

Direct-to-DashScope Happy Horse 1.0 image-to-video. Strict consistency with the source image, fluent natural motion, native audio. 3-15 seconds.

Happy Horse 1.0 T2V 14

Direct-to-DashScope Happy Horse 1.0 text-to-video. Cheaper than the Replicate path. 3-15 second durations, five aspect ratios, native audio always on.

Happy Horse 1.0 R2V 20

Reference-to-video — combines up to 9 reference images for strong subject + scene consistency. Direct-to-DashScope only (Replicate proxy hides this).

Veo 3.1 Fast 20

Google's Veo 3.1 Fast with native audio and frame-to-frame generation. Supports start and end frames for seamless transitions.

Wan 2.6 I2V 20-30

Alibaba Wan 2.6 image-to-video with multi-shot storytelling, native audio, and precise lip-sync. Up to 15 seconds.

Wan 2.6 T2V 20-30

Alibaba's latest text-to-video model with multi-shot storytelling, native audio, and precise lip-sync. Up to 15 seconds.

Happy Horse 1.0 Video Edit 24

Local or global edits to an existing video using natural-language instructions and up to 5 reference images. Preserves original motion. Direct-to-DashScope only.

Kling v3 34-45

Kuaishou Kling v3 — cinematic text-to-video and image-to-video up to 15 seconds with native audio and lip-synced dialogue. Supports start and end frames. Standard mode = 720p, Pro mode = 1080p.

Veo 3.1 40

Google's flagship video model with strongest prompt adherence and cinematic motion. Synchronized native audio, reference images, and start+end frame control.

Kling v2.1 Master 56

Premium Kling model with enhanced quality and longer durations.

管理 Video 模型

质量

480P 5

720P 9

1080P 20

模式

文本转视频

图像转视频

视频转视频

宽高比

unspecified Unspecified

16:9 Wide

9:16 Tall

时长

5 seconds 25

6 seconds 30

7 seconds 35

8 seconds 40

10 seconds 50

12 seconds 60

15 seconds 75

起始图片

选择一张图片以制作成视频动画。AI 将根据你的提示词让它动起来。

上传起始图片

选择文件未选择文件

— OR —

输入网址

源视频

上传现有视频进行编辑。描述你想要的更改——替换、场景调换、重新着色。原有的运动会被保留。

上传源视频

选择文件未选择视频

— OR —

输入网址

或从您的素材库中选择：访问任意一个你的视频作品并点击“编辑视频”——您将进入此页面，并已预加载源文件。

结束帧（可选）

设置结束帧以引导视频过渡的方向。非常适合创建循环或可控动画。

将起始图片用作结束帧（平滑循环）

上传结束帧

选择文件未选择文件

— OR —

输入网址

参考输入 (可选)

在多次生成中固定角色的外观或动作风格。在提示中引用它们，格式为 [Image1], [Image2], [Video1].

[Image1] 参考图片

选择文件未选择文件

— OR —

输入网址

[Image2] 参考图片

选择文件未选择文件

— OR —

输入网址

[Image3] 参考图片

选择文件未选择文件

— OR —

输入网址

[Video1] 参考视频

选择文件未选择文件

— OR —

输入网址

视频提示词 *

计算中…

提示：描述你想要的场景、动作和风格。上传图片以进行图片转视频转换。

生成音频

否

是

图像来源

上传图片

从文本生成

图像模型

P-Image 1

Pruna P-Image（直连）— 为生产打造的亚秒级文生图。在标准宽高比下具备强大的提示词遵循度和清晰的文字渲染。阅读更多 →

Z-Image Turbo 1

使用此模型需要登录。 Alibaba Z-Image Turbo — 轻量级文生图，支持中英双语文字渲染。快速、低成本，分辨率从 512×512 到 2048×2048 灵活可选。阅读更多 →

Flux Schnell 2

超快图像生成，成本仅为零头。非常适合快速迭代和验证创意。阅读更多 →

P-Image Edit 2

Pruna P-Image Edit — 专注于图像编辑，支持 1-5 张参考图、亚秒级推理和精确的提示词遵循。严格仅限图生图（至少需要一张源图）。在标准宽高比下输出 256-1440 px。阅读更多 →

Wan 2.6 Image 2

使用此模型需要登录。 Alibaba Wan 2.6 文生图 — 出色的电影级摄影和艺术风格。渲染分辨率为 1280×1280 至 1440×1440，支持五种宽高比。阅读更多 →

Grok Imagine 4

使用此模型需要登录。 xAI 的 Imagine API 图像模型 — 快速生成和编辑，提示词还原度高。阅读更多 →

Qwen Image 4

使用此模型需要登录。 Alibaba Qwen Image — 旗舰文生图，具备强大的文字渲染、多语言提示词支持和清晰逼真的画质。阅读更多 →

Wan 2.7 Image 4

使用此模型需要登录。 Alibaba Wan 2.7 image — 文生图、至多 9 张参考图的图像编辑，以及多图参考生成。最高 2K 分辨率。此前通过 Replicate 提供，成本约为现在的 30 倍。阅读更多 →

SeedEdit 3.0 6

使用此模型需要登录。 BytePlus SeedEdit 3.0 — 专注于图像编辑的模型。接受一张源图加一条指令，并就地应用编辑（如"把气泡变成心形"、"把天空改成日落"）。自适应输出尺寸跟随输入图像的尺寸。严格仅限图生图（不支持纯文生图）。阅读更多 →

Seedream 4.0 6

使用此模型需要登录。 BytePlus Seedream 4.0 — 多模态图像创作，支持文本 + 单图 + 多图输入。多图融合至多 10 张参考图，连续图集生成至多 15 张输出，支持 4K 超高清。经 BytePlus 直连。阅读更多 →

Seedream 5.0 Lite 7

使用此模型需要登录。 BytePlus Dola-Seedream 5.0 Lite — 旗舰图像模型，支持联网检索、多图融合、图集生成和强一致性保持。支持文生图和图生图（至多 4 张参考图）。最高 2K 输出。经 BytePlus 直连。阅读更多 →

Qwen Image Edit 8

使用此模型需要登录。 Alibaba Qwen Image Edit — 图生图，支持多参考编辑、图中文字渲染和基于范例的构图。阅读更多 →

Seedream 4.5 8

使用此模型需要登录。 BytePlus Seedream 4.5 — 精进的图像模型，具备强大的编辑一致性、多图融合、更精细的细节控制、自然的小字和人脸渲染，以及 2560×1440 至 4096×4096 输出。支持文生图和图生图，至多 4 张参考图。阅读更多 →

Wan 2.7 Image Pro 8

使用此模型需要登录。 Alibaba Wan 2.7 image Pro — 高端变体，支持 4K 文生图、提升质量的思考模式、图像编辑和多图参考。阅读更多 →

Seedream 4.0 9

使用此模型需要登录。新一代图像模型，统一生成与编辑。支持连续图像生成以保持连贯性，并支持多参考工作流。阅读更多 →

Grok Imagine Quality 10

使用此模型需要登录。 xAI 更高质量的 Imagine 图像模型 — 相比标准版 Grok Imagine，细节更锐利、构图更强，但每张图成本更高。阅读更多 →

Gemini 2.5 Flash Image 12

使用此模型需要登录。 Google 的快速图像模型，质量与速度俱佳。出色地遵循自然语言提示词。⚠️ 该模型仅支持正方形（1:1）和匹配输入的宽高比。阅读更多 →

Seedream 4.5 12

使用此模型需要登录。升级版 Bytedance 图像模型，具备更强的空间理解和世界知识。阅读更多 →

Nano Banana 2 14

使用此模型需要登录。 Google 最新的快速图像模型，具备高效率的生产级视觉创作能力。支持最高 4K 的多种分辨率。阅读更多 →

Nano Banana Pro 27

使用此模型需要登录。 Google 的专业设计引擎，配备推理内核，可生成录音棚级 4K 视觉、复杂版面和精准文字渲染。阅读更多 →

管理 Image 模型

隐私模式

公开

您的作品将对所有人可见，并可能出现在公开图库和搜索结果中。

私密

只有您能看到此作品。它不会出现在任何公开信息流中，其他人也无法访问。

团队

仅与您的团队成员分享此创作。团队以外的其他人无法访问。

许可证

Generor Open

所有人均可免费使用、修改和商用，无需署名。了解更多 →

Generor Exclusive

只有您能使用此作品。其他人不能复制、修改或分发它。了解更多 →

输出语言

自动（输入语言）

English (United States)

Spanish (Spain)

French (France)

German (Germany)

Italian (Italy)

Portuguese (Brazil)

Arabic (Generic)

Bengali (India)

Bulgarian (Bulgaria)

Croatian (Croatia)

Czech (Czech Republic)

Danish (Denmark)

Dutch (Belgium)

Dutch (Netherlands)

Estonian (Estonia)

Finnish (Finland)

Greek (Greece)

Gujarati (India)

Hebrew (Israel)

Hindi (India)

Hungarian (Hungary)

Indonesian (Indonesia)

Japanese (Japan)

Kannada (India)

Korean (South Korea)

Latvian (Latvia)

Lithuanian (Lithuania)

Malayalam (India)

Mandarin Chinese (China)

Marathi (India)

Norwegian Bokmål (Norway)

Polish (Poland)

Romanian (Romania)

Russian (Russia)

Serbian (Cyrillic)

Slovak (Slovakia)

Slovenian (Slovenia)

Swahili (Kenya)

Swedish (Sweden)

Tamil (India)

Telugu (India)

Thai (Thailand)

Turkish (Turkey)

Ukrainian (Ukraine)

Urdu (India)

Vietnamese (Vietnam)

温度 ?

1.0

控制 LLM 模型的创意性与一致性。默认值：1.0。越低 = 越聚焦/确定，越高 = 越有创意/随机。

思考中

推理模型在回答前会先思考。关闭 = 1×，开启 = 2×，高 = 模型每次调用基础费用的 3×。

关闭 1

开启 2

高 3

免费 AI Video Generator - Text to Video

AI text to video generator — describe the scene you want to see and the AI creates a short video clip. Also supports image to video: upload a picture and bring it to life. Choose video length, aspect ratio and style across advanced video generation models.

适合谁

为 TikTok、Instagram Reels 和 YouTube Shorts 制作吸睛视频的社媒创作者、为产品和服务生成推广视频的营销团队、可视化概念与讲解的教育者、制作概念视频与预告片的游戏开发者，以及尝试视觉叙事的创意专业人士。

核心功能

可使用多种视频生成模型，可调视频长度选项，风格与运动控制，高质量输出，以及文生视频与图生视频的转换能力。

变体

Image to Video TikTok Video Reel YouTube Video Short

类似的生成器

图片使用顶尖的 AI 模型生成惊艳的图像——文生图与图生图

文本转语音（TTS）让 AI 将你的文本朗读为音频文件，具有自然的语调和情感控制

音乐生成任意风格的录音室级音乐

视频使用最新的 AI 模型生成和扩展视频——文本生成视频和图像生成视频

起始图片

源视频

结束帧（可选）

参考输入 (可选)

免费 AI Video Generator - Text to Video

适合谁

核心功能

变体

类似的生成器

1,000

100

可用生成器

视频 使用最新的 AI 模型生成和扩展视频——文本生成视频和图像生成视频

AI 模型

质量

模式

宽高比

时长

起始图片

源视频

结束帧（可选）

参考输入 (可选)

生成音频

图像来源

图像模型

隐私模式

许可证

输出语言

思考中

免费 AI Video Generator - Text to Video

适合谁

核心功能

变体

类似的生成器

1,000

100

可用生成器

视频使用最新的 AI 模型生成和扩展视频——文本生成视频和图像生成视频