Skip to main content
MindStudio
Pricing
Blog About
My Workspace
Video Generation Model

HappyHorse 1.0

Alibaba's unified video generation model supporting text-to-video, image-to-video, reference-to-video with up to 9 reference images, and natural-language video editing with realistic dynamic rendering.

Publisher Alibaba
Type Video
Context Window 2,000 tokens
Price $0.14-$0.24/second
Provider Alibaba Cloud
Source ImageReference ImagesVideo Editing

HappyHorse 1.0

**HappyHorse 1.0** is Alibaba Cloud's video generation model family, available through the DashScope API. It unifies four powerful video creation workflows into a single system: text-to-video, image-to-video, reference-to-video, and natural-language video editing — all featuring highly realistic dynamic rendering and fluid, detail-rich output. ### Generation Modes - **Text to Video**: Generates high-quality videos directly from text prompts, accurately comprehending text semantics to produce results that are fluid, natural, and rich in detail. - **Image to Video**: Animates a provided first-frame image, guided by a text prompt, while accurately interpreting both image and text semantics. - **Reference to Video**: Supports up to **9 reference images** of subjects, objects, and scenes with enhanced stability in subject and scene referencing, precisely preserving creative intent. Reference each image in the prompt (e.g., "Image 1", "Image 2") to direct how subjects appear in the final video. - **Video Edit**: Performs local or global edits to existing videos using natural-language instructions and up to **5 reference images**, precisely preserving the original motion dynamics while transforming visual elements. ### Output & Performance - Generates video at **720p or 1080p** resolution - Supports multiple aspect ratios, including 16:9 and 9:16 - Per-second pricing makes costs predictable across all modes HappyHorse 1.0 is well suited for content creators, marketers, and developers who need a single flexible model for generating new videos from scratch, animating still images, maintaining consistent characters across shots with reference images, or making precise edits to existing footage.

Ready to build with HappyHorse 1.0?

Get Started Free

Parameters & options

Mode Select
Default: text-to-video
Text to VideoImage to VideoReference to VideoVideo Edit
First Frame Image Image URL

Image used as the first frame of the generated video.

Source Video Video URL

The video to edit. Describe your edits in the prompt using natural language instructions.

Reference Images Image URL Array

Provide up to 9 reference images of subjects, objects, or scenes. Reference them as "Image 1", "Image 2", etc. in the prompt.

Reference Images Image URL Array

Optionally provide up to 5 reference images to use when editing elements of the video.

Resolution Toggle Group
Default: 720P
Aspect Ratio Toggle Group
Default: 16:9
Duration Select
Default: 5
5s

Start building with HappyHorse 1.0

No API keys required. Create AI-powered workflows with HappyHorse 1.0 in minutes — free.