Skip to content
English

HappyHorse 1.1 T2V API

POST /v1/tasks

All models are called through the Unified Async API POST /v1/tasks endpoint; only the input fields differ (see input parameters below).

Model summary

Model name happyhorse-1.1/text-to-video
Type Video generation (text-to-video)
Endpoint POST /v1/tasks
Pricing See HiAPI Pricing

HappyHorse 1.1 text-to-video API by Alibaba. Generate short videos with native audio from a text prompt, with natural motion and strong prompt adherence, at 720p / 1080p and 3-15 seconds.

Production guidance

Production guidance
  • For production, pass callback.url at the top level of the request body so HiAPI can notify your service when the task reaches a terminal state.
  • GET /v1/tasks/:id is better for local debugging, low-volume jobs, or fallback reconciliation if a callback is missed.
  • Use callback.when=final. Both success and fail are terminal states, so your service should deduplicate by taskId.

Best suited for

Sound-on short videos

Generate visuals and native audio in one call, skipping post-production scoring. Good for social shorts and ad intros.

prompt
Text-to-clip creative drafts

Produce coherent clips from a text prompt alone, ideal for storyboard checks, batch test shots, and creative drafts.

promptduration
Landscape and portrait ratios

Nine aspect ratios let one prompt adapt to 16:9 feeds and 9:16 vertical shorts.

aspect_ratio
Pick resolution and length

720p / 1080p tiers — choose resolution and duration to match budget and placement (billed per second).

resolutionduration

Request parameters

model string required

Fixed value happyhorse-1.1/text-to-video.

example happyhorse-1.1/text-to-video
input object required

Business parameters. Put HappyHorse 1.1 T2V-specific configuration here.

prompt string required

Text prompt describing the video to generate.

resolution enum optional

Output video resolution. Higher resolution costs more.

default 1080p enum: 720p1080p
aspect_ratio enum optional

Output video aspect ratio.

default 16:9 enum: 16:99:163:44:34:55:41:19:2121:9
duration integer optional

Clip length in seconds (3-15). Cost scales with duration.

default 5
callback object optional

Optional callback configuration. When set, HiAPI notifies your service when the task reaches a terminal state.

url string required

Required when callback is set; HTTPS URL that receives terminal task notifications.

example https://your-domain.com/hiapi/callback
when enum optional

Callback trigger timing. Use final.

default final enum: final

Example requests

Cinematic landscape clip

1080p / 16:9 / 5s with native audio, for narrative shots.

Request body
{
  "model": "happyhorse-1.1/text-to-video",
  "input": {
    "prompt": "A snow mountain lake in morning mist, slow camera push-in, calm and vast, warm cinematic light",
    "aspect_ratio": "16:9",
    "resolution": "1080p",
    "duration": 5
  }
}
Vertical cyberpunk

720p / 9:16 / 5s vertical short — lower cost.

Request body
{
  "model": "happyhorse-1.1/text-to-video",
  "input": {
    "prompt": "A neon street at night after rain, reflections on the ground, cyberpunk mood, slow pan",
    "aspect_ratio": "9:16",
    "resolution": "720p",
    "duration": 5
  }
}
Warm-toned slow motion

1080p / 16:9 / 8s, longer duration for slow-motion narrative.

Request body
{
  "model": "happyhorse-1.1/text-to-video",
  "input": {
    "prompt": "A golden retriever running on a beach, slow motion, gentle waves, warm sunset tones",
    "aspect_ratio": "16:9",
    "resolution": "1080p",
    "duration": 8
  }
}

Getting the result

  1. The response returns a taskId immediately without waiting for generation to finish.
  2. In production, prefer waiting for callback.url to receive the terminal notification. For local debugging, poll GET /v1/tasks/:id.
  3. When status=success, download the generated video from output[].url.
  4. When status=fail, fix the request based on the returned error instead of retrying the same invalid payload.

FAQ

Which resolutions and durations does HappyHorse 1.1 text-to-video support?

Resolutions 720p / 1080p and durations 3-15 seconds (default 5). Pricing is billed by resolution and duration (per second); 1080p costs more than 720p. See the live pricing page for current rates. View pricing

Does the generated video include audio?

HappyHorse 1.1 produces native audio with no extra configuration, generating synchronized sound alongside the visuals.

How do I get the generated video?

The response returns a taskId immediately. When the task reaches a terminal state, download the video from output[].url. In production, pass callback.url at the top level to receive terminal notifications and avoid polling.

Does it support image-to-video or reference images?

This model is text-to-video only. Use happyhorse-1.1/image-to-video to drive from a first-frame image, or happyhorse-1.1/reference-to-video for multiple reference images.

Next steps