Models Overview

All models — image, video, and audio — are called through the unified POST /v1/tasks endpoint, with the model selected via the model field and parameters in input. See each model page for its input fields.

Current pricing is maintained on the main site: HiAPI Pricing.

Image Models 10 models Video Models 4 models Audio Coming soon

Image Models

Text-to-image, image-to-image, reference editing, and photorealistic generation.

10 models

GPT Image 2

Online

Text-to-image with accurate text rendering and layout-heavy compositions.

gpt-image-2

View details

GPT Image 2 Pro

Online

Higher-fidelity text-to-image for brand visuals, English prompts, and 2K output.

gpt-image-2-pro

View details

GPT Image 2 Image-to-Image

Online

Image-to-image generation from one or more reference images.

gpt-image-2-image-to-image

View details

GPT Image 2 Pro Image-to-Image

Online

Pro reference-image editing and redraws from 1-5 images, up to 2K output.

gpt-image-2-image-to-image-pro

View details

GPT Image 2 Beta

Beta

Preview image generation model for early testing.

gpt-image-2-beta

View details

Nano Banana

Online

Fast, prompt-tolerant text-to-image for quick iteration.

Nano-Banana

View details

Nano Banana 2

Online

Balanced Nano Banana tier with optional references and 1K / 2K / 4K output.

Nano-Banana-2

View details

Nano Banana Pro

Online

Premium brand visuals, reference-based editing, and high-resolution output.

Nano-Banana-Pro

View details

FLUX 1.1 Pro

Online

Photorealistic image generation with optional reference-image guidance.

flux-1.1-pro

View details

Qwen Image 2.0

Online

Low-cost image model with strong Chinese prompt and text rendering behavior.

qwen-image-2.0

View details

Video Models

Text-to-video, image-to-video, reference media, and short-form generation.

4 models

Seedance 2.0

Online

Reference images, video, and audio, with optional synchronized audio.

seedance-2-0

View details

Wan 2.7 Text-to-Video

Online

Text-to-video with size, duration, prompt expansion, and shot controls.

wan2.7-t2v

View details

Wan 2.7 Image-to-Video

Online

Animate a first frame, with optional last frame, audio, or clip inputs.

wan2.7-i2v

View details

HappyHorse 1.0

Online

Short text-to-video clips, 3-15 seconds, at 720p or 1080p.

happyhorse-1-0

View details

Audio

Audio generation models will be documented once available.

Coming soon

Audio models

Coming soon

No public audio model ID is available yet. The docs page will be updated after launch.

No model ID yet

View status