Skip to content
English

Models Overview

All models — image, video, and audio — are called through the unified POST /v1/tasks endpoint, with the model selected via the model field and parameters in input. See each model page for its input fields.

Current pricing is maintained on the main site: HiAPI Pricing.

Image Models

Text-to-image, image-to-image, reference editing, and photorealistic generation.

10 models

GPT Image 2

Online

Text-to-image with accurate text rendering and layout-heavy compositions.

gpt-image-2

GPT Image 2 Pro

Online

Higher-fidelity text-to-image for brand visuals, English prompts, and 2K output.

gpt-image-2-pro

GPT Image 2 Image-to-Image

Online

Image-to-image generation from one or more reference images.

gpt-image-2-image-to-image

GPT Image 2 Pro Image-to-Image

Online

Pro reference-image editing and redraws from 1-5 images, up to 2K output.

gpt-image-2-image-to-image-pro

GPT Image 2 Beta

Beta

Preview image generation model for early testing.

gpt-image-2-beta

Nano Banana

Online

Fast, prompt-tolerant text-to-image for quick iteration.

Nano-Banana

Nano Banana 2

Online

Balanced Nano Banana tier with optional references and 1K / 2K / 4K output.

Nano-Banana-2

Nano Banana Pro

Online

Premium brand visuals, reference-based editing, and high-resolution output.

Nano-Banana-Pro

FLUX 1.1 Pro

Online

Photorealistic image generation with optional reference-image guidance.

flux-1.1-pro

Qwen Image 2.0

Online

Low-cost image model with strong Chinese prompt and text rendering behavior.

qwen-image-2.0

Video Models

Text-to-video, image-to-video, reference media, and short-form generation.

4 models

Seedance 2.0

Online

Reference images, video, and audio, with optional synchronized audio.

seedance-2-0

Wan 2.7 Text-to-Video

Online

Text-to-video with size, duration, prompt expansion, and shot controls.

wan2.7-t2v

Wan 2.7 Image-to-Video

Online

Animate a first frame, with optional last frame, audio, or clip inputs.

wan2.7-i2v

HappyHorse 1.0

Online

Short text-to-video clips, 3-15 seconds, at 720p or 1080p.

happyhorse-1-0

Audio

Audio generation models will be documented once available.

Coming soon

Audio models

Coming soon

No public audio model ID is available yet. The docs page will be updated after launch.

No model ID yet