Ovi

ovi-text-to-video

post

A unified paradigm for audio-video generation

Header parameters
AuthorizationstringRequiredExample: sk-yOm1Vc3raDjQluBttyAwSdbVxtWFEAPCbtAZ2uF3yST1xNCY
Body
modelstringRequired

model is the model name

Example: ovi-text-to-video
promptstringRequired

prompt is the prompt.

Example: A cute baby sea otter.
resolutionstring · enumOptional

Resolution of the generated video in W:H format. One of (512x992, 992x512, 960x512, 512x960, 720x720, or 448x1120). Default value: "992x512"

Possible enum values: 512x992, 992x512, 960x512, 512x960, 720x720, 448x1120, 1120x448

Default: 992x512Example: 992x512Possible values:
negative_promptstringOptional

Negative prompt for video generation. Default value: "jitter, bad hands, blur, distortion"

Default: jitter, bad hands, blur, distortionExample: jitter, bad hands, blur, distortion
num_inference_stepsintegerOptional

The number of inference steps. Default value: 30

Default: 30Example: 30
audio_negative_promptstringOptional

Negative prompt for audio generation. Default value: "robotic, muffled, echo, distorted"

Default: robotic, muffled, echo, distortedExample: robotic, muffled, echo, distorted
seedintegerOptional

Random seed for reproducibility. If None, a random seed is chosen.

output_formatstring · enumOptional

output_format indicate the output format, default value is url.

Default: urlExample: urlPossible values:
Responses
200Success
application/json
post
/v1/videos/generations
200Success

Last updated