WAN 2.7 - Video Generation, Editing, and Reference API
Generate AI videos with WAN 2.7, Alibaba’s latest video model. Four endpoint categories: text-to-video, image-to-video (with video continuation), reference-to-video for character-consistent generation, and video editing with style transfer. All categories support 720P and 1080P resolution, audio-guided generation, and automatic prompt expansion.- POST /v1/ai/text-to-video/wan-2-7: Generate video from a text prompt (2-15 seconds)
- POST /v1/ai/image-to-video/wan-2-7: Generate video from image, first+last frame, or extend a video (2-15 seconds)
- POST /v1/ai/reference-to-video/wan-2-7: Generate video featuring characters from reference images/videos (2-10 seconds)
Veed Fabric 1.0 and Veed Fabric 1.0 Fast - Lip Sync API
Generate realistic talking videos from a portrait image and audio file with Veed Fabric 1.0. Two variants available: standard for highest quality and Fast for reduced generation time. Output at 720p or 480p resolution in MP4 format.- POST /v1/ai/lip-sync/veed-fabric-1-0: Generate a lip-synced talking video
- POST /v1/ai/lip-sync/veed-fabric-1-0-fast: Generate a lip-synced talking video (faster processing)
- GET /v1/ai/lip-sync/veed-fabric-1-0: List all Veed Fabric 1.0 tasks
- GET /v1/ai/lip-sync/veed-fabric-1-0-fast: List all Veed Fabric 1.0 Fast tasks
Video Upscaler Precision - AI Video Upscaling API
AI diffusion-based precision video upscaling with faithful detail recovery. Supports output resolutions of 1K, 2K, and 4K with adjustable sharpening, smart grain, and upscaling strength controls. FPS boost available for smoother motion.- POST /v1/ai/video-upscaler-precision: Create a precision video upscaling task
- GET /v1/ai/video-upscaler-precision: List all precision upscaler tasks
- GET /v1/ai/video-upscaler-precision/: Get task status and results
Kling 3 Motion Control - Video Generation API
Transfer motion from reference videos to character images with Kling 3 Motion Control. Preserves character appearance while applying motion patterns from 3-30 second reference videos. Available in Pro and Standard tiers.- POST /v1/ai/video/kling-v3-motion-control-pro: Generate motion-controlled video with Kling 3 Pro
- POST /v1/ai/video/kling-v3-motion-control-std: Generate motion-controlled video with Kling 3 Standard
Sound Effects API
Search, filter, and download royalty-free sound effects from the Freepik catalog. Browse 42 categories including ambience, foley, and transitions with full-text search, category filtering, and flexible sorting.- GET /v1/sfx: Search and filter sound effects
- GET /v1/sfx/download: Download a sound effect audio file
Music API
Search, filter, and download royalty-free music from the Freepik Music catalog. Filter by genre, mood, and premium status with sorting by relevance, popularity, duration, or tempo.- GET /v1/music: Search and filter music
- GET /v1/music/download: Download a music audio file
Nano Banana Pro Flash - Text To Image API
Generate images from text with Nano Banana Pro Flash, powered by Google’s Gemini 3.1 Flash model. Faster generation with Google Search grounding for real-world accuracy, reference image support (up to 3 images), 10 aspect ratios, and resolutions up to 4K.- POST /v1/ai/text-to-image/nano-banana-pro-flash: Create a new image generation task
- GET /v1/ai/text-to-image/nano-banana-pro-flash: List all Nano Banana Pro Flash tasks
- GET /v1/ai/text-to-image/nano-banana-pro-flash/task-id: Get task status and results by ID
Video Upscaler Turbo Endpoint and Frame-Based Pricing
New dedicated Turbo endpoint for Video Upscaler with faster processing and premium quality automatically applied. Pricing model updated to frame-based billing that varies by output resolution. Theturbo and premium_quality parameters have been removed from the standard endpoint in favor of the separate Turbo path. Turbo tasks use the same list and get-task endpoints as standard tasks.- POST /v1/ai/video-upscaler/turbo: Create a turbo video upscaling task
Runway Gen 4.5 - Video Generation API
Generate high-quality AI videos from text prompts or images with Runway Gen 4.5. Supports both text-to-video and image-to-video workflows with async task processing, polling, and webhook notifications.- POST /v1/ai/text-to-video/runway-4-5: Generate video from a text prompt
- POST /v1/ai/image-to-video/runway-4-5: Generate video from an image
Change Camera - Image Perspective API
Transform the camera angle and perspective of any image with AI. Control horizontal rotation (0-360 degrees), vertical tilt (-30 to 90 degrees), and zoom level (0-10) to generate multi-angle views from a single photo.- POST /v1/ai/image-change-camera: Create a new camera angle transformation task
Seedream V4.5 – Image Expand API
Outpaint and expand images with Seedream V4.5 by setting per-edge pixel growth. Optional prompt guidance with async tasks, polling, and webhooks support.- POST /v1/ai/image-expand/seedream-v4-5: Expand an image beyond its boundaries using Seedream V4.5 outpainting
Ideogram Image Edit – Inpainting API
Edit images with Ideogram inpainting using masks and prompts. Choose TURBO/DEFAULT/QUALITY modes, MagicPrompt, and async tasks with webhooks and polling.- POST /v1/ai/ideogram-image-edit: Inpaint and edit an image using a mask plus a prompt with Ideogram Image Edit
Ideogram Image Expand API
Expand images beyond their original boundaries with AI-powered outpainting using the Ideogram model. Control expansion independently on each edge (left, right, top, bottom) up to 2048 pixels, with optional prompt guidance and auto-prompt generation.- POST /v1/ai/image-expand/ideogram: Create a new image expansion task
Kling 3 - Video Generation API
Generate AI videos with Kling 3, the latest video generation model from Kuaishou.- POST /v1/ai/video/kling-v3-pro: Generate video with Kling 3 Pro
- POST /v1/ai/video/kling-v3-std: Generate video with Kling 3 Standard
Google Veo 3.1 Reference-to-Video API
Generate videos with consistent characters and objects using reference images. Maintain visual identity across scenes for storytelling and multi-scene projects.- POST /v1/ai/reference-to-video/veo-3-1: Create video with reference images
- GET /v1/ai/reference-to-video/veo-3-1: List all reference-to-video tasks
- GET /v1/ai/reference-to-video/veo-3-1/task-id: Get task status and results
- Character/object consistency using 1-3 reference images
- Multi-resolution output: 720p, 1080p, or 4K
- Native audio generation with dialogue and sound effects
- Fixed 8-second duration at 24 FPS
- Aspect ratios: 16:9 (landscape) or 9:16 (portrait)
- Up to 20,000 character prompts
Nano Banana Pro (Text-to-Image)
Generate high-quality images with Google’s Nano Banana Pro (Gemini 3) model. Supports reference images for guided generation, multiple aspect ratios, and resolution options.- POST /v1/ai/text-to-image/nano-banana-pro: Create image from text with optional reference images
- Up to 3 reference images for guided generation
- Multiple aspect ratios (1:1, 16:9, 9:16, 4:3, 3:4, etc.)
- Resolution options: low, medium (default), high (4K)
- Webhook notifications