Skip to main content

Kling O1 integration

Generate smooth video sequences using first frame and last frame interpolation, with optional reference images for character and style consistency.
Kling O1 is an image-to-video API that creates cinematic video sequences by interpolating between a first frame and last frame. It delivers smooth transitions with controlled start and end points, suitable for creative professionals and automated workflows. Choose between Pro quality for premium results or Standard for faster, cost-effective generation.

Key capabilities

  • Frame interpolation: Generate smooth transitions between first and last frame images
  • Reference image support: Maintain character and style consistency with up to 7 reference images
  • Dual quality modes: Pro for professional-grade output, Standard for efficient generation
  • Flexible durations: 5-second or 10-second video outputs
  • Multiple aspect ratios: 16:9 (widescreen), 9:16 (vertical), 1:1 (square)
  • Prompt guidance: Text descriptions up to 2500 characters to guide motion and style
  • Async processing: Webhook notifications or polling for task completion

Use cases

  • Film and animation: Create smooth scene transitions and animated sequences
  • Marketing content: Generate cinematic product reveals and brand videos
  • Social media: Produce vertical and square videos optimized for Instagram, TikTok, and YouTube Shorts
  • E-commerce: Animate static product images with controlled motion
  • Storytelling: Build narrative sequences with consistent character appearance
  • Motion design: Create abstract transitions and visual effects

API endpoints

Generate videos by submitting a request to one of the Kling O1 endpoints. The service returns a task ID for async polling or webhook notification.

Parameters

Image-to-Video (first/last frame)

ParameterTypeRequiredDefaultDescription
first_framestringNo*-Starting image (URL or Base64). Min 300x300px, max 10MB
last_framestringNo*-Ending image (URL or Base64). Same requirements as first_frame
promptstringNo-Motion description up to 2500 characters
aspect_ratiostringNo16:9Video ratio: 16:9, 9:16, or 1:1
durationintegerNo5Video length: 5 or 10 seconds
webhook_urlstringNo-URL for completion notification
*At least one frame (first_frame or last_frame) is required.

Video Reference

ParameterTypeRequiredDefaultDescription
promptstringYes-Video description up to 2500 characters
reference_imagesarrayNo-Up to 7 reference images for consistency
aspect_ratiostringNo16:9Video ratio: 16:9, 9:16, or 1:1
durationintegerNo5Video length: 5 or 10 seconds
webhook_urlstringNo-URL for completion notification

Frequently Asked Questions

Kling O1 is an image-to-video model that generates smooth video sequences through frame interpolation. You provide a first frame, last frame, or both, and the model creates a cinematic transition between them. Optionally, add a text prompt to guide the motion style. Processing is asynchronous: submit a request, receive a task ID, then poll for results or receive a webhook notification.
Pro mode generates higher-fidelity videos with better motion quality, detail retention, and temporal consistency. Standard mode offers faster processing at lower cost, suitable for prototyping or high-volume use cases. Both support the same parameters and output formats.
Kling O1 accepts JPEG and PNG images via URL (publicly accessible) or Base64 encoding. Requirements: minimum resolution 300x300px, maximum file size 10MB, aspect ratio between 1:2.5 and 2.5:1.
The video reference endpoints accept up to 7 reference images that help maintain character and style consistency. Use these when you need the same subject to appear consistently across multiple video generations or shots.
Processing time varies based on duration and mode. Typical generation: 30-120 seconds for 5-second videos, longer for 10-second outputs. Pro mode takes longer than Standard. Use webhooks for production workflows instead of polling.
Rate limits depend on your subscription tier. See the Rate Limits page for current limits by plan.
Pricing varies by mode (Pro vs Standard) and duration. See the Pricing page for current rates and subscription options.

Best practices

  • Image quality: Use high-resolution images (1080p or higher) with clear subjects and balanced lighting for best results
  • Frame similarity: When using both first and last frames, ensure similar compositions for smoother interpolation
  • Prompt specificity: Describe the desired motion clearly (camera movement, subject action, mood) rather than just the scene
  • Reference consistency: For video reference mode, use images with similar lighting and style for better character consistency
  • Production integration: Implement webhooks instead of polling for scalable applications
  • Error handling: Implement retry logic with exponential backoff for 503 (service unavailable) responses
  • Kling 2.6: Latest Kling model with motion control capabilities
  • Kling 2.5 Turbo Pro: Fast cinematic generation with 5-10 second outputs
  • Kling 2.1 Pro: High-fidelity motion with strong temporal consistency
  • Kling Elements: Element-based video generation for specific use cases