Business

Kling O1 Launches as the World's First Unified Multimodal Video Model

Kuaishou Technology ("Kuaishou" or the "Company"; HKD Counter Stock Code: 01024 / RMB Counter Stock Code: 81024), a leading content community and social platform, announced that on December 1, 2025, Kling AI officially unveiled Kling O1, positioned as the industry's first unified multimodal creation tool. Powered by next-generation video and imaging architectures, Kling O1 integrates text, video, image, and subject inputs, consolidating all generation and editing tasks into a single, all-encompa

articleKuaishou Technology Class BDecember 2, 20257/company/kuaishou-technology/news/kling-o1-launches-as-the-worlds-first-unified-multimodal-video-model

Kling O1 Launches as the World's First Unified Multimodal Video Model

About this update from Kuaishou Technology Class B

[{"type":"text","content":"HONG KONG, Dec. 2, 2025 /PRNewswire/ -- Kuaishou Technology ("Kuaishou" or the "Company"; HKD Counter Stock Code: 01024 / RMB Counter Stock Code: 81024), a leading content community and social platform, announced that on December 1, 2025, Kling AI officially unveiled Kling O1, positioned as the industry's first unified multimodal creation tool. Powered by next-generation video and imaging architectures, Kling O1 integrates text, video, image, and subject inputs, consolidating all generation and editing tasks into a single, all-encompassing engine. This launch definitively resolves the "consistency challenge" regarding characters and scenes in AI video generation, providing a deeply integrated, one-stop solution tailored for film, television, social media, advertising, and e-commerce.","length":833,"tagName":"p"},{"type":"text","content":"The Unified Model: A Paradigm Shift in Video Creation","length":53,"tagName":"p"},{"type":"text","content":"As the pioneer of unified multimodal video models, Kling O1 is engineered on a Multimodal Visual Language (MVL) framework. It transcends the boundaries of traditional single-task video generation models by fusing a comprehensive spectrum of capabilities – including reference-based video generation, text-to-video generation, start and end frame generation, video in-painting (content insertion and removal), video modification and transformation, style re-rendering, and shot extension – into one versatile engine. This eliminates the need for creators to toggle between disparate models and tools; the entire creative lifecycle, from inception to refinement, is now a seamless, single-stream workflow.","length":703,"tagName":"p"},{"type":"text","content":"Leveraging deep semantic reasoning, Kling O1 is able to interpret all user inputs – whether images, video clips, specific subjects, or text – as executable prompts. By removing modality constraints, Kling O1 achieves a holistic understanding of elements from multiple perspectives, generating output with pixel-perfect precision.","length":329,"tagName":"p"},{"type":"text","content":"With its user-friendly multimodal prompt input interface, Kling O1 transforms complex post-production editing into a simple, conversational experience. Bypassing the need for manual masking or keyframing, users can simply input p...

More updates from Kuaishou Technology Class B