ShengShu Technology: Multimodal Video Model Developer Raises RMB 600M+ Series A+

By Amit Chowdhry • Yesterday at 7:00 AM

ShengShu Technology said it has completed a Series A+ funding round of more than RMB 600 million as it scales development and commercialization of its multimodal foundation models, including its Vidu video generation platform. The round was co-led by Zhongguancun Science City and LINK-X CAPITAL, and included strategic investments from Wondershare, Visual China Group Co., Ltd., and TRS. ShengShu added that existing backers, including Qiming Venture Partners, the Beijing Artificial Intelligence Industry Investment Fund, G&O, C&D Emerging Industry Equity Investment, and Guowen Hechuang, also increased their investments.

ShengShu positioned the financing as support for continued model innovation and broader product adoption across creative and enterprise use cases. The company highlighted its prior research on multimodal generative algorithms, including the U-ViT architecture introduced in September 2022, and said it launched Vidu globally in July 2024 with a “Reference-to-Video” capability to improve multi-entity consistency in commercial video generation.

The company said it has released multiple iterations of Vidu, including Vidu Q1, Q2, and Q3, with improvements across consistency, semantic understanding, motion dynamics, stability, and inference speed. It described Vidu Q3 as designed for storytelling, supporting 16-second synchronized audio-video generation, native 1080p output, cinematic language and shot transitions, multilingual text rendering, and multi-language output.

ShengShu cited third-party benchmarking claims, saying Artificial Analysis rankings placed Vidu Q3 No. 1 in China and No. 2 globally, and that Vidu Q2 maintained the fastest generation speed globally among commercial-grade content generation models. The company also said it open-sourced its TurboDiffusion framework in December 2025, claiming a five-second video can be generated in 1.9 seconds on a single RTX 5090 GPU, representing a major efficiency improvement.

On commercialization, ShengShu said it has built a product ecosystem around Vidu, including Vidu MaaS, Vidu SaaS, Vidu App, and Vidu Agent, serving creators and enterprise clients globally. The company said it achieved more than 10× growth in both users and revenue in 2025, and that Vidu is used in more than 200 countries and regions for content production.

ShengShu also detailed sector adoption across film and entertainment, internet and smart hardware, advertising, gaming, and international creator and enterprise clients, listing a range of partners and customers that it said use Vidu for content production, marketing asset creation, and interactive experiences. Looking forward, the company framed multimodal video models as an emerging production paradigm, with potential to extend beyond digital content workflows and into deeper integration with the physical world.

KEY QUOTES

“ShengShu Technology began with a strong foundation in algorithm research and continues to push the boundaries of core model innovation. Among leading international multimodal foundation models, Vidu has established clear differentiation and strong competitive advantages. From research breakthroughs to large-scale commercialization, we believe multimodal foundation models will become a next-generation production paradigm and a transformative force in productivity, reshaping global content workflows and industry structures. ShengShu will remain technology-driven and value-oriented, advancing product and commercial strategies to unlock the full potential of multimodal AI for the global content ecosystem,”

Yihang Luo, CEO, ShengShu Technology

“The ceiling for multimodal video models is exceptionally high. Beyond powering digital content creation and interaction, they have the potential to evolve into true world models that understand the underlying structures of reality and support end-to-end machine decision-making. As AI continues to mature in the digital world, ShengShu aims to push its boundaries further, expanding from digital deployment toward deeper integration with the physical world,”

Jun Zhu, Founder and Chief Scientist, ShengShu Technology