
Chinese artificial intelligence startup ShengShu Technology has secured 2 billion yuan (about $293 million) in fresh funding, marking one of the biggest AI financing rounds in China this year and highlighting how Chinese firms are moving beyond chatbot-style systems toward broader artificial intelligence models designed to understand real-world environments.
The funding round was led by Alibaba Cloud, with participation from both state-linked and private investors. The capital will support ShengShu’s long-term effort to develop what the company describes as a “general world model,” an AI framework aimed at processing video, sound, movement, and physical interaction more like human perception rather than focusing only on text generation.
This signals a wider shift inside China’s technology sector. While earlier AI investment focused heavily on language models and conversational systems, investors are now backing companies that can build systems capable of handling multiple forms of data at the same time. These models are increasingly viewed as necessary for robotics, industrial automation, autonomous systems, and advanced machine decision-making.
ShengShu was founded in 2023 by researcher Zhu Jun, who previously worked in advanced artificial intelligence development at Tsinghua University. The company gained early attention after launching Vidu, one of China’s first major text-to-video generation platforms, at a time when global competition in AI-generated video was rapidly intensifying.
Its Vidu platform later received multiple upgrades focused on better motion quality, stronger visual consistency, and improved scene generation. These upgrades helped ShengShu position itself in a competitive space where video-generation systems are becoming an important testing ground for multimodal AI performance.
The company has also moved into robotics-related artificial intelligence. In late 2025, ShengShu released an open-source multimodal model called Motus, designed to support robotic movement and physical interaction tasks. That product suggested the company is not limiting itself to media generation but is attempting to build AI systems that can eventually support machines operating in dynamic physical environments.
The new investment comes at a time when Chinese artificial intelligence firms are under pressure to prove technical depth rather than simply replicate existing chatbot products. Investors in China are increasingly prioritizing companies that offer original infrastructure or unique model capability.
Competition remains intense. Large Chinese technology groups including Alibaba, ByteDance, and Kuaishou are all investing heavily in video-generation and multimodal systems. At the same time, international players continue advancing similar technologies, making product quality and deployment speed equally important.
Industry analysts say the importance of ShengShu’s funding is less about the amount itself and more about what it reveals: capital is now moving toward startups that can connect large models with industrial use cases rather than purely consumer-facing AI tools.
If ShengShu succeeds in building a scalable world-model architecture, it could place the company in a stronger position within the next phase of artificial intelligence development, where understanding movement, physical space, and real-world interaction may become more valuable than text generation alone.