Monday, 24 February 2025
25.5 C
Singapore
28.3 C
Thailand
19.8 C
Indonesia
25.6 C
Philippines

The founder says Chinese AI can thrive with bigger models and more data

Stepfun's founder champions scaling laws and multimodality in AI development, predicting a trillion-parameter model revolution in China's AI industry.

If you follow the latest developments in artificial intelligence, youโ€™ll find that bigger models and more data are the keys to success. Jiang Daxin, the founder of Stepfun, a Shanghai-based AI start-up, believes in the power of scaling laws in large language model (LLM) development. Despite challenges like lower investment and a lack of advanced chips in China, Jiang remains optimistic.

Jiang, who used to work at Microsoft, shared his thoughts at the World Artificial Intelligence Conference (WAIC) in Shanghai. He predicts that LLMs will eventually reach hundreds of trillions of parameters, greatly enhancing their capabilities.

The promise of scaling laws

Scaling laws are all about the relationship between an AI modelโ€™s performance and its number of parameters. Generally, larger models perform better, especially with more data and excellent computational resources, although the improvements can slow down after a certain point. Big tech companies invest heavily in advanced technology, particularly Nvidiaโ€™s H100 chips, to maximize performance.

Jiang highlighted this trend in his talk. โ€œThe advancements in OpenAIโ€™s GPT series, which powers ChatGPT, and the massive investments in supercomputing centers by companies like Amazon, Microsoft, and Meta show that scaling laws work,โ€ he said on Saturday. However, he cautioned that the availability of data, skilled personnel, and concerns about return on investment could affect the pace of these advancements.

Since OpenAI launched ChatGPT in late 2022, Chinese tech giants and start-ups have been eager to develop their LLMs. China has over 200 AI models, including Alibabaโ€™s Tongyi Qianwen and Baiduโ€™s Ernie. Alibaba owns the South China Morning Post, which reported this news. Yet, many Chinese AI firms struggle to match the spending power of their US counterparts and focus instead on revenue-generating applications.

Stepfunโ€™s innovative models

Founded in April 2023, Stepfun has been dedicated to developing fundamental models. At WAIC, the company launched Step-2, a trillion-parameter LLM, along with the Step-1.5V multimodal model and the Step-1X image generation model.

Jiang also emphasized the importance of multimodality in creating a comprehensive AI. Multimodal models can process visual and other data types to develop internal representations of the external world. He explained that Stepfun aims to combine generative and comprehension abilities in a single model.

Stepfun also offers consumer-facing products, such as Yuewen, a ChatGPT-like personal assistant, and Maopaoya, an AI companion that can take on various character personalities.

The future of AI investment

โ€œLast year, global AI investments reached US$22.4 billion, with 70 to 80 percent going to companies developing large models,โ€ said Alex Zhou Zhifeng, managing partner at Qiming Venture Partners, at another WAIC side event. Qiming was an early investor in Stepfun.

Zhou noted that more investments in AI applications are expected soon, partly due to decreasing token costs. In AI, a token is a basic data unit processed by algorithms.

Peng Wensheng, an economist at China International Capital, added that Chinaโ€™s AI model market is projected to reach about 5.2 trillion yuan (US$715.1 billion) by 2030. The size of the size of the industrial AI market is expected to be around 9.4 trillion yuan.

This optimistic outlook suggests a bright future for AI development in China, driven by the potential of scaling laws and innovative models like those from Stepfun.

Hot this week

OpenAI moves to loosen ChatGPT restrictions

OpenAI updates ChatGPTโ€™s policies to promote intellectual freedom, allowing for more perspectives on controversial topics while maintaining neutrality.

ASUS launches ZenScreen Duo OLED MQ149CD, a portable monitor with dual OLED displays

ASUS unveils the ZenScreen Duo OLED MQ149CD, a portable dual-screen monitor with OLED technology, delivering stunning visuals and flexible work setups.

Internal chats expose Metaโ€™s approach to AI training data

Court filings reveal Meta staff debated using copyrighted materials for AI training, discussing legal risks and alternative data sources like Libgen.

SBF supports Budget 2025’s focus on long-term growth and cost relief

SBF welcomes Budget 2025โ€™s focus on business transformation, tax relief, and workforce support, reinforcing Singaporeโ€™s long-term economic strategy.

Duolingoโ€™s Cybertruck stunt โ€˜killsโ€™ mascot Duo, and users canโ€™t get enough

Duolingoโ€™s marketing stunt claims its mascot, Duo the Owl, was hit by a Cybertruckโ€”boosting app engagement and sparking a viral campaign.

Did xAI mislead the public about Grok 3โ€™s benchmarks?

xAI is under scrutiny for allegedly misleading AI benchmark results, with OpenAI employees questioning its claims about Grok 3โ€™s performance.

BT and Equinix expand partnership to enhance global interconnectivity

BT and Equinix expand their partnership to boost interconnectivity for multinational businesses, deploying BTโ€™s Global Fabric NaaS in 40+ Equinix data centres worldwide.

LG unveils new SKS branding for luxury kitchen suite at KBIS 2025

LG rebrands Signature Kitchen Suite to SKS at KBIS 2025, introducing new luxury appliances like a free-zone induction range and an advanced island system.

LG unveils advanced laundry solutions at KBIS 2025

LG unveils its latest heat pump washer and dryer lineup at KBIS 2025, featuring AI-driven efficiency, ventless design, and smart connectivity.

Related Articles