Large Model Price Wars Drive Focus to Vertical Markets
Advertisements
The Chinese market for large-scale AI models is currently in a heated state as major players such as Alibaba Cloud, Baidu Intelligent Cloud, Tencent, and iFlytek have all decided to significantly reduce their pricingThis price cut frenzy comes after extensive interviews with industry experts, who suggest that the main driving force behind this trend is the fierce competition for market share among these companies.
One of the key concerns raised by analysts is that this price war further highlights the alarming degree of homogenization present among the large AI models currently availableMany industry professionals have pointed out that most of these models share similar technological architectures and algorithms, often built and trained on public datasetsThis rush to market has driven companies to resort to proven business models, ultimately resulting in an oversaturated market with little differentiation among products.
Amidst this backdrop, the competition for market dominance is a significant topic of discussion
Many experts believe that the models targeting specific industries, known as vertical models, are likely to demonstrate greater potential for growth compared to their general-purpose counterparts.
Interestingly, although Dunbao's model launched later than others in the market, it gained immediate traction by offering an enterprise-level pricing of only 0.0008 yuan per 1000 tokensThis aggressive pricing strategy prompted multiple firms to follow suit swiftly.
Within a short period, Alibaba Cloud announced a price reduction for its Tongyi Qianwen model series, announcing that the primary model’s API input price would drop by up to 97%, falling from 0.02 yuan to just 0.0005 yuan per 1000 tokensFollowing suit, Baidu Intelligent Cloud also declared its ERNIE models would be offered free of charge to enterprise users on the same day.
In the subsequent days, Tencent Cloud made headlines by unveiling a new big model upgrade plan
- Chinese Space Industry: Comprehensive Commercialization Opportunities
- The Fed Strikes Again! Buffett Sells Off
- Bitcoin vs. Stocks: A Divergent Investment Journey
- India GDP Up 5.4%, Record High — Eyes Japan in 2024?
- Fed to Keep Rates Steady in December?
One of its leading models, the Hunyuan-lite, would see an upgrade in input output limits from 4k to a remarkable 256k tokens with the price cut to freeFurthermore, numerous other models from Tencent also experienced price reductions ranging between 50% and 87.5%. Not to be outdone, iFlytek announced that their Xinghuo API offerings would also be operating at zero cost, while their premium service would see a reduction to as low as 0.21 yuan per 10,000 tokens.
This intriguing price reduction trend appears to be tied to the larger logic governing AI model developmentTechnological advancements continually refine these models, leading to lower inference and operational costs, which can contribute to price reductionsDraw parallels with international players such as OpenAI, which has followed a consistent downward price trajectory, with four major price cuts throughout 2023, including a notable 50% drop in price for their GPT-4o model.
According to Alibaba Cloud representatives, the current price cuts can be credited to the technological advantages presented by public cloud infrastructure, as well as the economies of scale achieved through digital innovation
They utilize proprietary technologies such as heterogeneous chip interconnects and high-performance storage systems to enhance AI computational efficiency, subsequently driving down modeling costs.
In recent statements, Baidu also highlighted that its algorithms have considerably improved in training efficiency compared to the previous year, claiming a 5.1 times increase in operational performance, with inference costs decreasing to just 1% of previous levels.
However, it's crucial to note that a reduction in inference costs does not equate to lower research and development (R&D) expenses, particularly when considering the performance dynamics of overseas chip technologiesThe R&D costs associated with large models still present considerable challenges for Chinese companies, as highlighted by Baidu's 2023 fiscal report indicating a 242 billion yuan spending on R&D, reflecting a 4% increase year-on-year, largely driven by expenses related to AI infrastructure.
Similarly, during its recent quarterly earnings report, iFlytek reported a revenue increase of 26.27% year-on-year; however, their profit margins took a hit, with net profit down by 2.42 billion yuan from the previous year, reflecting the heavy investments in developing general AI and large models
The report also indicated an additional investment of around 300 million yuan dedicated to core technology autonomy and expanding the CAD model domain.
As the industry grapples with the intricacies of these price cuts, some insiders have candidly remarked that the primary aim is to capture market shareGiven the aggressive pricing tactics deployed by certain firms, others feel compelled to respond similarlyThis leaves companies struggling to differentiate themselves in a market traditionally dominated by heavyweight players.
Amidst this competitive landscape, there remains a question regarding the market positioning of specialized vertical models amidst the general model price declinesExperts argue that while lower prices might attract users toward general models, they also create uncertainty for clients who may find themselves overwhelmed by a sea of indistinguishable options.
Currently, the consensus among these industry leaders is that the varied applications of vertical models, particularly those targeting niche sectors like finance, maintain a distinct value proposition that cannot be easily replicated by general models
For instance, banks seeking AI-assisted solutions may test both general and financial vertical models to ascertain their respective performance and efficacy, particularly in areas such as marketing or enhancing business operations.
To move forward, the imperative to transcend the current stage of homogenization is paramount for both general and specialized modelsExperts suggest that general models must focus on improving versatility and usability to accommodate a wide array of applications, continuously integrating new functionalities such as multi-modal support or cross-language processing to maintain user engagementIn contrast, vertical models should prioritize custom development tailored to specific industry demands, leveraging collaborative efforts with leading enterprises to curate solutions enriched by specialized knowledge and data resources.
Looking ahead, the competitive dynamics in the large-scale AI model arena are expected to intensify
Write a Comment