Stock Markets April 27, 2026 01:39 AM

DeepSeek Discounts Access to New V4 AI Model, Cuts API Cache Pricing

Chinese startup offers steep short-term discounts as it rolls out V4, tuned for Huawei chips and aimed at agent-style AI workloads

By Marcus Reed
DeepSeek Discounts Access to New V4 AI Model, Cuts API Cache Pricing

DeepSeek has launched a preview of its V4 family of AI models and is offering developers a 75% discount on the Pro variant through May 5. At the same time, the company reduced input cache hit charges across its entire API suite to one-tenth of prior levels. V4 is available in a higher-performance Pro edition and a lighter Flash edition, and the company says the Pro model rivals other open-source models in knowledge benchmarks, trailing only Google’s closed-source Gemini-Pro-3.1.

Key Points

  • DeepSeek is offering a 75% discount on the DeepSeek-V4-Pro model for developers until May 5.
  • The company reduced input cache hit pricing across its entire DeepSeek API suite to one-tenth of the previous price, effective immediately.
  • V4 is available in two versions - Pro (higher performance, higher price) and Flash (lighter, lower cost) - and the Pro edition is said to outperform other open-source models on world-knowledge benchmarks, trailing only Google’s closed-source Gemini-Pro-3.1; these developments are particularly relevant to AI development, cloud compute usage, and firms building agent-style applications.

China-based AI developer DeepSeek announced a limited-time 75% price cut for developers subscribing to its newly revealed DeepSeek-V4-Pro model, with the offer running through May 5, the company said. In the same announcement, DeepSeek said it has lowered prices for input cache hits across its full DeepSeek API range to one-tenth of the previous rate.

The company released a preview of the V4 model on Friday, noting that the architecture has been adapted for compatibility with Huawei chip technology. DeepSeek positioned V4 as a two-tier product: a more capable - and higher-priced - Pro edition and a lighter, lower-cost Flash edition.

According to DeepSeek, tests place the Pro edition ahead of other open-source models on world-knowledge benchmarks, with the Pro model trailing only Google’s closed-source Gemini-Pro-3.1. The startup also highlighted that the V4 models are especially well suited for AI agent work - applications that can perform more complex, multi-step tasks than standard chatbots but which also demand greater computing resources.

The simultaneous pricing moves - a temporary steep discount on the Pro model and a broad reduction in input cache hit charges for the API lineup - indicate a notable short-term cost change for developers using DeepSeek’s services. The company did not provide additional details in the announcement about pricing beyond the discount period or long-term rate plans.


Context and implications

DeepSeek’s preview emphasizes performance comparisons and hardware adaptation without offering further product roadmaps or permanent pricing commitments. The V4 family’s dual-tier approach creates distinct choices for developers between higher-capacity and lower-cost options, while the cache hit price cut affects the billing of API-based usage across its product set.

Readers should note the company’s specific claims on benchmark placement and the stated suitability of V4 for agent-style AI work were presented by DeepSeek in its announcement.

Risks

  • The 75% discount is explicitly time-limited through May 5, creating uncertainty for developers about pricing after the offer ends - this affects budgeting for AI development and cloud expenses.
  • The Pro edition’s better performance is associated with higher computing demands, which may constrain adoption for teams or projects with limited processing capacity or cost sensitivity - this impacts cloud infrastructure and compute providers.
  • The reduction of input cache hit costs to one-tenth of prior levels changes the cost dynamics for API usage; without information on long-term pricing plans, developers face uncertainty in forecasting operating expenses tied to DeepSeek’s APIs.

More from Stock Markets

BofA: A Bigger Cash-Return Program Could Reprice Nvidia Apr 27, 2026 Orders for LNG Carriers Climb as Fuel Efficiency and New Output Support Demand Apr 27, 2026 Moody’s Raises Leonardo’s Rating to Baa2, Shares Edge Higher Apr 27, 2026 UK Stocks Open Mixed as Iran Submits Proposal to Reopen Strait of Hormuz Apr 27, 2026 European Shares Tepid as U.S.-Iran Talks Stall, Shipping Disruption Persists Apr 27, 2026