DeepSeek Discounts Access to New V4 AI Model, Cuts API Cache Pricing

DeepSeek has launched a preview of its V4 family of AI models and is offering developers a 75% discount on the Pro variant through May 5. At the same time, the company reduced input cache hit charges across its entire API suite to one-tenth of prior levels. V4 is available in a higher-performance Pro edition and a lighter Flash edition, and the company says the Pro model rivals other open-source models in knowledge benchmarks, trailing only Google’s closed-source Gemini-Pro-3.1.

Key Points

DeepSeek is offering a 75% discount on the DeepSeek-V4-Pro model for developers until May 5.
The company reduced input cache hit pricing across its entire DeepSeek API suite to one-tenth of the previous price, effective immediately.
V4 is available in two versions - Pro (higher performance, higher price) and Flash (lighter, lower cost) - and the Pro edition is said to outperform other open-source models on world-knowledge benchmarks, trailing only Google’s closed-source Gemini-Pro-3.1; these developments are particularly relevant to AI development, cloud compute usage, and firms building agent-style applications.

China-based AI developer DeepSeek announced a limited-time 75% price cut for developers subscribing to its newly revealed DeepSeek-V4-Pro model, with the offer running through May 5, the company said. In the same announcement, DeepSeek said it has lowered prices for input cache hits across its full DeepSeek API range to one-tenth of the previous rate.

The company released a preview of the V4 model on Friday, noting that the architecture has been adapted for compatibility with Huawei chip technology. DeepSeek positioned V4 as a two-tier product: a more capable - and higher-priced - Pro edition and a lighter, lower-cost Flash edition.

According to DeepSeek, tests place the Pro edition ahead of other open-source models on world-knowledge benchmarks, with the Pro model trailing only Google’s closed-source Gemini-Pro-3.1. The startup also highlighted that the V4 models are especially well suited for AI agent work - applications that can perform more complex, multi-step tasks than standard chatbots but which also demand greater computing resources.

The simultaneous pricing moves - a temporary steep discount on the Pro model and a broad reduction in input cache hit charges for the API lineup - indicate a notable short-term cost change for developers using DeepSeek’s services. The company did not provide additional details in the announcement about pricing beyond the discount period or long-term rate plans.

Context and implications

DeepSeek’s preview emphasizes performance comparisons and hardware adaptation without offering further product roadmaps or permanent pricing commitments. The V4 family’s dual-tier approach creates distinct choices for developers between higher-capacity and lower-cost options, while the cache hit price cut affects the billing of API-based usage across its product set.

Readers should note the company’s specific claims on benchmark placement and the stated suitability of V4 for agent-style AI work were presented by DeepSeek in its announcement.

Risks

The 75% discount is explicitly time-limited through May 5, creating uncertainty for developers about pricing after the offer ends - this affects budgeting for AI development and cloud expenses.
The Pro edition’s better performance is associated with higher computing demands, which may constrain adoption for teams or projects with limited processing capacity or cost sensitivity - this impacts cloud infrastructure and compute providers.
The reduction of input cache hit costs to one-tenth of prior levels changes the cost dynamics for API usage; without information on long-term pricing plans, developers face uncertainty in forecasting operating expenses tied to DeepSeek’s APIs.

Menu

Key Points

Risks

More from Stock Markets