DeepSeek V4 Pro at 75% off until 31 May (opens in new tab)

(api-docs.deepseek.com)

87 pointsnateb20223d ago85 comments

85 comments

I've been using DeepSeek V4 a lot in the last week and I am very happy with it. If you have a really gnarly bug, you might need a SOTA model like Opus. For most things it is very very good, and costs significantly less (even without the discount).

I've been using it as part of a complex DOS game decompilation project[0]. I'm working on refactoring the software rendering pipeline so that we can add GPU rendering. The hardest part of this so far is converting the 90's polygon rendering from screen to world space.

It spun its wheels a few times doing a large mostly mechanical change. After resetting and improving my prompts it was able to get through it. I'm using Matt Pocock's skills[1] for this work, which has been quite nice.

[0]: https://github.com/FatalDecomp/ROLLER

[1]: https://github.com/mattpocock/skills

doctoboggan3d ago

What agentic harness do you use deepseek with?

deevus3d ago

I'm using Pi: https://github.com/badlogic/pi-mono/tree/main/packages/codin...

EEnsw3r3d ago

I find it hard to understand why nobody in this thread considers that the current pricing might still be below cost. The discount was supposed to end on May 5, and then shortly after that they extended it to May 31. They clearly made a judgment call there, rather than treating it as a desperate loss-leader.

If you have actually used DeepSeek, you would notice that the cache-hit rate is extremely high, and the cache invalidation window is much longer than every other provider's. That suggests DeepSeek is simply much better at utilizing its infrastructure than other vendors.

I am also highly skeptical that the average user's input is worth more than the API cost of processing it. Do people really think DeepSeek researchers enjoy panning for gold in a river of boilerplate and half-baked code?

zozbot2343d ago

DeepSeek's KV cache is tiny compared to other open weight models. This actually makes very large inference batches viable even on consumer hardware, even when resorting to SSD offload for weights. Once support is added to the main inference frameworks, it should be an absolute game changer for SOTA local inference.

ern3d ago

A few days ago we were hearing about how the "free lunch is over", now we're seeing discounts and increased usage limits.

niobe3d ago

This is clearly a well-timed loss-leading strategic market share grab! Anthropic have blown a lot of user trust in the last couple of months..

But, overall, the current AI pricing is completely unsustainable, across all AI companies, except via the exponential growth they are relying on. Dylan Patel did the most insightful analysis of this I've come across.. https://youtu.be/mDG_Hx3BSUE?si=nyJu4adwYCH1igbJ

sidrag223d ago

Really feel like the current versions are for sure "good enough". Thats not how market capture is gonna function though and they are gonna keep pushing because the only moat is to stay ahead, so the problems gonna stay strange. at some point more compute isn't a reasonable answer, and optimization is, and my feeling is we are well past that point from a product perspective, but ipos etc etc

2 more replies

flakiness3d ago

We're subsidized by the Chinese government!

https://www.reuters.com/world/asia-pacific/deepseek-nears-45...

2ndorderthought3d ago

Cool go download qwen 3.6 and run it on a single GPU and you can avoid paying into a subsidized model

1 more reply

2ndorderthought3d ago

People don't understand that deep seek is running a plausibly sustainable business. Like how qwen/Alibaba is.

jarym3d ago

Every AI vendor is trying to steal marketshare. For now the competition is good!

mannanj3d ago

Free lunch? More like "free data". The fools who give their life data and most intimate Intellectual property over to the AI companies for free, yes that's a free lunch that won't be subsidized for much longer when the cost on them which has been unsustainable (their data being harvested for non-training purposes) come stop catch up with them.

Sincerely, - I see you AI companies harvesting our data giving us discounted subscriptions so we can not realize we are paying you to take our own data!

dyauspitr3d ago

They need to build data centers and lots of them everywhere, preferably powered with renewable energy. Let the tokens flow like water. The models are finally getting to the point where the LLM just knows what you’re asking for and gives it to you.

dominotw3d ago

there will be free lunch till they admit to themselves that there is no moat. Acquring customers at huge costs is a fools errand when models are mostly indisguishable.

Anthropic is learning that lesson now. Doesnt help that their ceo goes around antognozing everyone by claiming jobs are over and annoying boris does like 500 podcasts per week repeating "coding is solved"

HWR_143d ago

I'm guessing there was a pullback in usage as the free lunch started ending. So we get some more subsidized usage.

ttul3d ago

* from Chinese labs

splatzone3d ago

What advantage do you think they have?

4 more replies

mattas3d ago

I can't figure out how there's both too little supply (so a dramatic need for more data centers) but also too little demand (so labs subsidize inference).

AlexB1383d ago

There isn't too little demand. There is massive demand and many competing companies trying to capture that demand, so they are attempting to make better offers than their competition. Hence subsidy.

1 more reply

samdhar3d ago

Cached input at $0.003625/M, output at $0.435/M. Aggressive pricing.

For anyone doing the "should I self-host on rented GPUs?" math: at this rate you'd need to push roughly 1B output tokens/day to break even against an 8xH100 fleet on Vast/Lambda (assuming 3-5k tokens/sec aggregate throughput). The vast majority of "I should run my own LLM" use cases don't come close to that volume.

Every API price drop kills another tranche of "self-host the open model" use cases. The implied bet: even if regular pricing ($1.74/M output) is also subsidized, exponential demand growth eventually makes the unit economics work. We'll see.

wxw3d ago

Per 1M tokens (input cache hit / input cache miss / output)

v4-pro (75% off): $0.003625 / $0.435 / $0.87

v4-pro (regular): $0.0145 / $1.74 / $3.48

v4-flash: $0.0028 / $0.14 / $0.28

that is damn cheap.

yehosef3d ago

You are the product. The book is called "So long, and thanks for all the secrets"

niobe3d ago

What if I told you.. this was no different to every US company

wanderlust1233d ago

You are the product whenever you are sending your data to an LLM not controlled by you.

Nothing specific to Deepseek.

croon3d ago

There is not a single LLM provider I trust enough to send secrets to. If you firewall accordingly the provider (or local) can be interchangable, barring capability differences of course.

I also struggle to find a provider that can credibly convince me I wouldn't be a product for when using. Have you found one?

jack_pp3d ago

Generous of you to think I'm doing top secret coding and not just another cat website

yehosef3d ago

Is anyone concerned about these services and China’s National Intelligence Law?

2ndorderthought3d ago

No because China can only do so much to me as someone who doesn't live there and never will.

It's the same reason why I prefer vpns that are owned by countries outside my own.

yehosef3d ago

Unless you're very careful, it's trivial to have my secrets to be sent to the LLM. If it reads your .env just to see the variable names, the secrets have been sent to the servers. Now - they probably don't care about you and your secrets - but it makes me more uncomfortable that they have them.

This is true of anthropic or openai - but for some reason I think the us govt or anyone else will have a harder time getting to my data from them than the CCP will any chinese company.

6 more replies

striking3d ago

It's unlikely that you're special enough that someone will genuinely look through the massive amount of data produced by this system in order to target You Specifically. If you are that special you can just use another provider.

From this line of reasoning, my guess is that the huge discount is not so much intended to sell the data collection system as much as it is intended to sell the model. If you had to wring a geopolitical consequence from this, it would be that the US labs producing models would be impacted by a vastly less expensive competitor.

mdni0073d ago

No I'm more concerned with OpenAI and Anthropic AI models being used as a tool to murder brown people in the middle east for our "greatest ally".

missedthecue3d ago

Not for my purposes tbh. Enjoy my shitty javascript, Xi.

brcmthrowaway3d ago

Should be TypeScript

martin_henk3d ago

yes. imagine getting denied at the border or something because of data you shared with deep seek,WeChat or any other china centric service

peyton3d ago

Definitely would select the frowny face if that happened.

1 more reply

mannanj3d ago

the US does that to you too, for not liking your opinions about particular parties or intelligence aparati.

1 more reply

2ndorderthought3d ago

Are you actually planning on travelling out of the country right now? It's probably not a good idea even if you don't use Chinese products, which by the way you definitely do.

1 more reply

dyauspitr3d ago

Eh I’m using it for stuff where there is nothing proprietary or identifiable.

protocolture3d ago

More worried about the Epstein regime

dancemethis3d ago

Eh, I'd be more concerned about the Three-Letters and the One country that dropped an A-bomb.

serf3d ago

>the One country that dropped an A-bomb.

i'd like to point out that the soviet RDS-3 was an airdropped A-bomb.

I get that you mean 'in anger', but I don't feel that bad being a pedant against a propagandist statement that's also pedantically wrong.

1 more reply

ottomanbob3d ago

I mean I can't believe I have to say this explicitly but it should be assumed that any data you send to China can and will be used against our interest by the CCP...

jrflowers3d ago

Kicking myself when my little vibe coded widget to notify me when socks go on sale that does not and never has functioned properly is wielded as a mighty scepter to topple western hegemony

gverrilla3d ago

Same with the USA. Difference is China is not bombing brown-skinned people every so often.

yehosef3d ago

Yes! The only saving grace is that they have so many secrets, mine are not so important.

cleaning3d ago

"our"?

WatchDog3d ago

What coding agent(ideally CLI) have people found works well with this?

Occasionally I go and try different agents with openrouter models, but nothing seems to really get close to the proprietary ones like claude-code.

flakiness3d ago

Pi (pi.dev) is fine. I'm using it with DS v4 right now. It's not close to Claude code but I think that's the point.

By the way OpenRouter version is very slow for some reason. DeepSeek platform is faster (and cheaper with the discount) if you don't mind passing the credit card number / email to this company.

croon3d ago

As sibling said, Pi is great, and you can absolutely run it directly (there's even a plugin to use itself as a sub agent), but I mainly run it as a sub agent from other harnesses, for example running a more capable model in copilot, and then delegating simpler chunks to pi (using a cheaper model) as the sub agent. I've tried gas town and some others but never got into that way of working. I'm going to try opencode though as a less vendor specific harness than copilot/claude/gemini.

speu2d ago

I've been using OpenCode for a few days now, I like it. It doesn't feel any "less heavy" than Claude Code (they're both massive piles of vibe-coded typescript) but for me it's essentially a 1:1 replacement for Claude Code.

Sidenote, I've been trying deepseek-v4-flash and I'm blown away. It's no Opus, but it's as cheap as tap water and punches far above its weight as a Flash model. I keep throwing tasks at it out of curiosity and it keeps solving them.

binary1323d ago

Anecdatally, out of all the popular LLMs I’ve only found Gemini to be any use for entry-level Ford Power Stroke Diesel mechanics and diagnostics. :)

pupppet3d ago

Have any regular Opus users taken V4 for a spin? What’s your take?

grovel4brown3d ago

lmao i can pay them to steal my ideas and code

1 more reply

j / k navigate · click thread line to collapse

85 comments

deevus3d ago

[0]: https://github.com/FatalDecomp/ROLLER

[1]: https://github.com/mattpocock/skills

doctoboggan3d ago

What agentic harness do you use deepseek with?

deevus3d ago

I'm using Pi: https://github.com/badlogic/pi-mono/tree/main/packages/codin...

EEnsw3r3d ago

zozbot2343d ago

ern3d ago

A few days ago we were hearing about how the "free lunch is over", now we're seeing discounts and increased usage limits.

niobe3d ago

This is clearly a well-timed loss-leading strategic market share grab! Anthropic have blown a lot of user trust in the last couple of months..

sidrag223d ago

2 more replies

flakiness3d ago

We're subsidized by the Chinese government!

https://www.reuters.com/world/asia-pacific/deepseek-nears-45...

2ndorderthought3d ago

Cool go download qwen 3.6 and run it on a single GPU and you can avoid paying into a subsidized model

1 more reply

2ndorderthought3d ago

People don't understand that deep seek is running a plausibly sustainable business. Like how qwen/Alibaba is.

jarym3d ago

Every AI vendor is trying to steal marketshare. For now the competition is good!

mannanj3d ago

Sincerely, - I see you AI companies harvesting our data giving us discounted subscriptions so we can not realize we are paying you to take our own data!

dyauspitr3d ago

dominotw3d ago

there will be free lunch till they admit to themselves that there is no moat. Acquring customers at huge costs is a fools errand when models are mostly indisguishable.

HWR_143d ago

I'm guessing there was a pullback in usage as the free lunch started ending. So we get some more subsidized usage.

ttul3d ago

* from Chinese labs

splatzone3d ago

What advantage do you think they have?

4 more replies

mattas3d ago

I can't figure out how there's both too little supply (so a dramatic need for more data centers) but also too little demand (so labs subsidize inference).

AlexB1383d ago

There isn't too little demand. There is massive demand and many competing companies trying to capture that demand, so they are attempting to make better offers than their competition. Hence subsidy.

1 more reply

samdhar3d ago

Cached input at $0.003625/M, output at $0.435/M. Aggressive pricing.

wxw3d ago

Per 1M tokens (input cache hit / input cache miss / output)

v4-pro (75% off): $0.003625 / $0.435 / $0.87

v4-pro (regular): $0.0145 / $1.74 / $3.48

v4-flash: $0.0028 / $0.14 / $0.28

that is damn cheap.

yehosef3d ago

You are the product. The book is called "So long, and thanks for all the secrets"

niobe3d ago

What if I told you.. this was no different to every US company

wanderlust1233d ago

You are the product whenever you are sending your data to an LLM not controlled by you.

Nothing specific to Deepseek.

croon3d ago

There is not a single LLM provider I trust enough to send secrets to. If you firewall accordingly the provider (or local) can be interchangable, barring capability differences of course.

I also struggle to find a provider that can credibly convince me I wouldn't be a product for when using. Have you found one?

jack_pp3d ago

Generous of you to think I'm doing top secret coding and not just another cat website

yehosef3d ago

Is anyone concerned about these services and China’s National Intelligence Law?

2ndorderthought3d ago

No because China can only do so much to me as someone who doesn't live there and never will.

It's the same reason why I prefer vpns that are owned by countries outside my own.

yehosef3d ago

This is true of anthropic or openai - but for some reason I think the us govt or anyone else will have a harder time getting to my data from them than the CCP will any chinese company.

6 more replies

striking3d ago

mdni0073d ago

No I'm more concerned with OpenAI and Anthropic AI models being used as a tool to murder brown people in the middle east for our "greatest ally".

missedthecue3d ago

Not for my purposes tbh. Enjoy my shitty javascript, Xi.

brcmthrowaway3d ago

Should be TypeScript

martin_henk3d ago

yes. imagine getting denied at the border or something because of data you shared with deep seek,WeChat or any other china centric service

peyton3d ago

Definitely would select the frowny face if that happened.

1 more reply

mannanj3d ago

the US does that to you too, for not liking your opinions about particular parties or intelligence aparati.

1 more reply

2ndorderthought3d ago

Are you actually planning on travelling out of the country right now? It's probably not a good idea even if you don't use Chinese products, which by the way you definitely do.

1 more reply

dyauspitr3d ago

Eh I’m using it for stuff where there is nothing proprietary or identifiable.

protocolture3d ago

More worried about the Epstein regime

dancemethis3d ago

Eh, I'd be more concerned about the Three-Letters and the One country that dropped an A-bomb.

serf3d ago

>the One country that dropped an A-bomb.

i'd like to point out that the soviet RDS-3 was an airdropped A-bomb.

I get that you mean 'in anger', but I don't feel that bad being a pedant against a propagandist statement that's also pedantically wrong.

1 more reply

ottomanbob3d ago

I mean I can't believe I have to say this explicitly but it should be assumed that any data you send to China can and will be used against our interest by the CCP...

jrflowers3d ago

Kicking myself when my little vibe coded widget to notify me when socks go on sale that does not and never has functioned properly is wielded as a mighty scepter to topple western hegemony

gverrilla3d ago

Same with the USA. Difference is China is not bombing brown-skinned people every so often.

yehosef3d ago

Yes! The only saving grace is that they have so many secrets, mine are not so important.

cleaning3d ago

"our"?

WatchDog3d ago

What coding agent(ideally CLI) have people found works well with this?

Occasionally I go and try different agents with openrouter models, but nothing seems to really get close to the proprietary ones like claude-code.

flakiness3d ago

Pi (pi.dev) is fine. I'm using it with DS v4 right now. It's not close to Claude code but I think that's the point.

By the way OpenRouter version is very slow for some reason. DeepSeek platform is faster (and cheaper with the discount) if you don't mind passing the credit card number / email to this company.

croon3d ago

speu2d ago

binary1323d ago

Anecdatally, out of all the popular LLMs I’ve only found Gemini to be any use for entry-level Ford Power Stroke Diesel mechanics and diagnostics. :)

pupppet3d ago

Have any regular Opus users taken V4 for a spin? What’s your take?

grovel4brown3d ago

lmao i can pay them to steal my ideas and code

1 more reply

j / k navigate · click thread line to collapse