DeepClaude – Claude Code agent loop with DeepSeek V4 Pro (opens in new tab)

(github.com)

669 pointsalattaran7d ago279 comments

279 comments

    #!/bin/sh
    export ANTHROPIC_BASE_URL=https://api.deepseek.com/anthropic
    export ANTHROPIC_AUTH_TOKEN=sk-secret
    export ANTHROPIC_MODEL=deepseek-v4-flash
    export CLAUDE_CODE_DISABLE_NONESSENTIAL_TRAFFIC=1
    exec claude $@

aaurelions6d ago

It seems like any project that makes fun of Claude is bound to reach the top spot on Hacker News. Even if it’s just a project consisting of four lines of code.

2 more replies

nadermx6d ago

The AI wars have begun

vitaflo7d ago

I'm not exactly sure what the point of this is. Deepseek already has instructions to use its API with many CLI's including Claude Code directly:

https://api-docs.deepseek.com/quick_start/agent_integrations...

2ndorderthought7d ago

There probably isn't a point. Someone didn't understand something, didn't research it, so they 1 shotted their first thought and sent it to the front page of HN and all of their socials. It's the future bruh

ttoinou6d ago

I thought the tool format wasnt exactly the same ? So plugging any IA into claude code requires a conversion of format

ricardobeat6d ago

Many of them expose “anthropic-compatible” APIs for this very purpose.

justech6d ago

If you're looking for Claude Code alternatives, I would first suggest looking into pi.dev or opencode for your harness. And then for models, you can choose from OpenCode Go (IMO most cost effect at this moment), OpenRouter, or direct from DeepSeek. Better if you go the Kimi route IMO and just buy a subscription from kimi.com

wolttam6d ago

I’m going to throw my harness in the ring: https://codeberg.org/mlow/lmcli

Aeroi6d ago

agreed. OpenCode is a strong base, and with a couple modifications it can become a very effective harness. my sideproject mouse.dev I’ve been combining parts from OpenCode, Claude Code, and Hermes to build a cloud agent architecture that works well from mobile.

CharlesW6d ago

> OpenCode is a strong base, and with a couple modifications it can become a very effective harness.

I personally didn't find it to be competitve with Claude Code as a harness. Can I ask how you modified it to perform better?

Aeroi6d ago

I haven’t run formal evals but i improved the experience for my own needs and it feels noticeably better with these modifications.

-Claude-style subagents -an MCP layer for higher-level tools -Cursor-style control plane modes like Ask, Plan, Debug, and Build.

The MCP layer lets the harness use things like GitHub file/code read, PR creation, web search/fetch, structured user questions, plan-mode switching, user skills, and subagents.

So the improvement is mostly from better ui/ux orchestration and tool access. There's some things from hermes that are interesting as well.

Most of my focus has been on applying this stack to sandboxed cloud agents so you can properly code and work from mobile devices.

I can't definitively say that the stack is better or worse than Claude code, more just tuned for my use case I guess.

aaurelions6d ago

Another very cost-effective option is Ollama Cloud. In a month of use, I only hit the 5-hour limit once, when I ran 8 agents simultaneously for 2 hours.

postatic6d ago

definitely worth it - have both ollama cloud, opencode and hermes running to test them all out, working great so far.

bakugo6d ago

> I would first suggest looking into pi.dev

Looked into this one. Thought it was suspicious that it only had 7 open issues on github. Turns out they have a bot that auto-closes every single issue just because.

I honestly have no words.

rsanek6d ago

>DeepSeek V4 Pro scores 96.4% on LiveCodeBench and costs $0.87/M output tokens

This is a heavily subsidized price and will only last until the end of the month: "The deepseek-v4-pro model is currently offered at a 75% discount, extended until 2026/05/31 15:59 UTC." [0]

The "supported backends" table is also deceiving -- while OpenRouter's server's may be in the US, the only way to get the $0.44/$0.87 pricing is to pass through to the DeepSeek API, which of course is China-based. [1]

I do think the model is quite good, I myself use it through Ollama Cloud for simple tasks. But I think some folks have bought in a little too much to the marketing hype around it.

[0] https://api-docs.deepseek.com/quick_start/pricing [1] https://openrouter.ai/deepseek/deepseek-v4-pro/providers

1 more reply

syntex6d ago

Not sure you can replace Claude with DeepSeek V4 that easily and have same results.

From what I see while building my own agentic system in Elixir, the problem is in training for your specific harness/contracts. Claude/GPT-style models seem to be trained around very specific contracts used by the harness like tool call formats, planning structure, patching, reading files, recovering from errors, and knowing when to stop.

In practice, you either need a very strong general model that can infer and follow those contracts (expensive), or a weaker model that has been fine-tuned / trained specifically on your own agent contracts. Otherwise, the whole thing becomes flaky very quickly. And I suspect with Deepseek V4 you may get last options.

6 more replies

isege6d ago

> Claude Code is the best autonomous coding agent.

If you look at the terminal-bench@2.0 leaderboard, you'll quickly see it's actually one of the weakest agentic harnesses. Anthropic's own models score lower with Claude Code than with virtually any other harness.

So it's quite the opposite. Claude Code is arguably the worst harness to run models with.

3 more replies

l5870uoo9y6d ago

> DeepSeek V4 Pro scores 96.4% on LiveCodeBench and costs $0.87/M output tokens.

Yes and this is a temporary discount which increases to 3.48 USD on 2026/05/31 15:59 UTC.

Source: https://api-docs.deepseek.com/quick_start/pricing

_3457d ago

If you're okay with sonnet level performance, this sounds like a straight upgrade. But I find that sonnet messes up too much, that it ends up not being worth cost optimizing down to using it or another sonnet-level model. Glad to have this as an option though

2ndorderthought7d ago

A lot of people are having good experiences doing things like using opus for designing and using locally hosted qwen3.6 for implementation.

I could see a serious cost reduction story by using opus for design and deepseek for implementation.

Personally I would avoid anthropic entirely. But I get why people don't.

girvo7d ago

Like me: that’s what I do. Either Opus 4.7 or GLM 5.1 for planning, write it out to a markdown file, then farm it out to Qwen 3.6 27B on my DGX Spark-alike using Pi. Works amusingly well all things considered.

2ndorderthought7d ago

How is glm 5.1? I have t tried it yet but have been meaning too

aftbit6d ago

What hardware are you using to power this?

chrsw6d ago

I keep re-learning this lesson: I chug along with a lesser model then throw a problem at it that's too complex. Then I try different models until I give up and bring in Opus 4.6 to clean up.

brianwawok6d ago

And I keep using Opus to like, make git commits. Really just need a smart router that is actually smart, vs having to micromanage model

TheServitor6d ago

It's surprisingly easy to hit $200 worth of tokens even at ~$1/M token though. No matter how many times I do the math the coding plans are the better value.

1 more reply

iosjunkie6d ago

Get comfortable with Deepseek's privacy policy for using this for anything serious.

"To improve and develop the Services and to train and improve our technology, such as our machine learning models and algorithms. Including by monitoring interactions and usage across your devices, analyzing how people are using it, and training and improving our technology."

https://cdn.deepseek.com/policies/en-US/deepseek-privacy-pol...

dopeepsreaddocs6d ago

Did... Did you just ask an AI to one-shot something that normally amounts to no more than setting two env variables?

sbinnee6d ago

After some time replacing gemini 3 flash preview with deepseek v4 flash for a chat model, the biggest difference is the auto reasoning effort. Gemini flash is super fast and perfect for a chat model. But when I need some thought experiments with a handful of constraints, it struggles a bit and I switch to sonnet. But with deepseek v4 flash, it can do long complex reasoning and it gets things often right. Generating a lot of reasoning tokens means that it takes a lot of time of course. But I am happy to find a cheaper model and excited to try something other than gemini flash. Gemini flash has been so good that I was locked on it for a while.

izietto6d ago

Just want to say that I faced this very problem the last week, I discovered OpenCode agent and it works great, with DeepSeek and other models. Try it out guys.

2 more replies

alexdns7d ago

obviously vibe coded ( co authored ) + the prices dont even match

2ndorderthought7d ago

It's going to be real hard to find headlines that weren't vibe coded from here on out unfortunately.

cyanydeez7d ago

welp, pack it it in boys, it was nice conceptualizing all you as real humans on the internet. I guess I'll just have to go touch grass if I want to feel parasocial.

jason1cho5d ago

By dog fooding itself, the tool doesn't fail to live up to the expectations of its users.

nclin_6d ago

Is claude code the best coding harness? Anyone running evals on that?

ahmadyan6d ago

In my anecdotal experience, it is not. Same model, opus, works better in 3P harnesses such as Factory Droid or Amp.

Claude code, on the other hand, is the most subsidized one, both for consumers (through max subscription) and for enterprises (token discounts). It is also heavily optimized for cost, specially token caching and reduced thinking, at the expense of quality.

orliesaurus7d ago

Is there a way to do this directly by using claudecode CLI (which I already have installed) and openrouter??

vitaflo7d ago

Yes, Deepseek even documents how:

https://api-docs.deepseek.com/quick_start/agent_integrations...

theanonymousone7d ago

Yes, from Claude Code themselves: https://code.claude.com/docs/en/llm-gateway

gnat7d ago

This repo's README explains how it works and you can do it yourself. claude looks for environment variables that say which API endpoint to talk to, which key to pass, which model name to use for haiku/sonnet/opus-level workloads, etc.

lukaslalinsky6d ago

I've been using DeepSeek v4 pro as an alternative to Claude models and for the first time I can see it as a real replacement. With the other Chinese models, I was missing something, but DeepSeek seems good enough for the kind of development I want to do.

dzink6d ago

Tried DeepSeek V4 Pro and Flash on Open Router and they worked fine - flash might have actually produced a better result, but also the same prompt across different inference providers produced the same result. Then tried DS4 Pro again via tinfoil.sh and got the same design but littered with random Chinese characters in the code. Tinfoil pegs prompt data as private / not trained on. Do know know DS4 providers that are verifiably private and not training on your prompts and outputs?

1 more reply

jay19965236d ago

Claude code can already use the DeepSeek API, so what are the advantages of this tool?

connorwhitlock6d ago

96.4% on LiveCodeBench is impressive but LiveCodeBench is single-shot. The interesting test is multi-turn agentic — has anyone benchmarked DeepSeek V4 Pro vs Opus on SWE-bench Verified or similar where the cheaper model has to be more decisive about tool use over 30+ turns? Curious if there's a cliff at higher tool-call depths.

ultrasandwich6d ago

Using this "out-of-the-box" with an OpenRouter subscription using DeepSeekv4, I just blew through 15 dollars in 45 minutes on a moderate sized code base, just making a plan and executing a refactoring of an upload pipeline to use a state machine. Not really seeing the cost savings for real-world work tbh.

xbmcuser6d ago

With how cheap and fast DeepSeek is and now using its free chat over the last few days I just cancelled my claude subscription so far I was using the chat interface only but I might just have to learn how to use the api.

9999000009996d ago

I just spent half my day getting CUDA and LLAMA to work with my 5070TI.

I was able to use it in agent mode with Roo, I stopped after having it write out a plan, but I'll continue when I have more time.

Deepseek feels less likely to do a straight up rug pull since you can self host with enough money, but I'm still more excited about local solutions.

Usually I just need grunt work done. I'm not solving difficult problems.

sowild_fun6d ago

Using a bunch of CLIs to work with DeepSeek V4, I've found that Langcli is the best fit for DeepSeek V4. For programming tasks, the cache hit rate is above 95%.

Not only can it seamlessly and dynamically switch between DeepSeek V4 Flash, V4 Pro, and other mainstream models within the same context, but it is also 100% compatible with Claude Code.

sfewfweg6d ago

Langcli + deepseek v4 is very good

1 more reply

vagab0nd6d ago

This has become a problem for me. I like trying new things. But I also know that in about a week, there's going to be a better/cheaper setup. And a week after that. And ideally I'd like to get some coding done when I'm not tinkering with the tools.

So I think I'll stay with CC for now.

kordlessagain6d ago

CC has the ability to use Ollama as well, which includes the ability for Ollama to proxy to Ollama's cloud models. It's brilliant, and works with a single Ollama command that doesn't mess with CC at all (so you can run them at the same time).

If you are interested, I've built an agentic terminal that helps manage these types of things better: https://deepbluedynamics.com/hyperia

zkmon6d ago

Next claude news (trump style): Recent versions of Claude code no longer allow talking to other models, or helping with any code that has the goal of moving away from anthropic models.

langitbiru6d ago

I'm wondering why DeepSeek didn't create an AI coding agent like Kimi Code.

1 more reply

shay1607m6d ago

Interesting setup

do you have any benchmarks on: - token usage over time - failures/retry rates

would be great to see how it behaves in production

rib3ye6d ago

How is this different than using ollama to launch Claude with

ollama launch claude --model deepseek-v4-pro:cloud

1 more reply

diamondosas6d ago

I have a question. does anyone have a problem with switihng context between AI and your terminal

esafak7d ago

Why wouldn't you use something open source like OpenCode, which already support DSv4 and has more features than CC?

dlx6d ago

As someone who does use other models with CC, I am curious about opencode, what extra features does it have that you find essential?

esafak6d ago

I like being able to add a wide array of models, define perms for agents and subagents, turn MCPs on and off at will, and be able to fix bugs I find in it.

dlx6d ago

fair enough...any drawbacks that you've found?

ttoinou7d ago

More features than CC ?

Also opencode tracks you by default. Its not safe. Every first prompt you send is routed through their servers, logged and they can use your data however they want

sedawkgrep6d ago

I thought this was debunked awhile ago. ?

esafak6d ago

I could not find any evidence of prompt logging. The code is open; can you point me to it?

dbeley6d ago

Honestly with the likes of Opencode / pi / hermes I don't really find the "Claude Code agent loop" part particularly interesting.

The edge Anthropic has on others lies on its models performance. CLI tooling (and obviously pricing) is definitely not better than others.

danny_codes6d ago

Except the model isn't particularly better anymore, as compared to the newest wave of FOSS models

Lihh276d ago

the wrapper is basically env var glue. You’re still betting the whole loop on Anthropic's closed client.

Copenjin6d ago

I wonder if openrouter will replicate that 120x caching, I suppose they will?

game_the0ry6d ago

Cost engineering [1] will be the next hot topic for AI.

[1] A fancier way of saying "reducing cost."

DeathArrow6d ago

You don't need Deep Claude. Claude Code is working with any model that exposes an endpoint for an Anthropic compatible API.

I am using Claude Code with GLM 5.1, MiniMax M2.7, Kimi K2.6 and Xiaomi MiMo V2.5 Pro.

tgautot6d ago

Nice, it's quite usefull to have a project like this which streamlines the setup necessary to use other "brains" in claude code "body". I personally will give this a try, but Ijust find the message on pricing a bit disingenuous, the deepseek price of "$0.87/M output tokens" is a discount, and this setup anyways needs a calude.ai subscription offering claude code, which now is 100$/month min.

triyambakam6d ago

And if I don't care about cost, what about actual performance?

itrunsdoomguy6d ago

Does it play Doom?

Tanxsinxlnx6d ago

does it support aws bedrock provider

akartit6d ago

why not opencode with deepseek?

karel-3d6d ago

Can I... somehow run this locally? DeepSeek is opensource? Do I even need their API key?

(I have no experience with running anything locally, maybe it's a stupid question)

1 more reply

dukeofdoom6d ago

Is there some way to make claude/codex beep when it finishes a task.

1 more reply

portsentinel6d ago

I am now thinking how far can agentic AI can go how far we can achieve

fHr6d ago

layer on layer on layer to refactor bunch of lines xD

2ndorderthought7d ago

Oh shoot now the next CC upgrade will blow your subscription for doing this

morpheos1376d ago

anthropic messed up big time harness works with any muh commodity LLM, meanwhile VCs were duped on the myth of FOOM AGI, probably not a cooincidence Anthropic is enmeshed with the scifi fan fic forum known as lesswrong. The world wants useful tools. The bay area bubble in contrast thrives on Mythos.

hgyyy6d ago

I think OAI and Anthropic will be ok for a year or two. But after that If they still continue to earn revenues from selling tokens to firms/software engineers they will be in serious trouble.

The American firms are not demonstrating escape velocity and as long as china offers something somewhat comparable and offers it at a very low price to compensate for any difference in quality, they will not be generating enough in cash flows to finance reinvestment. I highly doubt they’ll be able to continue raising external financing for numerous periods from here on out - they gotta start showing strong financials and that they are running away from the open source models.

LeFantome6d ago

The performance gap will likely close as Chinese hardware improves. This is happening very rapidly.

Already DeepSeek v4 is being hosted on Huawei Ascend 950. What do you think those cost relative to NVIDIA gear?

morpheos1376d ago

I wouldnt put it past the US gov to ban foreign models. they tried to ban tiktok. what is being demosrrated here is silicon valley can not withstand a competitive market.

LeFantome6d ago

Good luck banning Open Source models.

Not only that but other countries are very unlikely to follow suit, so it is just a straight-up productivity tax on the US.

1 more reply

bwfan1236d ago

> anthropic messed up big time harness works with any muh commodity LLM

that surprised me too. The intelligence is at the client, and by making that open, anthropic has commoditized the coding agent.

deadbabe6d ago

I had a call with our CTO and we are pivoting away from Claude Code to DeepClaude because the cost savings are too substantial to ignore.

j / k navigate · click thread line to collapse

279 comments

aftbit7d ago

    #!/bin/sh
    export ANTHROPIC_BASE_URL=https://api.deepseek.com/anthropic
    export ANTHROPIC_AUTH_TOKEN=sk-secret
    export ANTHROPIC_MODEL=deepseek-v4-flash
    export CLAUDE_CODE_DISABLE_NONESSENTIAL_TRAFFIC=1
    exec claude $@

aaurelions6d ago

It seems like any project that makes fun of Claude is bound to reach the top spot on Hacker News. Even if it’s just a project consisting of four lines of code.

2 more replies

nadermx6d ago

The AI wars have begun

vitaflo7d ago

I'm not exactly sure what the point of this is. Deepseek already has instructions to use its API with many CLI's including Claude Code directly:

https://api-docs.deepseek.com/quick_start/agent_integrations...

2ndorderthought7d ago

ttoinou6d ago

I thought the tool format wasnt exactly the same ? So plugging any IA into claude code requires a conversion of format

ricardobeat6d ago

Many of them expose “anthropic-compatible” APIs for this very purpose.

justech6d ago

wolttam6d ago

I’m going to throw my harness in the ring: https://codeberg.org/mlow/lmcli

Aeroi6d ago

CharlesW6d ago

> OpenCode is a strong base, and with a couple modifications it can become a very effective harness.

I personally didn't find it to be competitve with Claude Code as a harness. Can I ask how you modified it to perform better?

Aeroi6d ago

I haven’t run formal evals but i improved the experience for my own needs and it feels noticeably better with these modifications.

-Claude-style subagents -an MCP layer for higher-level tools -Cursor-style control plane modes like Ask, Plan, Debug, and Build.

The MCP layer lets the harness use things like GitHub file/code read, PR creation, web search/fetch, structured user questions, plan-mode switching, user skills, and subagents.

So the improvement is mostly from better ui/ux orchestration and tool access. There's some things from hermes that are interesting as well.

Most of my focus has been on applying this stack to sandboxed cloud agents so you can properly code and work from mobile devices.

I can't definitively say that the stack is better or worse than Claude code, more just tuned for my use case I guess.

aaurelions6d ago

Another very cost-effective option is Ollama Cloud. In a month of use, I only hit the 5-hour limit once, when I ran 8 agents simultaneously for 2 hours.

postatic6d ago

definitely worth it - have both ollama cloud, opencode and hermes running to test them all out, working great so far.

bakugo6d ago

> I would first suggest looking into pi.dev

Looked into this one. Thought it was suspicious that it only had 7 open issues on github. Turns out they have a bot that auto-closes every single issue just because.

I honestly have no words.

rsanek6d ago

>DeepSeek V4 Pro scores 96.4% on LiveCodeBench and costs $0.87/M output tokens

This is a heavily subsidized price and will only last until the end of the month: "The deepseek-v4-pro model is currently offered at a 75% discount, extended until 2026/05/31 15:59 UTC." [0]

I do think the model is quite good, I myself use it through Ollama Cloud for simple tasks. But I think some folks have bought in a little too much to the marketing hype around it.

[0] https://api-docs.deepseek.com/quick_start/pricing [1] https://openrouter.ai/deepseek/deepseek-v4-pro/providers

1 more reply

syntex6d ago

Not sure you can replace Claude with DeepSeek V4 that easily and have same results.

6 more replies

isege6d ago

> Claude Code is the best autonomous coding agent.

So it's quite the opposite. Claude Code is arguably the worst harness to run models with.

3 more replies

l5870uoo9y6d ago

> DeepSeek V4 Pro scores 96.4% on LiveCodeBench and costs $0.87/M output tokens.

Yes and this is a temporary discount which increases to 3.48 USD on 2026/05/31 15:59 UTC.

Source: https://api-docs.deepseek.com/quick_start/pricing

_3457d ago

2ndorderthought7d ago

A lot of people are having good experiences doing things like using opus for designing and using locally hosted qwen3.6 for implementation.

I could see a serious cost reduction story by using opus for design and deepseek for implementation.

Personally I would avoid anthropic entirely. But I get why people don't.

girvo7d ago

2ndorderthought7d ago

How is glm 5.1? I have t tried it yet but have been meaning too

aftbit6d ago

What hardware are you using to power this?

chrsw6d ago

I keep re-learning this lesson: I chug along with a lesser model then throw a problem at it that's too complex. Then I try different models until I give up and bring in Opus 4.6 to clean up.

brianwawok6d ago

And I keep using Opus to like, make git commits. Really just need a smart router that is actually smart, vs having to micromanage model

TheServitor6d ago

It's surprisingly easy to hit $200 worth of tokens even at ~$1/M token though. No matter how many times I do the math the coding plans are the better value.

1 more reply

iosjunkie6d ago

Get comfortable with Deepseek's privacy policy for using this for anything serious.

https://cdn.deepseek.com/policies/en-US/deepseek-privacy-pol...

dopeepsreaddocs6d ago

Did... Did you just ask an AI to one-shot something that normally amounts to no more than setting two env variables?

sbinnee6d ago

izietto6d ago

Just want to say that I faced this very problem the last week, I discovered OpenCode agent and it works great, with DeepSeek and other models. Try it out guys.

2 more replies

alexdns7d ago

obviously vibe coded ( co authored ) + the prices dont even match

2ndorderthought7d ago

It's going to be real hard to find headlines that weren't vibe coded from here on out unfortunately.

cyanydeez7d ago

welp, pack it it in boys, it was nice conceptualizing all you as real humans on the internet. I guess I'll just have to go touch grass if I want to feel parasocial.

jason1cho5d ago

By dog fooding itself, the tool doesn't fail to live up to the expectations of its users.

nclin_6d ago

Is claude code the best coding harness? Anyone running evals on that?

ahmadyan6d ago

In my anecdotal experience, it is not. Same model, opus, works better in 3P harnesses such as Factory Droid or Amp.

orliesaurus7d ago

Is there a way to do this directly by using claudecode CLI (which I already have installed) and openrouter??

vitaflo7d ago

Yes, Deepseek even documents how:

https://api-docs.deepseek.com/quick_start/agent_integrations...

theanonymousone7d ago

Yes, from Claude Code themselves: https://code.claude.com/docs/en/llm-gateway

gnat7d ago

lukaslalinsky6d ago

dzink6d ago

1 more reply

jay19965236d ago

Claude code can already use the DeepSeek API, so what are the advantages of this tool?

connorwhitlock6d ago

ultrasandwich6d ago

xbmcuser6d ago

9999000009996d ago

I just spent half my day getting CUDA and LLAMA to work with my 5070TI.

I was able to use it in agent mode with Roo, I stopped after having it write out a plan, but I'll continue when I have more time.

Deepseek feels less likely to do a straight up rug pull since you can self host with enough money, but I'm still more excited about local solutions.

Usually I just need grunt work done. I'm not solving difficult problems.

sowild_fun6d ago

Using a bunch of CLIs to work with DeepSeek V4, I've found that Langcli is the best fit for DeepSeek V4. For programming tasks, the cache hit rate is above 95%.

Not only can it seamlessly and dynamically switch between DeepSeek V4 Flash, V4 Pro, and other mainstream models within the same context, but it is also 100% compatible with Claude Code.

sfewfweg6d ago

Langcli + deepseek v4 is very good

1 more reply

vagab0nd6d ago

So I think I'll stay with CC for now.

kordlessagain6d ago

If you are interested, I've built an agentic terminal that helps manage these types of things better: https://deepbluedynamics.com/hyperia

zkmon6d ago

Next claude news (trump style): Recent versions of Claude code no longer allow talking to other models, or helping with any code that has the goal of moving away from anthropic models.

langitbiru6d ago

I'm wondering why DeepSeek didn't create an AI coding agent like Kimi Code.

1 more reply

shay1607m6d ago

Interesting setup

do you have any benchmarks on: - token usage over time - failures/retry rates

would be great to see how it behaves in production

rib3ye6d ago

How is this different than using ollama to launch Claude with

ollama launch claude --model deepseek-v4-pro:cloud

1 more reply

diamondosas6d ago

I have a question. does anyone have a problem with switihng context between AI and your terminal

esafak7d ago

Why wouldn't you use something open source like OpenCode, which already support DSv4 and has more features than CC?

dlx6d ago

As someone who does use other models with CC, I am curious about opencode, what extra features does it have that you find essential?

esafak6d ago

I like being able to add a wide array of models, define perms for agents and subagents, turn MCPs on and off at will, and be able to fix bugs I find in it.

dlx6d ago

fair enough...any drawbacks that you've found?

ttoinou7d ago

More features than CC ?

Also opencode tracks you by default. Its not safe. Every first prompt you send is routed through their servers, logged and they can use your data however they want

sedawkgrep6d ago

I thought this was debunked awhile ago. ?

esafak6d ago

I could not find any evidence of prompt logging. The code is open; can you point me to it?

dbeley6d ago

Honestly with the likes of Opencode / pi / hermes I don't really find the "Claude Code agent loop" part particularly interesting.

The edge Anthropic has on others lies on its models performance. CLI tooling (and obviously pricing) is definitely not better than others.

danny_codes6d ago

Except the model isn't particularly better anymore, as compared to the newest wave of FOSS models

Lihh276d ago

the wrapper is basically env var glue. You’re still betting the whole loop on Anthropic's closed client.

Copenjin6d ago

I wonder if openrouter will replicate that 120x caching, I suppose they will?

game_the0ry6d ago

Cost engineering [1] will be the next hot topic for AI.

[1] A fancier way of saying "reducing cost."

DeathArrow6d ago

You don't need Deep Claude. Claude Code is working with any model that exposes an endpoint for an Anthropic compatible API.

I am using Claude Code with GLM 5.1, MiniMax M2.7, Kimi K2.6 and Xiaomi MiMo V2.5 Pro.

tgautot6d ago

triyambakam6d ago

And if I don't care about cost, what about actual performance?

itrunsdoomguy6d ago

Does it play Doom?

Tanxsinxlnx6d ago

does it support aws bedrock provider

akartit6d ago

why not opencode with deepseek?

karel-3d6d ago

Can I... somehow run this locally? DeepSeek is opensource? Do I even need their API key?

(I have no experience with running anything locally, maybe it's a stupid question)

1 more reply

dukeofdoom6d ago

Is there some way to make claude/codex beep when it finishes a task.

1 more reply

portsentinel6d ago

I am now thinking how far can agentic AI can go how far we can achieve

fHr6d ago

layer on layer on layer to refactor bunch of lines xD

2ndorderthought7d ago

Oh shoot now the next CC upgrade will blow your subscription for doing this

morpheos1376d ago

hgyyy6d ago

I think OAI and Anthropic will be ok for a year or two. But after that If they still continue to earn revenues from selling tokens to firms/software engineers they will be in serious trouble.

LeFantome6d ago

The performance gap will likely close as Chinese hardware improves. This is happening very rapidly.

Already DeepSeek v4 is being hosted on Huawei Ascend 950. What do you think those cost relative to NVIDIA gear?

morpheos1376d ago

I wouldnt put it past the US gov to ban foreign models. they tried to ban tiktok. what is being demosrrated here is silicon valley can not withstand a competitive market.

LeFantome6d ago

Good luck banning Open Source models.

Not only that but other countries are very unlikely to follow suit, so it is just a straight-up productivity tax on the US.

1 more reply

bwfan1236d ago

> anthropic messed up big time harness works with any muh commodity LLM

that surprised me too. The intelligence is at the client, and by making that open, anthropic has commoditized the coding agent.

deadbabe6d ago

I had a call with our CTO and we are pivoting away from Claude Code to DeepClaude because the cost savings are too substantial to ignore.

j / k navigate · click thread line to collapse