Cursor Composer 2 is just Kimi K2.5 with RL (opens in new tab)

(twitter.com)

276 pointsmirzap1mo ago168 comments

168 comments

Cursor Composer 1 was Qwen and this is Kimi. IDE is based on VSCode. The entire company is build on packaging open source and reselling it.

Ollama is also doing this.

There is so much money to be made repackaging open source these days.

So funny to see Twitter go wild saying "a 50 person team just beat Anthropic" blah blah.

miroljub1mo ago

> Cursor Composer 1 was Qwen and this is Kimi. IDE is based on VSCode. The entire company is build on packaging open source and reselling it.

The question is, where's the outrage? Why are there no headlines "USA steals Chinese tech?" "All USA can do is make a cheap copy of Chinese SOTA models".

> So funny to see Twitter go wild saying "a 50 person team just beat Anthropic" blah blah.

Well, if it's an American company, then it's a noble underdog story. When Chinese do it, they are thieves leeching on the US tech investment.

It's all so predictable, even the comments here.

hakunin1mo ago

Do you think Chinese LLMs acquired training data legitimately? I think the whole situation is a bit funny, but I don't think the US "started it" to be fair.

geocar1mo ago

> Do you think Chinese LLMs acquired training data legitimately?

I think they probably acquire it in accordance with Chinese law.

> but I don't think the US "started it" to be fair.

Who are you quoting with those marks? Started what? To be fair to whom?

1 more reply

fooster1mo ago

I mean as if anthropic and openai did.

muzani1mo ago

If American policies stay this way, we'll see "Made in USA. Designed in Beijing."

Tostino1mo ago

I mean, I (and a ton of others) were pretty outspoken about ollama being a pack of grifters. The thing they are good at is marketing though, so it drowns out other projects in the area.

MangoCoffee1mo ago

yup. fully agree. American cry and bitch about Chinese copy and steal their tech then an American company (Cursor) use/steal open source tech from China and everyone is silence.

chzblck1mo ago

because its open source.

miroljub1mo ago

A license doesn't matter if the perpetrator doesn't comply with it.

elashri1mo ago

Open source licence requires attribution which obviously it is not done in this case.

1 more reply

thefounder1mo ago

Like licenses are worth anything in the AI world…

NitpickLawyer1mo ago

> packaging open source and reselling it.

It's a bit more than that. They have plenty of data to inform any finetunes they make. I don't know how much of a moat it will turn out to be in practice, but it's something. There's a reason every big provider made their own coding harness.

pbowyer1mo ago

Can anyone enlighten me how having a coding harness when for most customers you say "we won't train on your code" helps you do RL? What's the data that they rely on? Is it the prompts and their responses?

rubymamis1mo ago

I guess they rely on many people not toggling privacy-mode on?

doctorpangloss1mo ago

It doesn't matter what your privacy setting is, with any savvy vendor. Your data is used to train by paraphrasing it, and the paraphrasing makes it impossible to prove it was your data (it is stored at rest paraphrased). Of course the paraphrasing stores all the salient information, like your goals and guidance to the bot to the answer, even if it has no PII.

1 more reply

josho1mo ago

The meta data is useful.

Eg, When a prompt had a bad result and was edited, or had lots of back and forth to correct tool usage that information can be distilled and used to improve models.

And now imagine if you are focused on this for weeks you can likely come up with other ideas to leverage the metadata to improve model performance.

victorbjorklund1mo ago

I doubt the majority does that. I bet the majority is using the defaults.

__mharrison__1mo ago

Does "code" include the prompt? Seems like the prompts would be the goldmines. Hook those up to rl an open weight model...

dmix1mo ago

Cursor’s integration is much deeper than just plugging an LLM into VSCode

That said I have a feeling both VSCode and Claude code will catch up to their integration. But neither comes close yet (I say that as someone who mainly uses Claude Code).

bearjaws1mo ago

As a command line junkie, what is the main thing Claude Code needs to catch up with cursor?

I haven't dove into using a LLM in my editor, so I am less familiar with workflows there.

lubujackson1mo ago

I use both pretty heavily. Cursor has an "Ask" mode that is useful when I don't want it to touch files or ask a non-sequitur. Claude may have an easy way to do this, but I haven't seeked it.

Cursor also has an interesting Debug mode that actively adds specific debug logging logic to your code, runs through several hypotheses in a loop to narrow down the cause, then cleans up the logging. It can be super useful.

Finally, when making peecise changes I can select a function, hit cmd-L and add certain ljnes of code to the context. Hard to do that in Claude. Cursor tends to be much faster for quicker, more precise work in general, and rarely goes "searching through the codebase" for things.

Most importantly, I'm cheap. a If I leave Cursor on Auto I can use it full time, 8 hours a day, and never go past the $20 monthly charge. Yes, it is probably just using free models but they are quite decent now, quick and great for inline work.

1 more reply

dmix1mo ago

Mostly saying "include this line from x file and this block from y file" which keyboard shortcuts. Claude's VSCode plugin only does one selection. Claude Code requires explicitly telling it what to reference.

That plus Cursor's integration into VSCode feels very deep and part of the IDE, including how it indexes file efficiently and links to changed files, opens plans. Using Claude Code's VScode extension loads into a panel like a file which feels like a hack, not a dedicated sidebar. The output doesn't always properly link to files you can click on. Lots of small stuff like that which significantly improves the DX without swapping tabs or loading a terminal.

I also use Code from terminal sometimes but it feels very isolated unless you're vibecoding something new. I also tried others: Zed is only like 50% of the way there (or less). I also tried to use (Neo)Vim again and it's also nowhere close, probably 25% of the UX of Cursor even with experimental plugins/terminal setups.

physicles1mo ago

You’re not missing much.

I used Cursor for the second half of last year. If you’re hand-editing code, its autocomplete is super nice, basically like reading your mind.

But it turns out the people who say we’re moving to a world where programming is automated are pretty much right.

I switched to Claude Code about three weeks ago and haven’t looked back. Being CLI-first is just so much more powerful than IDE-first, because tons of work that isn’t just coding happens there. I use the VSCode extension in maybe 10% of my sessions when I want targeted edits.

So having a good autocomplete story like Cursor is either not useful, or anti-useful because it keeps you from getting your hands off the code.

MintPaw1mo ago

In cursor:

You can copy/paste or drag code snippets the chat window and they automatically become context like. (@myFile.cpp:300-310)

You can click any of the generated diffs in the assistant chat window to instantly jump to the code.

Generated code just appears as diffs till you manually approve each snippet or file. (which is fairly easy to do with "jump to next snippet/file" buttons)

These are all features I use constantly as someone who doesn't vibe but wants to just say "pack/unpack this struct into json", "add this new property to the struct, add it to the serialization, and the UI", and other true busywork tasks.

1 more reply

rvz1mo ago

> Cursor Composer 1 was Qwen...

We know Composer 2 is Kimi K2.5 from that tweet. Where is the evidence for Composer 1 being based on Qwen?

> So funny to see Twitter go wild saying "a 50 person team just beat Anthropic" blah blah.

In this case, it will be the other way round: Anthropic will see Cursor as a competitor AI lab using open weight models for Composor 2 (actually Kimi K2.5) which was allegedly distilled from Opus 4.6, and would be enough for Anthropic to cut off Cursor from using any of models.

That's where it is going.

PUSH_AX1mo ago

> There is so much money to be made repackaging open source these days

These days? Almost every tech offering in existence is 1000+ OSS dependencies gaffer taped together with a sprinkling of business logic.

Cursor isn't a shocking bit of software to pay for, its investment however...

faangguyindia1mo ago

It just means Kursor is sharing data with Chinese llm which enables them to improve their LLM by training on outputs and input of all data which cursor collects.

It's a two way street.

satvikpendem1mo ago

No, they self host the Chinese open source LLMs, not use their APIs.

rubymamis1mo ago

Do you know what Qwen model Composer 1.5 used?

simplyluke1mo ago

> a 50 person team just beat Anthropic

How does this blow that narrative up? A 50 person team likely broke a license to have a product that's competitive on output at a fraction of the costs of one of the most well capitalized companies on the planet. Claude code and anthropic are certainly the darlings of the space today, but to me this just reinforces the idea that their moat is razor thin on the model front, even compared to OSS that can be run on independent hardware.

The application layer play is also suspect to me. In the medium to long term I _want_ tools that'll let me run whatever models I want vs being tied to an expensive, proprietary, and singular provider. For personal work I care about costs, and eventually my employer will care both about costs _and_ enterprise features/governance that a company like Anysphere is extremely well positioned to provide.

More and more, I see the future of the application layer being model agnostic, most enterprises hosting models on their own cloud for data security concerns, and the models being fully commoditized.

torginus1mo ago

Considering how AI companies incestously RL on each other's models, I would not be surprised if any number of behavioral patterns and (claims to be ChatGPT/Claude/Deepseek or whatever) just popped up on new models constantly.

I would also not rule out that since K2 is an 1T model, this is a distill, as I don't think they're serving expensive models just like that, which would not be a licensing violation?.

simplyluke1mo ago

There's a now-deleted tweet from a Kimi dev claiming that they verified the tokenizier was the same, which would imply it going at least beyond RL. Could still be a distill I think.

deaux1mo ago

Looks like two Moonshot employees confirmed that it's not licensed before Moonshot made the decision to get out of the debate and delete their posts [0][1].

[0] https://chainthink.cn/zh-CN/news/113784276696010804 - may have originally been https://x.com/apples_jimmy/status/2034920082602864990

[1] https://pbs.twimg.com/media/HD2Ky9jW4AAAe0Y?format=jpg&name=...

lmc1mo ago

This is on their website...

"Is Kimi K2.5 open source?"

"Yes, Kimi K2.5 is an open source AI model. Developers and researchers can explore its architecture, build new solutions, and experiment openly. Model weights and code are publicly available on Hugging Face and the official GitHub repository."

https://www.kimi.com/ai-models/kimi-k2-5

saidmukhamad1mo ago

4th paragraph in license block

Our only modification part is that, if the Software (or any derivative works thereof) is used for any of your commercial products or services that have more than 100 million monthly active users, or more than 20 million US dollars (or equivalent in other currencies) in monthly revenue, you shall prominently display "Kimi K2.5" on the user interface of such product or service.

ffsm81mo ago

My first reaction was "well, who knows how much revenue they're actually doing"

But at least the rumor mill has them significantly above that line:

> Revenue: As of March 2026, reports suggest Cursor has surpassed $2 billion in annualized revenue (ARR).

lmc1mo ago

That's not an open source license, then.

2 more replies

Eridrus1mo ago

Cursor have said they are using Composer through their inference provider (Fireworks). Presumably the MIT is not viral like the GPL, so Cursor, and companies that use Cursor do not need to display Kimi attribution on their products.

It's definitely not what Kimi wanted, but it sounds like this is what is written.

827a1mo ago

Another one, I believe this one was also deleted: https://x.com/HarveenChadha/status/2034933979720425611/photo...

rfoo1mo ago

TBH they really shouldn't have posted such a tweet in the first place, just sit back and watch their license enforced by the Internet.

I had the question "how do you even enforce this weird license term" back then, I guess I know the answer now.

gillesjacobs1mo ago

Cursor is mostly an IDE / coding-agent harness company. So it probably makes sense for them not to train their own base model, but instead license something like Kimi and fine-tune it for their own harness and workflows.

Their moat looks pretty thin. A VSCode fork with an open-source LLM fork on top. In the fast-moving coding-agent market, it’s not obvious they keep their massive valuation forever.

jstummbillig1mo ago

There is a plausible scenario in which software engineering requires a very finite amount of intelligence, in which sota models will be used mainly for other things and where for coding the harness will become increasingly more important than the model.

merlindru1mo ago

i've kinda had this thought before but never could express it ("you only need up to a certain level of smartness to express most coding concepts correctly")

but it never occurred to me that, if true, of course the harness becomes increasingly more important. which feels absolutely correct of course.

not sure if the hypothesis is even true though.

maronato1mo ago

The problem is that it seems they didn't license it: https://pbs.twimg.com/media/HD2Ky9jW4AAAe0Y?format=jpg&name=...

NitpickLawyer1mo ago

> Their moat looks pretty thin.

Their value is in the data they've collected and are collecting. Usage, acceptance rate, and all the connected signals. Plus having a large userbase where they can A / B test any finetune they create.

CharlieDigital1mo ago

That's every harness including VC Code Copilot.

People home about Teams sucking, but its market share is several times that of Slack because of distribution.

I guarantee that Microsoft has even more data.

_puk1mo ago

There were conversations in the team yesterday about how Cursor's cloud agents are still ahead of Claude from a UX perspective.

Obviously we're running both, using the right tool for the job.

There is stickiness there from being early. That will be hard to replicate.

genthree1mo ago

I hope for their sake they're using real metrics internally, and not whatever nonsense they're using to calculate stuff like "% written by LLM" in their dashboard, because that's... very wrong.

granitepail1mo ago

"Just" Kimi K2.5 with RL—people really misunderstand how difficult it is to achieve these reults with RL. Cursor's research team is highly respected within the industry, and what they've done is quite impressive.

Before people go jumping to conclusions about model theft, it's worth considering the possibility that they did reach an agreement with Moonshot which their researchers were not aware of. That would certainly explain the deleted tweets. Until Moonshot makes an official statement, I'm not particularly concerned.

halJordan1mo ago

The amount of angst people feel the need to have against ai is incredible. We all seemed to want open weights, but it's time to take offense when open weights are used as intended?

granitepail1mo ago

Hey, what do you know—Moonshot came out and explained that Cursor was operating within the licensing agreement. https://x.com/kimi_moonshot/status/2035074972943831491

Aurornis1mo ago

Earlier their employees were Tweeting that Cursor did not have a license ( https://pbs.twimg.com/media/HD2Ky9jW4AAAe0Y?format=jpg&name=... ) so Moonshot employees are partially to blame for the outrage.

ajitid1mo ago

Could you explain how much improvement RL+fine tuning has given to Composer 2.0 over Kimi K2.5? I don't fully grasp the work Cursor model has done here and why it is difficult to achieve these results with RL.

jeany2s1mo ago

You should realize the number/comparison they released is based on CursorBench... I don't have to emphasize more how sus this kind of self-defined closed-source benchmark can be

nreece1mo ago

Partnership confirmed by Moonshot:

Cursor accesses Kimi-k2.5 via FireworksAI_HQ hosted RL and inference platform as part of an authorized commercial partnership.

https://x.com/Kimi_Moonshot/status/2035074972943831491

Cursor teams take:

Only ~1/4 of the compute spent on the final model came from the base, the rest is from our training. This is why evals are very different.

https://x.com/leerob/status/2035035355364081694

prodigycorp1mo ago

There are many reasons to make fun of Cursor. However, one of the things get right is their autocomplete model.

Are there any open models that come close? Why doesnt OAI or Anthropic dedicate some resources to blowing Cursor's model out of the water? Cursor's completion model is a sticking point for a lot of users.

druskacik1mo ago

I agree, their autocomplete (tab) model is the best, but recently I realised I am using it less and less - the new models are so good that I mostly just do agentic coding, and I do very little changes in the codebase by myself. This is probably a general trend and if the usage of autocomplete models is dying out, it's understandable the companies are not investing resources into it.

seunosewa1mo ago

Antigravity has an autocomplete model too. Based on Windsurf's, I guess.

harmonic183741mo ago

Absolutely not. Windsurf also just stole an open source model, there’s almost zero chance Google is using that under the hood.

olejorgenb1mo ago

The model is great. The UX is ~~horrible~~ annoying...

Tadpole91811mo ago

Don't get me started. For every half-decent choice, there's a multitude of insane choices. After all this time they still don't have side-by-side review.

Equally as annoying, the break from VSCode is horrible. Having to use a separate registry, not having basic settings sync, the delay behind mainline VSCode updates.

Then, it's just plain buggier than others. The agent terminal just doesn't work semi-regularly, it doesn't like listing directories in the @, the SSH plugin crashes every other time it tries to connect, undoing agent work undoes edits I made in unrelated files sometimes. Sometimes updates just regress performance hard for seemingly no reason.

I also noticed the token use is wildly less efficient than CC or Codex these days. After almost no time at all it's up to 100,000 tokens and they're charging $1 per request for Sonnet. Side-by-side, Cursor spent $17 in the same time CC spent $4. Which is bizarre to me, since they advertise how their indexing and semantic search is more token efficient?

The autocomplete model was the only reason I stayed as long as I did. I wish there was a VSCode equivalent.

olejorgenb1mo ago

Well, the UI as a whole is ok to me (except the parts which is way too volatile). I was talking about the UX of the autocomplete model. The model are very often spot on and fast, but it's impossible to properly configure it to be less in your face. Making it basically useless for day-to-day development.

g947o1mo ago

Most companies don't do auto competition these days, including some that just recently stopped offering completion.

Which I find very unfortunate. There are so many cases, especially in proprietary codebases with non standard infrastructure, where good autocomplete is much better than "agentic" edits that produce nothing but slop which takes longer to clean up.

granzymes1mo ago

They’re pretty upfront in their release post that they took an open source model and improved it with their own coding data. They mention “continued pretraining” (on top of the base model) and RL. Cursor never claimed to have done a full pretraining run.

More to the point, beating Opus 4.6 at coding and coming within striking distance of gpt-5.4 is impressive! The benchmarks outperform raw Kimi K2.5.

It’s particularly impressive given larger labs like Meta are struggling to catch up to OpenAI/Anthropic.

1 more reply

827a1mo ago

This is exactly what Cursor should be doing, within the obvious bounds of the law and such. Not everyone needs a pristine foundation model. What a waste of compute. Anthropic & OpenAI need product-level competition to knock them off their $25/Mtok horse.

HeavyStorm1mo ago

There's no "just" in RL. Fine tuning is very important and could make a lot of difference.

lukaslalinsky1mo ago

Indeed, this is quite obvious on Claude models vs Gemini. I fully believe Gemini is more powerful model, but the post training process is nowhere near what Anthropic does, which results in Gemini being horrible at coding sessions, while Claude is excellent.

merlindru1mo ago

apparently GPT-5 uses the same pretrain as 4o did, hah

justindotdev1mo ago

im pretty sure this is in violation of moonshot's ToS. this is going to be fun to watch unfold

kgeist1mo ago

At the same time, Moonshot violated Anthropic's ToS by training on their models' outputs :) [0]. And Anthropic violated copyright law by training on copyrighted material. It's violations all the way down.

https://news.ycombinator.com/item?id=47126614

_aavaa_1mo ago

I thought Anthopoc’s training was deemed fair used. It was the downloading that was illegal

Aurornis1mo ago

> And Anthropic violated copyright law by training on copyrighted material

Training on copyrighted material does not violate copyright laws (despite what many will assume)

NitpickLawyer1mo ago

There is no ToS at play here. There's only the license[1], which is MIT modified like so:

> Our only modification part is that, if the Software (or any derivative works thereof) is used for any of your commercial products or services that have more than 100 million monthly active users, or more than 20 million US dollars (or equivalent in other currencies) in monthly revenue, you shall prominently display "Kimi K2.5" on the user interface of such product or service.

[1] - https://huggingface.co/moonshotai/Kimi-K2.5/blob/main/LICENS...

zozbot2341mo ago

Yes, this is pretty clear-cut. There's even a great alternative, namely GLM-5, that does not have such a clause (and other alternatives besides) so it feels a bit problematic that they would use Kimi 2.5 and then disregard that advertisement clause.

NitpickLawyer1mo ago

I've replied down the thread, but there are ways to go around that clause entirely, even if it would be enforceable. The obvious way is to have another company do the modification.

1 more reply

gillesjacobs1mo ago

They probably licensed it. Still a bit deceptive not to mention it on the model card/blog post, but companies whitelabel all the time without mentioning.

It goes against the ML community ethos to obscure it, but is common branding practice.

deaux1mo ago

No they didn't [0][1]. With this leak they're probably negotiating as we speak, which could be why they've deleted the posts.

[0] https://chainthink.cn/zh-CN/news/113784276696010804

[1] https://pbs.twimg.com/media/HD2Ky9jW4AAAe0Y?format=jpg&name=...

gillesjacobs1mo ago

I stand corrected, that is pretty scummy.

I bet Moonshot is going to make them open their wallets to avoid legal trouble.

charcircuit1mo ago

Kimi K2.5 was released under a modified MIT license (100M+ MAU or $20M+ MRR has to prominently display Kimi K2.5). It will be fine.

antirez1mo ago

Basically this is true for most startups in the world BUT Cursor, so here you are kinda inverting the logic of the matter. Cursor is at a size that, if they wanted to use K2.5, they could clearly state that it was K2.5 or get a license to avoid saying it.

NitpickLawyer1mo ago

IF we assume that the modified MIT clause is enforceable. And if we assume Cursor Inc. is running the modification. It could very well be the case that Cursor Research LTD is doing the modifications and re-licensing it to Cursor Inc. That would make any clause in the modified MIT moot.

1 more reply

charcircuit1mo ago

In practice nothing happens after violating an open source licenses, especially if you are willing to follow the terms after being notified.

htrp1mo ago

The cursor investor pitch was we're training our own models to do coding. If your amazing model is just an RL repack, you need a new pitch to justify your 50bn valuation

https://www.bloomberg.com/news/articles/2026-03-12/ai-coding...

simplyluke1mo ago

Any investor who believed a team their size and with their capital was training a SOTA base model doesn't understand the space. I fully believe that was some of their investors, but people acting like RL + fine tuning based on their massive user base that's producing qualitatively better outputs than the base model is meaningless aren't understanding what the company is doing.

ajitid1mo ago

Could you explain how much improvement RL+fine tuning can give with respect to Composer 2.0 model over Kimi K2.5? I don't fully grasp the work Cursor model has done here.

olejorgenb1mo ago

To be fair, is "with RL", "just"?

They should have disclosed it though. If they didn't it's a bad look for sure.

ajitid1mo ago

Could you explain how much improvement RL+fine tuning has given Composer 2.0 over Kimi K2.5? I don't fully grasp the work Cursor model has done here.

rockmeamedee1mo ago

What does this mean, that you can take Kimi and RL finetune it a little more and blow the big AI shops out of the water?

Would this have been extensively fine tuned, beyond what Anthropic/OAI would do themselves?

I guess this is Cursor's own benchmark, so you can finetune on your own dataset and get better results on your own specific tasks I guess.

odst1mo ago

What do people like about cursor? I've been using it for the past couple days, and I just don't see many positive things about it. It seems people like the autocomplete so I'll have to give that a try.

There's just too many "features" the ux ends up being all over the place. I thought having the browser inside of the editor would be great for design, but it's not that much better than just having your browser open along with your editor.

__alexs1mo ago

Advertising your model with some obviously home grown benchmark is a bold play. It doesn't matter how good your model is, I immediately trust it less.

lossolo1mo ago

Their first model was also based on an open source Chinese base model. They never fully trained their own model.

varispeed1mo ago

I noticed something strange with Cursor lately. When I am using Opus 4.6, sometimes it is giving ridiculously dumb answers as if they were actually using something like Qwen with a prompt to present itself as Opus. I have to close the session and start again hoping I'll get actual Opus.

merlindru1mo ago

There's no shot they're doing that. Would be suicide as soon as anyone notices, and by the looks of it, they didn't even clean up the URL here to "hide" the fact that this is Kimi K2.5 so i doubt there's any grand conspiracy here.

What's way more likely is that Opus has been quantized by anthropic or something similar. Or that Opus was updated and didn't work well with Cursor's harness after. Or a token caching issue. Etc.

varispeed1mo ago

Well, I noticed. Though I am too busy to make a fuss about it.

chvid1mo ago

Moonshot is raising money at a 10B usd valuation, cursor/anysphere is at a 30B usd valuation.

chaosprint1mo ago

This is actually becoming a path dependency, a dependence on the supply chain.

EugeneOZ1mo ago

I don't know - it works okay (yet to be tested whether it is actually smarter than Opus 4.6), but it is not bad at all. So far, it works quite fine (I'm not testing the "fast" version).

1 more reply

vachina1mo ago

A question. I’m due for a yearly Cursor subscription renewal, how does the credit limit look like?

Currently I’ve not hit any of the limits despite using it quite rigorously, I wonder if this will change with a renewal?

thewhitetulip1mo ago

But you have to buy into it right? If you don't have a limit then what did your contract look like?

vachina1mo ago

I signed up last year when the limits were a lot more generous.

thewhitetulip1mo ago

Interesting. I wasn't aware that cursor also has a free tier!

simonw1mo ago

I'm annoyed that we still don't know for certain which base model they used for Cursor 1.

This feels really rude to me. I have no problem with them fine-tuning open weight models to create their own - they are getting great results, and Cursor's research term should be respected for that. But deliberately hiding the base model they use is disrespectful of the researchers who created that model.

leerob1mo ago

We used a Kimi base, with midtraining and RL on top. Going forward, we'll include the base used in our blog posts, that was a miss. Also, the license is through Fireworks: https://x.com/Kimi_Moonshot/status/2035074972943831491

simonw1mo ago

Thanks for sharing this, I saw Cursor are committed to publishing this now which is great to see.

enraged_camel1mo ago

Simon, sorry to hijack the thread, but what is a good way of contacting you? I'd love to pick your brain on an AI talk I'm supposed to be giving soon.

cbg01mo ago

Scores higher than Opus 4.6 on their in-house benchmark? Sounds legit.

827a1mo ago

FYI: Someone from Cursor has responded to this https://x.com/leerob/status/2035035355364081694

I think there's a reason why the people from Moonshot deleted their tweets; they're probably just researchers who got yelled at by the people who actually knew what was going on at Moonshot.

manojlds1mo ago

But that makes no sense - what are inference partner terms?

827a1mo ago

Here you go: https://x.com/Kimi_Moonshot/status/2035074972943831491

People need to seriously stop it with the whole reddit-esque Boston Marathon Bomber investigation-style low-info crusades. Its extremely unhealthy for both your own mental state and the state of discourse on the internet. Even if Cursor misbehaved (they did not): Your life is not materially changed whether they did or did not. Use it, or don't use it; these things are a matter that lies exclusively between Cursor and Moonshot.

manojlds1mo ago

Again, I was not questioning Cursor here, but inference partner terms makes no real sense and that is what I wanted explained.

Sammi1mo ago

As a paying customer, it just doesn't feel good that they are trying to pass off someone else's model as their own.

I mean I guess this is what businesses do all the time. There's a term for it even, it's called white-labeling.

But is this all that Cursor have? They pass of VS Code as their own, they pass off Kimi as their own... What do Cursor even do? What do I need them for?

jstanley1mo ago

As a paying customer, I don't care where the model comes from, I only care how good it is.

Sammi1mo ago

Sure, and also at what price point.

But can I rely on Cursor to be able to keep delivering, when they aren't the one's doing the work themselves?

827a1mo ago

Can you ensure that Notion is able to keep delivering given they don't develop their own models? Lovable? OpenCode? Should we be worried that Discord might disappear because they don't run their own data centers? Personally, I'm very concerned that one day Google might just have to close up shop, because while they do design their own chips, they don't fabricate them in-house; and don't get me started on TSMC and their critical dependency on ASML, they might as well just lock the doors.

acmj1mo ago

Well, they can keep stealing as long as someone open weight their models.

manojlds1mo ago

And how cheap it is

khuey1mo ago

White-labeling may be slightly dishonest to the consumer but the manufacturer and distributor are honest with each other. That doesn't appear to be the case here (Kimi's license requires publicly acknowledging Kimi is used for anyone operating at Cursor's scale).

mono4421mo ago

This whole ai stuff feels like a big bubble especially with the oil price soon at $200 and guaranteed recession.

rvz1mo ago

Honestly I don't think this leak is any good for Cursor. Not only this appears as a violation to Moonshot's ToS, this may also be in fact enough evidence for Anthropic to ban Cursor from using their models, just like they are doing to OpenCode.

Why? As I said before, Anthropic mentions Moonshot AI (Maker of the Kimi models) as one of the AI labs that were part of this alleged "distillation attack" [0] campaign and will use that reason to cut off Cursor, Just like they did to OpenAI, xAI and OpenCode.

Let's see if the market thinks Composor 2 is really that good without the Claude models helping Cursor. (If Anthropic cuts them off).

[0] https://www.anthropic.com/news/detecting-and-preventing-dist...

Majromax1mo ago

> this may also be in fact enough evidence for Anthropic to ban Cursor from using their models, just like they are doing to OpenCode.

The Anthropic ban on OpenCode isn't an Anthropic ban on OpenCode, it's a ban on using a Calude Code subscription with OpenCode. That's justified (or not) under various ToS arguments, but one can still use OpenCode with the more expensive API access.

Anthropic's complaint about distillation attacks is a distinct prong, one not levied against OpenCode. Additionally, the distillation activities described in your link don't describe Cursor's routine use of Anthropic's models. There, the model outputs are a primary product (e.g. the autocompleted code), and any learning signals provided are incidental.

zozbot2341mo ago

Anthropic's complaint about "distillation" attacks (obligatory scare quotes because training on glorified chat logs is a far cry from actually distilling from model weights you have real access to) is also about ToS violations. Anthropic's ToS, like OpenAI's, forbids you from exploiting interactions with their model for the purpose of building a competitor, even though rumor has it that the AI industry has been doing exactly this for a long time anyway.

charcircuit1mo ago

Kimi K2.5 is an open source model. It is intended for people to make derivative models.

MangoCoffee1mo ago

cursor copy open source software repack it as closed source and made massive money. i don't want to hear anything on how Chinese is stealing and copying when the west is doing it themselves.

taytus1mo ago

YC is back at it again.

DeathArrow1mo ago

I whish it was GLM 5.0.

coreyburnsdev1mo ago

is this the model used on free mode?

heliumtera1mo ago

For all the muh productivity guys that like to claim they can turn invisible when no one is looking, an produce 600k lock over 6 weeks, well...cursor is useless now. We know kimi K2.5 won't make you 100 trillion times faster.

Cursor is killed for this market.

agluszak1mo ago

A hyped startup providing zero added value, burning investor money only to repackage somebody else's work? That's new... /s

DeathArrow1mo ago

It depends on what you consider value. People are using it so they find some value.

genthree1mo ago

Incompetently repackaging. They started with VSCode so nearly all the work was already done, but still managed to make it leak memory like it's infinite. The power of AI slop! Their product is an anti-advertisement for the core concept of itself, which is kind of impressive.

merlindru1mo ago

I like Cursor's AI projects a lot. Cursor Tab is truly impressive. But you couldn't be more right.

I just downloaded VSCode again today after Cursor's latest update dropped my editor to 5 FPS or so (legitimately unusable. not hyperbole.) and holy shit it feels snappy. Completely forgot what it's like.

QubridAI1mo ago

Honestly, this is pretty much how most of the new models operate nowadays: a base model combined with RL and some product-layer magic.

koakuma-chan1mo ago

Cursor can't compete with Claude Code's subsidized pricing, so they are trying to gaslight people that their cheap model is good enough.

j / k navigate · click thread line to collapse

168 comments

mohsen11mo ago

Cursor Composer 1 was Qwen and this is Kimi. IDE is based on VSCode. The entire company is build on packaging open source and reselling it.

Ollama is also doing this.

There is so much money to be made repackaging open source these days.

So funny to see Twitter go wild saying "a 50 person team just beat Anthropic" blah blah.

miroljub1mo ago

> Cursor Composer 1 was Qwen and this is Kimi. IDE is based on VSCode. The entire company is build on packaging open source and reselling it.

The question is, where's the outrage? Why are there no headlines "USA steals Chinese tech?" "All USA can do is make a cheap copy of Chinese SOTA models".

> So funny to see Twitter go wild saying "a 50 person team just beat Anthropic" blah blah.

Well, if it's an American company, then it's a noble underdog story. When Chinese do it, they are thieves leeching on the US tech investment.

It's all so predictable, even the comments here.

hakunin1mo ago

Do you think Chinese LLMs acquired training data legitimately? I think the whole situation is a bit funny, but I don't think the US "started it" to be fair.

geocar1mo ago

> Do you think Chinese LLMs acquired training data legitimately?

I think they probably acquire it in accordance with Chinese law.

> but I don't think the US "started it" to be fair.

Who are you quoting with those marks? Started what? To be fair to whom?

1 more reply

fooster1mo ago

I mean as if anthropic and openai did.

muzani1mo ago

If American policies stay this way, we'll see "Made in USA. Designed in Beijing."

Tostino1mo ago

I mean, I (and a ton of others) were pretty outspoken about ollama being a pack of grifters. The thing they are good at is marketing though, so it drowns out other projects in the area.

MangoCoffee1mo ago

yup. fully agree. American cry and bitch about Chinese copy and steal their tech then an American company (Cursor) use/steal open source tech from China and everyone is silence.

chzblck1mo ago

because its open source.

miroljub1mo ago

A license doesn't matter if the perpetrator doesn't comply with it.

elashri1mo ago

Open source licence requires attribution which obviously it is not done in this case.

1 more reply

thefounder1mo ago

Like licenses are worth anything in the AI world…

NitpickLawyer1mo ago

> packaging open source and reselling it.

pbowyer1mo ago

rubymamis1mo ago

I guess they rely on many people not toggling privacy-mode on?

doctorpangloss1mo ago

1 more reply

josho1mo ago

The meta data is useful.

Eg, When a prompt had a bad result and was edited, or had lots of back and forth to correct tool usage that information can be distilled and used to improve models.

And now imagine if you are focused on this for weeks you can likely come up with other ideas to leverage the metadata to improve model performance.

victorbjorklund1mo ago

I doubt the majority does that. I bet the majority is using the defaults.

__mharrison__1mo ago

Does "code" include the prompt? Seems like the prompts would be the goldmines. Hook those up to rl an open weight model...

dmix1mo ago

Cursor’s integration is much deeper than just plugging an LLM into VSCode

That said I have a feeling both VSCode and Claude code will catch up to their integration. But neither comes close yet (I say that as someone who mainly uses Claude Code).

bearjaws1mo ago

As a command line junkie, what is the main thing Claude Code needs to catch up with cursor?

I haven't dove into using a LLM in my editor, so I am less familiar with workflows there.

lubujackson1mo ago

I use both pretty heavily. Cursor has an "Ask" mode that is useful when I don't want it to touch files or ask a non-sequitur. Claude may have an easy way to do this, but I haven't seeked it.

1 more reply

dmix1mo ago

physicles1mo ago

You’re not missing much.

I used Cursor for the second half of last year. If you’re hand-editing code, its autocomplete is super nice, basically like reading your mind.

But it turns out the people who say we’re moving to a world where programming is automated are pretty much right.

So having a good autocomplete story like Cursor is either not useful, or anti-useful because it keeps you from getting your hands off the code.

MintPaw1mo ago

In cursor:

You can copy/paste or drag code snippets the chat window and they automatically become context like. (@myFile.cpp:300-310)

You can click any of the generated diffs in the assistant chat window to instantly jump to the code.

Generated code just appears as diffs till you manually approve each snippet or file. (which is fairly easy to do with "jump to next snippet/file" buttons)

1 more reply

rvz1mo ago

> Cursor Composer 1 was Qwen...

We know Composer 2 is Kimi K2.5 from that tweet. Where is the evidence for Composer 1 being based on Qwen?

> So funny to see Twitter go wild saying "a 50 person team just beat Anthropic" blah blah.

That's where it is going.

PUSH_AX1mo ago

> There is so much money to be made repackaging open source these days

These days? Almost every tech offering in existence is 1000+ OSS dependencies gaffer taped together with a sprinkling of business logic.

Cursor isn't a shocking bit of software to pay for, its investment however...

faangguyindia1mo ago

It just means Kursor is sharing data with Chinese llm which enables them to improve their LLM by training on outputs and input of all data which cursor collects.

It's a two way street.

satvikpendem1mo ago

No, they self host the Chinese open source LLMs, not use their APIs.

rubymamis1mo ago

Do you know what Qwen model Composer 1.5 used?

simplyluke1mo ago

> a 50 person team just beat Anthropic

More and more, I see the future of the application layer being model agnostic, most enterprises hosting models on their own cloud for data security concerns, and the models being fully commoditized.

torginus1mo ago

I would also not rule out that since K2 is an 1T model, this is a distill, as I don't think they're serving expensive models just like that, which would not be a licensing violation?.

simplyluke1mo ago

There's a now-deleted tweet from a Kimi dev claiming that they verified the tokenizier was the same, which would imply it going at least beyond RL. Could still be a distill I think.

deaux1mo ago

Looks like two Moonshot employees confirmed that it's not licensed before Moonshot made the decision to get out of the debate and delete their posts [0][1].

[0] https://chainthink.cn/zh-CN/news/113784276696010804 - may have originally been https://x.com/apples_jimmy/status/2034920082602864990

[1] https://pbs.twimg.com/media/HD2Ky9jW4AAAe0Y?format=jpg&name=...

lmc1mo ago

This is on their website...

"Is Kimi K2.5 open source?"

https://www.kimi.com/ai-models/kimi-k2-5

saidmukhamad1mo ago

4th paragraph in license block

ffsm81mo ago

My first reaction was "well, who knows how much revenue they're actually doing"

But at least the rumor mill has them significantly above that line:

> Revenue: As of March 2026, reports suggest Cursor has surpassed $2 billion in annualized revenue (ARR).

lmc1mo ago

That's not an open source license, then.

2 more replies

Eridrus1mo ago

It's definitely not what Kimi wanted, but it sounds like this is what is written.

827a1mo ago

Another one, I believe this one was also deleted: https://x.com/HarveenChadha/status/2034933979720425611/photo...

rfoo1mo ago

TBH they really shouldn't have posted such a tweet in the first place, just sit back and watch their license enforced by the Internet.

I had the question "how do you even enforce this weird license term" back then, I guess I know the answer now.

gillesjacobs1mo ago

Their moat looks pretty thin. A VSCode fork with an open-source LLM fork on top. In the fast-moving coding-agent market, it’s not obvious they keep their massive valuation forever.

jstummbillig1mo ago

merlindru1mo ago

i've kinda had this thought before but never could express it ("you only need up to a certain level of smartness to express most coding concepts correctly")

but it never occurred to me that, if true, of course the harness becomes increasingly more important. which feels absolutely correct of course.

not sure if the hypothesis is even true though.

maronato1mo ago

The problem is that it seems they didn't license it: https://pbs.twimg.com/media/HD2Ky9jW4AAAe0Y?format=jpg&name=...

NitpickLawyer1mo ago

> Their moat looks pretty thin.

CharlieDigital1mo ago

That's every harness including VC Code Copilot.

People home about Teams sucking, but its market share is several times that of Slack because of distribution.

I guarantee that Microsoft has even more data.

_puk1mo ago

There were conversations in the team yesterday about how Cursor's cloud agents are still ahead of Claude from a UX perspective.

Obviously we're running both, using the right tool for the job.

There is stickiness there from being early. That will be hard to replicate.

genthree1mo ago

I hope for their sake they're using real metrics internally, and not whatever nonsense they're using to calculate stuff like "% written by LLM" in their dashboard, because that's... very wrong.

granitepail1mo ago

halJordan1mo ago

The amount of angst people feel the need to have against ai is incredible. We all seemed to want open weights, but it's time to take offense when open weights are used as intended?

granitepail1mo ago

Hey, what do you know—Moonshot came out and explained that Cursor was operating within the licensing agreement. https://x.com/kimi_moonshot/status/2035074972943831491

Aurornis1mo ago

ajitid1mo ago

jeany2s1mo ago

You should realize the number/comparison they released is based on CursorBench... I don't have to emphasize more how sus this kind of self-defined closed-source benchmark can be

nreece1mo ago

Partnership confirmed by Moonshot:

Cursor accesses Kimi-k2.5 via FireworksAI_HQ hosted RL and inference platform as part of an authorized commercial partnership.

https://x.com/Kimi_Moonshot/status/2035074972943831491

Cursor teams take:

Only ~1/4 of the compute spent on the final model came from the base, the rest is from our training. This is why evals are very different.

https://x.com/leerob/status/2035035355364081694

prodigycorp1mo ago

There are many reasons to make fun of Cursor. However, one of the things get right is their autocomplete model.

druskacik1mo ago

seunosewa1mo ago

Antigravity has an autocomplete model too. Based on Windsurf's, I guess.

harmonic183741mo ago

Absolutely not. Windsurf also just stole an open source model, there’s almost zero chance Google is using that under the hood.

olejorgenb1mo ago

The model is great. The UX is ~~horrible~~ annoying...

Tadpole91811mo ago

Don't get me started. For every half-decent choice, there's a multitude of insane choices. After all this time they still don't have side-by-side review.

Equally as annoying, the break from VSCode is horrible. Having to use a separate registry, not having basic settings sync, the delay behind mainline VSCode updates.

The autocomplete model was the only reason I stayed as long as I did. I wish there was a VSCode equivalent.

olejorgenb1mo ago

g947o1mo ago

Most companies don't do auto competition these days, including some that just recently stopped offering completion.

granzymes1mo ago

More to the point, beating Opus 4.6 at coding and coming within striking distance of gpt-5.4 is impressive! The benchmarks outperform raw Kimi K2.5.

It’s particularly impressive given larger labs like Meta are struggling to catch up to OpenAI/Anthropic.

1 more reply

827a1mo ago

HeavyStorm1mo ago

There's no "just" in RL. Fine tuning is very important and could make a lot of difference.

lukaslalinsky1mo ago

merlindru1mo ago

apparently GPT-5 uses the same pretrain as 4o did, hah

justindotdev1mo ago

im pretty sure this is in violation of moonshot's ToS. this is going to be fun to watch unfold

kgeist1mo ago

https://news.ycombinator.com/item?id=47126614

_aavaa_1mo ago

I thought Anthopoc’s training was deemed fair used. It was the downloading that was illegal

Aurornis1mo ago

> And Anthropic violated copyright law by training on copyrighted material

Training on copyrighted material does not violate copyright laws (despite what many will assume)

NitpickLawyer1mo ago

There is no ToS at play here. There's only the license[1], which is MIT modified like so:

[1] - https://huggingface.co/moonshotai/Kimi-K2.5/blob/main/LICENS...

zozbot2341mo ago

NitpickLawyer1mo ago

I've replied down the thread, but there are ways to go around that clause entirely, even if it would be enforceable. The obvious way is to have another company do the modification.

1 more reply

gillesjacobs1mo ago

They probably licensed it. Still a bit deceptive not to mention it on the model card/blog post, but companies whitelabel all the time without mentioning.

It goes against the ML community ethos to obscure it, but is common branding practice.

deaux1mo ago

No they didn't [0][1]. With this leak they're probably negotiating as we speak, which could be why they've deleted the posts.

[0] https://chainthink.cn/zh-CN/news/113784276696010804

[1] https://pbs.twimg.com/media/HD2Ky9jW4AAAe0Y?format=jpg&name=...

gillesjacobs1mo ago

I stand corrected, that is pretty scummy.

I bet Moonshot is going to make them open their wallets to avoid legal trouble.

charcircuit1mo ago

Kimi K2.5 was released under a modified MIT license (100M+ MAU or $20M+ MRR has to prominently display Kimi K2.5). It will be fine.

antirez1mo ago

NitpickLawyer1mo ago

1 more reply

charcircuit1mo ago

In practice nothing happens after violating an open source licenses, especially if you are willing to follow the terms after being notified.

htrp1mo ago

The cursor investor pitch was we're training our own models to do coding. If your amazing model is just an RL repack, you need a new pitch to justify your 50bn valuation

https://www.bloomberg.com/news/articles/2026-03-12/ai-coding...

simplyluke1mo ago

ajitid1mo ago

Could you explain how much improvement RL+fine tuning can give with respect to Composer 2.0 model over Kimi K2.5? I don't fully grasp the work Cursor model has done here.

olejorgenb1mo ago

To be fair, is "with RL", "just"?

They should have disclosed it though. If they didn't it's a bad look for sure.

ajitid1mo ago

Could you explain how much improvement RL+fine tuning has given Composer 2.0 over Kimi K2.5? I don't fully grasp the work Cursor model has done here.

rockmeamedee1mo ago

What does this mean, that you can take Kimi and RL finetune it a little more and blow the big AI shops out of the water?

Would this have been extensively fine tuned, beyond what Anthropic/OAI would do themselves?

I guess this is Cursor's own benchmark, so you can finetune on your own dataset and get better results on your own specific tasks I guess.

odst1mo ago

__alexs1mo ago

Advertising your model with some obviously home grown benchmark is a bold play. It doesn't matter how good your model is, I immediately trust it less.

lossolo1mo ago

Their first model was also based on an open source Chinese base model. They never fully trained their own model.

varispeed1mo ago

merlindru1mo ago

What's way more likely is that Opus has been quantized by anthropic or something similar. Or that Opus was updated and didn't work well with Cursor's harness after. Or a token caching issue. Etc.

varispeed1mo ago

Well, I noticed. Though I am too busy to make a fuss about it.

chvid1mo ago

Moonshot is raising money at a 10B usd valuation, cursor/anysphere is at a 30B usd valuation.

chaosprint1mo ago

This is actually becoming a path dependency, a dependence on the supply chain.

EugeneOZ1mo ago

I don't know - it works okay (yet to be tested whether it is actually smarter than Opus 4.6), but it is not bad at all. So far, it works quite fine (I'm not testing the "fast" version).

1 more reply

vachina1mo ago

A question. I’m due for a yearly Cursor subscription renewal, how does the credit limit look like?

Currently I’ve not hit any of the limits despite using it quite rigorously, I wonder if this will change with a renewal?

thewhitetulip1mo ago

But you have to buy into it right? If you don't have a limit then what did your contract look like?

vachina1mo ago

I signed up last year when the limits were a lot more generous.

thewhitetulip1mo ago

Interesting. I wasn't aware that cursor also has a free tier!

simonw1mo ago

I'm annoyed that we still don't know for certain which base model they used for Cursor 1.

leerob1mo ago

simonw1mo ago

Thanks for sharing this, I saw Cursor are committed to publishing this now which is great to see.

enraged_camel1mo ago

Simon, sorry to hijack the thread, but what is a good way of contacting you? I'd love to pick your brain on an AI talk I'm supposed to be giving soon.

cbg01mo ago

Scores higher than Opus 4.6 on their in-house benchmark? Sounds legit.

827a1mo ago

FYI: Someone from Cursor has responded to this https://x.com/leerob/status/2035035355364081694

I think there's a reason why the people from Moonshot deleted their tweets; they're probably just researchers who got yelled at by the people who actually knew what was going on at Moonshot.

manojlds1mo ago

But that makes no sense - what are inference partner terms?

827a1mo ago

Here you go: https://x.com/Kimi_Moonshot/status/2035074972943831491

manojlds1mo ago

Again, I was not questioning Cursor here, but inference partner terms makes no real sense and that is what I wanted explained.

Sammi1mo ago

As a paying customer, it just doesn't feel good that they are trying to pass off someone else's model as their own.

I mean I guess this is what businesses do all the time. There's a term for it even, it's called white-labeling.

But is this all that Cursor have? They pass of VS Code as their own, they pass off Kimi as their own... What do Cursor even do? What do I need them for?

jstanley1mo ago

As a paying customer, I don't care where the model comes from, I only care how good it is.

Sammi1mo ago

Sure, and also at what price point.

But can I rely on Cursor to be able to keep delivering, when they aren't the one's doing the work themselves?

827a1mo ago

acmj1mo ago

Well, they can keep stealing as long as someone open weight their models.

manojlds1mo ago

And how cheap it is

khuey1mo ago

mono4421mo ago

This whole ai stuff feels like a big bubble especially with the oil price soon at $200 and guaranteed recession.

rvz1mo ago

Let's see if the market thinks Composor 2 is really that good without the Claude models helping Cursor. (If Anthropic cuts them off).

[0] https://www.anthropic.com/news/detecting-and-preventing-dist...

Majromax1mo ago

> this may also be in fact enough evidence for Anthropic to ban Cursor from using their models, just like they are doing to OpenCode.

zozbot2341mo ago

charcircuit1mo ago

Kimi K2.5 is an open source model. It is intended for people to make derivative models.

MangoCoffee1mo ago

cursor copy open source software repack it as closed source and made massive money. i don't want to hear anything on how Chinese is stealing and copying when the west is doing it themselves.

taytus1mo ago

YC is back at it again.

DeathArrow1mo ago

I whish it was GLM 5.0.

coreyburnsdev1mo ago

is this the model used on free mode?

heliumtera1mo ago

Cursor is killed for this market.

agluszak1mo ago

A hyped startup providing zero added value, burning investor money only to repackage somebody else's work? That's new... /s

DeathArrow1mo ago

It depends on what you consider value. People are using it so they find some value.

genthree1mo ago

merlindru1mo ago

I like Cursor's AI projects a lot. Cursor Tab is truly impressive. But you couldn't be more right.

QubridAI1mo ago

Honestly, this is pretty much how most of the new models operate nowadays: a base model combined with RL and some product-layer magic.

koakuma-chan1mo ago

Cursor can't compete with Claude Code's subsidized pricing, so they are trying to gaslight people that their cheap model is good enough.

j / k navigate · click thread line to collapse