my name is Devin and I don't like sharing a name with a product. Will you please consider changing the name?
There is always the chance that someone named Devin will do something that gives your product a bad name. Perhaps some new scandal will involve someone named Devin or something.
I'd also like you to imagine that a hot new erotic AI was named "Walden", and people said things like "I was talking with Walden last night" as a euphemism. How would that make you feel?
Even if that version is limited to only editing public Github repos. $500 to see how well it works is too much.
I'd like to see that $50/month tier reopened to subscribers, and a $0/month+credits tier added (1 concurrent active session only, constrained to small VM spec with immutable rootfs (regular devin VMs have writable rootfs), no automatic knowledge generation, no snapshots, though playbooks allowed).
> Even if that version is limited to only editing public Github repos
Not possible to constrain like that with the current Devin architecture.
* will you store my code + train on workflows that Devin does for me? * are you piping data to other third party providers (i.e. anthropic, openAI)?
That seems to be the challenge with Cursor Agent in it's current form, it generates a bunch of code that has bugs and requires a lot of iteration.
Also to mention, "suggest knowledge" modal is broken (it silently ignores changes made if you edit the suggested knowledge).
Another issue, sleep&snapshot system is still prone to race conditions in certain cases.
How much context window does it load when it is solving tasks?
How does it determine which files to load into context?
The word "try" is VERY different than the actual case, which is "pay for use".
If the answer to the first line is yes, how do I request my email be deleted? I started to sign up but I am not a use case for $500 a month at the moment.
What is the pricing story?
Can I use it as side project dev or is the target enterprise customers only / mainly?
https://www.washingtonpost.com/technology/interactive/2021/p...
At least our names got attached to an upstanding product, and one that is likely to languish and fail. We're not the next "Alexa", I hope.
Had the nickname fart fart until my sister moved out of the house.
Maybe you could confirm, but ChatGPT tells me in Japanese Debu colloquially and offensively means "fat" or "chubby", and Bu is an onomotapoeia for a fart noise, like "prrt" in English.
Bold move, but imagine the patch notes:
• Fixed bug where assistant attempted to unmake the fabric of reality
• Resolved issue where “Set alarm for 7 AM” triggered a rampancy cascade
• Improved pronunciation of “Lh’owon” for calendar appointments
Probably still a better bet than Durandal, definitely an improvement over Tycho.
And then there was Leela…
If it had to be a name for a product, it seems like to give me some sort of cheap male grooming or AXE body spray product vibes.
I don't like first name product names for other reasons but not because they share a name with humans named the same
Just having fun. I see what you mean and vaguely support it... I just won't lose anything over it
edit:
> Eschew flamebait. Avoid generic tangents. Omit internet tropes.
> Please don't post shallow dismissals, especially of other people's work. A good critical comment teaches us something.
I've been frankly terrified of the pace of LLM development since 2022.
Can you share concrete examples?
https://docs.anthropic.com/en/docs/build-with-claude/compute...
Sometimes even when you're making calls to dozens of different endpoints they're easy, but other times, you end up guessing at how to access undocumented functionality within a GraphQL API that has introspection turned off, or working around entity modeling that's completely different from your system and requires a lot of translation. Or you work with an API whose indexes variably start from 1, 0, -1, and -2 in different endpoints. These generally aren't hard technical challenges to solve, and something like Devin that could take care of most surface-level problems you see while integrating with some XML API from 2007 would be welcome.
There are companies like https://www.tryfinch.com and https://www.merge.dev that try to solve these issues, but their abstractions also reduce flexibility and aren't a perfect for all HRIS integration use cases right now.
There are also more flexible solutions like https://www.nango.dev
It handles the API-specific complexities (auth, retries, webhooks, per-customer config, pre-built templates) but allows you to implement the exact use case + data model you need.
It's open source/source available.
(disclaimer: I am a founder)
So if there are 10 users, the free tier lets me give them the ability to add up to 3 integrations? Is it 3 per user?
Thanks
this is definitely slower than if i were doing it full time, but i run a company. i go from customer meeting to customer meeting and spend 5-10 min a day taking whatever is blocking devin and pasting it into an email to the partner to get a response for devin.
It's a pretty typical clause you'll see in most SaaS policies.
Source: I work for a SaaS, but I am not a lawyer, caveat emptor.
https://web.archive.org/web/20111103081406/http://consumeris...
Original article that caused the outrage. In particular, the TOS did not say they owned your pictures, but it did give them a license that was quite broad, which included using your likeness in advertisements. However, the change that caused the outrage was that the license no longer expired on account deletion nor content removal.
https://www.npr.org/2009/02/17/100783689/facebook-users-angr...
News article about the outrage.
https://www.nytimes.com/2009/02/19/technology/internet/19fac...
News article about the walkback.
I could not find anything about it being challenged in court.
And other points where it should shine. How does it compare to using Cursor? Is it the slack integration?
There are many niches to be captured.
In my own experience using Cursor with Claude 3.5 Sonnet (new) and o1-preview, Claude is sufficient for most things, but there are times when Claude gets stumped. Invariably that means I asked it to do too much. But sometimes, maybe 10-20% of the time, o1-preview is able to do what Claude couldn’t.
I haven’t signed up for o1 Pro because going from Cursor to copy/pasting from ChatGPT is a big DevX downgrade. But from what I’ve heard o1 Pro can solve harder coding problems that would stump Claude or o1-preview.
My solution is just to split the problem into smaller chunks that make it tractable for Claude. I assume this is what Devin’s doing. Or is Devin using custom models or an early version of the o1 (full or pro) API?
https://x.com/cognition_labs/status/1834292718174077014
I'd expect a very different experience with Devin vs the IDE-forks -- it provides status updates in Slack, runs CI, and when it's done it puts up a pull request in GitHub.
Slack integration, automatically pushing to CI, etc., are relatively low-value compared to the questions of “does it write better code than alternatives?”, “can I depend on it to solve hard problems?”, “will I still need a Cursor and/or ChatGPT Pro subscription to debug Devin’s mistakes?”
( removed pricing q, as I missed it is $500 / month for whole teams. I get why that is the pricing, but doesn't work for me to try it in side projects sadly )
But these are the kinds of problems that help shape the product. The software archictecture should be a compression of a deep and intuitive understanding of the problem space. How can you develop that knowledge if you're just delegating it to a black box that can't operate at a near-human level?
I've used ai based tools to great success, but on an ad-hoc basis, for specific and small functions or modules. To do the integration part requires an understanding of what abstraction is appropriate where. I don't think these tools are good that.
What we do get is a price of $ 500,- per month from a company that has been caught lying about this very product [0] and has never allowed independent testing.
Cognition, I am sorry to tell you, but there is no reason to trust you. In fact, there are multiple good reasons no to, even if you offered Devin at a fraction.
If this were e.g. Anthropic launching a new beyond Opus size model that was still performant and came with "chain-of-thought" capabilities, a far more extensive context window that still fully passes needle in haystack and is absolutely solid in sourcing from provided files, keeps on track even when provided with large documents, has few or no restrictions on usage and comes with extensive, verifiable benchmarks that showcase this offering being a significant upgrade over other models, maybe such a price could be justified.
You know why Cognition? Because they haven’t actively lied. What they did instead was let people use their models and actually test the advantages. Even Claude Instant way back when had certain use cases that made them have their own niche and showed they could execute before expanding with 2 and the larger context, then 3 with more applications. You never did any of that, you never gave anyone reason to believe what you claim, you didn’t even release benchmarks. See the difference?
Seems more like a simple cash grab, attempting to ride the O1 wave. OpenAI has a hard time justifying their Pro pricing, you doubling that makes this an out of season April fools joke. Waiting for the inevitable reporting that this is just another API wrapper for Claude or ChatGPT with our old faithful RAG.
[0] https://www.youtube.com/watch?v=tNmgmwEtoWE&pp=ygUJZGV2aW4gY...
That said, I'm super excited about this space and love seeing smart folks putting energy into this. Even if it's still a bit aspirational, I think the idea of cutting down time spent debugging and refactoring and putting more power in the hands of less technical folks is awesome.