Redis array: short story of a long development process (opens in new tab)

(antirez.com)

316 pointsantirez6d ago109 comments

109 comments

Thanks for adding this. Excited about array/regex, also very interested in your experience using LLMs to stretch your abilities. There are many of us laboring quietly on various projects attempting the same. "Vibe coding" (and the backlash) doesn't really capture how we work.

2 more replies

wood_spirit6d ago

Sharing my current MO:

I start with a high level design md doc which an AI helps write. Then I ask another AI - whether the same model without the context, or another model - to critique it and spot bugs, gaps and omissions. It always finds obvious in hindsight stuff. So I ask it to summarize its findings and I paste that into the first AI and ask its opinions. We form an agreed change and make it and carry on this adversarial round robin until no model can suggest anything that seems weighty.

I then ask the AI to make a plan. And I round robin that through a bunch of AIs adversarially as well. In the end, the plan looks solid.

Then the end to end test cases plan and so on.

By the end of the first day or week or month - depending on the scale of the system - we are ready to code.

And as code gets made I paste that into other AIs with the spec and plan and ask them to spot bugs, omissions and gaps too and so on. Continually using other AI to check on the main one implementing.

And of course you have to go read the code because I have found it that AI misses polishes.

5 more replies

sylvinus6d ago

Thanks for the write up. Always interesting to see how very senior developers interact with AI these days.

@antirez: Introducing a regex feature that late into the project for a seemingly unrelated feature feels a bit weird? Can you explain more your rationale on that? thanks!

1 more reply

jdw646d ago

It feels like Redis is becoming a small database, which seems to make it more convenient to use. Could you add more examples that clarify where the boundary should be?

antirezOP6d ago

Well, Redis is a data structures server, and has very complicated and edgy data structures like the HyperLogLog, so I have very little doubts that a fundamental data type like the Array will fit :) Also the actual complexity added is mostly two C files that are quite commented and understandable.

    wc -l t_array.c sparsearray.c
        2012 t_array.c
        2063 sparsearray.c
        4075 total (including comments)

Sure there are also the AOF / RDB glues, the tests, the vendored TRE library for ARGREP. But all in all it's self contained complexity with little interactions with the rest of the server.

A quick note: if we focus only on that part of the implementation, skipping tests and persistence code which is not huge, 4075 lines in 4 months are an average of 33 lines per day, which is quite low.

jdw646d ago

I’m a big fan of your work, and I honestly didn’t expect to receive a reply from you. Thank you. Also, thank you for pointing out exactly where I was misunderstanding the issue. In the past, I used Redis for temperature measurements in a smart farm project. I used Hashes back then, but it seems like Array would fit that use case much better.

This looks like a very useful feature. Thank you again for the reply.

antirezOP6d ago

I appreciate your kind reply as well :)

SuperV12346d ago

Closely matches my own experiences with current SOTA AI. Extremely useful collaborator, far from being a replacement for human intelligence and creativity.

antirezOP6d ago

There are projects that I develop mostly not looking at the code, but owning the concepts, algorithms and ideas asking questions and giving hints, and owning especially the product. But, not for Redis, not yet at least. When in the future this will be possible, server software, the way it is developed today, will be over. I bet there will be still projects and repositories, as accumulation of features, fixes and experiences will still be worth it, but the role of programmers will be very similar to what Linus did so far for the kernel. And for certain projects I'm developing, like the DeepSeek v4 inference engine, I'l already working like that.

foobarian6d ago

I like to say, AI is the duck programming duck I always wanted

bonesss6d ago

LLMs are the insensitive Asmovian robots I’ve always wanted, who translate and do the hardest part of my job: ensuring my emails are polite and none of my true thoughts or feelings are revealed…

Now I just need a way to protect my chats from any potential discovery, and <pew pew> business’ll be easy.

ericpauley5d ago

Couldn't some of the use cases presented for this be accomplished with ZSETs? I get the performance angle, but it seems that this could have been accomplished without the new API surface by selectively optimizing ZSET storage for dense values (in the same way that Arrays selectively use sparse representations).

The RE component is interesting, but as commentary here has noted it seems orthogonal to the array data structure (i.e., usable on others as well). Does this not make more sense to accomplish with Lua scripting? Or if performance of Lua is an issue perhaps abstracting OP to be composable on top of any command that returns a range of values.

I say this with reverence for Antirez as the expert in this space, but some of this new feature set feels like the sort of solution that I tend to see arise from LLM-driven development; namely creation of new functionality instead of enhancement of existing, plus overcomplicating features when composition with others might be more effective.

1 more reply

localhoster6d ago

Let's make it very clear - this is the original creator of redis, or one of them.

He is not "your avg dev" and it took him 4 months with llm.

This is not a seal of approval for you to go and command all your developers to move to Claude code/codex/any other ai coding tool fully.

I'm looking at you - any avg CEO of a startup.

simonw6d ago

It's a pretty strong endorsement for the idea that coding agents, used skillfully by experienced developers, can further amplify their expertise.

zozbot2345d ago

Sure but the OP suggests that these were minor gains, and that this limited scope for gains was necessary in order to preserve the quality standard that's long been expected in that FLOSS community. We aren't talking about either a 10x productivity gain or one-shotting entire new features from scratch.

This is arguably a key quote: "Then, it was time to read all the code, line by line. ... I found many small inefficiencies or design errors ... so I started a process of manual and AI-assisted rewrite of many modules." We should not underestimate that step: reading code line by line might easily require more time than writing it from scratch.

simonw5d ago

Right, and those of us who advocate for a sensible approach to agentic engineering don't talk about 10x productivity gains or one-shotting entire new (production-ready) features from scratch either.

I remain unconvinced by the "faster to write it by hand than read it" arguments though. My experience throughout my career is that most people, myself included, top out at a couple of hundred lines of tested, production-ready code per day. I can productively review a couple of thousand.

DrammBA6d ago

> He is not "your avg dev" and it took him 4 months with llm.

To clarify, from TFA:

> even before LLMs the implementation was likely something I could do in four months. What changed is that in the same time span, I was able to do a lot more

The initial timeframe was 4 months, he was able to do more work within the same timeframe with LLMs.

tracker15d ago

I would add that the output was likely more as well.. ex: more thorough tests, documentation, etc.

I've been working on a Database adapter for a couple months using an LLM... I've got a couple minor refactors to do still, then getting the "publish" to jsr/npm working... I've mostly held off as I haven't actually done a full review of the code... I've reviewed the tests, and confirmed they're working though. The hard part is there's some features I really want when in Windows to a Windows SQL Server instance that isn't available in linux/containers. I don't think I'll ever choose SQL again, but at least I can use/access a good API with windows direct auth and FILESTREAM access in Deno/Bun/Node.

FWIW: My final implementation landed on ODBC via rust+ffi so after I get the mssql driver out, I'll strip a few bits in a fork and publish a more generic odbc client adapter. using/dispose and async iterators as first class features in the driver.

slig5d ago

>He is not "your avg dev" and it took him 4 months with llm.

He's not, but his work is obviously not average.

Average dev work is plumbing and CRUDs.

thallavajhula5d ago

Salvatore really wants to popularize the term Automatic Programming/Coding it seems. (https://antirez.com/news/159)

2 more replies

simonw6d ago

I vibe coded up an interactive playground against a WebAssembly build of the new array features: https://tools.simonwillison.net/redis-array

1 more reply

nitwit0055d ago

The use of C stdlib localization functions (toupper, mbrtowc, etc), makes me suspect if there will be some regex behavior differences between systems or locales.

1 more reply

ardline5d ago

Solid work. The devil's in the operational complexity, but this looks manageable.

tibbar5d ago

Reviewing 22,000 lines of code, even from antirez, with this complex of a feature set and minimal PR description sounds like a nightmare. One starts to see why major open-source software like Postgres tends to be developed on a mailing list, with intermediate design decisions discussed by the community, separate patches for different related features, incremental review, and then a spaced release cadence.

2 more replies

shay_ker5d ago

antirez: i'm curious, with the final code, have you experimented with effectively one-shotting the final result? i wonder if we can get there with GEPA, and maybe there's something we can learn in how to elicit/prompt these models to get what we want.

or maybe the conclusion is that model providers need to clean up their training data!

gbalduzzi6d ago

Is it possible to see the specification file you created and used for AI assisted development?

Very cool anyway! Can I expect a youtube video about this soon?

antirezOP6d ago

Yep I will release it, it is a bit out of sync at this point, but will do a pass of updating and will release it.

ok1234566d ago

Is this an apologia since the PR is +22,212 -34?

1 more reply

srinikhilr5d ago

Anyone know how to get the specification mentioned in the blog post? Don't see one in the linked PR.

dsecurity496d ago

AI is a fantastic co-pilot, but you still need to know how to fly the plane when the edge cases start hitting the fan.

leetrout6d ago

On safari mobile it's a page with the title header and a footer. Theres no content rendering.

antirezOP6d ago

Checking, thanks. EDIT: works very well on my iPhone, so without being able to reproduce is not easy to fix.

tobr6d ago

Same here, I need to turn off content blockers for the article content to load.

epolanski6d ago

Got few questions:

- the project essentially spans almost 3 different (albeit minor) generations of LLMs. Have you noticed major differences in their personas, behavior, output for that specific use case?

- when using AI for feedback, have you ever considered giving it different "personalities"? I have few skills that role play as very different reviewers with their own different (by design conflicting) personalities. I found this to improve the output, but also to be extremely tiring and to often have high noise ratio.

- when did you, if ever, felt that AI was slowing you down massively compared to just doing it yourself (e.g. some specific bug or performance or design fix)? Are there recurring patterns?

- conversely, how often did AI had moments where it genuinely gave you feedback or ideas that would've not come to you?

- last: do you have specific prompts, skills, setups, etc to work on specific repositories?

1 more reply

ozozozd5d ago

That was too short a story @antirez!

jaunt75d ago

In short, Redis can't be trusted any more.

Who is going to do an LLM free fork?

2 more replies

asG12985d ago

Redis wants to get in on the vector database market that is popular in AI. That is all there is to it.

That is the reason why the Redis author keeps boosting AI. To the point where he even uses Redis to demonstrate how many bugs AI has found. Not every software is as buggy as Redis.

It is all an advertisement by someone extremely adept at manipulating techies.

2 more replies

j / k navigate · click thread line to collapse

109 comments

gurgeous6d ago

2 more replies

wood_spirit6d ago

Sharing my current MO:

I then ask the AI to make a plan. And I round robin that through a bunch of AIs adversarially as well. In the end, the plan looks solid.

Then the end to end test cases plan and so on.

By the end of the first day or week or month - depending on the scale of the system - we are ready to code.

And as code gets made I paste that into other AIs with the spec and plan and ask them to spot bugs, omissions and gaps too and so on. Continually using other AI to check on the main one implementing.

And of course you have to go read the code because I have found it that AI misses polishes.

5 more replies

sylvinus6d ago

Thanks for the write up. Always interesting to see how very senior developers interact with AI these days.

@antirez: Introducing a regex feature that late into the project for a seemingly unrelated feature feels a bit weird? Can you explain more your rationale on that? thanks!

1 more reply

jdw646d ago

It feels like Redis is becoming a small database, which seems to make it more convenient to use. Could you add more examples that clarify where the boundary should be?

antirezOP6d ago

    wc -l t_array.c sparsearray.c
        2012 t_array.c
        2063 sparsearray.c
        4075 total (including comments)

Sure there are also the AOF / RDB glues, the tests, the vendored TRE library for ARGREP. But all in all it's self contained complexity with little interactions with the rest of the server.

jdw646d ago

This looks like a very useful feature. Thank you again for the reply.

antirezOP6d ago

I appreciate your kind reply as well :)

SuperV12346d ago

Closely matches my own experiences with current SOTA AI. Extremely useful collaborator, far from being a replacement for human intelligence and creativity.

antirezOP6d ago

foobarian6d ago

I like to say, AI is the duck programming duck I always wanted

bonesss6d ago

LLMs are the insensitive Asmovian robots I’ve always wanted, who translate and do the hardest part of my job: ensuring my emails are polite and none of my true thoughts or feelings are revealed…

Now I just need a way to protect my chats from any potential discovery, and <pew pew> business’ll be easy.

ericpauley5d ago

1 more reply

localhoster6d ago

Let's make it very clear - this is the original creator of redis, or one of them.

He is not "your avg dev" and it took him 4 months with llm.

This is not a seal of approval for you to go and command all your developers to move to Claude code/codex/any other ai coding tool fully.

I'm looking at you - any avg CEO of a startup.

simonw6d ago

It's a pretty strong endorsement for the idea that coding agents, used skillfully by experienced developers, can further amplify their expertise.

zozbot2345d ago

simonw5d ago

Right, and those of us who advocate for a sensible approach to agentic engineering don't talk about 10x productivity gains or one-shotting entire new (production-ready) features from scratch either.

DrammBA6d ago

> He is not "your avg dev" and it took him 4 months with llm.

To clarify, from TFA:

> even before LLMs the implementation was likely something I could do in four months. What changed is that in the same time span, I was able to do a lot more

The initial timeframe was 4 months, he was able to do more work within the same timeframe with LLMs.

tracker15d ago

I would add that the output was likely more as well.. ex: more thorough tests, documentation, etc.

slig5d ago

>He is not "your avg dev" and it took him 4 months with llm.

He's not, but his work is obviously not average.

Average dev work is plumbing and CRUDs.

thallavajhula5d ago

Salvatore really wants to popularize the term Automatic Programming/Coding it seems. (https://antirez.com/news/159)

2 more replies

simonw6d ago

I vibe coded up an interactive playground against a WebAssembly build of the new array features: https://tools.simonwillison.net/redis-array

1 more reply

nitwit0055d ago

The use of C stdlib localization functions (toupper, mbrtowc, etc), makes me suspect if there will be some regex behavior differences between systems or locales.

1 more reply

ardline5d ago

Solid work. The devil's in the operational complexity, but this looks manageable.

tibbar5d ago

2 more replies

shay_ker5d ago

or maybe the conclusion is that model providers need to clean up their training data!

gbalduzzi6d ago

Is it possible to see the specification file you created and used for AI assisted development?

Very cool anyway! Can I expect a youtube video about this soon?

antirezOP6d ago

Yep I will release it, it is a bit out of sync at this point, but will do a pass of updating and will release it.

ok1234566d ago

Is this an apologia since the PR is +22,212 -34?

1 more reply

srinikhilr5d ago

Anyone know how to get the specification mentioned in the blog post? Don't see one in the linked PR.

dsecurity496d ago

AI is a fantastic co-pilot, but you still need to know how to fly the plane when the edge cases start hitting the fan.

leetrout6d ago

On safari mobile it's a page with the title header and a footer. Theres no content rendering.

antirezOP6d ago

Checking, thanks. EDIT: works very well on my iPhone, so without being able to reproduce is not easy to fix.

tobr6d ago

Same here, I need to turn off content blockers for the article content to load.

epolanski6d ago

Got few questions:

- the project essentially spans almost 3 different (albeit minor) generations of LLMs. Have you noticed major differences in their personas, behavior, output for that specific use case?

- when did you, if ever, felt that AI was slowing you down massively compared to just doing it yourself (e.g. some specific bug or performance or design fix)? Are there recurring patterns?

- conversely, how often did AI had moments where it genuinely gave you feedback or ideas that would've not come to you?

- last: do you have specific prompts, skills, setups, etc to work on specific repositories?

1 more reply

ozozozd5d ago

That was too short a story @antirez!

jaunt75d ago

In short, Redis can't be trusted any more.

Who is going to do an LLM free fork?

2 more replies

asG12985d ago

Redis wants to get in on the vector database market that is popular in AI. That is all there is to it.

That is the reason why the Redis author keeps boosting AI. To the point where he even uses Redis to demonstrate how many bugs AI has found. Not every software is as buggy as Redis.

It is all an advertisement by someone extremely adept at manipulating techies.

2 more replies

j / k navigate · click thread line to collapse