Gas Town Decoded (opens in new tab)

(alilleybrinker.com)

219 pointsalilleybrinker3mo ago234 comments

234 comments

I’m very bought in to the idea that raw coding is now a solved problem with the current models and agentic harnesses. Let alone what’s coming in the near term.

That being said, I think we’re in a weird phase right now where people’s obvious mental health issues are appearing as “hyper productivity” due to the use of these tools to absolutely spam out code that isn’t necessarily broadly coherent but is locally impressive. I’m watching multiple people both publicly and privately clearly breaking down mentally because of the “power” AI is bestowing on them. Their wires are completely crossed when it comes to the value of outputs vs outcomes and they’re espousing generated nonsense as it’s thoughtful insight.

It’s an interesting thing to watch play out.

ben_w3mo ago

Mm.

I'd agree, the code "isn’t necessarily broadly coherent but is locally impressive".

However, I've seen some totally successful, even award-winning, human-written projects where I could say the same.

Ages back, I heard a woodworking analogy:

  LLM code is like MDF. Really useful for cheap furniture, massively cheaper than solid wood, but it would be a mistake to use it as a structural element in a house.

Now, I've never made anything more complex than furniture, so I don't know how well that fit the previous models let alone the current ones… but I've absolutely seen success coming out of bigger balls of mud than the balls of mud I got from letting Claude loose for a bit without oversight.

Still, just because you can get success even with sloppy code, doesn't mean I think this is true everywhere. It's not like the award was for industrial equipment or anything, the closest I've come to life-critical code is helping to find and schedule video calls with GPs.

theshrike793mo ago

"Without oversight" is the key here.

You need to define the problem space so that the agent knows what to do. Basically give it the tools to determine when it's "done" as defined by you.

spmurrayzzz3mo ago

This has also been an interesting social experiment in that we get to see what work people think is actually impressive vs trivial.

Folks who have spent years effectively snapping together other people’s APIs like LEGOs (and being well-compensated for it) are understandably blown away by the current state of AI. Compare that to someone writing embedded firmware for device microcontrollers, who would understandably be underwhelmed by the same.

The gap in reactions says more about the nature of the work than it does about the tools themselves.

aaronblohowiak3mo ago

>Compare that to someone writing embedded firmware for device microcontrollers, who would understandably be underwhelmed by the same.

One datum for you: I recently asked Claude to make a jerk-limited and jerk-derivative-limited motion planner and to use the existing trapezoidal planner as reference for fuzzy-testing various moves (to ensure total pulses sent was correct) and it totally worked. Only a few rounds of guidance to get it to where I wanted to commit it.

spmurrayzzz3mo ago

My comment above I hope wasn't read to mean "LLMs are only good at web dev." Only that there are different capability magnitudes.

I often do experiments where I will clone one of our private repos, revert a commit, trash the .git path, and then see if any of the models/agents can re-apply the commit after N iterations. I record the pass@k score and compare between model generations over time.

In one of those recent experiments, I saw gpt-oss-120b add API support to swap tx and rx IQ for digital spectral inversion at higher frequencies on our wireless devices. This is for a proprietary IC running a quantenna radio, the SDK of which is very likely not in-distribution. It was moderately impressive to me in part because just writing the IQ swap registers had a negative effect on performance, but the model found that swapping the order of the IQ imbalance coefficients fixed the performance degradation.

I wouldn't say this was the same level of "impressive" as what the hype demands, but I remain an enthusiastic user of AI tooling due to somewhat regular moments like that. Especially when it involves open weight models of a low-to-moderate param count. My original point though is that those moments are far more common in web dev than they are elsewhere currently.

EDIT: Forgot to add that the model also did some work that the original commit did not. It removed code paths that were clobbering the rx IQ swap register and instead changed it to explicitly initialize during baseband init so it would come up correct on boot.

1 more reply

aprdm3mo ago

This is not true. You can see people who are much older and built a lot of the "internet scale" equally excited about it, e.g: freebsd OG developers, Steve himself (who wrote gas town) etc.

In fact, I would say I've seen more people who are "OG Coders" excited (and in their >50s) then mid generation

spmurrayzzz3mo ago

I think you're shadow-boxing with a point I never made. I never said experienced devs are not or can not be excited about current AI capabilities.

Lots of experienced devs who work in more difficult domains are excited about AI. In fact, I am one of them (see one of my responses in this thread about gpt-oss being able to work on proprietary RF firmware in my company [1]).

But that in no way suggests that there isn't a gap in what impresses or surprises engineers across any set of domains. Antirez is probably one of the better, more reasoned examples of this.

[1] https://news.ycombinator.com/item?id=46682604

phist_mcgee3mo ago

I think this says a lot about yourself and where your prejudices and preferences lie.

spmurrayzzz3mo ago

Preferences I think I get, but prejudices?

The OED defines prejudice as a "preconceived opinion that is not based on reason or actual experience."

My day to day work involves: full stack web dev, distributed systems, embedded systems, and machine learning. In addition to using AI tooling for dev tasks, we also use agents in production for various workflows and we also train/finetune models (some LLMs, but also other types of neural networks for anomaly detection, fault localization, time series forecasting, etc). I am basing my original commentary in this thread on all of that cumulative experience.

It has been my observation over the last almost 30 years of being a professional SWE that full stack web dev has been much easier and simpler than the other domains I work in. And even further, I find that models are much better at that domain on average than the other domains, measured by pass@k scores on private evals representing each domain. Anecdotal experience also tends to match the evals.

This tracks with all the other information we have pertaining to benchmark saturation, the "we need harder evals" crowd has been ringing this bell for the last 8-12 months. Models are getting very good at the less complex tasks.

I don't believe it will remain that way forever, but at present its far more common to see someone one shot a full stack web app from a single prompt than something like kernel driver for a NIC. One class of devs is seeing a massive performance jump, another class is not.

I don't see how that can be perceived as prejudice, it just may be an opinion you don't agree with or an observation that doesn't match your own experience (both of which are totally valid and understandable).

yetihehe3mo ago

If you give every idiot a worldwide heard voice, you will hear every idiot from the whole world. If you give every idiot a tool to make programs, you will see a lot of programs made by idiots.

meowface3mo ago

Steve Yegge is not an idiot or a bad programmer. Possibly just hypomanic at most. And a good, entertaining writer. https://en.wikipedia.org/wiki/Steve_Yegge

Gas Town is ridiculous and I had to uninstall Beads after seeing it only confuse my agents, but he's not completely insane or a moron. There may be some kernels of good ideas inside of Gas Town which could be extracted out into a better system.

yetihehe3mo ago

> Steve Yegge is not an idiot or a bad programmer.

I don't think he's an idiot, there are almost no actual idiots here on HN in my opinion and they don't write such articles or make systems like Steve Yegge. I'm only commenting about giving more tools to idiots. Even tools made by geniuses will give you idiotic results when used by actual idiots, but a lot of smart people want to lower barriers of entry so that idiots can use more tools. And there are a lot of idiots who were inactive just because they didn't have the tools. Famous quote from a famous Polish essayist/futurist Stanisław Lem: "I didn't know there are so many idiots in this world until I got internet".

fegd853mo ago

Even if I looked past the overwrought, self-indulgent Mad Max LARP (and the poor judgment evidenced by the prioritization of world-building minutia while the basic architecture is imploding), the cost of finding those kernels in a monstrosity of this size negates any ROI. 189k lines in four weeks will inevitably surface interesting pattern combinations — that's not merit, that's sample size. You might as well search the Library of Babel; at least the patterns are guaranteed to exist there.

The other problem with that reasoning is that whatever patterns ARE interesting are more likely to be new to AI-assisted coding generally – meaning a cleaner system built for the same use case will surface them without the archaeological dig, just by virtue of its builder having the skill to design it (and crucially, being more interested in designing it than in creating AI drawings of polecats in steampunk-adjacent garb).

I'm also a bit curious about at which point you start considering someone an idiot when they keep making objectively idiotic moves – the whimsical Disneyfied presentation, the "please don't download this" false modesty while keeping the repo public, the inexplicable code growth all come from the same place. They're not separate quirks: they're the same inability to edit, the same need for immediate audience validation, the same substitution of volume and narrative for actual engineering discipline. Someone who thinks "Polecats" and "Guzzoline" are good names for production abstractions is not suddenly going to develop the editorial rigor to scrap a codebase and rebuild.

Which is why it's worth remembering that Yegge's one successful shipped project was Grok, an internal tool used by Google engineers, so Yegge seems to have bought his own hype, missing how much of that project's success was likely subsidized by its user base comprising people skilled enough to route around its limitations.

These days he seems to be building for developers in general, but critically might be missing that actual developers immediately clock the project's ineptitude + Yegge's immature, narcissistic prioritization and peace the fuck out. The end result of this is filtering for the self-described vibe-coder types, people already Dunning-Krugered enough to believe you can prompt your way into a complete system without knowing how to reason about that system in order to guide the AI.

Which, fittingly, is how you end up with users who can't even follow "please don't download this yet".

sonnig3mo ago

Well put. I can't help thinking of this every time I see the 854594th "agent coordination framework" in GitHub. They all look strangely similar, are obviously themselves vibe-coded, and make no real effort to present any type of evidence that they can help development in any way.

petesergeant3mo ago

> where people’s obvious mental health issues

I think the kids would call this "getting one-shotted by AI"

GrowingSideways3mo ago

> raw coding is now a solved problem

Surely this was solved with fortran. What changed? I think most people just don't know what program they want.

lordnacho3mo ago

You no longer have to be very specific about syntax. There's now an AI that can translate your idea into whatever language you want.

Previously, if you had an idea of what the program needed to do, you needed to learn a new language. This is so hard that we use language itself as a metaphor: It's hard to learn a new language, only a few people can translate from French to English, for example. Likewise, few people can translate English to Fortran.

Now, you can just think about your program in English, and so long as you actually know what you want, you can get a Fortran program.

The issue is now what it was originally for senior programmers: to decide what to make, not how to make it.

hnlmorg3mo ago

The hard part of software development is equivalent to the hard part of engineering:

Anyone can draw a sketch of what a house should look like. But designing a house that is safe, conforms to building regulations, and which wouldn't be uncomfortable to live in (for example, poor choice of heat insulation for the local climate) is the stuff people train on. Not the sketching part.

It's the same for software development. All we've done is replace FORTRAN / Javascript / whatever with a subset of a natural language. But we still need to thoroughly understand the problem and describe it to the LLM. Plus the way we format these markdown prompts, you're basically still programming. Albeit in a less strict syntax and the "compiler" is non-deterministic.

This is why I get so mythed by comments about AI replacing programmers. That's not what's happening. Programming is just shifting to a language that looks more like Jira tickets than source code. And the orgs that think they can replace developers with AI (and I don't for one second believe many of the technology leaders think this, but some smaller orgs likely do) are heading for a very unpleasant realisation soon.

I will caveat this by saying: there are far too many naff developers out there that genuinely aren't any better than an LLM. And maybe what we need is more regulation around software development, just like there is in proper engineering professions.

2 more replies

GrowingSideways3mo ago

Again, I don't think most people are prepared to articulate what behavior they want. Fortran (and any other formal language) used to force this, but now you just kind of jerk off on the keyboard or into the microphone and expect mind-reading.

Reactionarily? Sure. Maybe AI has some role to play there. Maybe you can ask the chatbot to modify settings.

I am no fan of chatbots. But i do have empathy for the people responsible for them when their users start complaining that programs don't do what they want, despite the chatbots delivering precisely the code demanded.

https://youtu.be/5IsSpAOD6K8?si=FtfQZzgRU8K2z4Ub

hahahahhaah3mo ago

Yeah I am definitely trying to stay off hype and just use the damn tool

bkolobara3mo ago

There is a lot of research on how words/language influences what we think, and even what we can observe, like the Sapir-Whorf hypothesis. If in a langauge there is one word for 2 different colors, speakers of it are unable to see the difference between the colors.

I have a suspicion that extensive use of LLMs can result in damage to your brain. That's why we are seeing so many mental health issues surfacing up, and we are getting a bunch of blog posts about "an agentic coding psychosis".

It could be that llms go from bicycles for the brain to smoking for the brain, once we figure out the long term effects of it.

BrenBarn3mo ago

> If in a langauge there is one word for 2 different colors, speakers of it are unable to see the difference between the colors.

That is quite untrue. It is true that people may be slightly slower or less accurate in distinguishing colors that are within a labeled category than those that cross a category boundary, but that's far from saying they can't perceive the difference at all. The latter would imply that, for instance, English speakers cannot distinguish shades of blue or green.

bkolobara3mo ago

The point I was trying to make is that the way our brain works is deeply connected to language and words, including how fast and how accurate you perceive colors [0][1]. And interacting with an LLM could have unexpected side effects on it, because we were never before exposed to "statistically generated language" in such amounts.

[0]: https://youtu.be/RKK7wGAYP6k?si=GK6VPP0yoFoGyOn3 [1]: https://youtu.be/I64RtGofPW8?si=v1FNU06rb5mMYRKj&t=889

1 more reply

jstanley3mo ago

> If in a langauge there is one word for 2 different colors, speakers of it are unable to see the difference between the colors.

Perhaps you mean to say that speakers are unable to name the difference between the colours?

I can easily see differences between (for example) different shades of red. But I can't name them other than "shade of red".

I do happen to subscribe to the Sapir-Whorf hypothesis, in the sense that I think the language you think in constrains your thoughts - but I don't think it is strong enough to prevent you from being able to see different colours.

bkolobara3mo ago

No, if you show them two colors and ask them if they are different, they will tell you no.

EDIT: I have been searching for the source of where I saw this, but can't find it now :(

EDIT2: I found a talk touching in the topic with a study: https://youtu.be/I64RtGofPW8?si=v1FNU06rb5mMYRKj&t=889

3 more replies

skywhopper3mo ago

But the color thing is self-evidently untrue. It’s not even hard to talk about. Unless you yourself are colorblind I think that would be obvious?

bonzini3mo ago

Sort of, at least some degree of relativism exists though how much is debated. Would you ever talk about sea having the same color as wine? But that's exactly what Homer called it.

https://en.wikipedia.org/wiki/Wine-dark_sea

https://en.wikipedia.org/wiki/Linguistic_relativity_and_the_...

1 more reply

bastawhiz3mo ago

The idea of gas town is simultaneously appealing and appalling to me. The waste and lack of control is wild, but at the same time there's at least a nugget of fascinating, useful work in there. In a world where compute is cheap and abundant and the models are a notch smarter, I think it's the start of a useful framework for what the future of augmented work might look like.

I have no interest in using gas town as it is (for a plethora of reasons, not the least of which being that I'm uninterested in spending the money), but I've been fascinated with the idea of slowing it down and having it run with a low concurrency. If you've got a couple A100s, what does it look like if you keep them busy with two agents working concurrently (with 20+ agents total)? What does it mean to have the town focus the scope of work to a series of non-overlapping changesets instead of a continuous stream of work?

If you don't plan to have it YOLO stuff in realtime and you can handle the models being dumber than Claude, I think you can have it do some really practical, useful things that are markedly better than the tools we have today.

_ea1k3mo ago

I put it in a VM and had it build a really simple todo app for me the other day. It wasted so many tokens that I can't help but agree with you right now. And I could certainly have done the same thing with beads and opus in approximately the same amount of time.

However, the gas town one was almost completely hands off. I think my only interventions were due to how beta it was, so I had to help it work around its own bugs to keep from doing stupid things.

Other than that, it implemented exactly what I asked for in a workable fashion with effectively one prompt. It would have taken several prompts and course corrections to get the same result without it.

Other than the riskyness (it runs in dangerous permissions mode) and incredible cost inefficiency, I'd certainly use it.

safety1st3mo ago

If gas town can actually do stuff well at any price it'll have a radical impact on how society is organized, because there are people out there who have practically unlimited money (billions of dollars of their own to spend, plus they can get the government to print more dollars for them if necessary; you probably already know who a few of these people are).

I've only started using coding agents recently and I think they go a long way to explain why different people get different mileage from "AI." My experience with Opencode using its default model, vs. Github Copilot using its default model, is night and day. One is amazing, the other is pretty crappy. That's a product of both the software/interface and the model itself I'd suspect.

Where I think this goes in the medium term is we will absolutely spin up our own teams of agents, probably not conforming to the silly anthropomorphized "town" model with mayors and polecats and so on, but they'll be specialized to particular purposes and respond to specific events within a software architecture or a project or even a business model. Currently the sky's the limit in my mind for all the possible applications of this, and a lot of it can be done with existing and fairly cheap models too, so the bottleneck is, surprise surprise... developer time! The industry won't disappear but it will increasingly revolve around orchestrating these teams of models, and software will continue to eat the world.

eru3mo ago

I guess tokens get cheaper all the time, and we can fix the risk via sufficient sand boxing. (I mean the risk to your computer.)

Avicebron3mo ago

I've been running my own version of what Gas Town seems to be in a couple of proxmox hosts for a while now, it's fine.

condiment3mo ago

If software engineers can agree on anything, it's that LLM experiences are wildly inconsistent. People have similar inconsistencies. We have different experiences, intellects, educations, priorities, motivations, value systems. And in software specifically (and institutions generally) we create methodologies and processes that diminish our inconsistencies and leverage our strengths.

Gas town is a demonstration of a methodology for getting a consistent result from inconsistent agents. The case in point is that Yegge claims to have solved the MAKER problem (tower of Hanoi) via prompting alone. With the right structure, quantity has a quality all its own.

hahahahhaah3mo ago

I feel like each of these things is going to be bitter lessoned by a model who you can just say "yeah get a bunch of agents together and clone twitter, get em to put requirements together first, ya know, measure once and all that. promise em a beer when done".

keyle3mo ago

I'd help build Gas City and Gas State, and Gas Country if that would mean we actually would solve the things AI promised to solve. All sickness, famine, wealth ...

The problem is, we're just fidgeting yolo-fizzbuzz ad nauseam.

The return on investment at the moment is probably one of the worst in the history of human investments.

AI does improve over time, still today, but we're going to run out of planet before we get there...

ViscountPenguin3mo ago

As of yet, the AI models doing important work are still pretty specialized. I'd be happy to pitch in to run something like an open source version of alpha-fold, but I'm not aware of any such projects.

I have trouble seeing LLMs making meaningful progress on those frontiers without reaching ASI, but I'd be happy to be wrong.

Terr_3mo ago

I think part of the problem/difference is that all "important work" needs to be auditable and understood by humans. We need to be able to fix bugs, and not just roll the dice and hoping that a lack of symptoms means everything is cured.

camgunz3mo ago

Even alphafold generated a bunch of slop,like impossible proteins and such.

alecco3mo ago

That doesn't make any sense.

Yegge named it Gas Town as in "refinery" because the main job for the human at this stage is reviewing the generated code and merging. "

The whole point of the project is to be in control. Yegge even says the programmers who can read/review a lot of code fast are the new 10x (paraphrasing).

colin_jack3mo ago

"I’ve never seen the code, and I never care to, which might give you pause"

https://steve-yegge.medium.com/welcome-to-gas-town-4f25ee16d...

alecco3mo ago

Oof, he changed that. I stand corrected.

1 more reply

soulofmischief3mo ago

The Wright brothers are idiots, if it were me I'd have made a supersonic jet from the get go and not waste my time mucking around with prototypes.

ncruces3mo ago

The prototype phase meant data centers are now measured in MW instead of TFLOPS.

At a time where we were desperate to reduce emissions, data centers now consume around 20% of the energy consumed by the entire aviation sector, with consumption is rising at 15% YoY.

Never mind the water required to cool them, or the energy and resources required to build them, the capital allocation, and the opportunity cost of not allocating all of that to something else.

And this is, your words, the prototype phase.

neoromantique3mo ago

Emissions and Energy consumed do not necessarily have to be linked up.

We have plenty of ways to make clean energy, it is only matter of incentives.

As long as burning coal is simply cheaper, business will burn coal.

soulofmischief3mo ago

The computing power in a crappy cheap modern phone used to fill up a warehouse and cost a ton of energy, relatively. Moore's law might not remain steadfast, but if history is any indication, we'll find a way to make the technology more efficient.

So, yes, prototypes often use more energy than the final product. That doesn't mean we shouldn't sustainable build datacenters, but that's conflating issues.

jpfromlondon3mo ago

the Wright brothers sold me a subscription to a supersonic jet and I've got a bundle of matchsticks and some canvas.

soulofmischief3mo ago

On the other hand, flight is ubiquitous and has changed everything.

1 more reply

ares6233mo ago

We were promised supersonic jets today or very soon though and our economies have been held hostage waiting for that promise.

eru3mo ago

The passive voice is doing a lot of work in your sentence.

1 more reply

soulofmischief3mo ago

The first recorded supersonic flight was in 1947.

1 more reply

Kostchei3mo ago

Are you saying that people can't work out what to code using these? Or that code is not a worthy subject to use AI for? 'cause I got news for you... 1. Improving coding improved reasoning in the models. Having a verifiable answer that is not a single thing is a good training test. 2. Software has been used for fairly serious things. We used to have skyscrapers of people doing manual math. Now we have campuses of people doing manual code. You might argue that nobody would trust AI to write code when it matters. History tells us that if that is ever true, it will pass. 3. We are not going to run out of planet. It just feels to folks that there is not enough planet for their dreams and we get population panic, energy panic etc. There is a huge fusion reactor conveniently holding us in it's gravity well and spewing out many orders of magnitude more energy than we can currently use. Chill.

I think at Gas Country levels we will need better networking systems. Maybe that backbone Nvidia just built....

krupan3mo ago

Replacing human computers with electronic computers is nothing like what LLMs do or how they work. The electronic computer is straight up automation. Same input in gives you the same input out every time. Electronic computers are actually pretty simple. They just do simple mathematical operations like add, subtract, multiply, and divide. What makes them so powerful is that they can do billions of those simple operations a second.

LLMs are not simple deterministic machines that automate rote tasks like computers or compilers. People, please stop believing and repeating that they are the next level of abstraction and automation. They aren't.

toephu23mo ago

AI can't even find a cure for the common cold.

jamestimmins3mo ago

I actually love the idea of totally new naming schemes for experimental software.

Certain name types are so normalized (agent, worker, etc) that while they serve their role well, they likely limit our imagination when thinking about software, and it's a worthwhile effort to explore alternatives.

tom_3mo ago

This reminds me of Moldbug's Urbit. I can't be bothered to look it up, but his comment was along the lines of "existing words bring assumptions, so safest to make new ones". To which, my comment would be: perflufflington flibnik qupnux.

vessenes3mo ago

Not just this, but I’ve been thinking that naming things with aggressively strong connotations might help Claude get out of ‘nice/helpful’ mode. “You are the Deacon, grrrrr”. So there might be actually be a bit of effectiveness added by naming an agent appropriately. I offer no opinion on the word polecats.

ivankra3mo ago

Maybe helps the LLM, but at the cost of confusing humans. It would've been better left as an internal implementation detail. I've got better things to keep in my head that remembering wtf deacon is, etc.

3dsnano3mo ago

yes, totally. AI-luddites see this as whimsy, but it's actually wizard-level power of abstractions and context ascension. if u know u know

tptacek3mo ago

I do too, but you can take things too far, which I'd argue has happened the moment "figuring out what the names mean" becomes enough of an intellectual challenge to provide a dopamine hit; at that point, you've (intentionally or otherwise) germinated a cult. It's human nature: people will support the design not on its merits but rather as loss aversion for the work they put into decoding it.

jamestimmins3mo ago

Yes at some point innovative software and naming are at cross purposes, and if your naming gets too extreme ultimately that will get all of the attention.

bonesss3mo ago

Anthropomorphizing chunks of your system is kinda weird given interactive chat as the UI to the LLM.

Akka and others have standardized names for all this stuff (and seem to fully know that a code ‘actor’ is code). These wheels don’t need reinventing (much less as ‘the Marvin’s’, a lovable set of bi-racial quadruplets who always get you where you’re going <rocket emoji>).

In fact, I dare say a lot of LLM fascination for orchestration is people unfamiliar with actor models and the level of elegance a properly expressive language lets them have.

1 more reply

vessenes3mo ago

Very minor nit -- crew could be a person also - in fact that's how you're supposed to hack on a codebase in gas town directly - add yourself as crew.

Other than that, this is a helpful list especially for someone who hasn't been hacking around on this thing as it's in rapid development mode. I find gas town super interesting, and tantalizingly close to being amazingly useful. That said, I wouldn't mind a slightly less 'flavored' set of names for workers.

fdr3mo ago

I use beads quite a bit, but not as steve intended. And definitely the opposite of "Gas Town," where I use the note-taking capability and integration with Git (that is, as something of a glorified Makefile and database) to debug contexts, to close the loop and increase accuracy over time. Nevertheless, it has been useful for large batch runs over my code base: the record has been processing for thirty hours straight while getting something useful, and enough trace data to make further improvements.

Steve has gone "a bit" loopy, in a (so far) self aware manner, but he has some kind of insight into the software engineering process, I think. Yet, I predict beads will break under the weight of no-supervision eventually if he keeps churning it, but some others will pick up where he left off, with more modest goals. He did, to his credit, kill off several generations of project before this one in a similar category.

alexjurkiewicz3mo ago

His latest post is endorsing a crypto exchange because they paid him $50k.

https://steve-yegge.medium.com/bags-and-the-creator-economy-...

bonesss3mo ago

I’m pro LLM and use them, but crikey: if they’re so good at code why aren’t these people with all the attention, branding, and connections in the world unable to capitalize them?

I believe Google that uses their internal Gemini trained on their internal infrastructure to generate boiler plate and insights for older, less mature, code in one of the worlds biggest and most complicated anythings, ever. But I don’t see them saying anything to the effect of “neener neener, we’re using markov chains so 10x our stock ‘cause of the otherwise impossible face melting Google Docs 2026.”

OpenAI is chasing ads, like Reddit, to regurgitate Reddit content. If this stuff is worth the squeeze I need to see the top 10 LLM-fluencers refusing to bend over for $50K. The opposite is on display.

So hypotheses: Google’s s-tier geniuses and PMs are already expressing the mature optimum application. No silver bullets, more gains to be had ditching bad tech and extraneous vendor entanglements (copilot, 365).

coryrc3mo ago

> if they’re so good at code why aren’t these people with all the attention, branding, and connections in the world unable to capitalize them?

Exactly, this is what I'm wanting to see.

lovich3mo ago

That entire article sounds like my friends who think AI is real and keep sending their parents money into crypto scams.

I think I’ll just develop a drinking problem if this is Gas Town becomes something real in the industry and this kind of person is now one of our thought leaders.

lovich3mo ago

Too late to edit,

Who thinks AI is a real person*

_ea1k3mo ago

To be fair, he's always been a little loopy. At least, I think this post of his was loopy: https://steve-yegge.blogspot.com/2007/06/that-old-marshmallo...

It was also one of my favorite posts of his and has aged incredibly well as my experience has grown.

fdr3mo ago

that's one reason I am less worried about him than some, although, I don't want to say that only to have something bad happen to him, that is, a form of complacency. Just because (say) Boltzmann and Cantor had useful insights along the way didn't mean people shouldn't have been looking to support them.

sorenbs3mo ago

> but some others will pick up where he left off, with more modest goals

Already happening :-) https://github.com/Dicklesworthstone/beads_rust

fdr3mo ago

the main area I'd like to see some departure from beads is to use markdown files (or something) to be able to see the issue context/comments better in a diff generated by git.

The other area I'd like to see some software engineering thinking that's more open ended is on regression testing: ways of storing or referencing old versions of texts to see if the agent can complete old transformations properly even with a context change that patches up a weakness in a transformation that is desirable. This is tricky as it interacts with something essential in software engineering, the ability to run test suites and responding to the outcome. I don't think we know yet when to apply what fidelity of testing, e.g. one-shot on snippets versus a more realistic test based on git worktrees.

This is not something you'd want for every context, but a lot of my effort is spent building up prompt fragments to normalize and clean up the code coming out of a model that did some ad-hoc work that meets the test coverage bar, which constrains it decently into having achieved "something." Kind of like a prototype. But often, a lot of ungratifying massaging is required to even cover the annoying but not dangerous tics of the LLM, to bring clarity to where it wrote, well, very bad and unprincipled code...as it does sometimes.

wild_egg3mo ago

I was disappointed to see that this is still 10x the code needed for the feature set and that it still insists on duplicating state into a SQLite index for such minuscule amounts of data.

I've seen 25-30 similar efforts to make a Beads alternative and they all do this for some reason.

krupan3mo ago

Gas town and the like all basically sound to me like, "AI is amazing! Ok, actually it isn't very good, but maybe we can just use more AI with our AI and then it'll be good!"

And I'm not surprised at all to learn that this path took us to a "Maintenance Manager Checker Agent." I wonder what he'll call the inevitable Maintenance Manager Checker Agent Checker Agent?

Maybe I've been in this game too long, but I've encountered managers that think like this before. "We don't need expensive, brilliant, developers, we just need good processes for the cheap inexperienced developers to follow." I think what keeps this idea alive is that it sort of works for simple CRUD apps and other essentially "solved" problems. At least until the app needs to become more than just a simple CRUD app

mjr003mo ago

It's the Jarvis Effect.

For years we had people trying to make voice agents, like Iron Man's Jarvis, a thing. You had people super bought into the idea that if you could talk to your computer and say "Jarvis, book me a flight from New York to Hawaii" and it would just do it just like the movies, that was the future, that was sci-fi, it was awesome.

But it turns out that voice sucks as a user interface. The only time people use voice controls is when they can't use other controls, i.e. while driving. Nobody is voluntarily booking a flight with their Alexa. There's a reason every society on the planet shifted from primarily phone calls to texting once the technology was available!

It's similar with vibe coding. People like Yegge are extremely bought into the idea of being a hyperpowered coder, sitting in a dimly lit basement in front of 8 computer screens, commanding an army of agents with English, sipping coffee between barking out orders. "Agent 1, refactor that method to be more efficient. Agent 5, tighten up the graphics on level 3!"

Whether or not it's effective or better than regular software development is secondary, if it's a concern at all. The purpose is the process. It's the future. It's sci-fi. It's awesome.

AI is an incredible tool and we're still discovering the right way to use it, but boy, "Gas Town" is not it.

bagacrap3mo ago

This is confusing. Voice is not a UI, it's an input device. When I call my bank and have to input some numbers into the automated system, I prefer to say them than to type them. The phone menu system is the UI, fingers or voice are two different input modes for the same UI.

The problem with alexa booking tickets is not the use of my voice but that there are a lot of decisions (comparison shopping, seat selection etc) to be made. Alexa can't read my mind to make the trade-offs I would make, although it could ask me 10 zillion questions. The difference between voice/ears and fingers/eyes is the bandwidth of information transfer, but also the availability of the tools. Hands and eyes may be busy as in your car example, but they are also busy if I'm carrying a toddler around the house or can't be bothered to reach into my pocket or am already using my phone for something else (game, video etc). So voice is a good option for many tasks. And LLMs/agents do have the potential to make more tasks (simple ones, not booking tickets) accessible to voice since "AI as UI" is where it holds the most potential IMHO. And that's great because we need all the help we can get to avoid taking our phones out of our pockets and getting sucked into random tangents like HN comment threads just bc we wanted to check the weather

colin_jack3mo ago

"Agent 1, refactor that method to be more efficient. Agent 5, tighten up the graphics on level 3!"

I'm not sure its even that, his description of his role in this is:

"You are a Product Manager, and Gas Town is an Idea Compiler. You just make up features, design them, file the implementation plans, and then sling the work around to your polecats and crew. Opus 4.5 can handle any reasonably sized task, so your job is to make tasks for it. That’s it."

And he says he isn't reviewing the code, he lets agents review each others code from look of it. I am interested to see the specs/feature definitions he's giving them, that seems to be one interesting part of his flow.

mjr003mo ago

Yeah maybe the refactoring was a bad example because it implies looking at the code. It's more like "Agent 1, change the color of this widget. Agent 9, add a red dot whenever there's a new message. Agent 67, send out a marketing email blast advertising our new feature."

sonnig3mo ago

Assuming both agents are using the same model, what could the reviewer agent add of value to the agent writing the code? It feels like "thinking mode" but with extra steps, and more chance of getting them stuck in a loop trying to overcorrect some small inane detail.

1 more reply

brunoborges3mo ago

> Nobody is voluntarily booking a flight with their Alexa.

Rich people use voice because they have disposable income and they don't care if a flight is $800 or $4,000. They are likely buying business/first class anyways.

Tony Stark certainly doesn't care. Elon Musk certainly uses voice to talk to his management team to book his flights.

The average person doesn't have the privilege of using voice because it doesn't have enough fuck-you-money to not care for prices.

mjr003mo ago

As someone who's friends with executive assistants: rich people use executive assistants (humans) because they are busy and/or value their time more than money and don't want to bother with the details. None of them are using voice assistants.

> Tony Stark certainly doesn't care. Elon Musk certainly uses voice to talk to his management team to book his flights.

Delegating to a human isn't the same as using a voice assistant, this should be obvious, unless you believe that managers are doing all the real work and every IC is a brainless minion. Maybe far in the future when there's AGI, but certainly not today.

> The average person doesn't have the privilege of using voice because it doesn't have enough fuck-you-money to not care for prices.

You can order crap off Amazon for the same price as you would through the website with your Alexa right now, but Amazon themselves have admitted approximately 0% of people actually do this which is why the entire division ended up a minor disaster. It's just a shitty interface in the same way that booking a flight through voice is a shitty interface.

1 more reply

colin_jack3mo ago

This might be worth a read, just as its from a trusted source and is more grounded: https://antirez.com/news/158

krupan3mo ago

Still the same. "Hey look, I got these crappy developers (LLMs) to actually produce working code! This is a game-changer!" When the working code is a very small, limited thing.

colin_jack3mo ago

I don't know, your talking about an incredibly talented engineer saying:

"In the past week, just prompting, and inspecting the code to provide guidance from time to time, in a few hours I did the following four tasks, in hours instead of weeks"

Its up to you to decide how to behave, but I can't see any reasons to completely dismiss this. It ends with good guidance what to do if you can't replicate though.

1 more reply

ryanjshaw3mo ago

What evidence will convince you?

1 more reply

krackers3mo ago

If you add enough agents, you basically recreate the structure of human teams again. And the lessons from mythical man month etc. start applying. Large companies/teams don't seem to produce higher quality software than small ones, in fact they usually seem to be worse.

At best the notion of "subagents" today seems to be a hack to work around context length limits.

hakunin3mo ago

This is a good point, but with ai it’s a little different, because both your process and ai are getting better. You build processes that can aspirationally support inferior AIs, while at the same time AIs themselves improve and meet you half way. This thought does not help my mental well being unfortunately.

jaapz3mo ago

I think in the end people will realise AI is not a silver bullet that will solve all problems and make all software engineers obsolete. Instead it will just become an extra tool in our toolbelt right alongside LSP, linters/fixers, unit test frameworks, well though out text editors and IDE's, etc.

When the bubble has burst in a few years, the managers will have moved on to the next fad.

krupan3mo ago

Yes, if it even becomes as useful as a good linter or LSP. It actually has very little I'm common with those, but maybe it could be as useful.

disgruntledphd23mo ago

It's definitely helpful for search and summarisation.

In terms of prototyping, I can see the benefits but they're negated by the absurd amount of work it takes to get the code into more maintainable form.

I guess you can just do really heavy review throughout, but then you lose a lot of the speed gains.

That being said, putting a textual interface over everything is super cool, and will definitely have large downstream impacts, but probably not enough to justify all the spending.

iosovi3mo ago

The original Gas Town article reads like a terminal entry in Fallout

zmmmmm3mo ago

It seems like one of the key events that needs to happen for any professional domain to take off is for it to develop an "inside" language that nobody else understands. For example, I still don't know what a kanban or a scrum is. So I'm very ill positioned to challenge their use or question how they are done. Hence they got to dodge a whole lot of opposition that would probably have brought it all down. The invention of a new mysterious terminology I think was critical for agile to take off.

The problem with this phenomenon is that the same freedom from critique that is seemingly necessary for new domains to establish themselves also detaches them from necessary criticism. There's simply no way to tell if this isn't a load of baloney. And by the time it's a bullet point requirement on CVs to get employed it's too late for anybody to critique it.

bfrog3mo ago

Claude is ok. Gas town seems like a Claude multiplier. I’m not sure more Claude is what I’d even want!

Not sure I love what it does all the time, it tends to fit whatever box you setup and will easily break out if you aren’t veeeery specific. Is it better than writing a few thousand lines of code myself that I deeply understand that can debug and explain? I don’t know yet. I think it’d be good for writing functions one at a time with massive supervision.

It’s great for writing scripts and things where precision and correctness outside the success path isn’t really needed. If a script fails and it wasn’t deleting a hard drive who cares. If my embedded code fails out in a product in the wild this is a much bigger nuisance and potentially fatal for the device (not the humans) which is wasteful.

oofbey3mo ago

I’d like gastown more if it could run cursor-CLI instead of claude, and thus be able to choose models. Claude is okay. But these things certainly have personalities. I’m not sure which would be best for each role. But gastown’s different actors seem like a great place to take advantage of the different quirks of each. And I certainly don’t choose Claude consistently when given a choice.

e12e3mo ago

Well, according to Yegge, the next step is an SDK/framework for building your own ai orchestrator - so I expect we'll see support for more than Claude soon.

furyofantares3mo ago

A couple/few years ago people were trying to do agents by just putting the LLM in a loop and letting it go, and it was just awful and didn't work at all. I think a bunch of things had to happen over the course of 1-2 years to get to coding agents being a real, useful thing: models had to get quite a bit smarter/cheaper/faster, models had to get good at tool use, and they needed to be executed in well-built harnesses with good tools available.

This feels like the same thing. Too early, but we're definitely headed in the direction of finding ways to use more tokens to get more mileage per prompt.

hota_mazi3mo ago

Show, don't tell.

If you need ten pages to explain your project and even after I read your description, I'm still left confused why I need it at all, then maybe... I don't need it?

devin3mo ago

Maintenance Manager Checker Agent and the rest of the nouns Yegge employs are ironic given his Kingdom of Nouns essay.

dragonwriter3mo ago

“Maintenance Manager Checker Agent is not a noun Yegge employs”, it is Brinker’s term for Yegge’s “Boot the Dog”.

0xbadcafebee3mo ago

Anyone have some kind of central hub of finding out about new tools/techniques? I'm convinced that headless multi-agent coordination is the way to go, but it needs a lot of guard rails, one of the biggest of which will be cost-control. I'm sure there will be a lot more developments in this space, but I don't want to just happen across them by accident...

mccoyb3mo ago

I think Yegge and Huntley are smart guys.

I don't think they're doing a good job incubating their ideas into being precise and clearly useful -- there is something to be said about being careful and methodical before showing your cards.

The message they are spreading feels inevitable, but the things they are showing now are ... for lack of better words, not clear or sharp. In a recent video at AI Engineer, Yegge comments on "the Luddites" - but even for advocates of the technology, it is nigh impossible to buy the story he's telling from his blog posts.

Show, don't tell -- my major complaint about this group is that they are proselytizing about vibe coding tools ... without serious software to show for it.

Let's see some serious fucking software. I'm looking for new compilers, browsers, OSes -- and they better work. Otherwise, what are we talking about? We're counting foxes before the hunt.

In any case, wouldn't trying to develop a serious piece of software like that _at the same time you're developing Gas Town or Loom_ make (what critics might call) the ~Emacs config tweaking for orchestration~ result driven?

mccoyb3mo ago

Here's a separate, optimistic comment about Yegge and Huntley: they are obviously on the right track.

In a recent video about Loom (Huntley's orchestration tool), Huntley comments:

"I've got a single goal and that is autonomous evolutionary software and figuring out what's needed to be there."

which is extremely interesting and sounds like great fun.

When you take these ideas seriously, if the agents get better (by hook and crook or RLVR) -- you can see the implications: "grad student descent" on whatever piece of software you want. RAG over ideas, A/B testing of anything, endless looping, moving software.

It's a nightmare for the model of software development and human organization which is "productive" today, but an extremely compelling vision for those dabbling in the alternative.

PKop3mo ago

> they are obviously on the right track

How can you just assert that? It's fine to say it looks like the right track to you. But in what way is it obvious?

3dsnano3mo ago

yes, and Yegge + Huntley are doing it in an fun and creative way, breaking rules that make folks really mad and huffy puffy. this is a renaissance to those who can see it, those who drink the koolaid willingly, because it makes you trip balls and come up with crazy ideas... just like Hypercard...

why do we drink it? because its awesome and makes software 100X more FUN than it used to be. what yegge + huntley are doing is intensely creative. they are having FUN. and i am have FUN!!!!!

skybrian3mo ago

It's a science project. I think the "I am so crazy" messaging is deliberate to scare most people away while attracting a few like-minded beta testers. He's telling you not to use it, which some people will take as a dare...

vessenes3mo ago

Counterpoint - you can go much faster if you get lots of people engaging with something and testing it. This is exploratory work, not some sort of ivory tower rationalism exercise, (if those even ever truly exist), there’s no compulsion involved, so everyone engaged does so for self-motivated reasons..

Don’t be mad!

Also, beads is genuinely useful. In my estimation, gas town, or a successor built on a similar architecture, will not only be useful, but likely be considered ‘state of the art’ for at least a month sometime in the future. We should be glad this stuff is developed in the open, in my opinion.

bob10293mo ago

> Persistent Worker Agents, which you talk to directly (not through the Mayor),

I had a bit of a chuckle.

I think there is value in anything approximating a proposer-verifier loop, but I don't know that this is the most ideal approach.

ipnon3mo ago

It's like Conway's Law. Both humans and agents arrive at roughly identical hierarchies for organizing labor. There is something inherent in the game of telephone required by limited working memory that requires this structure. Gas Town's only failure is not being familiar with prior art and coming up with very strange names for established patterns that already exist in large hierarchical organizations like governments, corporations and militaries.

cagenut3mo ago

it has been amazing to watch how much of agentic ai is driven by "can you write clear instructions to explain your goals and use cases" and "can you clearly define the rules of each step in your process."

Animats3mo ago

This looks familiar to people who have seen how the more elaborate NPC systems work in major multiplayer games. There are lots of semi-independent NPCs, with some degree of overall coordination. Groups of cops or soldiers may have a commander program for tactical coordination, and there may be a higher level system deploying units for strategic purposes.

In games, what the NPCs can do is usually rather dumb. Move and shoot is usually most of their functionality. This keeps the overhead down so the system is affordable.

Gas Town may be a step towards AIs which have an ongoing sense of what they're doing. I'm not going to get into the "consciousness" debate, but it's closer to liveness.

jrowen3mo ago

What games are notable in this regard? The classic Majesty series comes to mind. UO aspired to complex NPC systems. Fable as well. I always dreamt of a more advanced Sim City-meets-MMO that just went all in on that.

Philpax3mo ago

https://www.rockpapershotgun.com/why-fears-ai-is-still-the-b...

nhinck33mo ago

UO? I don't remember any complex NPC systems there. Ultima 7 had daily schedules and the veneer of a functioning economy though

jrowen3mo ago

Well, I said aspired to. They had ambitions that eventually got walked back to more basic behaviors. Here's an article from Raph Koster on it:

https://www.raphkoster.com/2006/06/09/why-dont-our-npcs/

neilv3mo ago

I haven't read the Yegge post closely, so just commenting that namespaces (or naming conventions) would make the easier-to-casually-read names more practical...

For example, if Polecat becomes GasTown.WorkerAgent (or GasTown.Worker), then you always have both an unambiguous way and a shorthand-in-context way of referring to the concept.

(For naming conventions when you don't have namespaces as a language feature, use prefixes within the identifier, such as `GasTown_Worker`.)

If GasTown.Worker is implemented with framework Foo, using that framework's Worker concept, GasTown.Worker might have a field named fooWorker of type Foo.Worker. (In the context of the implementation of GasTown, the unqualified name always means the GasTown concept, and you always disambiguate concepts from elsewhere that use the sane generic or similar terms.)

Complicated names like GasTown.MaintenanceManagerCheckerAgent might need some creative name shortening, but hopefully are still descriptive, or easy to pick up and remember. Or, if the descriptive and distinguishing name was complicated because the concept is a weird special case within the framework, maybe consider whether it should be rethought.

CuriouslyC3mo ago

I don't understand why people are making this so complicated. We have a battle tested SDLC. We don't need to reinvent this shit. We just need to make some affordances in the tools and processes we set up for the majority of the actors in the system to be agents (such as rationing human attention).

Spec your software like an architect/po, decompose it into a task dag, then orchestrate for each lane and assemble all change sets in a merge branch rather than constantly repointing head.

Kostchei3mo ago

These operate in parallel. Maybe you SDLC does that, the effort of each human developer sitting in a planning meeting, getting jira tickets, doing individual code (or pair or whatever), reporting back in standup, coordinating the next step, getting it QA'ed...

Yes, if your shop is well developed these work (10% of the time every time), but this is a structure to kick that all in to gear, as a repo, where all you need to add is unlimited machine cognitive power/tokens.

Maybe you need to add these gas town personalities to various parts of the existing SDLC, .....but..... you still need to track what they do and how- and you need them to intermediate between each other at 2am when they hit an impasse. Something very rare in most human cognition shops.

And word from the experimenters is.. it sort of works. Which is on par with most human shops. IMO. I don't have the money to burn to test at the scale Yegge is, but the small scale stuff I have done in this direction, this seems plausible.

vessenes3mo ago

It not only sort of works, the 10% of the time it works surprisingly well at scale! Tantalizing.

jkhdigital3mo ago

I can’t stop thinking about this exact notion. The main reason we don’t always use stuff like TLA+ to spec out our software is because it’s tedious AF for anything smaller than, like, mission-critical enterprise-grade systems and we can generally trust the humans to get the details right eventually through carrot-and-stick incentive systems. LLM agents have none of the attentional and motivational constraints of humans so there’s no reason not to do things the right way.

arcanemachiner3mo ago

SDLC = Software Development Life Cycle (?)

CuriouslyC3mo ago

Correct

vivzkestrel3mo ago

- someone really needs to rewrite that entire article without all that jargon

rilindo3mo ago

The overuse of metaphors makes me feel like this person is trying to reinvent Chef, but for AI.

zbyforgotp3mo ago

At some point evolving software instead of designing it will work. Now the evolutionary pressure leads towards churning more tokens.

grebc3mo ago

Steve Yegge used to have interesting, albeit long winded, things to say re software.

1 more reply

the_real_cher3mo ago

> Work in Gas Town can be chaotic and sloppy, which is how it got its name. Some bugs get fixed 2 or 3 times, and someone has to pick the winner. Other fixes get lost. Designs go missing and need to be redone. It doesn’t matter, because you are churning forward relentlessly on huge, huge piles of work, which Gas Town is both generating and consuming. You might not be 100% efficient, but you are flying.

This is hilarious and insane and amazing.

TheRealPomax3mo ago

Thought this would be about Vancouver, but unfortunately it was just about an AI company =(

fasbiner3mo ago

This is not a good line of reasoning since it would just as surely apply to such fanciful cult terms like "I'm using fargate with docker over ECS to integrate with ceph, but I'm considering switching to talos and EKS."

It is valuable to use unique terms of art that are not heavily overloaded and this is what gastown's terminology is intended to do, which also really helps LLMs since they are as much dumb text search as they are vector embedding.

brushfoot3mo ago

Gas Town seems like a more confusing/expensive alternative to GitHub Copilot Agents. https://github.com/copilot/agents

Go to the URL, type what you want done, and a cloud Claude agent creates a PR. $10/month.

Kostchei3mo ago

try the tools. Really. If you are remotely interested in tech or AI, try the tools Copilot this is not. You may be trolling of course. There are huge steps between these various tools, if you try them, for a smidge of investment, it will become obvious what the trajectory is.

It is like saying "I don't handwrite anything, I care too much about line spacing, I only use a dot matrix printer" when some one is trying to sell you a calligraphy pen and coloured inks, and you have only tried a ballpoint pen. You might be the wrong market, but they are not even close in use case and application.

(spelling)

brushfoot3mo ago

I'm not trolling. I'm just not aware of major differences between them.

When I make a change with a Copilot Agent, it checks for issues, builds my project, runs tests, and iterates until things work. Multiple agents can do that in parallel.

My impression was that this does more or less the same thing.

That said, I'm definitely open to learning more about them both.

What are the advantages of this in your experience?

vessenes3mo ago

It is worth an install; it works very differently than an agent in a single loop.

Beads formalizes building a DAG for a given workload. This has a bunch of implications, but one is that you can specify larger workloads and the agents won’t get stuck or confused. At some level gas town is a bunch of scaffolding around the benefits of beads; an orchestrator that is native to dealing with beads opens up many more benefits than one that isn’t custom coded for it.

Think of a human needing to be interacted with as a ‘fault’ in an agentic coding system — a copilot agent might be at 0.5 9s or so - 50% of tasks can complete without intervention, given a certain set of tasks. All the gas town scaffolding is trying to increase the number of 9s, and the size of the task that can be given.

My take - Gas town (as an architecture) certainly has more nines in it than a single agent; the rest is just a lot of fun experimentation.

1 more reply

Capricorn24813mo ago

In your post history you say you have never programmed. Why are you so sure it produces code of value?

This is so prohibitively expensive in its wastefulness that blithely telling strangers to try the tools likely means you either haven't tried it, or have money to burn.

dcmatt3mo ago

Gas Town wasn't satire?

never_inline3mo ago

Poe's law &c.

aswegs83mo ago

Wait, what? When did we come to that conclusion?

SomaticPirate3mo ago

Don’t forget the apparent crypto grift angle now (something related to BAGS)

Ridiculous. Beads might be passable software but gas town just appears to be a good way to burn tokens at the moment

ohazi3mo ago

Real, genuinely confused human here: Can someone please clarify whether or not gas town is/was a joke? I've searched repeatedly and can't find anything that looks like an obvious tell, and I'm not sure if this is because it's actually real and people are taking it seriously, or because the pages and pages of discourse surrounding it is AI generated and taking itself literally.

If it's not a joke... I have no words. You've all gone insane.

danpalmer3mo ago

It's not a joke, but I think it's an example of the same thing we're seeing with folks who think they're talking to god when they talk to ChatGPT, or those who spiral and in some cases, sadly take their own life.

These chatbots create an echo chamber unlike that which we've ever had to deal with before. If we thought social media was bad, this is way worse.

I think Gastown and Beads are examples of this applied to software engineering. Good software is built with input from others. I've seen many junior engineers go off and spend weeks building the wrong thing, and it's a mess, but we learn to get input, we learn to have our ideas critiqued.

LLMs give us the illusion of pair programming, of working with a team, but they're not. LLMs vastly accelerate the rate at which you can spiral spiral down the wrong path, or down a path that doesn't even make sense. Gastown and Beads are that. They're fever dreams. They work, somewhat, but even just a little bit of oversight, critique, input from others, would have made them far better.

singingbard3mo ago

I think the underlying approach seems sensible.

The problem with Gas Town is how it was presented. The heavy metaphor and branding felt distracting.

It’s a bit like reading the Dune book, where you have to learn a whole vocabulary of new terms before you can get to the interesting mechanics, which is a tough ask in an already crowded AI space.

danpalmer3mo ago

I think you have to remove an awful lot of what makes Gastown Gastown to find something sensible – at the minimum you need to restructure and simplify the roles, restructure the memory system, remove tmux, ...

The best bit about it was the agentic coding maturity model he presented. That was actually great.

I don't think it's at all like reading Dune. Dune is creative fiction, Gastown is. Oh ok wait, if you consider Gastown to be creative fiction then I guess I agree. As a software tool though I don't think this analogy works.

nonethewiser3mo ago

It's a double edged sword. If it can lead the uninformed down the wrong path faster, it can lead the informed down the right path faster. It's not only fast in one direction.

bwestergard3mo ago

I believe the author of gas town is very informed, having been a professional software developer for some time. And the premise of the above comment is that he did, despite this, go down the wrong path.

1 more reply

pcthrowaway3mo ago

The difference between light-up arrows pointing the way "forward" for a car turning onto the expressway the wrong way, and doing so with the possibility humans might see and attempt to flag them down before they're too far to turn around.

People will make mistakes, and AI holding their hand and guiding them while they do it can have disastrous consequences.

But it's nice that the arrows will appear to also guide people going the right way I guess.

bobjordan3mo ago

Not sure you’ve actually tried using it, but beads has been an absolute game changer for my projects. “Game changer” is even underselling it.

wild_egg3mo ago

Beads was phenomenal back in October when it was released. Unfortunately it has somehow grown like a cancer. Now 275k lines of Go for task tracking? And no human fully knows what it is all doing. Steve Yegge is quite proud to say he's never looked at any of its code. It installs magic hooks and daemons all over your system and refuses to let go. Most user hostile software I've used in a long time.

Lot of folks rolling their own tools as replacements now. I shared mine [0] a couple weeks ago and quite a few folks have been happy with the change.

Regardless of what you do, I highly recommend to everyone that they get off the Beads bandwagon before it crashes them into a brick wall.

[0] https://github.com/wedow/ticket

2 more replies

mattgreenrocks3mo ago

How do you handle the dogs ignoring the deacons and going after the polecats though? Seems like the mayor should get involved to me.

2 more replies

wenc3mo ago

I'm not entitled to your time of course, but would you mind describing how?

All I know is beads is supposed to help me retain memory from one session to the next. But I'm finding myself having to curate it like a git repo (and I already have a git repo). Also it's quite tied to github, which I cannot use at work. I want to use it but I feel I need to see how others use it to understand how to tailor it for my workflow.

2 more replies

pjm3313mo ago

Gas town is the cackling mad laughter emitting from someone who knows they are being both insane and prescient simultaneously. Today, it is insane. But I fully expect to be hearing about a very serious thing in the near future about which people will say “gas town was an early attempt at this”

jcims3mo ago

This is the best take I've seen in here.

I've been tinkering with it for the past two days. It's a very real system for coordinating work between a plurality of humans and agents. Someone likened it to kubernetes in that it's a complex system that is going to necessitate a lot of invention and opinions, the fact that it *looks* like a meme is immaterial, and might be an effort to avoid people taking it too seriously.

Who knows where it ends up, but we will see more of this and whatever it is will have lessons learned from Gas Town in it.

dunk0103mo ago

I've had to read so far down to get a single non-stupid, ignorant, or inflammatory comment. What's wrong with HN, jeepers. Some actual discussion of the thing itself and not just pearl clutching would be appropriate here.

0xbadcafebee3mo ago

It's a real open source tool Yegge has built and been using for a while now. And no, it's not insane, he's literally written a book with Gene Kim about the fundamental lessons that go into it, and he's been on lots of podcasts where he explains more.

I expect major companies will soon be NIH-ing their own version of it. Even bleeding tokens as it does, the cost is less than an engineer, and produces working software much faster. The more it can be made to scale, the more incentive there is. A competitive business can't justify not using a system like this.

PKop3mo ago

Where is the working software it produces? Do you have a repo you've made with it as an example?

Kostchei3mo ago

yeh the repo is Gas Town

astrange3mo ago

It doesn't have to exclusively be one or the other.

> If it's not a joke... I have no words. You've all gone insane.

I think this is covered by the part in Yegge's post where he says not to run it unless you're so rich you don't care if it works or not.

chrisjj3mo ago

How rich do you have to be not care about the environmental cost?

nl3mo ago

I think Andrew Ng wrote a great piece on this.

For example, in the US, which do you think uses more water: Golf Courses or Data Centers?

  a) Gold Courses use twice as much water as Data Centers
  b) About the same
  c) Data Centers use twice as much water as Gold Courses

The answer is "None of the above": "Golf courses in the U.S. use around 500 billion gallons annually of water to irrigate their turf [snip] data centers consume [snip] 17 billion gallons, or maybe around 10x that if we include water use from energy generation"

Do you think a Google search or a Gemini query produces more carbon?

> Google had estimated that a single web search query produces 0.2 grams of CO2 emissions. [snip] the median Gemini LLM app query produces a surprisingly low 0.03 grams of CO2 emissions), and uses less energy than watching 9 seconds of television

https://www.deeplearning.ai/the-batch/issue-336/

3 more replies

astrange3mo ago

That's an Internet meme and not a real issue.

1 more reply

Retr0id3mo ago

It's kinda like how edgy political takes are often wrapped in seven layers of meta-irony. If the audience reaction is negative you can say it was just a joke that didn't land.

And that's not necessarily a bad thing, if it allows exploring new ideas with relative safety. I think that's what's going on here. It's a crazy idea that might just work, but if it doesn't work it can be retconned as satirical performance art.

AlexCoventry3mo ago

No, not a joke. The author also co-vibe-coded a book, called Vibe Coding, describing and recommending exactly the sort of system he's trying to build as Gas Town.

Quarrelsome3mo ago

> If it's not a joke... I have no words. You've all gone insane.

How is it insane to jump to the logical conclusion of all of this? The article was full of warnings, its not a sensible thing to do but its a cool thing to do. We might ask whether or not it works, but does that actually matter? It read as an experiment using experimental software doing experimental things.

Consider a deterministic life form looking at how we program software today, that might look insane to it and gastown might look considerably more sane.

Everything that ever happens in human creation begins as a thought, then as a prototype before it becomes adopted and maybe (if it works/scales) something we eventually take for granted. I mean I hate it but maybe I've misunderstood my profession when I thought this job was being able to prove the correctness of the system that we release. Maybe the business side of the org was never actually interested in that in the first place. Dev and business have been misaligned with competing interests for decades. Maybe this is actually the fit. Give greater control of software engineering to people higher up the org chart.

Maybe this is how we actually sink c-suite and let their ideas crash against the rocks forcing c-suite to eventually become extremely technical to be able to harness this. Instead of today's reality where c-suite gorge on the majority of the profit with an extremely loosely coupled feedback loop where its incredibly difficult to square cause and effect. Stock went up on Tuesday afternoon did it? I deserve eleventy million dollars for that. I just find it odd to crap on gastown when I think our status quo is kinda insane too.

hota_mazi3mo ago

I mean, Gas Town is 100% vibe coded, and its very own author says AI can't be trusted to write reliable code.

Draw your own conclusion.

never_inline3mo ago

From Steve yegge's post

> Better UIs will come. But tmux is what you have for now. And it’s worth learning.

So brother has 2 claude code accounts and couldn't vibe code a UI, huh?

Cedricgc3mo ago

I'm developing concern for Steve. He's been a well known developer and writer in the industry for years now (See his popular 'Google Platforms Rant' essay from years ago) [0].

Now, Yegge's writing tilts towards the grandoise... see his writing when joining Grab [1] and Sourcegraph [2] respectively versus how things actually played out.

I prefer optimism and I'm not anti AI by any means, but given his observed behavior and how AI can't exacerbate certain pathologies... not great. Adding the recent crypto activities on top and all that entails is the ingredients for a powder keg.

Hope someone is looking out for him.

[0] https://courses.cs.washington.edu/courses/cse452/23wi/papers...

[1] https://steve-yegge.medium.com/why-i-left-google-to-join-gra...

[2] https://sourcegraph.com/blog/introducing-steve-yegge

refulgentis3mo ago

He was right about Google in [1] when I was still drinking the Kool-Aid, in big and tangible ways that aren't discussed publicly.

[2] is 100% accurate, Grok was the backbone / glue of Google's internal developer tools.

I don't disagree on the current situation, and I'm uncomfortable sticking my neck out on this because I'm basically saying "the guy who kinda seems out of it, totally wasn't out of it, when you think he was", but [1] and [2] definitely aren't grandiose, the claims he makes re: Google and his work there are accurate. A small piece of why I feel comfortable in this, is that both of these were public blogs his employer was 100% happy about when hiring him to top positions.

Cedricgc3mo ago

I should be specific. I think the technical analysis is reasonable and I actually enjoy someone staking on a big vision, which is why I saved these pieces.

An example:

"I’ve seen Grab’s hunger. I’ve felt it. I have it. This space is win or die. They will fight to the death, and I am with them. This company, with some 3000 employees I think, is more unified than I’ve seen with most 5-person companies. This is the kind of focused camaraderie, cooperation and discipline that you typically only see in the military, in times of war.

Which should hardly surprise you, because that’s exactly what this is. This is war.

I am giving everything I’ve got to help Grab win. I am all in. You’d be amazed at what you can accomplish when you’re all in."

This is the writing of someone planning to make a capstone career move instead of leaving in 18 months. It's not the worst thing to do (He says he left b/c the time difference to support a team in SE Asia was hard physically, and he's getting older) and I support taking big swings. I'm just saying Yegge's writing has a pattern.

Crypto and what Yegge is doing with $GAS is dangerous because if the token price crashes and people betting their life savings think he didn't deliver on his promises... I like Steve personally which is why I'm saying anything.

tom_3mo ago

This appears to be the coin in question: https://coinmarketcap.com/currencies/gas-town/ - up 222,513.21% in the past week! (And down 25.26% in the last 24 hours. But... suppose it goes back up again?!)

1 more reply

draw_down3mo ago

I didn’t see the source graph thing, but the Grab episode always seemed odd to me. He wrote these breathless rants about how epic it all was, then quit after a year or so. I just figured the long hours eventually stopped being awesome.

driverdan3mo ago

The Gas Town post reads like some type of manic psychosis. I hope he snaps out of it and gets help.

dang3mo ago

Please don't do internet psychiatric diagnosis on HN. I know that often the intention is good, but it leads to bad places.

https://hn.algolia.com/?sort=byDate&type=comment&dateRange=a...

(and I realize the GP was the place the line started getting crossed)

1 more reply

jumploops3mo ago

I ran the Gas Town intro post through ChatGPT 5.2 Pro[0]

Based on my initial read, and a pass at this summary, it seems mostly right. YMMV

Did some further dives into the little public usage data from Gas Town, and found that most of the "Beads" are tasks that are broken down quite small, almost too small imo.

Super interesting project with the goal of keeping Claude "busy" however it feels more like a casino game than something I'd use for production engineering.

[0]https://gist.github.com/jumploops/2e49032438650426aafee6f43d...

j / k navigate · click thread line to collapse

234 comments

dchuk3mo ago

I’m very bought in to the idea that raw coding is now a solved problem with the current models and agentic harnesses. Let alone what’s coming in the near term.

It’s an interesting thing to watch play out.

ben_w3mo ago

Mm.

I'd agree, the code "isn’t necessarily broadly coherent but is locally impressive".

However, I've seen some totally successful, even award-winning, human-written projects where I could say the same.

Ages back, I heard a woodworking analogy:

  LLM code is like MDF. Really useful for cheap furniture, massively cheaper than solid wood, but it would be a mistake to use it as a structural element in a house.

theshrike793mo ago

"Without oversight" is the key here.

You need to define the problem space so that the agent knows what to do. Basically give it the tools to determine when it's "done" as defined by you.

spmurrayzzz3mo ago

This has also been an interesting social experiment in that we get to see what work people think is actually impressive vs trivial.

The gap in reactions says more about the nature of the work than it does about the tools themselves.

aaronblohowiak3mo ago

>Compare that to someone writing embedded firmware for device microcontrollers, who would understandably be underwhelmed by the same.

spmurrayzzz3mo ago

My comment above I hope wasn't read to mean "LLMs are only good at web dev." Only that there are different capability magnitudes.

1 more reply

aprdm3mo ago

This is not true. You can see people who are much older and built a lot of the "internet scale" equally excited about it, e.g: freebsd OG developers, Steve himself (who wrote gas town) etc.

In fact, I would say I've seen more people who are "OG Coders" excited (and in their >50s) then mid generation

spmurrayzzz3mo ago

I think you're shadow-boxing with a point I never made. I never said experienced devs are not or can not be excited about current AI capabilities.

But that in no way suggests that there isn't a gap in what impresses or surprises engineers across any set of domains. Antirez is probably one of the better, more reasoned examples of this.

[1] https://news.ycombinator.com/item?id=46682604

phist_mcgee3mo ago

I think this says a lot about yourself and where your prejudices and preferences lie.

spmurrayzzz3mo ago

Preferences I think I get, but prejudices?

The OED defines prejudice as a "preconceived opinion that is not based on reason or actual experience."

yetihehe3mo ago

If you give every idiot a worldwide heard voice, you will hear every idiot from the whole world. If you give every idiot a tool to make programs, you will see a lot of programs made by idiots.

meowface3mo ago

Steve Yegge is not an idiot or a bad programmer. Possibly just hypomanic at most. And a good, entertaining writer. https://en.wikipedia.org/wiki/Steve_Yegge

yetihehe3mo ago

> Steve Yegge is not an idiot or a bad programmer.

fegd853mo ago

Which, fittingly, is how you end up with users who can't even follow "please don't download this yet".

sonnig3mo ago

petesergeant3mo ago

> where people’s obvious mental health issues

I think the kids would call this "getting one-shotted by AI"

GrowingSideways3mo ago

> raw coding is now a solved problem

Surely this was solved with fortran. What changed? I think most people just don't know what program they want.

lordnacho3mo ago

You no longer have to be very specific about syntax. There's now an AI that can translate your idea into whatever language you want.

Now, you can just think about your program in English, and so long as you actually know what you want, you can get a Fortran program.

The issue is now what it was originally for senior programmers: to decide what to make, not how to make it.

hnlmorg3mo ago

The hard part of software development is equivalent to the hard part of engineering:

2 more replies

GrowingSideways3mo ago

Reactionarily? Sure. Maybe AI has some role to play there. Maybe you can ask the chatbot to modify settings.

https://youtu.be/5IsSpAOD6K8?si=FtfQZzgRU8K2z4Ub

hahahahhaah3mo ago

Yeah I am definitely trying to stay off hype and just use the damn tool

bkolobara3mo ago

It could be that llms go from bicycles for the brain to smoking for the brain, once we figure out the long term effects of it.

BrenBarn3mo ago

> If in a langauge there is one word for 2 different colors, speakers of it are unable to see the difference between the colors.

bkolobara3mo ago

[0]: https://youtu.be/RKK7wGAYP6k?si=GK6VPP0yoFoGyOn3 [1]: https://youtu.be/I64RtGofPW8?si=v1FNU06rb5mMYRKj&t=889

1 more reply

jstanley3mo ago

> If in a langauge there is one word for 2 different colors, speakers of it are unable to see the difference between the colors.

Perhaps you mean to say that speakers are unable to name the difference between the colours?

I can easily see differences between (for example) different shades of red. But I can't name them other than "shade of red".

bkolobara3mo ago

No, if you show them two colors and ask them if they are different, they will tell you no.

EDIT: I have been searching for the source of where I saw this, but can't find it now :(

EDIT2: I found a talk touching in the topic with a study: https://youtu.be/I64RtGofPW8?si=v1FNU06rb5mMYRKj&t=889

3 more replies

skywhopper3mo ago

But the color thing is self-evidently untrue. It’s not even hard to talk about. Unless you yourself are colorblind I think that would be obvious?

bonzini3mo ago

Sort of, at least some degree of relativism exists though how much is debated. Would you ever talk about sea having the same color as wine? But that's exactly what Homer called it.

https://en.wikipedia.org/wiki/Wine-dark_sea

https://en.wikipedia.org/wiki/Linguistic_relativity_and_the_...

1 more reply

bastawhiz3mo ago

_ea1k3mo ago

However, the gas town one was almost completely hands off. I think my only interventions were due to how beta it was, so I had to help it work around its own bugs to keep from doing stupid things.

Other than the riskyness (it runs in dangerous permissions mode) and incredible cost inefficiency, I'd certainly use it.

safety1st3mo ago

eru3mo ago

I guess tokens get cheaper all the time, and we can fix the risk via sufficient sand boxing. (I mean the risk to your computer.)

Avicebron3mo ago

I've been running my own version of what Gas Town seems to be in a couple of proxmox hosts for a while now, it's fine.

condiment3mo ago

hahahahhaah3mo ago

keyle3mo ago

I'd help build Gas City and Gas State, and Gas Country if that would mean we actually would solve the things AI promised to solve. All sickness, famine, wealth ...

The problem is, we're just fidgeting yolo-fizzbuzz ad nauseam.

The return on investment at the moment is probably one of the worst in the history of human investments.

AI does improve over time, still today, but we're going to run out of planet before we get there...

ViscountPenguin3mo ago

I have trouble seeing LLMs making meaningful progress on those frontiers without reaching ASI, but I'd be happy to be wrong.

Terr_3mo ago

camgunz3mo ago

Even alphafold generated a bunch of slop,like impossible proteins and such.

alecco3mo ago

That doesn't make any sense.

Yegge named it Gas Town as in "refinery" because the main job for the human at this stage is reviewing the generated code and merging. "

The whole point of the project is to be in control. Yegge even says the programmers who can read/review a lot of code fast are the new 10x (paraphrasing).

colin_jack3mo ago

"I’ve never seen the code, and I never care to, which might give you pause"

https://steve-yegge.medium.com/welcome-to-gas-town-4f25ee16d...

alecco3mo ago

Oof, he changed that. I stand corrected.

1 more reply

soulofmischief3mo ago

The Wright brothers are idiots, if it were me I'd have made a supersonic jet from the get go and not waste my time mucking around with prototypes.

ncruces3mo ago

The prototype phase meant data centers are now measured in MW instead of TFLOPS.

At a time where we were desperate to reduce emissions, data centers now consume around 20% of the energy consumed by the entire aviation sector, with consumption is rising at 15% YoY.

Never mind the water required to cool them, or the energy and resources required to build them, the capital allocation, and the opportunity cost of not allocating all of that to something else.

And this is, your words, the prototype phase.

neoromantique3mo ago

Emissions and Energy consumed do not necessarily have to be linked up.

We have plenty of ways to make clean energy, it is only matter of incentives.

As long as burning coal is simply cheaper, business will burn coal.

soulofmischief3mo ago

So, yes, prototypes often use more energy than the final product. That doesn't mean we shouldn't sustainable build datacenters, but that's conflating issues.

jpfromlondon3mo ago

the Wright brothers sold me a subscription to a supersonic jet and I've got a bundle of matchsticks and some canvas.

soulofmischief3mo ago

On the other hand, flight is ubiquitous and has changed everything.

1 more reply

ares6233mo ago

We were promised supersonic jets today or very soon though and our economies have been held hostage waiting for that promise.

eru3mo ago

The passive voice is doing a lot of work in your sentence.

1 more reply

soulofmischief3mo ago

The first recorded supersonic flight was in 1947.

1 more reply

Kostchei3mo ago

I think at Gas Country levels we will need better networking systems. Maybe that backbone Nvidia just built....

krupan3mo ago

toephu23mo ago

AI can't even find a cure for the common cold.

jamestimmins3mo ago

I actually love the idea of totally new naming schemes for experimental software.

tom_3mo ago

vessenes3mo ago

ivankra3mo ago

3dsnano3mo ago

yes, totally. AI-luddites see this as whimsy, but it's actually wizard-level power of abstractions and context ascension. if u know u know

tptacek3mo ago

jamestimmins3mo ago

Yes at some point innovative software and naming are at cross purposes, and if your naming gets too extreme ultimately that will get all of the attention.

bonesss3mo ago

Anthropomorphizing chunks of your system is kinda weird given interactive chat as the UI to the LLM.

In fact, I dare say a lot of LLM fascination for orchestration is people unfamiliar with actor models and the level of elegance a properly expressive language lets them have.

1 more reply

vessenes3mo ago

Very minor nit -- crew could be a person also - in fact that's how you're supposed to hack on a codebase in gas town directly - add yourself as crew.

fdr3mo ago

alexjurkiewicz3mo ago

His latest post is endorsing a crypto exchange because they paid him $50k.

https://steve-yegge.medium.com/bags-and-the-creator-economy-...

bonesss3mo ago

I’m pro LLM and use them, but crikey: if they’re so good at code why aren’t these people with all the attention, branding, and connections in the world unable to capitalize them?

coryrc3mo ago

> if they’re so good at code why aren’t these people with all the attention, branding, and connections in the world unable to capitalize them?

Exactly, this is what I'm wanting to see.

lovich3mo ago

That entire article sounds like my friends who think AI is real and keep sending their parents money into crypto scams.

I think I’ll just develop a drinking problem if this is Gas Town becomes something real in the industry and this kind of person is now one of our thought leaders.

lovich3mo ago

Too late to edit,

Who thinks AI is a real person*

_ea1k3mo ago

To be fair, he's always been a little loopy. At least, I think this post of his was loopy: https://steve-yegge.blogspot.com/2007/06/that-old-marshmallo...

It was also one of my favorite posts of his and has aged incredibly well as my experience has grown.

fdr3mo ago

sorenbs3mo ago

> but some others will pick up where he left off, with more modest goals

Already happening :-) https://github.com/Dicklesworthstone/beads_rust

fdr3mo ago

the main area I'd like to see some departure from beads is to use markdown files (or something) to be able to see the issue context/comments better in a diff generated by git.

wild_egg3mo ago

I was disappointed to see that this is still 10x the code needed for the feature set and that it still insists on duplicating state into a SQLite index for such minuscule amounts of data.

I've seen 25-30 similar efforts to make a Beads alternative and they all do this for some reason.

krupan3mo ago

Gas town and the like all basically sound to me like, "AI is amazing! Ok, actually it isn't very good, but maybe we can just use more AI with our AI and then it'll be good!"

And I'm not surprised at all to learn that this path took us to a "Maintenance Manager Checker Agent." I wonder what he'll call the inevitable Maintenance Manager Checker Agent Checker Agent?

mjr003mo ago

It's the Jarvis Effect.

Whether or not it's effective or better than regular software development is secondary, if it's a concern at all. The purpose is the process. It's the future. It's sci-fi. It's awesome.

AI is an incredible tool and we're still discovering the right way to use it, but boy, "Gas Town" is not it.

bagacrap3mo ago

colin_jack3mo ago

"Agent 1, refactor that method to be more efficient. Agent 5, tighten up the graphics on level 3!"

I'm not sure its even that, his description of his role in this is:

mjr003mo ago

sonnig3mo ago

1 more reply

brunoborges3mo ago

> Nobody is voluntarily booking a flight with their Alexa.

Rich people use voice because they have disposable income and they don't care if a flight is $800 or $4,000. They are likely buying business/first class anyways.

Tony Stark certainly doesn't care. Elon Musk certainly uses voice to talk to his management team to book his flights.

The average person doesn't have the privilege of using voice because it doesn't have enough fuck-you-money to not care for prices.

mjr003mo ago

> Tony Stark certainly doesn't care. Elon Musk certainly uses voice to talk to his management team to book his flights.

> The average person doesn't have the privilege of using voice because it doesn't have enough fuck-you-money to not care for prices.

1 more reply

colin_jack3mo ago

This might be worth a read, just as its from a trusted source and is more grounded: https://antirez.com/news/158

krupan3mo ago

Still the same. "Hey look, I got these crappy developers (LLMs) to actually produce working code! This is a game-changer!" When the working code is a very small, limited thing.

colin_jack3mo ago

I don't know, your talking about an incredibly talented engineer saying:

"In the past week, just prompting, and inspecting the code to provide guidance from time to time, in a few hours I did the following four tasks, in hours instead of weeks"

Its up to you to decide how to behave, but I can't see any reasons to completely dismiss this. It ends with good guidance what to do if you can't replicate though.

1 more reply

ryanjshaw3mo ago

What evidence will convince you?

1 more reply

krackers3mo ago

At best the notion of "subagents" today seems to be a hack to work around context length limits.

hakunin3mo ago

jaapz3mo ago

When the bubble has burst in a few years, the managers will have moved on to the next fad.

krupan3mo ago

Yes, if it even becomes as useful as a good linter or LSP. It actually has very little I'm common with those, but maybe it could be as useful.

disgruntledphd23mo ago

It's definitely helpful for search and summarisation.

In terms of prototyping, I can see the benefits but they're negated by the absurd amount of work it takes to get the code into more maintainable form.

I guess you can just do really heavy review throughout, but then you lose a lot of the speed gains.

That being said, putting a textual interface over everything is super cool, and will definitely have large downstream impacts, but probably not enough to justify all the spending.

iosovi3mo ago

The original Gas Town article reads like a terminal entry in Fallout

zmmmmm3mo ago

bfrog3mo ago

Claude is ok. Gas town seems like a Claude multiplier. I’m not sure more Claude is what I’d even want!

oofbey3mo ago

e12e3mo ago

Well, according to Yegge, the next step is an SDK/framework for building your own ai orchestrator - so I expect we'll see support for more than Claude soon.

furyofantares3mo ago

This feels like the same thing. Too early, but we're definitely headed in the direction of finding ways to use more tokens to get more mileage per prompt.

hota_mazi3mo ago

Show, don't tell.

If you need ten pages to explain your project and even after I read your description, I'm still left confused why I need it at all, then maybe... I don't need it?

devin3mo ago

Maintenance Manager Checker Agent and the rest of the nouns Yegge employs are ironic given his Kingdom of Nouns essay.

dragonwriter3mo ago

“Maintenance Manager Checker Agent is not a noun Yegge employs”, it is Brinker’s term for Yegge’s “Boot the Dog”.

0xbadcafebee3mo ago

mccoyb3mo ago

I think Yegge and Huntley are smart guys.

I don't think they're doing a good job incubating their ideas into being precise and clearly useful -- there is something to be said about being careful and methodical before showing your cards.

Show, don't tell -- my major complaint about this group is that they are proselytizing about vibe coding tools ... without serious software to show for it.

Let's see some serious fucking software. I'm looking for new compilers, browsers, OSes -- and they better work. Otherwise, what are we talking about? We're counting foxes before the hunt.

mccoyb3mo ago

Here's a separate, optimistic comment about Yegge and Huntley: they are obviously on the right track.

In a recent video about Loom (Huntley's orchestration tool), Huntley comments:

"I've got a single goal and that is autonomous evolutionary software and figuring out what's needed to be there."

which is extremely interesting and sounds like great fun.

It's a nightmare for the model of software development and human organization which is "productive" today, but an extremely compelling vision for those dabbling in the alternative.

PKop3mo ago

> they are obviously on the right track

How can you just assert that? It's fine to say it looks like the right track to you. But in what way is it obvious?

3dsnano3mo ago

why do we drink it? because its awesome and makes software 100X more FUN than it used to be. what yegge + huntley are doing is intensely creative. they are having FUN. and i am have FUN!!!!!

skybrian3mo ago

vessenes3mo ago

Don’t be mad!

bob10293mo ago

> Persistent Worker Agents, which you talk to directly (not through the Mayor),

I had a bit of a chuckle.

I think there is value in anything approximating a proposer-verifier loop, but I don't know that this is the most ideal approach.

ipnon3mo ago

cagenut3mo ago

Animats3mo ago

In games, what the NPCs can do is usually rather dumb. Move and shoot is usually most of their functionality. This keeps the overhead down so the system is affordable.

Gas Town may be a step towards AIs which have an ongoing sense of what they're doing. I'm not going to get into the "consciousness" debate, but it's closer to liveness.

jrowen3mo ago

Philpax3mo ago

https://www.rockpapershotgun.com/why-fears-ai-is-still-the-b...

nhinck33mo ago

UO? I don't remember any complex NPC systems there. Ultima 7 had daily schedules and the veneer of a functioning economy though

jrowen3mo ago

Well, I said aspired to. They had ambitions that eventually got walked back to more basic behaviors. Here's an article from Raph Koster on it:

https://www.raphkoster.com/2006/06/09/why-dont-our-npcs/

neilv3mo ago

I haven't read the Yegge post closely, so just commenting that namespaces (or naming conventions) would make the easier-to-casually-read names more practical...

For example, if Polecat becomes GasTown.WorkerAgent (or GasTown.Worker), then you always have both an unambiguous way and a shorthand-in-context way of referring to the concept.

(For naming conventions when you don't have namespaces as a language feature, use prefixes within the identifier, such as `GasTown_Worker`.)

CuriouslyC3mo ago

Spec your software like an architect/po, decompose it into a task dag, then orchestrate for each lane and assemble all change sets in a merge branch rather than constantly repointing head.

Kostchei3mo ago

vessenes3mo ago

It not only sort of works, the 10% of the time it works surprisingly well at scale! Tantalizing.

jkhdigital3mo ago

arcanemachiner3mo ago

SDLC = Software Development Life Cycle (?)

CuriouslyC3mo ago

Correct

vivzkestrel3mo ago

- someone really needs to rewrite that entire article without all that jargon

rilindo3mo ago

The overuse of metaphors makes me feel like this person is trying to reinvent Chef, but for AI.

zbyforgotp3mo ago

At some point evolving software instead of designing it will work. Now the evolutionary pressure leads towards churning more tokens.

grebc3mo ago

Steve Yegge used to have interesting, albeit long winded, things to say re software.

1 more reply

the_real_cher3mo ago

This is hilarious and insane and amazing.

TheRealPomax3mo ago

Thought this would be about Vancouver, but unfortunately it was just about an AI company =(

fasbiner3mo ago

brushfoot3mo ago

Gas Town seems like a more confusing/expensive alternative to GitHub Copilot Agents. https://github.com/copilot/agents

Go to the URL, type what you want done, and a cloud Claude agent creates a PR. $10/month.

Kostchei3mo ago

(spelling)

brushfoot3mo ago

I'm not trolling. I'm just not aware of major differences between them.

When I make a change with a Copilot Agent, it checks for issues, builds my project, runs tests, and iterates until things work. Multiple agents can do that in parallel.

My impression was that this does more or less the same thing.

That said, I'm definitely open to learning more about them both.

What are the advantages of this in your experience?

vessenes3mo ago

It is worth an install; it works very differently than an agent in a single loop.

My take - Gas town (as an architecture) certainly has more nines in it than a single agent; the rest is just a lot of fun experimentation.

1 more reply

Capricorn24813mo ago

In your post history you say you have never programmed. Why are you so sure it produces code of value?

This is so prohibitively expensive in its wastefulness that blithely telling strangers to try the tools likely means you either haven't tried it, or have money to burn.

dcmatt3mo ago

Gas Town wasn't satire?

never_inline3mo ago

Poe's law &c.

aswegs83mo ago

Wait, what? When did we come to that conclusion?

SomaticPirate3mo ago

Don’t forget the apparent crypto grift angle now (something related to BAGS)

Ridiculous. Beads might be passable software but gas town just appears to be a good way to burn tokens at the moment

ohazi3mo ago

If it's not a joke... I have no words. You've all gone insane.

danpalmer3mo ago

These chatbots create an echo chamber unlike that which we've ever had to deal with before. If we thought social media was bad, this is way worse.

singingbard3mo ago

I think the underlying approach seems sensible.

The problem with Gas Town is how it was presented. The heavy metaphor and branding felt distracting.

It’s a bit like reading the Dune book, where you have to learn a whole vocabulary of new terms before you can get to the interesting mechanics, which is a tough ask in an already crowded AI space.

danpalmer3mo ago

The best bit about it was the agentic coding maturity model he presented. That was actually great.

nonethewiser3mo ago

It's a double edged sword. If it can lead the uninformed down the wrong path faster, it can lead the informed down the right path faster. It's not only fast in one direction.

bwestergard3mo ago

1 more reply

pcthrowaway3mo ago

People will make mistakes, and AI holding their hand and guiding them while they do it can have disastrous consequences.

But it's nice that the arrows will appear to also guide people going the right way I guess.

bobjordan3mo ago

Not sure you’ve actually tried using it, but beads has been an absolute game changer for my projects. “Game changer” is even underselling it.

wild_egg3mo ago

Lot of folks rolling their own tools as replacements now. I shared mine [0] a couple weeks ago and quite a few folks have been happy with the change.

Regardless of what you do, I highly recommend to everyone that they get off the Beads bandwagon before it crashes them into a brick wall.

[0] https://github.com/wedow/ticket

2 more replies

mattgreenrocks3mo ago

How do you handle the dogs ignoring the deacons and going after the polecats though? Seems like the mayor should get involved to me.

2 more replies

wenc3mo ago

I'm not entitled to your time of course, but would you mind describing how?

2 more replies

pjm3313mo ago

jcims3mo ago

This is the best take I've seen in here.

Who knows where it ends up, but we will see more of this and whatever it is will have lessons learned from Gas Town in it.

dunk0103mo ago

0xbadcafebee3mo ago

PKop3mo ago

Where is the working software it produces? Do you have a repo you've made with it as an example?

Kostchei3mo ago

yeh the repo is Gas Town

astrange3mo ago

It doesn't have to exclusively be one or the other.

> If it's not a joke... I have no words. You've all gone insane.

I think this is covered by the part in Yegge's post where he says not to run it unless you're so rich you don't care if it works or not.

chrisjj3mo ago

How rich do you have to be not care about the environmental cost?

nl3mo ago

I think Andrew Ng wrote a great piece on this.

For example, in the US, which do you think uses more water: Golf Courses or Data Centers?

  a) Gold Courses use twice as much water as Data Centers
  b) About the same
  c) Data Centers use twice as much water as Gold Courses

Do you think a Google search or a Gemini query produces more carbon?

https://www.deeplearning.ai/the-batch/issue-336/

3 more replies

astrange3mo ago

That's an Internet meme and not a real issue.

1 more reply

Retr0id3mo ago

It's kinda like how edgy political takes are often wrapped in seven layers of meta-irony. If the audience reaction is negative you can say it was just a joke that didn't land.

AlexCoventry3mo ago

No, not a joke. The author also co-vibe-coded a book, called Vibe Coding, describing and recommending exactly the sort of system he's trying to build as Gas Town.

Quarrelsome3mo ago

> If it's not a joke... I have no words. You've all gone insane.

Consider a deterministic life form looking at how we program software today, that might look insane to it and gastown might look considerably more sane.

hota_mazi3mo ago

I mean, Gas Town is 100% vibe coded, and its very own author says AI can't be trusted to write reliable code.

Draw your own conclusion.

never_inline3mo ago

From Steve yegge's post

> Better UIs will come. But tmux is what you have for now. And it’s worth learning.

So brother has 2 claude code accounts and couldn't vibe code a UI, huh?

Cedricgc3mo ago

I'm developing concern for Steve. He's been a well known developer and writer in the industry for years now (See his popular 'Google Platforms Rant' essay from years ago) [0].

Now, Yegge's writing tilts towards the grandoise... see his writing when joining Grab [1] and Sourcegraph [2] respectively versus how things actually played out.

Hope someone is looking out for him.

[0] https://courses.cs.washington.edu/courses/cse452/23wi/papers...

[1] https://steve-yegge.medium.com/why-i-left-google-to-join-gra...

[2] https://sourcegraph.com/blog/introducing-steve-yegge

refulgentis3mo ago

He was right about Google in [1] when I was still drinking the Kool-Aid, in big and tangible ways that aren't discussed publicly.

[2] is 100% accurate, Grok was the backbone / glue of Google's internal developer tools.

Cedricgc3mo ago

I should be specific. I think the technical analysis is reasonable and I actually enjoy someone staking on a big vision, which is why I saved these pieces.

An example:

Which should hardly surprise you, because that’s exactly what this is. This is war.

I am giving everything I’ve got to help Grab win. I am all in. You’d be amazed at what you can accomplish when you’re all in."

tom_3mo ago

1 more reply

draw_down3mo ago

driverdan3mo ago

The Gas Town post reads like some type of manic psychosis. I hope he snaps out of it and gets help.

dang3mo ago

Please don't do internet psychiatric diagnosis on HN. I know that often the intention is good, but it leads to bad places.

https://hn.algolia.com/?sort=byDate&type=comment&dateRange=a...

(and I realize the GP was the place the line started getting crossed)

1 more reply

jumploops3mo ago

I ran the Gas Town intro post through ChatGPT 5.2 Pro[0]

Based on my initial read, and a pass at this summary, it seems mostly right. YMMV

Did some further dives into the little public usage data from Gas Town, and found that most of the "Beads" are tasks that are broken down quite small, almost too small imo.

Super interesting project with the goal of keeping Claude "busy" however it feels more like a casino game than something I'd use for production engineering.

[0]https://gist.github.com/jumploops/2e49032438650426aafee6f43d...

j / k navigate · click thread line to collapse