undefined | Better HN

0 pointsjohnisgood9mo ago0 comments

What is the use case? What does it solve exactly, or what practical value does it give you? I am not sure what a tool call loop is.

0 comments

queenkjuul9mo ago

An example:

I updated a svelte component at work, and while i could test it in the browser and see it worked fine, the existing unit test suddenly started failing. I spent about an hour trying to figure out why the results logged in the test didn't match the results in the browser.

I got frustrated, gave in and asked Claude Code, an AI agent. The tool call loop is something like: it reads my code, then looks up the documentation, then proposed a change to the test which i approve, then it re-runs the test, feeds the output back into the AI, re-checks the documentation, and then proposes another change.

It's all quite impressive, or it would be if at one point it didn't randomly say "we fixed it! The first element is now active" -- except it wasn't, Claude thought the first element was element [1], when of course the first element in an array is [0]. The test hadn't even actually passed.

An hour and a few thousand Claude tokens my company paid for and got nothing back for lol.

apwell239mo ago

any examples outside of coding agents ?

Even in this example coding agent is short lived . I am curious about continuously running agents that are never done.

dceddia9mo ago

A friend of mine set up a cron job coupled with the Claude API to process his email inbox every 30 minutes and unsubscribe/archive/delete as necessary. It could also be expanded to draft replies (I forget if his does this) and even send them, if you’re feeling lucky. I’m pretty sure the AI (I’m guessing Claude Code in this case) wrote most or all of the code for the script that does the interaction with the email API.

An example of my own, not agentic or running in a loop, but might be an interesting example of a use case for this stuff: I had a CSV file of old coupon codes I needed to process. Everything would start in limbo, uncategorized. Then I wanted to be able to search for some common substrings and delete them, search for other common substrings and keep them. I described what I wanted to do with Claude 3.7 and it built out a ruby script that gave me an interactive menu of commands like search to select/show all/delete selected/keep selected. It was an awesome little throwaway script that would’ve taken me embarrassingly long to write, or I could’ve done it all by hand in Excel or at the command line with grep and stuff, but I think it would’ve taken longer.

Honestly one of the hard things about using AI for me is remembering to try to use it, or coming up with interesting things to try. Building up that new pattern recognition.

queenkjuul9mo ago

No, the fact Claude couldn't remember that JavaScript is zero-indexed for more than 20 minutes has not left me interested in letting it take on bigger tasks

kro9mo ago

The tools can be an editor/terminal/dev environment, automatically iterating to testing the changes and refining until a finished product, without a human developer, at least that is what some wish of it.

johnisgoodOP9mo ago

Oh, okay, I understand it now, especially with the other comment that said Cursor is one. OK, makes sense. Seems like it "just" reduces friction (quite a lot).

csande179mo ago

Yeah, it's really just a user experience improvement. In particular, it makes AI look a lot better if it can internally retry a bunch of times until it comes up with valid code or whatever, instead of you having to see each error and prompt it to fix it. (Also, sometimes they can do fancy sampling tricks to force the AI to produce a syntactically valid result the first time. Mostly this is just used for simple JSON schemas though.)

johnisgoodOP9mo ago

Thank you, that is what my initial thought was. I am still doing things the old-fashioned way, thankfully it has worked out for me (and learned a lot in the process), but perhaps this AI agent thing might speed things up a bit. :D Although then I will learn much less.

ghuntley9mo ago

> I am not sure what a tool call loop is.

See https://ampcode.com/how-to-build-an-agent

holler9mo ago

that was a great read, thanks! - agentic noob

infecto9mo ago

Cursor is my classic example. I don’t know exactly what tools are defined in their loop but you give the agent some code to write. It may search your code base, it may then search online for third party library docs. Then come back and write some code etc.

jsemrau9mo ago

If it were only tool use, then it would be the same as a lambda function.

j / k navigate · click thread line to collapse

0 comments

queenkjuul9mo ago

An example:

An hour and a few thousand Claude tokens my company paid for and got nothing back for lol.

apwell239mo ago

any examples outside of coding agents ?

Even in this example coding agent is short lived . I am curious about continuously running agents that are never done.

dceddia9mo ago

Honestly one of the hard things about using AI for me is remembering to try to use it, or coming up with interesting things to try. Building up that new pattern recognition.

queenkjuul9mo ago

No, the fact Claude couldn't remember that JavaScript is zero-indexed for more than 20 minutes has not left me interested in letting it take on bigger tasks

kro9mo ago

johnisgoodOP9mo ago

Oh, okay, I understand it now, especially with the other comment that said Cursor is one. OK, makes sense. Seems like it "just" reduces friction (quite a lot).

csande179mo ago

johnisgoodOP9mo ago

ghuntley9mo ago

> I am not sure what a tool call loop is.

See https://ampcode.com/how-to-build-an-agent

holler9mo ago

that was a great read, thanks! - agentic noob

infecto9mo ago

jsemrau9mo ago

If it were only tool use, then it would be the same as a lambda function.

j / k navigate · click thread line to collapse