undefined | Better HN

0 pointsllm_trw1y ago0 comments

You could use messenger pigeons if you felt like it.

People really don't understand how much better LLM swarms get with more agents. I never hit a point of diminishing returns on text quality over two days of running a swarm of llama2 70Bs on an 8x4090 cluster during the stress test.

You would need something similar to, but better than, whatsapp to handle the firehose of data that needs to cascade between agents when you start running this at scale.

0 comments

ValentinA231y ago

>People really don't understand how much better LLM swarms get with more agents. I never hit a point of diminishing returns on text quality

Could you elaborate please ?

One use for swarms is to use multiple agents/prompts in place of one single agent with one long prompt in order to increase performance by splitting one big task into many. It is very time consuming though, as it requires experimenting to determine how best to divide one task into subtasks, including writing code to parse and sanitize each task output and plug it back into the rest of the agent graph.

Dspy [1] seems to target this problem space but last time I checked it only focused on single prompt optimization (by selecting which few shots examples lead to the best prompt performance for instance), but even though I have seen papers on the subject, I have yet to find a framework that tackles the problem of agent graph optimization although research on this topic has been done [2][3][4]

[1]DSPy: The framework for programming—not prompting—foundation models: https://github.com/stanfordnlp/dspy

[2]TextGrad: Automatic 'Differentiation' via Text -- using large language models to backpropagate textual gradients: https://github.com/zou-group/textgrad

[3]What's the Magic Word? A Control Theory of LLM Prompting: https://arxiv.org/abs/2310.04444

[4]Language Agents as Optimizable Graphs: https://arxiv.org/abs/2402.16823

llm_trwOP1y ago

>Could you elaborate please ?

No.

I've tried explaining this to supposedly smart people in both a 15 minute pitch deck and a research paper and unless they were inclined to think it from the start no amount of proof has managed to convince them.

I figure it's just not possible to convince people, even with the proof in front of them, of how powerful the system is. The same way that we still have people arguing _right now_ that all LLMs are just auto complete on steroids.

Veen1y ago

Prove how powerful "the system" is by doing something useful or value-generating with it. Then people will believe you. Talk is cheap.

llm_trwOP1y ago

>Prove how useful LLMs are by doing something useful or value-generating with them. Then people will believe you. Talk is cheap.

You after chat GPT2 was released.

dboreham1y ago

> people arguing _right now_ that all LLMs are just auto complete on steroids.

Funny because when I learned about how LLMS worked my immediate thought was "Oh, humans are just LLMs on steroids". So auto complete on steroids squared.

ValentinA231y ago

I'm inclined to think it from the start

llm_trwOP1y ago

If you care enough you can email me at omni_vision_ai@proton.me I'd be happy to talk more in a less public setting.

j / k navigate · click thread line to collapse

0 comments

ValentinA231y ago

>People really don't understand how much better LLM swarms get with more agents. I never hit a point of diminishing returns on text quality

Could you elaborate please ?

[1]DSPy: The framework for programming—not prompting—foundation models: https://github.com/stanfordnlp/dspy

[2]TextGrad: Automatic 'Differentiation' via Text -- using large language models to backpropagate textual gradients: https://github.com/zou-group/textgrad

[3]What's the Magic Word? A Control Theory of LLM Prompting: https://arxiv.org/abs/2310.04444

[4]Language Agents as Optimizable Graphs: https://arxiv.org/abs/2402.16823

llm_trwOP1y ago

>Could you elaborate please ?

No.

Veen1y ago

Prove how powerful "the system" is by doing something useful or value-generating with it. Then people will believe you. Talk is cheap.

llm_trwOP1y ago

>Prove how useful LLMs are by doing something useful or value-generating with them. Then people will believe you. Talk is cheap.

You after chat GPT2 was released.

dboreham1y ago

> people arguing _right now_ that all LLMs are just auto complete on steroids.

Funny because when I learned about how LLMS worked my immediate thought was "Oh, humans are just LLMs on steroids". So auto complete on steroids squared.

ValentinA231y ago

I'm inclined to think it from the start

llm_trwOP1y ago

If you care enough you can email me at omni_vision_ai@proton.me I'd be happy to talk more in a less public setting.

j / k navigate · click thread line to collapse