undefined | Better HN

0 pointsalmogodel1d ago0 comments

Remember nodes and graphs? A comfy user interface allows pretty incredible wiring among models local ai is like eurorack. The current graph skews heavily towards a a pair of small dense models collaborating with the large heavyweights selectively. It’s Qwen 3.6 27B with Gemma 4 31B, both unquantized, bf16/fp16, with phi 14b, nemotron cascade 2, and then those large heavyweights, r1 and subsequent deepseek models including speciale, gpt oss 120b, glm, min max,kimi, command r, mistrals, ever body, up in one graph, all them llm nodes patched and interconnected. Slow, resource intense, better than non local ai. I used Matteo’s graphllm for inspiration, and comfy ui (and st), and used the models to roll a new imgui node/graph model compositor. Now what?!

0 comments

gpugreg1d ago

> Slow, resource intense, better than non local ai

Why should connecting small models to big models result in higher output quality than just running the big models without the small models?

CamperBob214h ago

A hardware analogy: an amplifier might have an open-loop gain of a hundred million or more, but if you actually try to use it without some negative feedback, it will only give you one of two possible output levels. And/or a whole lot of noise.

j / k navigate · click thread line to collapse

0 comments

gpugreg1d ago

> Slow, resource intense, better than non local ai

Why should connecting small models to big models result in higher output quality than just running the big models without the small models?

CamperBob214h ago

j / k navigate · click thread line to collapse