Etched Is Making the Biggest Bet in AI (opens in new tab)

(etched.com)

45 pointsCompuIves1y ago17 comments

17 comments

How will these companies replace the software? Don't they need something similar to CUDA?

(I work at Etched.) You need something as complex as CUDA only to support general-purpose programmability; Sohu is built for one thing and one thing only: transformers. So while we certainly need a software stack to harness the chip, it’s much easier to do so, and even easier then to adapt existing LLM serving tools (vLLM, etc.) to use this stack.

samspenc1y ago

> Sohu is built for one thing and one thing only: transformers

Thanks for clarifying this. Could you clarify whether your chip supports the transformer architecture in general, or only specific models for e.g. Llama 70B? In case of the latter, would your ASIC have to be reprogrammed for each model?

1 more reply

CommieBobDole1y ago

"In five years, AI models became smarter than humans on most standardized tests."

This seems to be a novel definition of "smarter" - one could also argue that the printed answer key for a standardized test is smarter than most humans.

ilaksh1y ago

Wow.. I wonder how this compares to groq?

I made a request to access their developer cloud. Anyone have any idea when they start processing those requests and how many slots they might have?

Havoc1y ago

There are a couple now. Groq , sambanova and tenstorrent come to mind.

The more the merrier. If we do need power plants worth of juice then it may as well be dedicated custom hardware to optimise this to the max

ilaksh1y ago

Thanks for telling me about Sambanova. Too bad they don't have an API. The speed is absolutely incredible. They actually have a demo where it just keeps re-running the chat completion as you are typing your query.

1 more reply

seany621y ago

A coworker of mine recently got access to Groq. Not sure how many slots there are but we have a pretty strict rate limit for what thats worth

ilaksh1y ago

I have access to grow, was asking about sambanova

samspenc1y ago

Was curious about this as well, seems like the closest comparison today is to Groq, wondering how it holds up.

jeffs42711y ago

Etched has an interesting approach, I am not convinced in their public info because it is vague. Could be great or could be bullshit. Well certainly some marketing.

candiddevmike1y ago

> Specialized chips are inevitable

I mean, are they? Seems like the industry would prefer these things to become commodities, especially if it helps with portability and reproducibility.

jononor1y ago

The specialized chips may very well become commoditized over time. It is the way things go if the market is large and competitive enough - and that there is sufficient competitive advantage over the existing types of devices (GPUs in this case). For a rather recent example, see Bitcoin mining ASICs.

Of course GPUs (and CPUs, and pre-transformer/LLM type NPU/TPUs) are also going to respond to better suit LLM workloads. So we may see a convergence in capabilities.

What is for sure is that the LLM chip architecture is not yet settled. Currently available chips are mostly designed pre LLM craze, with slight adaptations. And have large areas of potential improvement, at least 10-100x (maybe 1000x) in power efficiency. And power efficiency is a key element (in addition to up from hardware investment). Which is the opportunity this startup (and others) have identified.

osnium1231y ago

Given that Moore’s law is slowing down, given that circuit design is getting more automated and given the rise of chiplet architecture, I expect that you will have more specialized chips.

j / k navigate · click thread line to collapse

17 comments

returnInfinity1y ago

How will these companies replace the software? Don't they need something similar to CUDA?

arjvik1y ago

samspenc1y ago

> Sohu is built for one thing and one thing only: transformers

1 more reply

CommieBobDole1y ago

"In five years, AI models became smarter than humans on most standardized tests."

This seems to be a novel definition of "smarter" - one could also argue that the printed answer key for a standardized test is smarter than most humans.

ilaksh1y ago

Wow.. I wonder how this compares to groq?

I made a request to access their developer cloud. Anyone have any idea when they start processing those requests and how many slots they might have?

Havoc1y ago

There are a couple now. Groq , sambanova and tenstorrent come to mind.

The more the merrier. If we do need power plants worth of juice then it may as well be dedicated custom hardware to optimise this to the max

ilaksh1y ago

1 more reply

seany621y ago

A coworker of mine recently got access to Groq. Not sure how many slots there are but we have a pretty strict rate limit for what thats worth

ilaksh1y ago

I have access to grow, was asking about sambanova

samspenc1y ago

Was curious about this as well, seems like the closest comparison today is to Groq, wondering how it holds up.

jeffs42711y ago

Etched has an interesting approach, I am not convinced in their public info because it is vague. Could be great or could be bullshit. Well certainly some marketing.

candiddevmike1y ago

> Specialized chips are inevitable

I mean, are they? Seems like the industry would prefer these things to become commodities, especially if it helps with portability and reproducibility.

jononor1y ago

Of course GPUs (and CPUs, and pre-transformer/LLM type NPU/TPUs) are also going to respond to better suit LLM workloads. So we may see a convergence in capabilities.

osnium1231y ago

Given that Moore’s law is slowing down, given that circuit design is getting more automated and given the rise of chiplet architecture, I expect that you will have more specialized chips.

j / k navigate · click thread line to collapse