undefined | Better HN

0 pointsofjcihen11mo ago0 comments

I feel like we get one of these articles that addresses valid AI criticisms with poor arguments every week and at this point I’m ready to write a boilerplate response because I already know what they’re going to say.

Interns don’t cost 20 bucks a month but training users in the specifics of your org is important.

Knowing what is important or pointless comes with understanding the skill set.

0 comments

mountainriver11mo ago

I feel the opposite, and pretty much every metric we have shows basically linear improvement of these models over time.

The criticisms I hear are almost always gotchas, and when confronted with the benchmarks they either don’t actually know how they are built or don’t want to contribute to them. They just want to complain or seem like a contrarian from what I can tell.

Are LLMs perfect? Absolutely not. Do we have metrics to tell us how good they are? Yes

I’ve found very few critics that actually understand ML on a deep level. For instance Gary Marcus didn’t know what a test train split was. Unfortunately, rage bait like this makes money

Night_Thastus11mo ago

Models are absolutely not improving linearly. They improve logarithmically with size, and we've already just about hit the limits of compute without becoming totally unreasonable from a space/money/power/etc standpoint.

We can use little tricks here and there to try to make them better, but fundamentally they're about as good as they're ever going to get. And none of their shortcomings are growing pains - they're fundamental to the way an LLM operates.

2 more replies

nickpsecurity11mo ago

"pretty much every metric we have shows basically linear improvement of these models over time."

They're also trained on random data scraped off the Internet which might include benchmarks, code that looks like them, and AI articles with things like chain of thought. There's been some effort to filter obvious benchmarks but is that enough? I cant know if the AI's are getting smarter on their own or more cheat sheets are in the training data.

Just brainstorming, one thing I came up with is training them on datasets from before the benchmarks or much AI-generated material existed. Keep testing algorithmic improvements on that in addition to models trained on up to date data. That might be a more accurate assessment.

1 more reply

attemptone11mo ago

>I feel the opposite, and pretty much every metric we have shows basically linear improvement of these models over time.

Wait, what kind of metric are you talking about? When I did my masters in 2023 SOTA models where trying to push the boundaries by minuscule amounts. And sometimes blatantly changing the way they measure "success" to beat the previous SOTA

1 more reply

mrkurt11mo ago

PLEASE write your response. We'll publish it on the Fly.io blog. Unedited. If you want.

ofjcihenOP11mo ago

I’m uninterested in giving you content. In particular because of your past behavior.

Thanks for the offer though.

2 more replies

kubb11mo ago

Maybe make a video of how you're vibecoding a valuable project in an existing codebase, and how agents are saving you time by running your tools in a loop.

2 more replies

briandrupieski11mo ago

> with poor arguments every week

This roughly matches my experience too, but I don't think it applies to this one. It has a few novel things that were new ideas to me and I'm glad I read it.

> I’m ready to write a boilerplate response because I already know what they’re going to say

If you have one that addresses what this one talks about I'd be interested in reading it.

slg11mo ago

>> with poor arguments every week

>This roughly matches my experience too, but I don't think it applies to this one.

I'm not so sure. The argument that any good programming language would inherently eliminate the concern for hallucinations seems like a pretty weak argument to me.

2 more replies

kubb11mo ago

There's also the reverse genre: valid criticism of absolutely strawman arguments that nobody makes.

tptacek11mo ago

Which of the arguments in this post hasn't occurred on HN in the past month or so?

2 more replies

csallen11mo ago

Can you direct me somewhere with superior counterarguments? I'm quite curious

calf11mo ago

What valid AI criticisms? Most criticisms of AI are not very deep nor founded in complexity theoretic arguments, whereas Yann LeCun himself gave an excellent 1 slide explanation of the limits of LLMs. Most AI criticisms are low quality arguments.

therealpygon11mo ago

“Valid” criticism rarely come from the people barely capable of understanding the difference between AI and LLMs, and using them interchangeably.

j / k navigate · click thread line to collapse

0 comments

mountainriver11mo ago

I feel the opposite, and pretty much every metric we have shows basically linear improvement of these models over time.

Are LLMs perfect? Absolutely not. Do we have metrics to tell us how good they are? Yes

I’ve found very few critics that actually understand ML on a deep level. For instance Gary Marcus didn’t know what a test train split was. Unfortunately, rage bait like this makes money

Night_Thastus11mo ago

2 more replies

nickpsecurity11mo ago

"pretty much every metric we have shows basically linear improvement of these models over time."

1 more reply

attemptone11mo ago

>I feel the opposite, and pretty much every metric we have shows basically linear improvement of these models over time.

1 more reply

mrkurt11mo ago

PLEASE write your response. We'll publish it on the Fly.io blog. Unedited. If you want.

ofjcihenOP11mo ago

I’m uninterested in giving you content. In particular because of your past behavior.

Thanks for the offer though.

2 more replies

kubb11mo ago

Maybe make a video of how you're vibecoding a valuable project in an existing codebase, and how agents are saving you time by running your tools in a loop.

2 more replies

briandrupieski11mo ago

> with poor arguments every week

This roughly matches my experience too, but I don't think it applies to this one. It has a few novel things that were new ideas to me and I'm glad I read it.

> I’m ready to write a boilerplate response because I already know what they’re going to say

If you have one that addresses what this one talks about I'd be interested in reading it.

slg11mo ago

>> with poor arguments every week

>This roughly matches my experience too, but I don't think it applies to this one.

I'm not so sure. The argument that any good programming language would inherently eliminate the concern for hallucinations seems like a pretty weak argument to me.

2 more replies

kubb11mo ago

There's also the reverse genre: valid criticism of absolutely strawman arguments that nobody makes.

tptacek11mo ago

Which of the arguments in this post hasn't occurred on HN in the past month or so?

2 more replies

csallen11mo ago

Can you direct me somewhere with superior counterarguments? I'm quite curious

calf11mo ago

therealpygon11mo ago

“Valid” criticism rarely come from the people barely capable of understanding the difference between AI and LLMs, and using them interchangeably.

j / k navigate · click thread line to collapse