undefined | Better HN

0 pointsdazzaji1y ago0 comments

[flagged]

0 comments

13 comments · 3 top-level

frotaur1y ago· 9 in thread

I'm very sorry if this isn't the case, but this message really feels LLM-written.

It’s gracious of you to say that you’d be sorry, and I did run my comment through 4o (perhaps ironically) which caught a slew of typos and weird grammar issues and offered some improvements. But the robotic sound and anything else you don’t like are my own responsibility. Do you, perhaps, have any thoughts on the substance of the comment?

3 more replies

jorvi1y ago

Its because of the em dashes (- is a normal dash, — is an em dash). Very few real people use those outside of writing books or longform articles.

There's also some strange wordings like "back-pocket tests."

It's 100% LLM generated.

What is much scarier is that those "quick reply" blurbs on Android/Gmail (and iOS?) will be able to be trained on your entire e-mail and WhatsApp history. That model will have your writing mannerisms and even be a stochastic mimic of your reasoning. So, you won't be able to even realize a model answered you, not a real person. And the initial message the model is responding to might be written by the other person's personal model.

The future of digital interactions might have some sort of cryptographic signing guaranteeing you're talking to a human being, perhaps even with blocked copy-pasting (or well, that part of the text shows up as unverified) and cheat detection.

Going even a layer deeper / more meta: what does it ultimately matter? We humans yearn for connection, but for some reason that connection only feels genuine with another human. Whereas, what is the difference between a human typing a message to you, a human inhabiting a robot body, a model typing a message to you, and a model inhabiting a robot body, if they can all give you unique interactions?

1 more reply

returnInfinity1y ago

And this is the reason, I have choose to write grammatically wrong content online. And basic english only, no fancy words.

1 more reply

tmikaeld1y ago

It may also be deliberate, I know a lot of people that are very dyslexic and are using AI for making themselves understood online.

trash_cat1y ago

It's the dashes that make it a dead-giveaway.

transcriptase1y ago

“ — “ is the giveaway.

1 more reply

thomashop1y ago

Why sorry? So what?

I often write things I want to post in bullets and then have it formulated better than I could by an LLM. But its just applying a style. The content comes from me.

My wife is dyslexic so she passes most things she writes through ChatGPT. Also not everyone is a native speaker.

joaohaas1y ago

TBH I've recently felt like that for ~70% of 'top-level replies' in HN, which has slowly pushed me to other mediums (mastodon and discord).

Could just be that the AI 'boom' brought a less programming-focused crowd into the site and those people lack the vocabulary that is constantly used here, who knows.

1 more reply

Oarch1y ago

I'm a big fan of sprinkling in a little profanity just to pass the LLM bullshit check

iamnotagenius1y ago· 1 in thread

I liked Grok 3 fiction writing style; catches lots of physics of mundane situations such as ringing echo in a closed bathroom we all know well; the prose feels very lively as the result. Kinda like R1 makes situations sharp with details, Grok 3 makes the other way around - rounded by using details.

dazzajiOP1y ago

That sounds like very evocative prose. Would you be up for sharing some of that fiction? I haven’t tried Grok 3 for that purpose and now I’m curious.

2 more replies

dazzajiOP1y ago

Here’s the conclusion of a much more refined initial review by Andrej Karpathy [1] which, I think overall, comports with the substance of my own hot take:

“As far as a quick vibe check over ~2 hours this morning, Grok 3 + Thinking feels somewhere around the state of the art territory of OpenAI's strongest models (o1-pro, $200/month), and slightly better than DeepSeek-R1 and Gemini 2.0 Flash Thinking. Which is quite incredible considering that the team started from scratch ~1 year ago, this timescale to state of the art territory is unprecedented. Do also keep in mind the caveats - the models are stochastic and may give slightly different answers each time, and it is very early, so we'll have to wait for a lot more evaluations over a period of the next few days/weeks. The early LM arena results look quite encouraging indeed. For now, big congrats to the xAI team, they clearly have huge velocity and momentum and I am excited to add Grok 3 to my "LLM council" and hear what it thinks going forward.”

[1] Full review at: https://x.com/karpathy/status/1891720635363254772?s=46&t=91u...

j / k navigate · click thread line to collapse

0 comments

13 comments · 3 top-level

frotaur1y ago· 9 in thread

I'm very sorry if this isn't the case, but this message really feels LLM-written.

dazzajiOP1y ago

3 more replies

jorvi1y ago

Its because of the em dashes (- is a normal dash, — is an em dash). Very few real people use those outside of writing books or longform articles.

There's also some strange wordings like "back-pocket tests."

It's 100% LLM generated.

1 more reply

returnInfinity1y ago

And this is the reason, I have choose to write grammatically wrong content online. And basic english only, no fancy words.

1 more reply

tmikaeld1y ago

It may also be deliberate, I know a lot of people that are very dyslexic and are using AI for making themselves understood online.

trash_cat1y ago

It's the dashes that make it a dead-giveaway.

transcriptase1y ago

“ — “ is the giveaway.

1 more reply

thomashop1y ago

Why sorry? So what?

I often write things I want to post in bullets and then have it formulated better than I could by an LLM. But its just applying a style. The content comes from me.

My wife is dyslexic so she passes most things she writes through ChatGPT. Also not everyone is a native speaker.

joaohaas1y ago

TBH I've recently felt like that for ~70% of 'top-level replies' in HN, which has slowly pushed me to other mediums (mastodon and discord).

Could just be that the AI 'boom' brought a less programming-focused crowd into the site and those people lack the vocabulary that is constantly used here, who knows.

1 more reply

Oarch1y ago

I'm a big fan of sprinkling in a little profanity just to pass the LLM bullshit check

iamnotagenius1y ago· 1 in thread

dazzajiOP1y ago

That sounds like very evocative prose. Would you be up for sharing some of that fiction? I haven’t tried Grok 3 for that purpose and now I’m curious.

2 more replies

dazzajiOP1y ago

Here’s the conclusion of a much more refined initial review by Andrej Karpathy [1] which, I think overall, comports with the substance of my own hot take:

[1] Full review at: https://x.com/karpathy/status/1891720635363254772?s=46&t=91u...

j / k navigate · click thread line to collapse