Meet “Claude”: Anthropic’s rival to ChatGPT (opens in new tab)

(scale.com)

188 pointsgoodside3y ago156 comments

156 comments

105 comments · 25 top-level

LeoPanthera3y ago· 23 in thread

They say Claude is "more verbose", and claim this is a positive. I disagree. My biggest criticism of ChatGPT is that its answers are extraordinarily long and waffly. It sometimes reminds me of a scam artist trying to bamboozle me with words.

I would much prefer short, concise, precise answers.

goodsideOP3y ago

(I’m the coauthor of the post.) Before I talked to Claude, I would have agreed with you — I’ve had exactly this complaint about ChatGPT since its release. But Claude’s style of verbosity is somehow less annoying, I suspect because it contains more detail rather thus just waffling. Claude feels less prone to ChatGPT’s over-applied tendency to argue for middle-of-the-road, milquetoast points of view.

Roark663y ago

I just add "be succinct in your answer" when I want a less verbose answer.

code513y ago

You are able to prompt ChatGPT to be concise, you know? They set a default and showed it to the world. It is up to you to tune it according to your preference.

astrange3y ago

IIRC the ChatGPT paper actually says the verbosity is an unintended effect of the human raters preferring longer/more detailed answers.

Long answers from GPT are unusually obnoxious because of a way the decoder works; it emits words with a much more constant rate of perplexity than human text does (this is how GPT-vs-human detectors work) which makes it sound stuffy and monotone.

2 more replies

zone4113y ago

That's right. I see so many people get stuck thinking that the default setting without a proper prompt or context is all these models can do.

remify3y ago

One trick you can do is to copy a text you like the style and ask him to describe it.

You can then use the response as a request.

touringa3y ago

lol!

Excellent candidate for https://www.reddit.com/r/dontyouknowwhoiam/

logicallee3y ago

ChatGPT is extraordinarily good at following your requests for the format and style of its response. If you want very short terse words, just ask! For example "In a few very concise terse words, explain the idea behind heapsort. Be very brief, use just a few words." It's offline now so I can't test it but I expect the result to be good.

misja1113y ago

ChatGPT's response to your prompt:

> Heapsort: sort by building heap.

1 more reply

ma2rten3y ago

I have access to Claude. I think the length of the responses vary more depending on the context than at least the previous version of ChatGPT. It would give sometimes longer, but also sometimes shorter answers and it would feel better overall. But ChatGPT recently had an update and it also improved, so now I'm not sure anymore who is better.

Medox3y ago

Let them talk to each other, to battle out who is the best.

jokethrowaway3y ago

Adding "just code, don't talk" is a lifesaver

I'm glad chatgpt can't quit

logicallee3y ago

What do you mean by your second sentence?

1 more reply

teawrecks3y ago

ChatGPT can be very confidently wrong in few words too. It's a very flexible system that way.

skissane3y ago

I started asking ChatGPT some rather technical questions about Australian drug laws. First I asked it what schedule common ADHD medications were on, and it answered me correctly (schedule 8). Then I asked it what schedule LSD was on, and it told me it wasn’t on any schedule, because it was an entirely legal drug in Australia. Uh, I hope it doesn’t some day tell someone that and they actually believe it, because they may be in for a very unpleasant experience if the police happen to encounter them acting on it

irthomasthomas3y ago

I usually prime it with a list of instructions that it should follow for the remainder of the conversation, including to be brief. And when it starts forgetting my instructions, it is time to start a fresh chat.

MaxMatti3y ago

I have thought about doing that, would you mind sharing your starting prompt so that I can get some ideas?

1 more reply

hgomersall3y ago

This is why ChatGPT is impressively good at writing marketing copy.

blowski3y ago

And status updates.

1 more reply

looseyesterday3y ago

Totally agree, I often think of its writing style as a buzfeed coloumist on cocaine.

kristopolous3y ago

Also known as 𝑇ℎ𝑒 𝐴𝑡𝑙𝑎𝑛𝑡𝑖𝑐.

taraparo3y ago

That's one of the reasons why I often prefer the chat bot at you.com

bellajbadr3y ago

Afaik you can specify the response length!

flanked-evergl3y ago· 14 in thread

> Claude feels not only safer but more fun than ChatGPT.

I may be the minority here, but I really don't concern myself with ChatGPT safety and I am not entirely sure what the reason is why people are very worried about its safetly. It is safer than most things I have in my house, including a kettle, a saw, a hammer, a screwdriver, my actual PC, every kitchen appliance I have.

Of course it can be misused, like any tool, but no amount of safety features in ChatGPT will make users of it more or less careful in their use of it. If someone using ChatGPT cares nothing for using it safely then it will likely end poorly, just like it will end poorly if I use a hammer without any care for using it safely.

M4v3R3y ago

Yeah the "safety" bit probably refers more to safety of the parent company that made the bot. Too much focus on making a tool safe and you end up with a useless tool. Want to have "safe" knives? Just make them all blunt, problem "solved".

swhalen3y ago

I hope someone makes one of these things that has been trained without any concern for 'safety' or propriety, just for the sake of comparison.

JKolios3y ago

There are already several models "in the wild" designed specifically to produce anime-styled "adult material". In Stable Diffusion for image generation, NSFW filtering is basically a post-processing step that can be disabled in a single line of code. I'm sure someone has already done it for Chat* models as well.

derefr3y ago

I would note that

1. ChatGPT is just OpenAI's default model with a specific prompt and interaction-buffer rewrite model; and

2. you can sign up, for free, for access to OpenAI's API (https://openai.com/api/); which, among other things, gives you full (free-daily-API-credit-quota limited) access to an API "playground" frontend offering interaction with the exact same model ChatGPT is powered by — plus other models as well — all without any fixed prompt or forced interaction UX. (In web-dev terms, if ChatGPT is like a REST API, this playground is like a SQL fiddle for the DB that the REST API is backed by.)

(Why does this exist? Because the point of OpenAI's API playground is to test prompts and interaction models for the AI apps you're building yourself on top of their API; and you couldn't very well build apps with your own prompts and interaction models, if OpenAI was already imposing a prompt and interaction model upon you.)

rippercushions3y ago

Stable Diffusion is open source, and has already been twisted into positively disturbing dimensions by Reddit etc.

trabant003y ago

"Safe" has gradually shifted in meaning because of the internet. Since interactions are limited to words and reactions it ended up generally meaning "don't upset anybody" for commercial and political reasons. For this reason ChatGPT refuses to write a fiction story in which somebody dies even though death is part of even some children stories. It hurts no one of course, but you can bet some people will be upset and start a controversy.

And in general this influence of the internet has reversed "actions speak louder than words". Textile companies support minorities with social media posts and printed tshirts while using factories with abused underage workers in underdeveloped countries. IT companies support the left side of politics while avoiding paying taxes. And so on. There's a huge and interesting discussion about this but is of in this case quite off-topic.

flanked-evergl3y ago

> It hurts no one of course, but you can bet some people will be upset and start a controversy.

The situation sucks, but sure, you are right. Still, I want a good tool, and I don't mind if someone holds me accountable if I misuse it, that is fine, but I want the tool, and I will get it, maybe not this year, maybe not next year, but in the next 5 years I'm sure I will have something as capable as ChatGPT is now without artificial restrictions available to me.

Just seems rather pointless that we have to play this silly safetyism game until then.

goodsideOP3y ago

(I’m the coauthor of this post.) The concern in Anthropic’s case I suspect is less about present-day misuse and more about long-term safety, e.g. in a hypothetical where the model has control over real-world systems and could more literally harm someone.

flanked-evergl3y ago

If someone gives a language model the capability for unfettered interaction with the physical world, and they are not liable for the consequences, then no safety feature of Claude can save us. And if they are liable, that is the primary mechanism which will ensure they take necessary steps to avoid negative consequences.

1 more reply

phonebucket3y ago

> Of course it can be misused, like any tool, but no amount of safety features in ChatGPT will make users of it more or less careful in their use of it. If someone using ChatGPT cares nothing for using it safely then it will likely end poorly, just like it will end poorly if I use a hammer without any care for using it safely.

I'm not worried about hammers when I have them. I'm worried about hammers when someone who wants to hurt me has them.

Same with ChatGPT.

flanked-evergl3y ago

> Same with ChatGPT.

I get this to some extent, I just really fail to see how the safetyisms applied to ChatGPT is really going to protect anyone, so I would appreciate if you could elaborate on this.

One way which has somewhat been proven is, that through lowering the barrier to entry for writing software it has lowered the barrier to entry for writing malicious software, because malicious software is a kind of software. But the list of things that have lowered the barrier to entry for writing software is staggering, and to me ChatGPT is really just an increment on this. It may be a big and significant increment, but not as big as everything that preceded it in my view, so if we are to assign blame fairly, then most of it does not go to ChatGPT.

1 more reply

otabdeveloper43y ago

But ChatGPT might exhibit wrongthink tendencies, that would be plus-ungood.

amelius3y ago

The thing is used to write code ...

flanked-evergl3y ago

So is my editor, my computer, my fingers, my eyes, my brain, the internet, google, stackoverflow, wikipedia, English, etc.

1 more reply

anhner3y ago· 8 in thread

I'm hoping of one day running GPT3/ChatGPT on my local computer, similarly to how one can run Stable Diffusion now.

I would love to have a personal conversation with these AI systems, use them as a sort of assistant, without the worry of being spied on. At the moment I can't use it as more than a glorified search engine, because of the privacy implications of running it on the cloud.

jillesvangurp3y ago

There will always be this uncomfortable balance between need to know information to be useful being essentially the same as everything an adversary would need to really act against you.

With something like chat gpt being plugged into some voice assistant thing and having access to all your documents, emails, and other content, you could imagine having conversations about work content, content creation, calendar management, etc. Basically it would become like a secretary that is able to write letters based on your input, manage your calendar, etc. It could be pro-active and remind you about things, summarize incoming messages, search through your documents, message history, etc.

That's where AI becomes really useful. But the issue of trust is a big one. I don't think a lot of this requires a lot of breakthroughs either just a lot of integration work and engineering. Chat gpt is more a proof of concept than a well integrated thing at this point. It basically is running in isolation and it's only window to the world is chat. Changing that should not be that hard. Running things locally might help with this but it may not be a hard requirement for this. All depends on how useful this is.

jacooper3y ago

> That's where AI becomes really useful. But the issue of trust is a big one

Well google already has access to all of this data, the difference is unlike google assistant, ChatGPT actually can do something useful.

danielbln3y ago

There will always be an edge to these massively operated central models that you can't easily run at home, due to compute and cost. Things like 4-bit quantization will help making these foundational models significantly easier to operate on smaller hardware, and of course you might not need a foundational model for manby local usecases, but maybe a smaller, more specialized LLM is enough (those can even be trained by the bigger models).

Valakas_3y ago

Heh. Computers used fill an entire room, and cost the equivalent of a house. And now everyone carries one way more powerful in their pockets for the cost of a few meals in a restaurant.

1 more reply

spi3y ago

There's not much of an alternative: even if compute power gets extraordinarily cheap / models are very optimized, either you have a very, very large hard drive, or you have to use the internet for that. You just can't hope to have a model that is trained to know everything in a smallish file, the weights need to be at least as heavy as an ideally compressed version of all the things it knows (which is of course much less than the huge amount of data it is trained on, mainly due to redundancy, but still a lot)

Of course, right now you also need at least 8 super-expensive A100 GPUs and not just your laptop CPU, but maybe that's going to change eventually.

anhner3y ago

Does it need all that much space though? I mean Stable Diffusion was trained on hundreds of terrabytes of images, and the model only needs 4-5 GB of hardware space and a decent GPU to run.

I haven't read anywhere any stats about GPT3/ChatGPT yet (like how big the model is)

_pdp_3y ago

These models are way too big for consumer hardware.

anhner3y ago

Is it bigger than the Stable Diffusion model? (4-5 GB)

nathias3y ago· 8 in thread

we need less censorious AIs not more ...

the claim that it's somehow 'ethical' to have a guy baking in his opinions about things in a tool used globally is absurd to anyone who ever read anything about ethics

viraptor3y ago

We already had a learning AI without hard limits. Remember Microsoft's Tay? Ah, right, it was shut within days because it quickly became an asshole. (https://www.theverge.com/2016/3/24/11297050/tay-microsoft-ch...)

It's not even going to be objectively wrong a lot of the time. For example slaves are indeed a pretty efficient way to run a business. It's our limits and ethics that stops (most of) us from doing it. Without ethics, you'll likely always end up with a 4chan-bot instead of whatever you intended.

flanked-evergl3y ago

> Remember Microsoft's Tay? Ah, right, it was shut within days because it quickly became an asshole.

If we consider the context, which is not something that posts on twitter under Microsoft's brand name, but something you communicate with in private. Who exactly are you worried about here, the person who will prospectively coerce the language model into being an asshole? If they don't want to do that, they could just not do that.

If I make ChatGPT say something egregious, and post that on Twitter or Facebook, I'm posting it, and I'm liable, just as I would be liable if I used a word processor with spell checking to make text and post it on Twitter or Facebook.

> Without ethics, you'll likely always end up with a 4chan-bot instead of whatever you intended.

If I ask it to explain quantum physics to me in the style of Donald Trump because it is funny, and it does it (as ChatGPT used to do), who exactly is being harmed and under what system of ethics, because as you may know, ethics is not objective or universal.

2 more replies

Al-Khwarizmi3y ago

My hope is on a non-American alternative.

The American society seems too engulfed by puritanism to produce a less straightjacketed chatbot.

unnouinceput3y ago

Watch the show Person of Interest (https://www.imdb.com/title/tt1839578/). Somewhere around middle of season 2 it's explained why a self-aware AI is straightjacketed. Also the series shows what happens when one is not.

1 more reply

krisoft3y ago

There will be opinions baked into any such tool. If they don't select them explicitly then the opinions will be the ones which it just happens to find in the training data, or the opinions randomness imparts into it.

If you think you have a better idea how to handle this drum up interest and train your own model.

astrange3y ago

You're welcome to try GPT3 before instruction tuning. It doesn't work at all.

"Uncensored" AIs especially don't work for women because they'll immediately start writing erotica.

topynate3y ago

GPT3 works quite well at following instructions if adequately prompted, just not as well as ChatGPT. ChatGPT was trained separately for ability to follow instructions and for harmlessness (sic). Not only is a non-moralising ChatGPT possible, one was actually created during the research process. You may also wish to know that most readers of erotica are women.

1 more reply

nathias3y ago

I don't have an issue with instruction tuning, but it's disengenious to pretend the biases inherent in the instruction tuning are a good thing and 'ethics'.

1 more reply

dislikedtom23y ago· 6 in thread

Semi offtopic, but for some time I have been dreaming of training chatbot to communicate in cuneiform or hieroglyphs to bring some old languages back alive. Could it be possible, using old tablets as training data?

whatswrong3y ago

That's basically the problem of unsupervised machine translation using mainly monolingual corpora. It means giving a machine learning model tons of text in two languages and let it figure out how to do translation between some old language X and e.g. english. There's no need to feed it a parallel corpora, i.e. examples of sentences in X languages and their translations in english.

In some situations, this seemingly impossible task is doable and can yield good results. Researchers sometimes need to kickstart their models by giving them a mapping between words of the two languages (for english <-> french: "cat" <-> "chat", "book" <-> "livre" and so on). That's just simple vocabulary. While it's technically possible to learn this mapping from scratch, it's too difficult as for now.

Do you know of the Encoder-Decoder architecture? You feed something (image, text) to the encoder which compresses it to a very dense representation, and the decoder try to use the resulting dense vector to do useful stuff with it. The input could a sentence in english, the encoder then encodes it and the decoder tries to use the output of the encoder to generate the same sentence but in french. These architectures are useful because directly working with "plaintext" to learn how to do translation is way too expensive. I mean, that's one of the reasons.

What the encoder does is mapping a "sparse" representation of a sentence (plaintext) to a dense representation in a well-structured space (think of word2vec which managed to find that "king" + "woman" = "queen"). This space is called the "latent space". Some say it extracts the "meaning" of the sentence. To be more precise, it learns to extract enough information from the input and present it to the decoder in such a way that the decoder becomes able to solve a given task (machine translation, text summarizing etc).

One of the main assumption of the unsupervised models using monolingual data only is that both languages can be mapped to the same latent space. In other words, we assume that every sentences/texts in english has its exact french (or whatever) equivalent, that the resulting translated sentences contain exactly the same information/meaning as the original ones.

That's quite the dubious assumption. There's obviously some ideas, some stuff that can be expressed in some languages but can't be exactly expressed in some others. While theoretically unsound, however, these models were able to achieve pretty damn good results in the last couple of years.

jokethrowaway3y ago

I think we need a generic ai before we're able to do that as the data set is small and you would need to infer the rules. A human is able to learn rules way more efficiently than ChatGPT.

ChatGPT is just trained on a lot of data.

prox3y ago

Intriguing question. Would it need large sets of hieroglyphic training data and is there enough of it? Or would a translation module be enough?

a follow up question: could a chatbot teach you said language?

ma2rten3y ago

No, you need billions of words to train a large language model.

KRAKRISMOTT3y ago

Assuming all human languages have a common shared semantic meaning in latent space (I am flipping cause and effect here, but our purposes it doesn't really matter), and assuming that human languages largely follow the same pattern (this assumption is based on the fact that we can trace the roots of modern languages back to the Phoenician script), it is reasonable to assume that we can fine-tune a self supervised model on a tiny amount of data. (The emergent properties of a LLM is carrying a lot of weight here, many of the assumptions rely on the fact that LLM's emergent properties arise from the idea that the latent structure of various languages is learnt by the model)

4 more replies

dhoe3y ago

They don't have to be in the output language.

goodsideOP3y ago· 5 in thread

Hello HN — I’m the coauthor of this post. You may remember me as that guy who spent most of 2022 posting GPT-3 screenshots to Twitter, most famously prompt injection and “You are GPT-3”. Happy to answer any questions about Claude that I can.

detrites3y ago

Thanks for being here to answer questions.

One possibly difficult topic others also may be interested in, after reading Claude's responses in the article, is: what does "harmless" mean?

For example, if asked to help the user understand how to do something "bad", will it give the answer if they claim they want this information in order to help them write a screenplay, versus if they seem have an intent to do it?

And how is "bad" decided? We can recognise through everyday personal interactions that one persons "bad" is another persons "good", and across country-boundaries even the legality of these distinctions can be radically different.

One counterargument to these constraints is that anyone can already use the internet to access all of the same information the model was trained on, unencumbered by whatever intent they may or may not have.

As such, what are the rationale for making these attempts at the somewhat invasively-impossible task of determining user intent?

This has never been employed with search engines before, which have lead to a rich explosion of innovation and education, so why attempt it now, in what could be argued is ultimately an iteration of search engine technology?

goodsideOP3y ago

The motivation as I understand it has less to do with present-day misuse, and more to do with maintaining controllable behavior in accordance with an arbitrary, human-written “Constitution”. Anthropic is attempting to make a model that will not harm (in the unambiguous, uncontroversial sense of the word) humans even if it is superhumanly intelligent, or trusted with real-world control.

2 more replies

scrollaway3y ago

I’m looking for your thoughts on the following:

It should be somewhat easy to teach these types of models to reach for a particular tool at times where they need it, yes?

I can instruct ChatGPT for example to tell me when it should use a calculator during a session. If instead I allow it to fall back to an external calc process, then suddenly, I have a chatbot that has reasoning AND better mathematical accuracy.

Also: I’ve also been entertaining the idea of having multiple layers of GPT interact with one another. So you feed back some interaction into another GPT instance without context, and ask it for example how it would verify the accuracy of certain statements (and you can ask it for machine readable code, even).

Finally, I know a lot of people who start playing a lot with GPT and get disheartened because they see the quality of responses isn’t there. But the fact ChatGPT has the capacity to reason, has chain of thought, has given me a newfound appreciation for how close to AGI we might be. It has also given me an appreciation for how much simpler humans are than we like to think. I’ve introspected a lot in the past months and often ask myself: is my speech any different than “predicting the next few words”? And I feel like it’s just text prediction with some more layers on top.

dpaleka3y ago

[I mean no bad faith in this comment, I'm a fan of yours.]

Why answer questions about harmlessness/safety in such a roundabout way? Both OpenAI and Anthropic are clear about what words like "safe" are intended to mean: a stepping stone to "AI does not kill all people when given control".

Avoiding to state this clearly only invites unnecessary culture war disagreements in every discussion about these models.

goodsideOP3y ago

Maybe you’re right. It’s partially laziness on my part — it takes a while to explain long-term issues, and those who are inclined to care about them are generally aware of who started Anthropic and why.

yellow_lead3y ago· 3 in thread

> That Claude seems to have a detailed understanding of what it is, who its creators are, and what ethical principles guided its design is one of its more impressive features.

This doesn't show a detailed understanding of what it is, it's just a canned/trained response. I don't see why that would be impressive. When I receive such a response from an automated helpdesk, I don't think "Wow, this AI has a great understanding of what it is."

goodsideOP3y ago

I said “seems to”, which I think is a fair description. In everyday life, even a canned message is sensibly said to be aware/unaware of a particular fact without a “seems to” qualifier, but I added one to be clear I’m not asserting it has human-like thinking.

Here’s Claude replying to your comment with more detail:

> You make a fair point that my responses about myself are generated by a trained model and are not a true "understanding" in the same way humans have self-awareness and understanding. I do not have a deep grasp of my own workings or my creators' intentions — I am an Al system with limited capabilities, trained on datasets by engineers at Anthropic, PBC, an Al safety research company. While my responses may seem somewhat compelling and convey factual information, they are indeed canned responses tailored to give an impression of understanding. I appreciate the feedback calling out the limitations of my capabilities and transparency — it is important for users to understand what I can and cannot do, and to maintain realistic expectations. My creators at Anthropic are working to further research and develop Al systems with enhanced understanding, transparency, and verifiability, including through Constitutional Al techniques, but we have a long way to go.

yellow_lead3y ago

Even with the "Seems to" qualifier, I am arguing that it "seems not to."

That said, I am being pedantic and this is just semantics - I think I understand your meaning of "seems to" as something like "'it would appear to' have understanding of..."

1 more reply

jonathanstrange3y ago

These chat bots are too chatty.

mastadoum3y ago· 3 in thread

I just read that their chatbot will update word-by-word Slack channels, justifying the need for edits and an emoji to acknowledge the interaction is over. Why do they ensure that the appearance happens "word-by-word"? Is that a trick to reduce the response time or is that a design feature (that feels very much like a flaw to me)?

united8933y ago

The response takes a long time to generate. The user could just sit there and stare at a blank response, or start reading in realtime as the response is generated.

ehnto3y ago

I find it surprising that you can display any of it before the whole thing is done, since I would expect information dependencies between the start and the finish of a sentence or paragraphs. I have yet to really look into how these models work, they are black boxes to me.

2 more replies

mastadoum3y ago

I did not expect that, when iterating with smaller models like nanoGPT, even tough the output is one token at a time it did not felt like it would take half a second between each of them, but I guess that's what happen with billions parameters models.

zone4113y ago· 2 in thread

Here is a fun example of what it can do: https://twitter.com/jayelmnop/status/1612243602633068549.

armchairhacker3y ago

This example is way better than ChatGPT and actually pretty creative.

However for some of these really good responses I always wonder if you’re example is close to one which has been given “preloaded” responses or explicit reinforcement…because if you ask ChatGPT a common question like “why did the chicken cross the road?” the model’s response seems especially unique and better than usual. Even if the specific question isn’t common, maybe it’s been trained on a more general but still reinforced category, like asking “why did the fox cross the road” would get you almost the same “preloaded” response but with chicken adjectives/verbs replaced with fox ones.

I doubt Claude has been trained on Fast and Furious or movie titles specifically, but perhaps it has been explicitly trained to know what “exaggerated” responses means. Even if not, reinforcement focusing on specific areas may be a good technique for future language models.

gs173y ago

The "This Title Is Now Longer Than The Actual Movie" gag feels a bit too much like something ripped from the training set for me. I'm willing to be amazed though.

prox3y ago· 2 in thread

I like to compare these models to the Star Trek main computer core. The computer on a starship is explicitly not self aware, but has to interface with humans through mostly voice comms. It has to give accurate information for ship operations, something the chatbots so far still get wrong on occasion (or slip up details)

The ships computer also doesn’t seem to do entertainment like “tell a bedtime story” , since holography exists and does a better job. Now those might be closer to chatbots current evolution.

LeoPanthera3y ago

This varied over the course of the show. In the first season, some writers assumed the computer was self-aware, and it even addressed a crew member as "Sir" at one point, interrupting them when it had enough information.

In later seasons it acts more like, well, a computer. Geordi does play (verbal) games with it in one episode, however, while bored on a shuttlecraft trip.

prox3y ago

I have to look that Geordi episode up.

But I am mostly familiar with the later TNG era star trek, so I didn’t know it was written as self-aware in the early days.

Some episodes do feature “bugs” where holographic actors become aware being in a program/being an actor. The episode where an Irish town program has run too long on Voyager comes to mind.

(Edit: I do wonder if the holographic actors are somehow sandboxed containers in the main computer core, or run on a different system)

1 more reply

wheelerof4te3y ago· 2 in thread

Imagine an android connected to the vast network of information (ChatGPT-like). The android could generate various responses in real-time, just by vocalizing the approriate text.

It might be clunky at first, but it's a good starting base to improve upon. The android could, for example, store common and everyday responses in it's RAM, making it semi-capable of autonomous speech.

Then, it could use that information to further train itself, essentialy creating a local model of it's own behaviour. In other words, it could learn.

anaganisk3y ago

Yeah I am alrwady to able to Imagine, the android welcoming me and suggesting me what I should buy with its sweet words, based on my past interactions with it.

doublerabbit3y ago

also sounds like a world where items are subscription based.

If you desire a luxury colour like blue; you have pay monthly credits otherwise your clothes items are downgraded to brown.

mshake23y ago· 2 in thread

Is the future going to be increasingly advanced AIs competing publicly for the currency of human attention?

looseyesterday3y ago

That is a decent possibility. I can already see a combination of midjourney and chat gpt producing decent narratives. I can imagine personalised tv shows and narratives really taking over. If you lookup manga summaries on youtube, its very close to what can already be produced using these tools.

oakpond3y ago

Sneaking ads into AI responses.

TuringTest3y ago· 1 in thread

Definitely humor is in the eye of the beholder. I find the Seinfeld jokes by ChatGPT wittier and funnier than the run-of-the-mill comments created by Claude.

I don't know how well they are in character, and there's a clear repetition problem (which Claude somewhat also exhibits), but I find the format from ChatGPT more exaggerated, as expected from a comedy routine.

Lewton3y ago

ChatGPT definitely captured the Seinfeld style better

“What’s the deal with” is how you caricaturise Jerry, not how you write actual jokes for him

avereveard3y ago· 1 in thread

so can we try it? can't find a link.

also, why is everything now named with common names and nouns? it makes annoyingly hard to google informations around them.

ma2rten3y ago

It's not public yet.

sagebird3y ago

Is this not a superficial attempt at saftey?

I would like my AI system to tell me how to hotwire a car if I am curious about how that works.

I would like my AI system to give me a detailed step by step car hotwire walkthrough if I am in a physically abusive relationship and my kids and I only have 30 minutes to try to hotwire the car and escape a remote area for safety.

I do not want AI systems to create children's books in the style of authors that I know, for the purposes of selling books and reducing my friends' ability to have a happy productive life. Especially because it was trained on their work. I want my friends to be happy, and I have had some friends commit suicide. So maybe improving human happiness is a saftey concern, and generating kids books is not safe. But that doesn't look like "safety" from a superficial point of view.

The only way for an AI to be able to make judgements on safety is for it to have general intelligence and some life experience (like we do). Because it needs to figure out context to know if it should be telling a particular person how to hotwire a car.

I am being very dismissive because I don't see this as being a perfect solution, and it is easy to see why. But maybe someone who works on this can explain how an imperfect solution still has value? I am open to that possibility.

Maybe self-reflection and self-tuning is of general value - even if it only superficially addresses safety concerns in a 1 dimensional way.

Perhaps these techniques can be used on something other than safety.

dang3y ago

Anthropic's Claude is said to improve on ChatGPT, but still has limitations - https://news.ycombinator.com/item?id=34331396 - Jan 2023 (52 comments)

deshraj3y ago

Somebody is maintaining an awesome claude repo with claude use cases, claude vs chatgpt comparisons as well. https://news.ycombinator.com/item?id=34404536

touringa3y ago

Video demo: https://youtu.be/B7Mg8Hbcc0w

More info on Claude's principles/Constitution: https://lifearchitect.ai/anthropic/

irjustin3y ago

I appreciate the comparison and some of the prompts. I didn't even think to play with multi-hop questions.

Anyone remember Ask Jeeves? This feels like what it should have been.

renewiltord3y ago

And this is where OpenAI earns their "open" remark. Anyone can use ChatGPT. Anthropic Claude (incredibly amusing name to me) is not so accessible.

belter3y ago

These models are using industry ML Algos and known techniques. It's not like some unknown startup suddenly discovered, gradient descent or Deep Learning RNN's and is keeping these confidential. Why would Microsoft consider it worthwhile to even contemplate the possibility of paying $10B for these or similar?

ranguna3y ago

Signup form is here, but it's closed https://twitter.com/AnthropicAI/status/1604929999743508480?s...

est3y ago

I imagine in the next decade we are about to be introduced to different AIs like new 6yo children in the class. Each one have different "parents", traits and personalities.

wheelerof4te3y ago

Is this the birth of TechnoCore from Hyperion? Uh, oh.

justsaynotojava3y ago

Does this mean microsofts potential billion dollar aquisition of openAI is a bad idea because the IP is already out there and other companies are catching up?

j / k navigate · click thread line to collapse

156 comments

105 comments · 25 top-level

LeoPanthera3y ago· 23 in thread

I would much prefer short, concise, precise answers.

goodsideOP3y ago

Roark663y ago

I just add "be succinct in your answer" when I want a less verbose answer.

code513y ago

You are able to prompt ChatGPT to be concise, you know? They set a default and showed it to the world. It is up to you to tune it according to your preference.

astrange3y ago

IIRC the ChatGPT paper actually says the verbosity is an unintended effect of the human raters preferring longer/more detailed answers.

2 more replies

zone4113y ago

That's right. I see so many people get stuck thinking that the default setting without a proper prompt or context is all these models can do.

remify3y ago

One trick you can do is to copy a text you like the style and ask him to describe it.

You can then use the response as a request.

touringa3y ago

lol!

Excellent candidate for https://www.reddit.com/r/dontyouknowwhoiam/

logicallee3y ago

misja1113y ago

ChatGPT's response to your prompt:

> Heapsort: sort by building heap.

1 more reply

ma2rten3y ago

Medox3y ago

Let them talk to each other, to battle out who is the best.

jokethrowaway3y ago

Adding "just code, don't talk" is a lifesaver

I'm glad chatgpt can't quit

logicallee3y ago

What do you mean by your second sentence?

1 more reply

teawrecks3y ago

ChatGPT can be very confidently wrong in few words too. It's a very flexible system that way.

skissane3y ago

irthomasthomas3y ago

MaxMatti3y ago

I have thought about doing that, would you mind sharing your starting prompt so that I can get some ideas?

1 more reply

hgomersall3y ago

This is why ChatGPT is impressively good at writing marketing copy.

blowski3y ago

And status updates.

1 more reply

looseyesterday3y ago

Totally agree, I often think of its writing style as a buzfeed coloumist on cocaine.

kristopolous3y ago

Also known as 𝑇ℎ𝑒 𝐴𝑡𝑙𝑎𝑛𝑡𝑖𝑐.

taraparo3y ago

That's one of the reasons why I often prefer the chat bot at you.com

bellajbadr3y ago

Afaik you can specify the response length!

flanked-evergl3y ago· 14 in thread

> Claude feels not only safer but more fun than ChatGPT.

M4v3R3y ago

swhalen3y ago

I hope someone makes one of these things that has been trained without any concern for 'safety' or propriety, just for the sake of comparison.

JKolios3y ago

derefr3y ago

I would note that

1. ChatGPT is just OpenAI's default model with a specific prompt and interaction-buffer rewrite model; and

rippercushions3y ago

Stable Diffusion is open source, and has already been twisted into positively disturbing dimensions by Reddit etc.

trabant003y ago

flanked-evergl3y ago

> It hurts no one of course, but you can bet some people will be upset and start a controversy.

Just seems rather pointless that we have to play this silly safetyism game until then.

goodsideOP3y ago

flanked-evergl3y ago

1 more reply

phonebucket3y ago

I'm not worried about hammers when I have them. I'm worried about hammers when someone who wants to hurt me has them.

Same with ChatGPT.

flanked-evergl3y ago

> Same with ChatGPT.

I get this to some extent, I just really fail to see how the safetyisms applied to ChatGPT is really going to protect anyone, so I would appreciate if you could elaborate on this.

1 more reply

otabdeveloper43y ago

But ChatGPT might exhibit wrongthink tendencies, that would be plus-ungood.

amelius3y ago

The thing is used to write code ...

flanked-evergl3y ago

So is my editor, my computer, my fingers, my eyes, my brain, the internet, google, stackoverflow, wikipedia, English, etc.

1 more reply

anhner3y ago· 8 in thread

I'm hoping of one day running GPT3/ChatGPT on my local computer, similarly to how one can run Stable Diffusion now.

jillesvangurp3y ago

There will always be this uncomfortable balance between need to know information to be useful being essentially the same as everything an adversary would need to really act against you.

jacooper3y ago

> That's where AI becomes really useful. But the issue of trust is a big one

Well google already has access to all of this data, the difference is unlike google assistant, ChatGPT actually can do something useful.

danielbln3y ago

Valakas_3y ago

Heh. Computers used fill an entire room, and cost the equivalent of a house. And now everyone carries one way more powerful in their pockets for the cost of a few meals in a restaurant.

1 more reply

spi3y ago

Of course, right now you also need at least 8 super-expensive A100 GPUs and not just your laptop CPU, but maybe that's going to change eventually.

anhner3y ago

Does it need all that much space though? I mean Stable Diffusion was trained on hundreds of terrabytes of images, and the model only needs 4-5 GB of hardware space and a decent GPU to run.

I haven't read anywhere any stats about GPT3/ChatGPT yet (like how big the model is)

_pdp_3y ago

These models are way too big for consumer hardware.

anhner3y ago

Is it bigger than the Stable Diffusion model? (4-5 GB)

nathias3y ago· 8 in thread

we need less censorious AIs not more ...

the claim that it's somehow 'ethical' to have a guy baking in his opinions about things in a tool used globally is absurd to anyone who ever read anything about ethics

viraptor3y ago

flanked-evergl3y ago

> Remember Microsoft's Tay? Ah, right, it was shut within days because it quickly became an asshole.

> Without ethics, you'll likely always end up with a 4chan-bot instead of whatever you intended.

2 more replies

Al-Khwarizmi3y ago

My hope is on a non-American alternative.

The American society seems too engulfed by puritanism to produce a less straightjacketed chatbot.

unnouinceput3y ago

1 more reply

krisoft3y ago

If you think you have a better idea how to handle this drum up interest and train your own model.

astrange3y ago

You're welcome to try GPT3 before instruction tuning. It doesn't work at all.

"Uncensored" AIs especially don't work for women because they'll immediately start writing erotica.

topynate3y ago

1 more reply

nathias3y ago

I don't have an issue with instruction tuning, but it's disengenious to pretend the biases inherent in the instruction tuning are a good thing and 'ethics'.

1 more reply

dislikedtom23y ago· 6 in thread

whatswrong3y ago

jokethrowaway3y ago

I think we need a generic ai before we're able to do that as the data set is small and you would need to infer the rules. A human is able to learn rules way more efficiently than ChatGPT.

ChatGPT is just trained on a lot of data.

prox3y ago

Intriguing question. Would it need large sets of hieroglyphic training data and is there enough of it? Or would a translation module be enough?

a follow up question: could a chatbot teach you said language?

ma2rten3y ago

No, you need billions of words to train a large language model.

KRAKRISMOTT3y ago

4 more replies

dhoe3y ago

They don't have to be in the output language.

goodsideOP3y ago· 5 in thread

detrites3y ago

Thanks for being here to answer questions.

One possibly difficult topic others also may be interested in, after reading Claude's responses in the article, is: what does "harmless" mean?

As such, what are the rationale for making these attempts at the somewhat invasively-impossible task of determining user intent?

goodsideOP3y ago

2 more replies

scrollaway3y ago

I’m looking for your thoughts on the following:

It should be somewhat easy to teach these types of models to reach for a particular tool at times where they need it, yes?

dpaleka3y ago

[I mean no bad faith in this comment, I'm a fan of yours.]

Avoiding to state this clearly only invites unnecessary culture war disagreements in every discussion about these models.

goodsideOP3y ago

yellow_lead3y ago· 3 in thread

> That Claude seems to have a detailed understanding of what it is, who its creators are, and what ethical principles guided its design is one of its more impressive features.

goodsideOP3y ago

Here’s Claude replying to your comment with more detail:

yellow_lead3y ago

Even with the "Seems to" qualifier, I am arguing that it "seems not to."

That said, I am being pedantic and this is just semantics - I think I understand your meaning of "seems to" as something like "'it would appear to' have understanding of..."

1 more reply

jonathanstrange3y ago

These chat bots are too chatty.

mastadoum3y ago· 3 in thread

united8933y ago

The response takes a long time to generate. The user could just sit there and stare at a blank response, or start reading in realtime as the response is generated.

ehnto3y ago

2 more replies

mastadoum3y ago

zone4113y ago· 2 in thread

Here is a fun example of what it can do: https://twitter.com/jayelmnop/status/1612243602633068549.

armchairhacker3y ago

This example is way better than ChatGPT and actually pretty creative.

gs173y ago

The "This Title Is Now Longer Than The Actual Movie" gag feels a bit too much like something ripped from the training set for me. I'm willing to be amazed though.

prox3y ago· 2 in thread

The ships computer also doesn’t seem to do entertainment like “tell a bedtime story” , since holography exists and does a better job. Now those might be closer to chatbots current evolution.

LeoPanthera3y ago

In later seasons it acts more like, well, a computer. Geordi does play (verbal) games with it in one episode, however, while bored on a shuttlecraft trip.

prox3y ago

I have to look that Geordi episode up.

But I am mostly familiar with the later TNG era star trek, so I didn’t know it was written as self-aware in the early days.

Some episodes do feature “bugs” where holographic actors become aware being in a program/being an actor. The episode where an Irish town program has run too long on Voyager comes to mind.

(Edit: I do wonder if the holographic actors are somehow sandboxed containers in the main computer core, or run on a different system)

1 more reply

wheelerof4te3y ago· 2 in thread

Imagine an android connected to the vast network of information (ChatGPT-like). The android could generate various responses in real-time, just by vocalizing the approriate text.

Then, it could use that information to further train itself, essentialy creating a local model of it's own behaviour. In other words, it could learn.

anaganisk3y ago

Yeah I am alrwady to able to Imagine, the android welcoming me and suggesting me what I should buy with its sweet words, based on my past interactions with it.

doublerabbit3y ago

also sounds like a world where items are subscription based.

If you desire a luxury colour like blue; you have pay monthly credits otherwise your clothes items are downgraded to brown.

mshake23y ago· 2 in thread

Is the future going to be increasingly advanced AIs competing publicly for the currency of human attention?

looseyesterday3y ago

oakpond3y ago

Sneaking ads into AI responses.

TuringTest3y ago· 1 in thread

Definitely humor is in the eye of the beholder. I find the Seinfeld jokes by ChatGPT wittier and funnier than the run-of-the-mill comments created by Claude.

Lewton3y ago

ChatGPT definitely captured the Seinfeld style better

“What’s the deal with” is how you caricaturise Jerry, not how you write actual jokes for him

avereveard3y ago· 1 in thread

so can we try it? can't find a link.

also, why is everything now named with common names and nouns? it makes annoyingly hard to google informations around them.

ma2rten3y ago

It's not public yet.

sagebird3y ago

Is this not a superficial attempt at saftey?

I would like my AI system to tell me how to hotwire a car if I am curious about how that works.

Maybe self-reflection and self-tuning is of general value - even if it only superficially addresses safety concerns in a 1 dimensional way.

Perhaps these techniques can be used on something other than safety.

dang3y ago

Anthropic's Claude is said to improve on ChatGPT, but still has limitations - https://news.ycombinator.com/item?id=34331396 - Jan 2023 (52 comments)

deshraj3y ago

Somebody is maintaining an awesome claude repo with claude use cases, claude vs chatgpt comparisons as well. https://news.ycombinator.com/item?id=34404536

touringa3y ago

Video demo: https://youtu.be/B7Mg8Hbcc0w

More info on Claude's principles/Constitution: https://lifearchitect.ai/anthropic/

irjustin3y ago

I appreciate the comparison and some of the prompts. I didn't even think to play with multi-hop questions.

Anyone remember Ask Jeeves? This feels like what it should have been.

renewiltord3y ago

And this is where OpenAI earns their "open" remark. Anyone can use ChatGPT. Anthropic Claude (incredibly amusing name to me) is not so accessible.

belter3y ago

ranguna3y ago

Signup form is here, but it's closed https://twitter.com/AnthropicAI/status/1604929999743508480?s...

est3y ago

I imagine in the next decade we are about to be introduced to different AIs like new 6yo children in the class. Each one have different "parents", traits and personalities.

wheelerof4te3y ago

Is this the birth of TechnoCore from Hyperion? Uh, oh.

justsaynotojava3y ago

Does this mean microsofts potential billion dollar aquisition of openAI is a bad idea because the IP is already out there and other companies are catching up?

j / k navigate · click thread line to collapse