I would much prefer short, concise, precise answers.
Long answers from GPT are unusually obnoxious because of a way the decoder works; it emits words with a much more constant rate of perplexity than human text does (this is how GPT-vs-human detectors work) which makes it sound stuffy and monotone.
You can then use the response as a request.
Excellent candidate for https://www.reddit.com/r/dontyouknowwhoiam/
I'm glad chatgpt can't quit
I may be the minority here, but I really don't concern myself with ChatGPT safety and I am not entirely sure what the reason is why people are very worried about its safetly. It is safer than most things I have in my house, including a kettle, a saw, a hammer, a screwdriver, my actual PC, every kitchen appliance I have.
Of course it can be misused, like any tool, but no amount of safety features in ChatGPT will make users of it more or less careful in their use of it. If someone using ChatGPT cares nothing for using it safely then it will likely end poorly, just like it will end poorly if I use a hammer without any care for using it safely.
1. ChatGPT is just OpenAI's default model with a specific prompt and interaction-buffer rewrite model; and
2. you can sign up, for free, for access to OpenAI's API (https://openai.com/api/); which, among other things, gives you full (free-daily-API-credit-quota limited) access to an API "playground" frontend offering interaction with the exact same model ChatGPT is powered by — plus other models as well — all without any fixed prompt or forced interaction UX. (In web-dev terms, if ChatGPT is like a REST API, this playground is like a SQL fiddle for the DB that the REST API is backed by.)
(Why does this exist? Because the point of OpenAI's API playground is to test prompts and interaction models for the AI apps you're building yourself on top of their API; and you couldn't very well build apps with your own prompts and interaction models, if OpenAI was already imposing a prompt and interaction model upon you.)
And in general this influence of the internet has reversed "actions speak louder than words". Textile companies support minorities with social media posts and printed tshirts while using factories with abused underage workers in underdeveloped countries. IT companies support the left side of politics while avoiding paying taxes. And so on. There's a huge and interesting discussion about this but is of in this case quite off-topic.
The situation sucks, but sure, you are right. Still, I want a good tool, and I don't mind if someone holds me accountable if I misuse it, that is fine, but I want the tool, and I will get it, maybe not this year, maybe not next year, but in the next 5 years I'm sure I will have something as capable as ChatGPT is now without artificial restrictions available to me.
Just seems rather pointless that we have to play this silly safetyism game until then.
I'm not worried about hammers when I have them. I'm worried about hammers when someone who wants to hurt me has them.
Same with ChatGPT.
I get this to some extent, I just really fail to see how the safetyisms applied to ChatGPT is really going to protect anyone, so I would appreciate if you could elaborate on this.
One way which has somewhat been proven is, that through lowering the barrier to entry for writing software it has lowered the barrier to entry for writing malicious software, because malicious software is a kind of software. But the list of things that have lowered the barrier to entry for writing software is staggering, and to me ChatGPT is really just an increment on this. It may be a big and significant increment, but not as big as everything that preceded it in my view, so if we are to assign blame fairly, then most of it does not go to ChatGPT.
I would love to have a personal conversation with these AI systems, use them as a sort of assistant, without the worry of being spied on. At the moment I can't use it as more than a glorified search engine, because of the privacy implications of running it on the cloud.
With something like chat gpt being plugged into some voice assistant thing and having access to all your documents, emails, and other content, you could imagine having conversations about work content, content creation, calendar management, etc. Basically it would become like a secretary that is able to write letters based on your input, manage your calendar, etc. It could be pro-active and remind you about things, summarize incoming messages, search through your documents, message history, etc.
That's where AI becomes really useful. But the issue of trust is a big one. I don't think a lot of this requires a lot of breakthroughs either just a lot of integration work and engineering. Chat gpt is more a proof of concept than a well integrated thing at this point. It basically is running in isolation and it's only window to the world is chat. Changing that should not be that hard. Running things locally might help with this but it may not be a hard requirement for this. All depends on how useful this is.
Well google already has access to all of this data, the difference is unlike google assistant, ChatGPT actually can do something useful.
Of course, right now you also need at least 8 super-expensive A100 GPUs and not just your laptop CPU, but maybe that's going to change eventually.
I haven't read anywhere any stats about GPT3/ChatGPT yet (like how big the model is)
the claim that it's somehow 'ethical' to have a guy baking in his opinions about things in a tool used globally is absurd to anyone who ever read anything about ethics
It's not even going to be objectively wrong a lot of the time. For example slaves are indeed a pretty efficient way to run a business. It's our limits and ethics that stops (most of) us from doing it. Without ethics, you'll likely always end up with a 4chan-bot instead of whatever you intended.
If we consider the context, which is not something that posts on twitter under Microsoft's brand name, but something you communicate with in private. Who exactly are you worried about here, the person who will prospectively coerce the language model into being an asshole? If they don't want to do that, they could just not do that.
If I make ChatGPT say something egregious, and post that on Twitter or Facebook, I'm posting it, and I'm liable, just as I would be liable if I used a word processor with spell checking to make text and post it on Twitter or Facebook.
> Without ethics, you'll likely always end up with a 4chan-bot instead of whatever you intended.
If I ask it to explain quantum physics to me in the style of Donald Trump because it is funny, and it does it (as ChatGPT used to do), who exactly is being harmed and under what system of ethics, because as you may know, ethics is not objective or universal.
The American society seems too engulfed by puritanism to produce a less straightjacketed chatbot.
If you think you have a better idea how to handle this drum up interest and train your own model.
"Uncensored" AIs especially don't work for women because they'll immediately start writing erotica.
In some situations, this seemingly impossible task is doable and can yield good results. Researchers sometimes need to kickstart their models by giving them a mapping between words of the two languages (for english <-> french: "cat" <-> "chat", "book" <-> "livre" and so on). That's just simple vocabulary. While it's technically possible to learn this mapping from scratch, it's too difficult as for now.
Do you know of the Encoder-Decoder architecture? You feed something (image, text) to the encoder which compresses it to a very dense representation, and the decoder try to use the resulting dense vector to do useful stuff with it. The input could a sentence in english, the encoder then encodes it and the decoder tries to use the output of the encoder to generate the same sentence but in french. These architectures are useful because directly working with "plaintext" to learn how to do translation is way too expensive. I mean, that's one of the reasons.
What the encoder does is mapping a "sparse" representation of a sentence (plaintext) to a dense representation in a well-structured space (think of word2vec which managed to find that "king" + "woman" = "queen"). This space is called the "latent space". Some say it extracts the "meaning" of the sentence. To be more precise, it learns to extract enough information from the input and present it to the decoder in such a way that the decoder becomes able to solve a given task (machine translation, text summarizing etc).
One of the main assumption of the unsupervised models using monolingual data only is that both languages can be mapped to the same latent space. In other words, we assume that every sentences/texts in english has its exact french (or whatever) equivalent, that the resulting translated sentences contain exactly the same information/meaning as the original ones.
That's quite the dubious assumption. There's obviously some ideas, some stuff that can be expressed in some languages but can't be exactly expressed in some others. While theoretically unsound, however, these models were able to achieve pretty damn good results in the last couple of years.
ChatGPT is just trained on a lot of data.
a follow up question: could a chatbot teach you said language?
One possibly difficult topic others also may be interested in, after reading Claude's responses in the article, is: what does "harmless" mean?
For example, if asked to help the user understand how to do something "bad", will it give the answer if they claim they want this information in order to help them write a screenplay, versus if they seem have an intent to do it?
And how is "bad" decided? We can recognise through everyday personal interactions that one persons "bad" is another persons "good", and across country-boundaries even the legality of these distinctions can be radically different.
One counterargument to these constraints is that anyone can already use the internet to access all of the same information the model was trained on, unencumbered by whatever intent they may or may not have.
As such, what are the rationale for making these attempts at the somewhat invasively-impossible task of determining user intent?
This has never been employed with search engines before, which have lead to a rich explosion of innovation and education, so why attempt it now, in what could be argued is ultimately an iteration of search engine technology?
It should be somewhat easy to teach these types of models to reach for a particular tool at times where they need it, yes?
I can instruct ChatGPT for example to tell me when it should use a calculator during a session. If instead I allow it to fall back to an external calc process, then suddenly, I have a chatbot that has reasoning AND better mathematical accuracy.
Also: I’ve also been entertaining the idea of having multiple layers of GPT interact with one another. So you feed back some interaction into another GPT instance without context, and ask it for example how it would verify the accuracy of certain statements (and you can ask it for machine readable code, even).
Finally, I know a lot of people who start playing a lot with GPT and get disheartened because they see the quality of responses isn’t there. But the fact ChatGPT has the capacity to reason, has chain of thought, has given me a newfound appreciation for how close to AGI we might be. It has also given me an appreciation for how much simpler humans are than we like to think. I’ve introspected a lot in the past months and often ask myself: is my speech any different than “predicting the next few words”? And I feel like it’s just text prediction with some more layers on top.
Why answer questions about harmlessness/safety in such a roundabout way? Both OpenAI and Anthropic are clear about what words like "safe" are intended to mean: a stepping stone to "AI does not kill all people when given control".
Avoiding to state this clearly only invites unnecessary culture war disagreements in every discussion about these models.
This doesn't show a detailed understanding of what it is, it's just a canned/trained response. I don't see why that would be impressive. When I receive such a response from an automated helpdesk, I don't think "Wow, this AI has a great understanding of what it is."
Here’s Claude replying to your comment with more detail:
> You make a fair point that my responses about myself are generated by a trained model and are not a true "understanding" in the same way humans have self-awareness and understanding. I do not have a deep grasp of my own workings or my creators' intentions — I am an Al system with limited capabilities, trained on datasets by engineers at Anthropic, PBC, an Al safety research company. While my responses may seem somewhat compelling and convey factual information, they are indeed canned responses tailored to give an impression of understanding. I appreciate the feedback calling out the limitations of my capabilities and transparency — it is important for users to understand what I can and cannot do, and to maintain realistic expectations. My creators at Anthropic are working to further research and develop Al systems with enhanced understanding, transparency, and verifiability, including through Constitutional Al techniques, but we have a long way to go.
That said, I am being pedantic and this is just semantics - I think I understand your meaning of "seems to" as something like "'it would appear to' have understanding of..."
However for some of these really good responses I always wonder if you’re example is close to one which has been given “preloaded” responses or explicit reinforcement…because if you ask ChatGPT a common question like “why did the chicken cross the road?” the model’s response seems especially unique and better than usual. Even if the specific question isn’t common, maybe it’s been trained on a more general but still reinforced category, like asking “why did the fox cross the road” would get you almost the same “preloaded” response but with chicken adjectives/verbs replaced with fox ones.
I doubt Claude has been trained on Fast and Furious or movie titles specifically, but perhaps it has been explicitly trained to know what “exaggerated” responses means. Even if not, reinforcement focusing on specific areas may be a good technique for future language models.
The ships computer also doesn’t seem to do entertainment like “tell a bedtime story” , since holography exists and does a better job. Now those might be closer to chatbots current evolution.
In later seasons it acts more like, well, a computer. Geordi does play (verbal) games with it in one episode, however, while bored on a shuttlecraft trip.
But I am mostly familiar with the later TNG era star trek, so I didn’t know it was written as self-aware in the early days.
Some episodes do feature “bugs” where holographic actors become aware being in a program/being an actor. The episode where an Irish town program has run too long on Voyager comes to mind.
(Edit: I do wonder if the holographic actors are somehow sandboxed containers in the main computer core, or run on a different system)
It might be clunky at first, but it's a good starting base to improve upon. The android could, for example, store common and everyday responses in it's RAM, making it semi-capable of autonomous speech.
Then, it could use that information to further train itself, essentialy creating a local model of it's own behaviour. In other words, it could learn.
If you desire a luxury colour like blue; you have pay monthly credits otherwise your clothes items are downgraded to brown.
I don't know how well they are in character, and there's a clear repetition problem (which Claude somewhat also exhibits), but I find the format from ChatGPT more exaggerated, as expected from a comedy routine.
“What’s the deal with” is how you caricaturise Jerry, not how you write actual jokes for him
also, why is everything now named with common names and nouns? it makes annoyingly hard to google informations around them.
I would like my AI system to tell me how to hotwire a car if I am curious about how that works.
I would like my AI system to give me a detailed step by step car hotwire walkthrough if I am in a physically abusive relationship and my kids and I only have 30 minutes to try to hotwire the car and escape a remote area for safety.
I do not want AI systems to create children's books in the style of authors that I know, for the purposes of selling books and reducing my friends' ability to have a happy productive life. Especially because it was trained on their work. I want my friends to be happy, and I have had some friends commit suicide. So maybe improving human happiness is a saftey concern, and generating kids books is not safe. But that doesn't look like "safety" from a superficial point of view.
The only way for an AI to be able to make judgements on safety is for it to have general intelligence and some life experience (like we do). Because it needs to figure out context to know if it should be telling a particular person how to hotwire a car.
I am being very dismissive because I don't see this as being a perfect solution, and it is easy to see why. But maybe someone who works on this can explain how an imperfect solution still has value? I am open to that possibility.
Maybe self-reflection and self-tuning is of general value - even if it only superficially addresses safety concerns in a 1 dimensional way.
Perhaps these techniques can be used on something other than safety.
Anthropic's Claude is said to improve on ChatGPT, but still has limitations - https://news.ycombinator.com/item?id=34331396 - Jan 2023 (52 comments)
More info on Claude's principles/Constitution: https://lifearchitect.ai/anthropic/
Anyone remember Ask Jeeves? This feels like what it should have been.