Ask HN: Can I download GPT / ChatGPT to my desktop?

25 pointsAHappyCamper3y ago48 comments

I would love to be able to run GPT / ChatGPT on my desktop and remove some of the length limits on text.

How can I do that, and where can I download it from?

48 comments

40 comments · 12 top-level

turkeygizzard3y ago· 8 in thread

I'm pretty sure the GPT model is huge and does not fit on any conventional GPU. Even if they open-sourced the weights, I don't think most people would be running it at home.

Also regarding the text limits, AFAIK, there's just an inherent limit in the architecture. Transformers are trained on finite-length sequences (I think their latest uses 4096 tokens). I have been trying to understand how ChatGPT seems to be able to manage context/understanding beyond this window length

Sharlin3y ago

I don't think ChatGPT does. I have had long discussions with it, with some rules agreed upon in the beginning, and at some point it clearly begins to forget the exact rules and has to be reminded of them.

(Specifically, AI Dungeon type games where ChatGPT is the DM and the human the protagonist, or vice versa. The most common failure mode seems to be that it forgets whether it's playing the DM or the protagonist. To be fair, it performs admirably well despite the limitations.)

rzzzt3y ago

In a previous thread (which I can not find right now) the recommendation was to either ask it to summarize what happened earlier, or do this job yourself from time to time.

puffybuf3y ago

I read that it just re-reads the discussion so far every time you submit. So it must hit a limit of what it can remember since they limit the amount of training tokens it can read for a submission.

1 more reply

throwaway2016a3y ago

Is Chat-GTP it's own model? I thought ChatGTP was just GTP-3 with an easier to use interface.

1 more reply

madiator3y ago

Yeah it wouldn't fit. GPT3 is 175B params, so even if you use 8 bit for each weight, you need 175×10^9÷2^30 = 163GiB of memory.

joshka3y ago

https://www.reddit.com/r/ChatGPT/comments/zhzjpq/comment/izo...

>It's around 500gbs and requires around 300+gbs of vram from my understanding and runs on one of the largest super computers in the world. Sable diffusion has around 6 billion parameters gpt-3/chatgpt has 175 billion.

cal853y ago

Wouldn’t that be possible with about 4 powerful GPUs? Or does it not work like that?

1 more reply

kurtoid3y ago

Silly question: how does OpenAI host/serve it?

1 more reply

5e92cb50239222b3y ago· 5 in thread

OpenAI is 'open' in the name only, so no. I don't think they have any plans of opening full access to the public either, considering that their previous model (that ChatGPT builds upon) was sold to Microsoft for exclusive use:

https://en.wikipedia.org/wiki/GPT-3

dilap3y ago

It's really obnoxious for non-open companies to include "open" in the name. Stolen valour, basically.

blacksmith_tb3y ago

Open as in "open for business" not as in "open source", you might say.

throwaway2016a3y ago

Maybe if you compare it to open source. But if you consider that many of these algorithms were historically invented by a private company and kept private / proprietary as a competitive advantage I think the fact OpenAI puts no unreasonable restrictions on who can use it makes it fairly "open"

Though, I too, would rather be able to run the model myself.

1 more reply

consumer4513y ago

I think a more accurate name might be Alignment AI. [0]

I realize it's a charitable interpretation of their behavior, and I used to be very angry about them not being truly open. However, after playing with ChatGPT I think I am beginning to understand and even support their behavior. [1]

My personal sea change was realizing that giving dual-use tools to the global @everyone and hoping for the best might not be the greatest plan. I came to this realization thinking about bio-tech and GNC software, but it may apply to ML products as well. [2]

While I used to think universally/religiously "it should be open, will be one day anyway, etc," I now think about these things on a case by case basis.

[0] https://openai.com/alignment/

[1] https://news.ycombinator.com/item?id=33928400

[2] Imagine how many nodes this bot farm would have if it wasn't limited by the existing C&C bottleneck. ChatGPT is a productivity multiplier. This is productivity I want to put off multiplying as long as possible: https://news.ycombinator.com/item?id=34165350

AHappyCamperOP3y ago

Truly a shame. Good marketing tho

dragonwriter3y ago· 4 in thread

There are non-OpenAI models based on the same GPT paper as the OpenAI GPT-series, e.g., GPT-NeoX [0], GPT-J, etc., that are actually Open Source, unlike OpenAI which is “open” only in the sense of “we might let you use it, either as a free preview or a paid service”.

You probably won't be able to run (or especially train) them on typical desktops, though.

[0] https://www.eleuther.ai/projects/gpt-neox/

qup3y ago

Are they good?

dragonwriter3y ago

That... depends. If you are looking at a pretrained model that competes with ChatGPT out of the box, no.

GPT-NeoX apparently compares favorably with GPT-3 on some measures, but ChatGPT is a 175 billion parameter GPT-3 model, and the big pretrained GPT-NeoX model available is a 20 billion parameter model. Could you rival ChatGPT with the right settings and training set and sufficient hardware for training? Well, if you want to try, you can with GPT-NeoX, and you can chosr whether or how to filter the output. With OpenAI’s models, you get what they give you, on the terms they are willing you give you access to it, with filters that exist tonprotect OpenAI’s image and liability.

beckingz3y ago

With some effort they're close to ChatGPT in terms of quality. More of one off input output as opposed to a conversation though.

speedgoose3y ago

It depends on what you use them for.

To replace ChatGPT, no they are not good enough.

luckyme1233y ago· 4 in thread

Luckily, no. Otherwise you (others) could hack the safeties and ask it how to cheaply kill a lot of people or so .. better not make bad people too intelligent! (Obviously not talking about you)

burkaman3y ago

It's not magic, it doesn't know anything that isn't easily findable public information.

Fatnino3y ago

And much of that it doesn't know either.

"in Google sheets, I have dates in the first column. How would I make the second column indicate if daylight savings time is active on that date?"

ChatGPT will confidently provide non working suggestion after non working suggestion, but ultimately can't make this work.

PM_me_your_math3y ago

Rename it and ask it to find & kill the virus it created?

xaml3y ago

I don't think that I can PM you, but could you explain how Green's theorem can be applied to regions with singularities?

1 more reply

Sharlin3y ago· 2 in thread

Even if it were freely available, there's no way to run GPT3 or ChatGPT on any existing desktop hardware. The exact hardware requirements aren't public either (yes, very "open") but a full 175-billion-parameter GPT3 instance requires hundreds of gigabytes of GPU memory, and even though ChatGPT is "smaller and better", when it comes to conversational dialogue, there's no way to fit it in current consumer hardware.

lambdaba3y ago

Will it ever be an option, perhaps the first option, given some evolution in the software and/or hardware?

Sharlin3y ago

Ever? Likely. Hardware keeps improving, and better training techniques will almost certainly keep shrinking model sizes ceteris paribus. But one should also remember that these are static models that can't learn anything that was not present in the original training corpus, so for some use cases that rely on current information they're simply not a good match. And training a model like this requires vastly more hardware (and human) resources than just using it. Never mind the issue of collecting a corpus in the first place.

navjack273y ago· 2 in thread

You can do gpt-j

https://gist.github.com/navjack/32197772df1c0a8dbb8628676bc4...

I mean yeah after you set it up like this you still have to prompt engineer to get it to behave like a chat but I mean it's better than GPT - 2

ghilston3y ago

Do you have any writeups on what prompt engineering you've done to get gpt-J to behave like a chat?

navjack273y ago

I haven't because I don't need it or want it

PlotCitizen3y ago· 2 in thread

It’s not possible currently but there’s another story on the HN front page with an open source alternative which I haven’t tried

arikr3y ago

Which one?

xaml3y ago

Possibly this "PaLM + RLHF - Pytorch" project:

https://github.com/lucidrains/PaLM-rlhf-pytorch

fred9673y ago· 1 in thread

You can download his ancestor here:

https://winworldpc.com/product/dr-sbaitso/2x

rzzzt3y ago

  HELLO FRED967,  MY NAME IS DOCTOR SBAITSO.
  
  I AM HERE TO HELP YOU.
  SAY WHATEVER IS IN YOUR MIND FREELY,
  OUR CONVERSATION WILL BE KEPT IN STRICT CONFIDENCE.
  MEMORY CONTENTS WILL BE WIPED OFF AFTER YOU LEAVE,
  
  SO, TELL ME ABOUT YOUR PROBLEMS.

That falling intonation is very reassuring.

htns3y ago

I don't much follow AI news beyond what I randomly happen to see on HN, but this might still be the largest open source model: https://github.com/yandex/YaLM-100B . There's discussion of it here: https://old.reddit.com/r/MachineLearning/comments/vpn0r1/d_h... - at the bottom of that page is a comment from someone who actually ran it in the cloud.

mellosouls3y ago

It's not possible post GPT2 for the reasons given by others.

Open communities with potential for involving yourself include Hugging Face and EleutherAI, the former perhaps more accessible, the latter an active Discord.

It's been a while since I spent time looking at them, I'm not sure if there is something you can easily get up and running with.

https://huggingface.co/

https://www.eleuther.ai/

chamwislothe2nd3y ago

https://github.com/bigscience-workshop/petals

Since my other account is shadow banned for some unexplained reason, I just wanted to mention the petal project. It's an attempt to bittorrent style distribute the load of running these large models. Good luck!

trilbyglens3y ago

My feeling is that even if it were available to download, the compute requirements to run it at production speeds would likely be blistering.

j / k navigate · click thread line to collapse

48 comments

40 comments · 12 top-level

turkeygizzard3y ago· 8 in thread

I'm pretty sure the GPT model is huge and does not fit on any conventional GPU. Even if they open-sourced the weights, I don't think most people would be running it at home.

Sharlin3y ago

rzzzt3y ago

In a previous thread (which I can not find right now) the recommendation was to either ask it to summarize what happened earlier, or do this job yourself from time to time.

puffybuf3y ago

I read that it just re-reads the discussion so far every time you submit. So it must hit a limit of what it can remember since they limit the amount of training tokens it can read for a submission.

1 more reply

throwaway2016a3y ago

Is Chat-GTP it's own model? I thought ChatGTP was just GTP-3 with an easier to use interface.

1 more reply

madiator3y ago

Yeah it wouldn't fit. GPT3 is 175B params, so even if you use 8 bit for each weight, you need 175×10^9÷2^30 = 163GiB of memory.

joshka3y ago

https://www.reddit.com/r/ChatGPT/comments/zhzjpq/comment/izo...

cal853y ago

Wouldn’t that be possible with about 4 powerful GPUs? Or does it not work like that?

1 more reply

kurtoid3y ago

Silly question: how does OpenAI host/serve it?

1 more reply

5e92cb50239222b3y ago· 5 in thread

https://en.wikipedia.org/wiki/GPT-3

dilap3y ago

It's really obnoxious for non-open companies to include "open" in the name. Stolen valour, basically.

blacksmith_tb3y ago

Open as in "open for business" not as in "open source", you might say.

throwaway2016a3y ago

Though, I too, would rather be able to run the model myself.

1 more reply

consumer4513y ago

I think a more accurate name might be Alignment AI. [0]

While I used to think universally/religiously "it should be open, will be one day anyway, etc," I now think about these things on a case by case basis.

[0] https://openai.com/alignment/

[1] https://news.ycombinator.com/item?id=33928400

AHappyCamperOP3y ago

Truly a shame. Good marketing tho

dragonwriter3y ago· 4 in thread

You probably won't be able to run (or especially train) them on typical desktops, though.

[0] https://www.eleuther.ai/projects/gpt-neox/

qup3y ago

Are they good?

dragonwriter3y ago

That... depends. If you are looking at a pretrained model that competes with ChatGPT out of the box, no.

beckingz3y ago

With some effort they're close to ChatGPT in terms of quality. More of one off input output as opposed to a conversation though.

speedgoose3y ago

It depends on what you use them for.

To replace ChatGPT, no they are not good enough.

luckyme1233y ago· 4 in thread

Luckily, no. Otherwise you (others) could hack the safeties and ask it how to cheaply kill a lot of people or so .. better not make bad people too intelligent! (Obviously not talking about you)

burkaman3y ago

It's not magic, it doesn't know anything that isn't easily findable public information.

Fatnino3y ago

And much of that it doesn't know either.

"in Google sheets, I have dates in the first column. How would I make the second column indicate if daylight savings time is active on that date?"

ChatGPT will confidently provide non working suggestion after non working suggestion, but ultimately can't make this work.

PM_me_your_math3y ago

Rename it and ask it to find & kill the virus it created?

xaml3y ago

I don't think that I can PM you, but could you explain how Green's theorem can be applied to regions with singularities?

1 more reply

Sharlin3y ago· 2 in thread

lambdaba3y ago

Will it ever be an option, perhaps the first option, given some evolution in the software and/or hardware?

Sharlin3y ago

navjack273y ago· 2 in thread

You can do gpt-j

https://gist.github.com/navjack/32197772df1c0a8dbb8628676bc4...

I mean yeah after you set it up like this you still have to prompt engineer to get it to behave like a chat but I mean it's better than GPT - 2

ghilston3y ago

Do you have any writeups on what prompt engineering you've done to get gpt-J to behave like a chat?

navjack273y ago

I haven't because I don't need it or want it

PlotCitizen3y ago· 2 in thread

It’s not possible currently but there’s another story on the HN front page with an open source alternative which I haven’t tried

arikr3y ago

Which one?

xaml3y ago

Possibly this "PaLM + RLHF - Pytorch" project:

https://github.com/lucidrains/PaLM-rlhf-pytorch

fred9673y ago· 1 in thread

You can download his ancestor here:

https://winworldpc.com/product/dr-sbaitso/2x

rzzzt3y ago

  HELLO FRED967,  MY NAME IS DOCTOR SBAITSO.
  
  I AM HERE TO HELP YOU.
  SAY WHATEVER IS IN YOUR MIND FREELY,
  OUR CONVERSATION WILL BE KEPT IN STRICT CONFIDENCE.
  MEMORY CONTENTS WILL BE WIPED OFF AFTER YOU LEAVE,
  
  SO, TELL ME ABOUT YOUR PROBLEMS.

That falling intonation is very reassuring.

htns3y ago

mellosouls3y ago

It's not possible post GPT2 for the reasons given by others.

Open communities with potential for involving yourself include Hugging Face and EleutherAI, the former perhaps more accessible, the latter an active Discord.

It's been a while since I spent time looking at them, I'm not sure if there is something you can easily get up and running with.

https://huggingface.co/

https://www.eleuther.ai/

chamwislothe2nd3y ago

https://github.com/bigscience-workshop/petals

trilbyglens3y ago

My feeling is that even if it were available to download, the compute requirements to run it at production speeds would likely be blistering.

j / k navigate · click thread line to collapse