But please, do share.
Google - T5 - Feb 2020 - https://blog.research.google/2020/02/exploring-transfer-lear...
Both of these were and still are used heavily for on-going research and T5 has been found to be decently useful when fine-tuned.
Weights were available for both.
Google published dozens to promote Tensorflow:
https://experiments.withgoogle.com/font-map https://experiments.withgoogle.com/sketch-rnn-demo https://experiments.withgoogle.com/curator-table https://experiments.withgoogle.com/nsynth-super https://experiments.withgoogle.com/t-sne-map
The list goes on. Many are source-available with weights too.
> can't name one LLM with public access before ChatGPT, much less weights available till LLaMA 1.
Do any of these ring a bell?
- DistilBERT/MobileBERT/DeBERTa/RoBERTa/ALBERT
- FNet
- GPT2/GPT-Neo/GPT-J
- Marian
- MBart
- M2m100
- NLLB
- Electra
- T5/LongT5/T5-flan
- XLNet
- Reformer
- ProphetNet
- Pegasus
That's not comprehensive but may be enough to jog your memory.
The gap in communication is we don't mean _literally_ no one _ever_ open-sourced models. I agree, that would be absurd. [1]
Companies, quite infamously and well-understood, _did_ hold back their "real" generative models, even from being available for pay.
Take a stab at a literal definition: - post-GPT2 LLMs (ex. PALM, PALM2) - art like DaLL-E, Imagen, Parti
Loosely, we had Disco Diffusion for art, and GPT-3 for LLMs, and then Dall-E, then Midjourney. That was over an _entire year_, and the floodgates on private ones didn't open till post SD/ChatGPT.
[1] thank you for the lengths you went to highlight the best over a considered span of time, I would have just said something snarky :)
[2] I did not realize FLAN was open-sourced a month before ChatGPT, that's fascinating: we're stretching a bit, beyond that, IMHO: the BERTs aren't recognizable as LLMs.
CLIP could be used as an image generator, slowly.
> and can't name one LLM with public access before ChatGPT, much less weights available till LLaMA 1
InstructGPT was available on OpenAI playground for months before ChatGPT and was basically as capable as GPT3, people were really missing out. Don't know any good public models though.