Skip to content
Better HN
Top
Best
Ask
Show
New
Jobs
Search
⌘K
0 points
balder1991
1y ago
0 comments
Save
Share
Or another question, do they still publish any research that’s relevant for the field nowadays?
0 comments
3 comments · 1 top-level
top
newest
oldest
awestroke
1y ago
· 2 in thread
No. They publish PDFs that hype up their models, but they do not publish anything even resembling a high-level overview of model architecture
jacobgorm
1y ago
Given that you can download and use the weights, the model architecture has to be includded as part of that. And I did read a paper from them recently describing their MoE architecture and how it differs from the original GShard.
awestroke
1y ago
Excuse me? What weights can you download from OpenAI? gpt2 does not count
1 more reply
j
/
k
navigate · click thread line to collapse