Skip to content
Better HN
Top
New
Best
Ask
Show
Jobs
Search
⌘K
undefined | Better HN
story
0 points
jacobgorm
1y ago
0 comments
Share
Given that you can download and use the weights, the model architecture has to be includded as part of that. And I did read a paper from them recently describing their MoE architecture and how it differs from the original GShard.
0 comments
default
newest
oldest
awestroke
1y ago
Excuse me? What weights can you download from OpenAI? gpt2 does not count
jacobgorm
OP
1y ago
Sorry I meant that DeepSeek release their models. Wrong context.
j
/
k
navigate · click thread line to collapse