Skip to content
Better HN
Top
Best
Ask
Show
New
Jobs
Search
⌘K
0 points
Doxin
2y ago
0 comments
Save
Share
I'd assume there's no real state the network can "remember" between iterations, so shuffling will at best just waste time.
0 comments
1 comments · 1 top-level
top
newest
oldest
Two_hands
2y ago
My thoughts had been related to the ordering, but it makes sense that it doesn’t matter. I have read that it is actually better to train the model in separate batches with generated and real images in their own batches before the gradient step.
j
/
k
navigate · click thread line to collapse