undefined | Better HN

0 pointsvisarga1y ago0 comments

You could theoretically run the input twice, allowing the model to correlate later tokens with previous ones. It would fix the problem with not knowing what information to retain. A more complicated approach would train the RNN to request replaying some earlier data when needed.

A great thing about RNNs is they can easily fork the state and generate trees, it would be possible to backtrack and work on combinatorial search problems.

Also easier to cache demonstrations for free in the initial state, a model that has seen lots of data is not using more memory than a model starting from scratch.

0 comments

2 comments · 1 top-level

imjonse1y ago· 1 in thread

Something like this?

https://hazyresearch.stanford.edu/blog/2024-07-01-jrt

visargaOP1y ago

Yes, that's the paper.

j / k navigate · click thread line to collapse