Sources
/MachineLearningLearning to (Learn at Test Time): RNNs with Expressive Hidden States https://t.co/g83JYT0kVk
Zhengzhong Tu🚨Learning to (Learn at Test Time): RNNs with Expressive Hidden States 🌟𝐏𝐫𝐨𝐣: https://t.co/0BXLYRgOUJ 🚀𝐀𝐛𝐬: https://t.co/4ANCg2z2He A new class of sequence modeling layers with linear complexity and an expressive hidden state https://t.co/zMaGui8suE
Arjun VikramProud to share what I've been working on for the past year, "Learning to (Learn at Test Time)"! Our new architecture trains a model to "learn" from its context, replacing Attention's costly KV cache with an expressive hidden state: the weights of a ML model!🤯 🧵by @karansdalal https://t.co/Dq3k4zt0DN



