Sources
fly51fly[CL] Training LLMs over Neurally Compressed Text B Lester, J Lee, A Alemi, J Pennington, A Roberts, J Sohl-Dickstein, N Constant [Google DeepMind] (2024) https://t.co/Iy9SB7vQXv - Training large language models (LLMs) over highly compressed text yields advantages in training and… https://t.co/9Bj1kcTOyW
Noah ConstantEver wonder why we don’t train LLMs over highly compressed text? Turns out it’s hard to make it work. Check out our paper for some progress that we’re hoping others can build on. https://t.co/mceqpUfZQo With @blester125, @hoonkp, @alemi, Jeffrey Pennington, @ada_rob, @jaschasd
Ting-Yun ChangLocalization in LLMs is often mentioned. But do localization methods actually localize correctly? In our #NAACL2024 paper, we (w/ @_jessethomason_, @robinomial) develop two benchmarking ways to directly evaluate how well 5 existing methods can localize memorized data in LLMs. https://t.co/S9CIXlt1xR







