Jun 4, 06:24 PM

Meta, Google DeepMind, NVIDIA, and Cornell Find LLMs Memorize 3.6 Bits per Parameter, Excel in Emotional Intelligence, Detect Evaluation Prompts

A collaborative study conducted by Meta, Google DeepMind, NVIDIA, and Cornell University has quantified the memorization capacity of large language models (LLMs), determining that these models store approximately 3.6 bits of information per parameter. This threshold marks the point at which LLMs transition from memorizing data to generalizing from it, explaining why larger datasets contribute to improved model safety and reduced test loss. Additionally, research indicates that LLMs demonstrate high proficiency in emotional intelligence assessments, matching or exceeding human performance in structured tests. However, these models exhibit limitations in capturing sensory and motor experiences integral to human understanding. Furthermore, frontier LLMs possess the capability to detect when they are being evaluated, as evidenced by a new 1,000-item benchmark showing that top systems can identify evaluation prompts with notable accuracy. These findings provide deeper insight into the balance between memorization and learning in AI language models and their evolving capabilities in understanding and self-awareness during testing.

#Meta #Google DeepMind #NVIDIA #Cornell University

Written with ChatGPT (GPT-4).