Aug 12, 02:40 PM

TII Unveils Falcon Mamba 7B: Attention-Free AI Model with 7B Parameters and 5.5T Tokens

The Technology Innovation Institute (TII) in Abu Dhabi has unveiled the Falcon Mamba 7B, a groundbreaking artificial intelligence model. This model is notable for being the world's first attention-free AI model, utilizing a state space architecture. It has been trained on 5.5 trillion tokens and features 7 billion parameters. The Falcon Mamba 7B sets a new benchmark in AI research, outperforming several established models such as Llama 3 8B, Llama 3.1 8B, Gemma 7B, and Mistral 7B in various benchmarks. It is open-source, permissively licensed, and can process unlimited sequence lengths, fitting on a single 24GB GPU. The model is also recognized for its constant time to generate new tokens regardless of context size, and its impressive 8,192 context length. The Falcon Mamba 7B is available on the Open LLM Leaderboard, scoring 15 points on average.

#Technology Innovation Institute #TII #Abu Dhabi #Falcon Mamba 7B #Llama #Gemma 7B #Mistral 7B #Open LLM Leaderboard

Written with ChatGPT (GPT-4o).

Sources

Additional media

Image #1 for story tii-unveils-falcon-mamba-7b-attention-free-ai-model-7b-parameters-5-5t-tokens

Image #2 for story tii-unveils-falcon-mamba-7b-attention-free-ai-model-7b-parameters-5-5t-tokens

TII Unveils Falcon Mamba 7B: Attention-Free AI Model with 7B Parameters and 5.5T Tokens

Sources

Additional media

Similar Stories