The cost of large language models (LLMs) has plummeted dramatically over the past year, signaling a transformative shift in the artificial intelligence landscape. Meta's Llama 3.3 now offers input tokens at $0.10 per million, representing a 450x cost reduction compared to OpenAI's GPT-4 at the start of the year, which was priced at $45 per million tokens. This trend is further highlighted by Llama 3.3's 96% lower cost compared to GPT-4o, 97% lower than Claude 3.5, and 92% lower than Gemini Pro. Meanwhile, AWS has introduced a near-state-of-the-art LLM at a fraction of the cost, marking a poetic entrance into the era of foundation model commoditization and intensifying competition in the market. The price-performance ratio of models like GPT-4o has also improved, with costs dropping fivefold while performance has doubled over the past year. Additionally, companies like QwQ are catching up to state-of-the-art (SOTA) models. Analysts predict that the commoditization of intelligence will continue in the coming years, with the price of intelligence potentially approaching zero, creating significant disruption across industries.
The cost of an LLM with the quality of the original GPT-4 (MMLU of 86 or greater) has fallen by a factor of 450x this year 🤯 On 1/1 it was @OpenAI GPT-4 ($45/million tokens) On 12/6 it is @AIatMeta Llama 3.3 8b ($0.10/million tokens) Crazy times. https://t.co/2TEZrQ2aQ7
🧵Here's how the latest @AIatMeta llama 3.3 stacks up: For input tokens ($0.1 per 1M tokens for Llama 3.3 70B): - vs GPT-4o ($2.5): 96% lower cost - vs Claude 3.5 ($3.0): 97% lower cost - vs Gemini Pro ($1.3): 92% lower cost 1/x
The next few years will be wild. Intelligence is about to become ubiquitous and affordable—adapt or risk being left behind. 🚀https://t.co/C29lEwvubf