LLM API Market 2024: Key Trends and Model Leaderboard
As we reach the midpoint of 2024, the competitive landscape for large language models (LLMs) is shifting rapidly. The so-called "LLM Wars" are intensifying, with major changes in the LLM API market for text-generation technologies. Is OpenAI losing its lead? Which large language models are developers actually using, and what trends are shaping the global LLM API market?
A recent analysis by X user "karminski"—using comprehensive data from the API aggregator OpenRouter—provides valuable insights into token usage, market share, and developer preferences for the first half of 2024.
Let's break down the findings from the OpenRouter dataset.
What is OpenRouter?
OpenRouter is a widely used API aggregator that allows developers to access and compare various large language models through a unified interface. Its usage data offers a unique perspective on real-world developer preferences and LLM API market trends.
Model Leaderboard: Most Popular LLMs Among Developers
Which large language models are leading the API market in 2024? According to the latest OpenRouter data:
- Google's Gemini-2.0-Flash is the most used model, topping the leaderboard.
- Anthropic's Claude-Sonnet-4 holds second place.
- Google's Gemini-2.5-Flash-Preview-0520 rounds out the top three.
- DeepSeek V3 (both free and paid versions) ranks 4th and 5th; combined, DeepSeek V3 rivals the second-place model in total API usage.
LLM API Market Share Breakdown
The LLM API market share data reveals:
- Google leads with a commanding 43.1% market share.
- DeepSeek and Anthropic follow closely, with 19.6% and 18.4% respectively.
Developer Preferences by Domain
OpenRouter's data also highlights which LLMs developers prefer for specific use cases.
Programming Domain: Top LLMs for Coding
For code generation and programming tasks, developers increasingly choose Anthropic's Claude series. These models are favored for their advanced capabilities in the programming domain.
Translation Domain: LLMs for Language Translation
In translation, the LLM API market is more fragmented. Google's models stand out, leveraging extensive linguistic datasets to provide robust translation capabilities.
Role-Playing Applications: Creative LLM Usage
For creative and role-playing applications, developers experiment with various LLMs to balance creativity, coherence, and cost for narrative-driven apps.
Advertising Domain: LLMs for Marketing Content
In marketing, developers use a range of LLMs to generate ad copy and campaign content, optimizing for creativity and cost-effectiveness.
Developer Interfaces: How LLMs Are Accessed via OpenRouter
Examining how developers access LLMs through OpenRouter reveals integration patterns and workflow preferences.
Key Takeaways: 2024 LLM API Market Insights
- Google's Strategic Dominance: Google leads the LLM API market, capturing nearly half of all usage. Its Gemini-2.0-Flash model is especially popular due to its cost-effectiveness and performance.
- Anthropic's Coding Focus: Anthropic's Claude-3.5, 3.7, and 4 models are gaining traction in the programming domain, appealing to developers seeking specialized coding capabilities.
- OpenAI's Decline: OpenAI's lower API market share may result from strategic decisions, AccessKey requirements, or less competitive pricing for high-volume use.
- DeepSeek's Rise: DeepSeek V3 is a 2024 success story, with strong developer retention. Its faster response times and lower time to first token (TTFT) make it more attractive than DeepSeek-R1.
- Meta's Llama Fades: The Llama series is losing momentum in the LLM API market, with declining developer interest.
- Emerging Players: Mistral AI holds 3% market share, especially among European developers interested in fine-tuning open-source models. X-AI's Grok and Alibaba's Qwen (Tongyi Qianwen) are present but face challenges in gaining broader adoption.
Glossary:
- TTFT (Time to First Token): Measures how quickly an LLM starts generating output.
- SOTA (State-of-the-Art): Refers to the leading models in a specific field.
For more on LLM performance, see How to Choose the Best LLM for Your Application and OpenRouter API Integration Guide.