OpenAI gpt-4-1106-preview
Rank #18
Power: 5750.77
View all Lmarena text generation LLM rankings

💡 What is Model Power?

Model Power = how strong and reliable a model is, based on real people's votes. It combines performance scores with user confidence to give you the most accurate assessment of each model's capabilities.

Looking for the right LLM for your needs?

Discover which model is the best and most suitable for your specific use cases. Our comprehensive analysis of OpenAI gpt-4-1106-preview text generation LLMs reveals the true performance landscape, powered by millions of real user votes and LMarena Pro ranking system. Whether you're building AI applications, content creation, or research projects, find your perfect match from the world's most advanced language models.

👉 Find My Best-Fit LLM
Compare with gpt-4-1106-preview best text generation LLMs

📊 Comparison with Top 10 Models

This section compares Rank 18 (gpt-4-1106-preview) with the top 10 best performing models. The coefficient shows the performance difference: Positive coefficient = Better than Rank 18, Negative coefficient = Worse than Rank 18. Coefficient = (Model Power - Rank 18 Power) / Rank 18 Power × 100

o3-2025-04-16 OpenAI
According to user voting, the o3-2025-04-16 model is 15.48% percentage better for text generation compared to gpt-4-1106-preview.
Rank #1
+15.48%
gemini-2.5-pro Google
According to user voting, the gemini-2.5-pro model is 14.62% percentage better for text generation compared to gpt-4-1106-preview.
Rank #2
+14.62%
chatgpt-4o-latest-20250326 OpenAI
According to user voting, the chatgpt-4o-latest-20250326 model is 13.96% percentage better for text generation compared to gpt-4-1106-preview.
Rank #3
+13.96%
gemini-2.5-flash Google
According to user voting, the gemini-2.5-flash model is 11.18% percentage better for text generation compared to gpt-4-1106-preview.
Rank #4
+11.18%
grok-3-preview-02-24 xAI
According to user voting, the grok-3-preview-02-24 model is 10.90% percentage better for text generation compared to gpt-4-1106-preview.
Rank #5
+10.90%
claude-3-7-sonnet-20250219-thinking-32k Anthropic
According to user voting, the claude-3-7-sonnet-20250219-thinking-32k model is 9.02% percentage better for text generation compared to gpt-4-1106-preview.
Rank #6
+9.02%
claude-opus-4-20250514 Anthropic
According to user voting, the claude-opus-4-20250514 model is 8.96% percentage better for text generation compared to gpt-4-1106-preview.
Rank #7
+8.96%
gpt-4.1-2025-04-14 OpenAI
According to user voting, the gpt-4.1-2025-04-14 model is 8.33% percentage better for text generation compared to gpt-4-1106-preview.
Rank #8
+8.33%
deepseek-v3-0324 DeepSeek
According to user voting, the deepseek-v3-0324 model is 7.81% percentage better for text generation compared to gpt-4-1106-preview.
Rank #9
+7.81%
o1-preview OpenAI
According to user voting, the o1-preview model is 7.22% percentage better for text generation compared to gpt-4-1106-preview.
Rank #10
+7.22%
Compare with gpt-4-1106-preview similar text generation LLMs

📊 Comparison with Similar Performance Models

This section compares Rank 18 with the 5 best models above it (Ranks 13-17) and 5 worst models below it (Ranks 19-23). The coefficient shows the performance difference relative to Rank 18.

claude-3-5-haiku-20241022 Anthropic
According to user voting, the claude-3-5-haiku-20241022 model is 4.35% percentage better for text generation compared to gpt-4-1106-preview.
Rank #13
+4.35%
gpt-4o-2024-05-13 OpenAI
According to user voting, the gpt-4o-2024-05-13 model is 3.48% percentage better for text generation compared to gpt-4-1106-preview.
Rank #14
+3.48%
claude-3-5-sonnet-20240620 Anthropic
According to user voting, the claude-3-5-sonnet-20240620 model is 2.61% percentage better for text generation compared to gpt-4-1106-preview.
Rank #15
+2.61%
gpt-4-turbo-2024-04-09 OpenAI
According to user voting, the gpt-4-turbo-2024-04-09 model is 1.74% percentage better for text generation compared to gpt-4-1106-preview.
Rank #16
+1.74%
claude-3-opus-20240229 Anthropic
According to user voting, the claude-3-opus-20240229 model is 0.87% percentage better for text generation compared to gpt-4-1106-preview.
Rank #17
+0.87%
RANK 18
gpt-4-1106-preview OpenAI
This is the reference model (Rank #18) - all comparisons are relative to this model
Rank #18
0.00%
claude-3-sonnet-20240229 Anthropic
According to user voting, the claude-3-sonnet-20240229 model is 0.87% percentage worse for text generation compared to gpt-4-1106-preview.
Rank #19
-0.87%
gpt-4-0613 OpenAI
According to user voting, the gpt-4-0613 model is 1.74% percentage worse for text generation compared to gpt-4-1106-preview.
Rank #20
-1.74%
claude-3-haiku-20240307 Anthropic
According to user voting, the claude-3-haiku-20240307 model is 2.61% percentage worse for text generation compared to gpt-4-1106-preview.
Rank #21
-2.61%
gpt-3.5-turbo-1106 OpenAI
According to user voting, the gpt-3.5-turbo-1106 model is 3.48% percentage worse for text generation compared to gpt-4-1106-preview.
Rank #22
-3.48%
claude-2.1 Anthropic
According to user voting, the claude-2.1 model is 4.35% percentage worse for text generation compared to gpt-4-1106-preview.
Rank #23
-4.35%