Grok 4 is a huge leap from Grok 3, but how good is it compared to other models in the market, such as Gemini 2.5 Pro? We now have answers, thanks to new independent benchmarks. LMArena.ai, which is an ...
Hosted on MSN
Why you can’t trust Grok 4’s benchmarks
Elon Musk’s xAI recently announced that its latest model, Grok 4, has achieved top scores on several major AI benchmarks. Most notably, it reportedly conquered the Abstraction and Reasoning Corpus ...
Elon Musk’s xAI Holdings Corp. has debuted a new large language model, Grok 4, that’s optimized for reasoning tasks such as generating code. The LLM’s late Wednesday launch followed a turbulent week ...
Elon Musk's xAI has launched its new flagship AI model, Grok-4, which demonstrates leading performance in various academic, reasoning, and coding benchmarks. Elon Musk's xAI today announced Grok 4, ...
Yesterday, just as OpenAI celebrated its 10-year anniversary, the AI company launched GPT-5.2, its latest series of AI models to power ChatGPT. The latest release is allegedly in response to OpenAI’s ...
XAI Grok 4.20 will include enhancements like improved multimodal capabilities (text, images, video), reduced hallucinations via fact-checking tools, advanced ...
Elon Musk-owned xAI is testing Grok 4.20, a new model update to Grok 4, which already competes with GPT-5 in some benchmarks, such as ARC-AGI 2. GPT-5 is one of the best models for coding, and it ...
In what appeared to be a bid to soak up some of Google's limelight prior to the launch of its new Gemini 3 flagship AI model — now recorded as the most powerful LLM in the world by multiple ...
Update: A day after this article was published, xAI unveiled Grok 4.1 access through its API for $0.20 per 1 input million tokens (or $0.05 for cached input) and output tokens at $0.50 per million, ...
Last week, Elon Musk’s xAI released the long-awaited Grok 4. And from our perspective, it likely marked the moment AI officially shifted into a higher gear. In the span of just a few months, xAI went ...
Some results have been hidden because they may be inaccessible to you
Show inaccessible results