Elon Musk’s artificial intelligence company, xAI, has released its latest flagship AI model, Grok 3, alongside new features in the Grok apps for iOS and the web, marking a significant milestone in the development of AI technology.
Grok, xAI’s response to models like OpenAI’s GPT-4o and Google’s Gemini, boasts the ability to analyze images and respond to questions, powering various features on Musk’s social network, X. The latest iteration, Grok 3, has been in development for several months and was initially slated for release in 2024, but missed the deadline, highlighting the complexities and challenges involved in creating advanced AI models.
Monday’s launch is an ambitious one, with xAI utilizing a massive data center in Memphis, containing approximately 200,000 GPUs, to train Grok 3. According to Musk, Grok 3 was developed with “10x” more computing power than its predecessor, Grok 2, and boasts an expanded training data set that includes filings from court cases, demonstrating the company’s commitment to improving the model’s capabilities.

During a live-streamed presentation, Musk described Grok 3 as “an order of magnitude more capable” than its predecessor, with the ability to seek truth, even if it means challenging politically correct norms. This statement underscores the company’s goal of creating a more advanced and unbiased AI model.
Grok 3 is not a single model, but rather a family of models, including a smaller version called Grok 3 mini, which responds to questions more quickly, albeit with some loss of accuracy. The rollout of these models begins on Monday, with some features still in beta, demonstrating the company’s commitment to continuous improvement and development.
xAI claims that Grok 3 outperforms GPT-4o on various benchmarks, including AIME and GPQA, which evaluate a model’s performance on math and physics problems. Additionally, an early version of Grok 3 scored competitively in Chatbot Arena, a crowdsourced test that pits different AI models against each other, highlighting the model’s capabilities and potential applications.

Two variations of Grok 3, Grok 3 Reasoning and Grok 3 mini Reasoning, can carefully “think through” problems, similar to “reasoning” models like OpenAI’s o3-mini and DeepSeek’s R1. These reasoning models thoroughly fact-check themselves before providing results, helping to avoid pitfalls that normally trip up models, and demonstrating the potential benefits of advanced AI reasoning capabilities.
xAI claims that Grok 3 Reasoning surpasses the best version of o3-mini on several popular benchmarks, including a newer mathematics benchmark called AIME 2025. This achievement highlights the company’s focus on developing more advanced and capable AI models.