Learn to use AI like a Pro. Learn More

xAI's Latest Development

Grok-2 Gets a Speed Boost with Groundbreaking Code Rewrite

Last updated:

Mackenzie Ferguson

Edited By

Mackenzie Ferguson

AI Tools Researcher & Implementation Consultant

Elon Musk's xAI has supercharged its Grok-2 and Grok-2 mini chatbots, thanks to developers Lianmin Zheng and Saeed Maleki, who rewrote the inference code using the open-source SGLang. This update has doubled the speed and slightly improved accuracy, making Grok-2 models formidable contenders in the AI landscape.

Banner for Grok-2 Gets a Speed Boost with Groundbreaking Code Rewrite

Elon Musk's xAI has recently made news with the launch of its Grok-2 large language model (LLM) chatbot, which is available through an $8 monthly subscription on the social network X. The Grok-2 model, along with its less powerful but faster variant Grok-2 mini, has undergone significant enhancements. The improvements were facilitated by developers Lianmin Zheng and Saeed Maleki, who rewrote the inference code stack from scratch using SGLang, an open-source system for executing complex language model programs.

    SGLang, developed by researchers from Stanford University, the University of California, Berkeley, Texas A&M University, and Shanghai Jiao Tong University, enables up to 6.4 times higher throughput than existing systems. It integrates a frontend language with a backend runtime to simplify the programming of language model applications. By leveraging SGLang's capabilities, xAI has managed to significantly enhance the speed and accuracy of both Grok-2 and Grok-2 mini.

      Learn to use AI like a Pro

      Get the latest AI workflows to boost your productivity and business performance, delivered weekly by expert consultants. Enjoy step-by-step guides, weekly Q&A sessions, and full access to our AI workflow archive.

      Canva Logo
      Claude AI Logo
      Google Gemini Logo
      HeyGen Logo
      Hugging Face Logo
      Microsoft Logo
      OpenAI Logo
      Zapier Logo
      Canva Logo
      Claude AI Logo
      Google Gemini Logo
      HeyGen Logo
      Hugging Face Logo
      Microsoft Logo
      OpenAI Logo
      Zapier Logo

      The main Grok-2 model, which requires multi-host inference due to its complexity, now operates at reasonable speeds, thanks to these improvements. According to xAI developer Igor Babuschkin, Grok-2 mini is now twice as fast as it was the previous day. This speed boost, along with slight accuracy improvements, has positioned Grok-2 favorably in the AI community.

        In recent updates to the third-party Lmsys Chatbot Arena leaderboard, the main Grok-2 has secured the #2 spot with an impressive Arena Score of 1293, based on 6686 votes. This places it alongside Google's Gemini-1.5 Pro model and just behind OpenAI's latest version of ChatGPT-4o. The Grok-2-mini has also climbed the ranks to reach the #5 position with an Arena Score of 1268 from 7266 votes, placing it just behind GPT-4o mini and Claude 3.5 Sonnet.

          The success of Grok-2 and Grok-2-mini is a testament to xAI's commitment to advancing AI technology. Grok-2 has particularly excelled in mathematical tasks, where it ranks #1. It also holds strong positions in other categories such as Hard Prompts, Coding, and Instruction-following. These achievements highlight xAI's ongoing innovation and its role in pushing the boundaries of what AI can achieve.

            The primary advantage of Grok-2 mini over its more powerful counterpart is its enhanced speed, which makes it an attractive option for users seeking high performance with lower computational overhead. Babuschkin has pledged further improvements to the processing speed of Grok-2 mini, which could make it an even more appealing choice for developers and end-users alike.

              Learn to use AI like a Pro

              Get the latest AI workflows to boost your productivity and business performance, delivered weekly by expert consultants. Enjoy step-by-step guides, weekly Q&A sessions, and full access to our AI workflow archive.

              Canva Logo
              Claude AI Logo
              Google Gemini Logo
              HeyGen Logo
              Hugging Face Logo
              Microsoft Logo
              OpenAI Logo
              Zapier Logo
              Canva Logo
              Claude AI Logo
              Google Gemini Logo
              HeyGen Logo
              Hugging Face Logo
              Microsoft Logo
              OpenAI Logo
              Zapier Logo

              The integration of Grok-2 and Grok-2 mini into the Chatbot Arena leaderboard has attracted significant attention within the AI community. The models' outstanding performance underscores the rapid advancements being made in AI technology. As xAI continues to refine and upgrade its models, we can expect further enhancements in both speed and accuracy, keeping Grok-2 and Grok-2 mini at the forefront of AI development.

                Overall, the advancements in Grok-2 and Grok-2 mini provide valuable insights for business readers looking to stay up to date on AI technology. The increased speed and accuracy of these models can have significant implications for businesses that rely on advanced AI capabilities. As xAI continues to push the envelope, it sets a precedent for what can be achieved in the realm of artificial intelligence, signaling a bright future for the industry.

                  Recommended Tools

                  News

                    Learn to use AI like a Pro

                    Get the latest AI workflows to boost your productivity and business performance, delivered weekly by expert consultants. Enjoy step-by-step guides, weekly Q&A sessions, and full access to our AI workflow archive.

                    Canva Logo
                    Claude AI Logo
                    Google Gemini Logo
                    HeyGen Logo
                    Hugging Face Logo
                    Microsoft Logo
                    OpenAI Logo
                    Zapier Logo
                    Canva Logo
                    Claude AI Logo
                    Google Gemini Logo
                    HeyGen Logo
                    Hugging Face Logo
                    Microsoft Logo
                    OpenAI Logo
                    Zapier Logo