NATURAL 20
Posts
xAI Unveils Powerful Grok 3

xAI Unveils Powerful Grok 3

PLUS: Step-Video-T2V Powers Advanced Video Generation, NYT Introduces Echo AI Tools and more.

Wes Roth
February 18, 2025

In partnership with

SUBSCRIBE | AI TOOLS | LEARN AI

There’s a reason 400,000 professionals read this daily.

Join The AI Report, trusted by 400,000+ professionals at Google, Microsoft, and OpenAI. Get daily insights, tools, and strategies to master practical AI skills that drive results.

Today:

xAI Unveils Powerful Grok 3
Mistral Unveils Arabic-Focused AI Model
South Korea Secures 10,000 GPUs

xAI Unveils Powerful Grok 3

Elon Musk’s AI company, xAI, has launched Grok 3, its latest AI model, along with updates to its iOS and web apps. Grok 3, trained on a massive dataset using 200,000 GPUs, claims to outperform OpenAI’s GPT-4o in math and science benchmarks. A “Reasoning” variant enhances accuracy by fact-checking responses. The model will first be available to X’s Premium+ subscribers, with advanced features locked behind a new SuperGrok plan. Future updates include a voice mode and enterprise API access. xAI plans to open-source Grok 2 once Grok 3 is stable. Musk aims to make the model more politically neutral.

Mistral Unveils Arabic-Focused AI Model

Mistral, a French AI startup, has launched Mistral Saba, a language model designed for Arabic-speaking regions. With 24 billion parameters, it outperforms its general-purpose counterpart in Arabic tasks and shows promise with Indian languages like Tamil and Malayalam. The move signals Mistral's strategic interest in the Middle East and potential regional funding. Saba is available via API and on-premise, catering to sectors like finance, energy, and healthcare.

South Korea Secures 10,000 GPUs

South Korea plans to secure 10,000 high-performance GPUs in 2025 to bolster its national AI computing center. The government is partnering with private companies to stay competitive in the global AI race. While GPU models and budgets will be finalized by September, Nvidia remains the leading supplier. South Korea benefits from U.S. export exemptions, positioning itself as a key player in AI development amid global technological rivalries.

🧠RESEARCH

Region-Adaptive Sampling for Diffusion Transformers

The RAS technique boosts Diffusion Transformers' speed by dynamically updating only key image regions while caching others. Tested on Stable Diffusion 3 and Lumina-Next-T2I, RAS achieved up to 2.51x faster performance with minimal quality loss, enabling more efficient, real-time generative tasks without retraining the model.

Large Language Diffusion Models

LLaDA, a diffusion-based language model, challenges autoregressive dominance by predicting masked tokens through data masking and reversal processes. It outperforms ARM baselines, rivals LLaMA3 8B in learning tasks, and even beats GPT-4o in poem reversal, proving diffusion models as a viable alternative for advanced language understanding.

Step-Video-T2V Technical Report: The Practice, Challenges, and Future of Video Foundation Model

Step-Video-T2V, a 30B-parameter text-to-video model, generates high-quality videos up to 204 frames. It uses Video-VAE for compression, bilingual text encoding, and 3D attention with Flow Matching. Achieving state-of-the-art results, it supports content creators with superior video generation while highlighting future improvements for diffusion-based models.

ZeroBench: An Impossible Visual Benchmark for Contemporary Large Multimodal Models

ZeroBench is a new visual benchmark designed to challenge Large Multimodal Models (LMMs) with tasks impossible for current systems. It includes 100 complex questions and 334 simpler ones. All 20 tested models scored 0%, highlighting gaps in spatial reasoning and pushing for more advanced visual understanding.

MM-RLHF: The Next Step Forward in Multimodal LLM Alignment

MM-RLHF introduces a 120k-pair dataset to better align Multimodal Large Language Models (MLLMs) with human preferences. It features critique-based rewards and dynamic scaling for improved feedback. Fine-tuning with MM-RLHF boosts conversational performance by 19.5% and safety by 60%, advancing MLLM capabilities across diverse tasks.

🛠️TOP TOOLS

Deepfake Video Maker - Cloud-based online software designed to facilitate the creation of deepfake videos using artificial intelligence.

Image To Font Finder - AI-powered tool designed to help users identify fonts from any image.

Bai Chat - AI platform designed to simplify the integration of artificial intelligence into various workflows for professionals, developers, and businesses.

DiagramGPT - AI-powered tool developed by Fraser Xu that enables users to generate a variety of diagram types using natural language input.

JanitorAI - AI tool that integrate chatbot functionality into applications, leveraging technologies such as NLP, ML, and generative AI.

📲SOCIAL MEDIA

Oh boy...
@sama about to drop the bomb on xAI?
— Wes Roth (@WesRothMoney)
5:10 PM • Feb 17, 2025

🗞️MORE NEWS

The New York Times is introducing AI tools like Echo to help staff with editing, summaries, and social media content while ensuring human oversight. Staff receive training, but AI use remains limited and regulated.
South Korea suspended new downloads of DeepSeek AI for privacy violations, citing excessive data collection and encryption flaws. Existing users are advised to avoid entering personal information until compliance improvements are implemented.
Apple's Image Playground app faces criticism for racial bias, as it struggles with skin tone and hair texture accuracy. Despite design limitations to avoid such issues, the AI model still exhibits problematic inconsistencies.
Innovaccer launched AI agents to automate repetitive tasks in healthcare, reducing clinician burnout. The agents handle protocol intake, referrals, and patient support via voice activation, enhancing efficiency and improving patient care amid staffing shortages.

What'd you think of today's edition?

Reply

or to participate.