THE NO BS AI REPORT

Apr 22, 2025

MODEL WARS HEAT UP

Google's Gemini 2.5 Pro, released in late March, boasts a 1 million token context window, with plans to expand to 2 million. That's enough to process 750,000 words—longer than the entire Lord of the Rings trilogy. It continues to show strong performance on reasoning and long-context tasks.

OpenAI launched GPT-4.1 last week, optimized for coding, QA, bug testing, and end-to-end application development. The release focuses on what OpenAI CFO Sarah Friar described as an "agentic software engineer." The model is currently accessible via API.

Anthropic's Claude 3.7 Sonnet remains a strong competitor, available via API, Amazon Bedrock, and Google Cloud's Vertex AI. It offers extended reasoning capabilities and Claude "thinking modes" to improve performance across creative, analytical, and professional tasks.

Pricing-wise, Gemini 2.5 Pro sits between OpenAI's o3-mini and Anthropic's higher-tier Claude 3.7 Sonnet. GPT-4.5 remains the most expensive, with early developer feedback largely positive.

TECH COLLABORATION HEATS UP

In a move toward interoperability, Google will support Anthropic's Model Context Protocol (MCP), allowing Gemini models to securely access structured data and external applications. This mirrors a similar announcement by OpenAI last month, signaling convergence around shared standards.

Anthropic also introduced Claude for Education, designed for students, and Claude Research, which integrates with Google Workspace tools like Gmail, Docs, and Calendar, providing personal-context-aware AI assistance.

ROBOTICS & REINFORCEMENT LEARNING BREAKTHROUGHS

Andrew Barto and Richard Sutton received the 2025 ACM Turing Award—computing's highest honor—for foundational work in reinforcement learning, which remains pivotal in robotics, game AI, and autonomous systems.

NVIDIA celebrated National Robotics Week by spotlighting advances in real-world RL, scalable robot learning, and embodied AI from top young researchers.

Noteworthy real-world innovations include Hullbot, an AI-powered underwater robot that reduces marine drag and fuel use by 13%, and a soft robot designed to crawl through earthquake rubble and aid in rescue missions—proving how embodied AI is making a physical difference.

AGENTIC AI CONTINUES TO DOMINATE

Microsoft expanded its Security Copilot platform with new agents for Phishing Triage, Alert Triage, and Vulnerability Remediation—designed to fully automate repetitive security tasks.

Meanwhile, an IBM Research report reveals that although 99% of developers are working on AI agents, most enterprises are not yet "agent-ready." Exposing APIs and building integration layers will be key throughout 2025.

INDUSTRY FORECASTS & CHALLENGES

According to the Stanford 2025 AI Index, the U.S. still leads in producing influential AI models, but China is rapidly narrowing the performance gap, especially on key benchmarks like MMLU, HumanEval, and HellaSwag.

The same report notes a 56.4% year-over-year increase in AI-related incidents, underscoring the ongoing importance of investing in AI safety and alignment.

BOTTOM LINE

The AI world continues evolving at an almost reckless pace:

Language models are gaining memory, reasoning, and coding superpowers.
AI agents are becoming operational in cybersecurity, coding, and productivity.
Robot learning is moving from labs to real-world applications.
Industry leaders are (finally) agreeing on shared standards like MCP.
And yet, AI safety challenges and ethics concerns are rising just as fast.

Buckle up—this ride is just getting started.