Skip to content

2025: The Year AI Stopped Being a Feature and Became the Infrastructure

Reasoning, agents, and agentic commerce moved from pilot to production. Here’s the field guide for Q1 2026.


The Year in Short

2025 wasn’t a year of experiments, it was a year of deployment. What started as cautious pilots in Q1 became production infrastructure by Q4. The marketing technology stack we planned 12 months ago is already obsolete.

Three shifts define the year:

  1. Reasoning models work - not perfectly, but reliably enough for production marketing automation
  2. Coding agents replaced junior engineers - Claude Code hit $1B ARR in 10 months
  3. Agentic commerce processed real revenue - 20% of Cyber Week orders touched AI

This isn’t about what’s possible. It’s about what’s already shipping and what you need to deploy in Q1 2026 before your competitors do.

Below we break down the year month-by-month, map the infrastructure changes to each channel (Search, Programmatic, Social, E-commerce), and provide tactical guidance for 2026 planning.


Month-by-Month: What Actually Shipped

January - DeepSeek R1 Proves Open Weights Can Compete

The Release: DeepSeek’s R1 reasoning model, trained for ~$5.5M, matched GPT-4-class performance while remaining fully open-source under MIT license.

Market Impact:

  • NVIDIA lost $593B in market cap in 48 hours as investors realized AI wasn’t necessarily an American monopoly
  • Chinese open-weight models (DeepSeek, Qwen, Kimi) began dominating leaderboards
  • Cost-per-reasoning-task dropped 10× for teams willing to self-host

What Changed for Marketing Ops:

ChannelBeforeAfter R1
SearchBlack-box API-only bid optimizationSelf-hosted reasoning agents with full auditability
ProgrammaticVendor lock-in for ML modelsMix-and-match open-weight optimizers
SocialCreative QA via expensive API callsLocal fatigue detection + brand safety checks
E-commerceCloud-only price optimizationOn-premise models with CRM data privacy

Tactical Move for 2026: Evaluate whether your most sensitive workflows (pricing, customer segmentation, competitive intelligence) should migrate to self-hosted open-weight models. Cost savings of 60-80% vs API-based reasoning are common.


February - Claude Code Launches, CLI Becomes Production Infrastructure

The Release: Anthropic buried the launch in a Claude 3.7 Sonnet announcement, but Claude Code quietly became one of 2025’s most transformative tools.

By The Numbers:

  • $1B ARR disclosed in December (62% from “asynchronous coding agents”)
  • 110 tools built by Simon Willison alone using vibe-coding patterns
  • 30-50% faster campaign implementation cycles vs manual coding

Channel-Specific Impact:

Use CaseOld ProcessClaude Code Reality
Campaign QAManual spreadsheet checks (3-4 hours)Agent script validates feeds in 8 minutes
Feed EnrichmentEngineering ticket backlog (3-5 days)Self-service schema generation same-day
Bid AutomationQuarterly ML model updatesNightly fine-tuning via agent loops
Creative VariantsDesign team bottleneck (1-2 weeks)Generative pipelines deployed in hours

The Release: Google’s AI Mode with Gemini 2.5 Pro and Deep Search began rolling out, fundamentally restructuring the SERP.

The Pew Data: Users are 24 percentage points less likely to click when AI summaries appear.

What This Means:

The classic “rank #1 for keywords” playbook died in Q1 2025. Ranking still matters, but citation within AI answers matters more.

Old SEOAI Mode SEO
Optimize meta descriptions for clicksOptimize structured data for citations
Build backlinks for PageRankBuild E-E-A-T signals for trust
Target keywords for rankTarget answer-ability for inclusion
Measure: Impressions, CTRMeasure: AI Visibility Share, Citation Rate

Channel Economics Shifted:

  • Search spend concentration increased as fewer clicks meant scarcer inventory
  • CPC inflation of 8-12% on AI Mode-enabled queries
  • Attribution blindness grew as traffic appeared as “Direct/Unknown”

What Worked:

Schema migration campaigns that added HowTo, FAQ, Product markup saw AI citation rates jump from 11% to 45-50% within 30 days.


April - Llama 4 Stumbles, Chinese Models Fill the Gap

The Release: Meta’s Llama 4 Scout (109B) and Maverick (400B) launched to underwhelming reception-too large for laptops, not differentiated enough vs GPT-4.

What Happened Instead:

Chinese open-weight models dominated the practical middle ground:

  • Qwen 3 (14B-72B range): Apache 2.0 licensed, laptop-friendly
  • Kimi K2 (32B): Best-in-class tool calling
  • GLM-4.7: Topped open-weight intelligence benchmarks

Client Adoption Pattern:

Teams that wanted to use Llama for compliance/privacy reasons ended up deploying Qwen 3 14B instead:

  • Runs on M3 Max MacBook Pro (64GB)
  • $0.60/M tokens vs $15+ for Claude API
  • Comparable MMLU scores to GPT-3.5

May - The Agentic Coding Wars Begin

Claude 4 Ships: Anthropic Goes All-In on Developers

The Release: On May 22, 2025, Anthropic released Claude Sonnet 4 and Claude Opus 4, positioning them as “the world’s best coding models.”

What Changed:

  • 72.5% SWE-bench score for Opus 4 (vs ~60% for GPT-4)
  • Extended thinking mode - toggle between fast responses and deep reasoning
  • 7-hour autonomous coding runs (vs 45 minutes for Opus 3.5)
  • New developer tools: Code execution, Model Context Protocol (MCP), Files API

Google I/O 2025 - The “AI Everything” Strategy

The Event: May 20-21, 2025 - Google’s annual developer conference

100+ Announcements, But These 5 Rewired Marketing Operations:

1. AI Mode Expansion Announced

  • Full US rollout confirmed for June 5
  • Custom Gemini 2.5 powering it
  • Coming features: Gmail context, shopping capabilities, Deep Search

2. Gemini 2.5 Flash Goes GA

  • Out of preview, production-ready
  • New default model for free tier
  • #2 on LMArena leaderboard (only behind Gemini 2.5 Pro)

3. New Subscription Tiers Announced

  • Google AI Pro: $19.99/month (renamed from AI Premium)
  • Google AI Ultra: $249.99/month (early access, maximum limits)
  • Free tier for college students in US, Brazil, Indonesia, Japan, UK

4. Imagen 4 & Veo 3 Launch

  • Imagen 4: 10× faster than Imagen 3, 2K resolution, better text rendering
  • Veo 3: Native audio generation with video, state-of-the-art quality

5. Jules - Autonomous Coding Agent in Public Beta

  • Integrates with existing repositories
  • Works asynchronously in background
  • Powered by Gemini 2.5

What Marketing Teams Actually Used:

Within 60 days of I/O:

  • Imagen 4 for display ad creative generation (50-100 variants/day became standard)
  • Veo 3 for social video content (TikTok/Reels production time dropped 80%)
  • AI Mode optimization became new SEO focus (traditional SEO teams scrambled)

The Agent Mode Tease:

Google previewed “Agent Mode” where Gemini could accomplish tasks autonomously - apartment hunting, message replies, booking tickets. Not shipping yet, but signal was clear: 2026 would be the year AI agents moved from demos to deployment.


June - AI Mode Goes Mainstream, Search CTR Collapses

The Release: On June 5, 2025, Google opened AI Mode to all US users without requiring login. On June 17, Gemini 2.5 Pro and Flash became generally available across all platforms.

Market Impact:

  • 100M+ monthly users within 8 weeks of universal launch
  • 24pp CTR decline for traditional blue links (Pew Research)
  • 2× search session length as users engaged with AI-generated responses
  • Google for Startups Gemini Kit launched: $350K in credits + instant API access

What Changed for Marketing Ops:

ChannelBefore AI ModeAfter AI Mode Universal Launch
SearchOptimize for 10 blue linksOptimize for AI Mode citations + Deep Search
ContentTitle tags + meta descriptionsStructured data + entity optimization
MeasurementClick-through rateVisibility in AI responses + citation rate
Budgets100% paid searchSplit: 70% paid, 30% AI-visibility infrastructure

Three Query Types Hit Hardest:

  1. “Best X for Y” comparisons → AI Mode generated comparison tables, no clicks needed
  2. “How to” tutorials → Step-by-step AI responses captured 30-40% of clicks
  3. Local search → AI Mode + Maps integration reduced organic CTR by 30%

What Actually Worked:

Teams that adapted fastest did three things:

  1. Shifted from keyword optimization to entity optimization

    • Made sure brand/product entities were clearly defined with schema markup
    • Built topic authority, not just page authority
  2. Built “AI Mode-first” content

    • Structured data, clear hierarchies, citation-friendly formats
    • Added HowTo, FAQ, Product schema everywhere
  3. Launched dual-web strategy

    • Created /ai mirrors of key pages optimized specifically for LLM consumption
    • Served different versions to LLM crawlers vs human browsers

June Also Brought:

  • Gemini 2.5 Flash-Lite: Entry-level model for real-time tasks at lower cost (June 17)
  • Gemini 2.5 GA: Full production availability for Pro and Flash models (June 17)

The Inflection Point:

June 2025 crystallized something the industry had been dancing around: Search wasn’t evolving anymore, it was bifurcating.

There would be “legacy search” (blue links, declining traffic, price wars) and “AI search” (citations, entity authority, structured data). The two required completely different optimization strategies, completely different measurement frameworks, and completely different budget allocations.


July - Perplexity Comet Launches, AI Browsing Goes Mainstream

The Release: Perplexity launched Comet, its AI-native browser, on July 9, 2025, initially for Perplexity Max subscribers ($200/month), with broader rollout throughout the summer.

The Context:

June had already seen The Browser Company launch Dia browser in beta (June 11), but July marked the moment when AI-native browsers moved from “startup experiments” to “viable alternatives.” Comet’s launch signaled that major AI companies believed browsers were the next battleground.

The Attribution Crisis Begins:

Traditional click-based attribution started breaking as AI browsers handled navigation differently:

  • Agent user-agents proliferated - traffic logs showed new patterns from AI-mediated browsing
  • Click-stream data became unreliable for attribution modeling
  • Zero-click interactions increased - users got answers without visiting sites
  • Server-side tracking became mandatory for accurate measurement

Perplexity Comet’s Key Features:

  • Unified AI Search: Instant answers, summaries, translations directly from any webpage
  • Comet Assistant: Automates form filling, clicking, typing on your behalf
  • Cross-tab intelligence: AI understands context across all open tabs
  • Built on Chromium: Chrome extensions work seamlessly

Real Marketing Impact by August:

Brands that exposed structured data to Perplexity early saw:

  • 8-11% of revenue from AI browser referrals (completely new channel)
  • Higher intent signals - users arriving via AI browsers converted 1.8× better
  • Attribution gaps - traditional analytics undercounted AI-mediated traffic by 40-60%

What Changed for Marketing Ops:

Traditional AttributionAI Browser Reality
Track clicks and sessionsTrack agent requests and API calls
Cookie-based measurementServer-side event tracking
Last-click attributionMulti-touch with AI intermediation
Google Analytics as truthGA + Conversion APIs + custom logging

The Coming Wave:

July was the warning shot. By October, when OpenAI launched ChatGPT Atlas (October 21), the AI browser wars would be in full swing. But the smart teams used July-September to:

  • Build attribution infrastructure for AI-mediated traffic
  • Expose product APIs for AI agent discovery
  • Test creative formats for conversational interfaces

August - Realtime Voice API Drops Latency to 108ms

The Release: OpenAI’s Realtime Voice API achieved 108ms end-to-end latency-20× faster than previous voice solutions.

Why Latency Matters:

At 2-3 seconds, voice AI feels like technology.
At 108ms, it feels like conversation.

Channel-Specific Applications:

ChannelVoice Use CaseEarly Results
SearchVoice-first ad experiences in Mobile Safari2.3× CTR vs text
ProgrammaticAudio companion ads that respond to questions40% lower skip rate
SocialBrand voice companions (24/7 availability)5.6× repeat engagement
E-commerceVoice-guided product selection2.1× basket size

September - Claude Sonnet 4.5 Becomes The Ops Backbone

The Release: Anthropic’s Claude Sonnet 4.5 with extended context windows and checkpointing capabilities.

Why This Mattered:

Claude didn’t win on creativity (that’s GPT-5’s territory). It won on reliability.

The Ops Use Case:

30-hour context windows meant agents could run unattended through weekends without losing state. For marketing ops teams, this was transformative:

TaskManual ProcessClaude Agent
Daily budget pacing30 min manual check3 min automated validation
Feed error detectionReactive (found after launch)Proactive (caught in QA)
Campaign QA2-3 hour checklist8 minute comprehensive scan
Bid logic debugging4-8 hours w/ engineer45 minutes self-service

The GPT-5 + Claude Pattern:

Smart teams split workloads:

  • GPT-5: Ideation, creative, multimodal storytelling
  • Claude Sonnet 4.5: Implementation, ops, long-running optimization

October - Sora 2 Goes Platform, Remix Culture Arrives

The Release: OpenAI’s Sora 2 launched not just as a video generator, but as a social platform with remix capabilities.

The Shift:

Video content stopped being “produced” and started being “forked.”

Key Metrics:

  • 100M weekly active remixers by December
  • 4.7× scene-dwell time vs TikTok baseline
  • $0.87 CPM for AI-generated story fragments vs $4.20 human-shot equivalents

What “Remix” Means for Marketing:

Campaigns evolved from static assets to open-ended narratives that communities expanded.

New Metrics:

  • Remix Velocity: Mutations per hour in your campaign feed
  • Narrative Coherence: Do remixes stay on-brand?
  • Participation Rate: % of viewers who remix vs passively watch

November - Agentic Commerce Protocol Ships

The Release: OpenAI’s Agentic Commerce Protocol (co-developed with Stripe) enabled in-chat purchases within ChatGPT.

The Numbers:

  • 25% of U.S. consumers completed AI-assisted purchases during holiday season (Salesforce)
  • 20% of Cyber Week orders were influenced by AI agents
  • 2.8× conversion rate for in-chat checkout vs mobile web funnel

What Changed:

The “research → visit site → buy” funnel collapsed into a single conversational flow.

Channel Impact:

Traditional E-commerceAgentic Commerce
Optimize product pages for SEOExpose APIs for agent discovery
Drive traffic to owned propertiesEnable checkout in agent environments
Measure: Traffic, conversion rateMeasure: API call quality, agent satisfaction

December - Claude Code Hits $1B ARR, Infrastructure Becomes Line Item

The Milestone: Anthropic disclosed that Claude Code reached $1 billion in annualized recurring revenue, with 62% attributed to “asynchronous campaign agents.”

What This Proves:

AI coding agents aren’t experimental they’re mission-critical infrastructure that companies pay serious money for.

New Line Items in 2026 Budgets:

Category% of Total Marketing TechWhat It Covers
Agent Infrastructure15-20%API costs, MetaMCP, edge compute
Voice Channel5-10%Realtime API, conversation design
Agentic Commerce10-15%Platform integrations, checkout flows
Sora Creative8-12%Generation compute, remix moderation

What Actually Happened: The Practical Takeaways

Stripping away the hype and looking at what actually shipped and scaled in 2025:

1. Reasoning models work now. They’re not perfect, but they’re reliable enough for production marketing automation.

2. Coding agents are real. Marketing ops teams can build custom tools in hours, not quarters.

3. Agentic commerce is processing real revenue. 20% of Cyber Week orders touched AI-that’s not a pilot, that’s a channel.

4. Chinese models compete at the frontier. This means lower costs and more options for self-hosted deployments.

5. Image generation reached production quality. Dynamic creative optimization finally has the tools it always needed.

6. The $200/month tier signals where usage is heading. Plan for 10x growth in token consumption.

7. Privacy and security remain unsolved. Prompt injection, data exfiltration, and model gullibility are still fundamental problems.

The 2026 Forecast: Three Predictions

Based on 2025’s trajectory, here’s what marketing teams should prepare for:

1. Answer Engine Optimization replaces SEO as the primary discipline - When 50%+ of searches never leave AI interfaces, traditional SEO metrics become lagging indicators at best.

2. Agentic checkout becomes table stakes - If your e-commerce platform doesn’t support agent-driven purchases, you’re invisible to a growing segment of high-value consumers.

3. The first major AI-driven security incident hits marketing - Whether it’s prompt injection exfiltrating customer data or an agent misconfiguring an ad campaign.

Conclusion: Infrastructure, Not Features

The fundamental lesson of 2025 is that AI stopped being something you “add” to marketing and became the substrate marketing runs on. Reasoning models don’t enhance workflows, they replace them. Coding agents don’t assist developers, they build the tools developers would have spent months creating.

For marketing practitioners, this means upskilling fast or becoming irrelevant. For ad tech engineers, it means embracing autonomous systems while building in the guardrails to prevent catastrophic failures.

The teams that thrive in 2026 won’t be the ones with the best AI features. They’ll be the ones who rebuilt their infrastructure around AI-native workflows-and who survived long enough to learn which guardrails actually matter.


Channel-by-Channel Tactical Guide

ChannelPriority Moves for Q1 2026Expected Impact
SearchMigrate top-50 pages to agent-ready schema; pilot AI Mode ad placements+25-40% AI visibility share
ProgrammaticDeploy reasoning-powered bid agents; test Sora-native creative units15-20% ROAS lift, -30% production cost
SocialLaunch voice-enabled brand companions; build remix-optimized Sora content3-5× engagement depth vs static
E-commerceExpose APIs to agentic commerce protocols; implement voice checkout flows10-15% GMV from agent-originated purchases

Further Reading from Performics Labs 2025 Coverage

Q3-Q4 Core Infrastructure

Search & Discovery Evolution

Agentic Systems

Creative & Content

Voice & Multimodal

Browser & Attribution

Open-Source & Infrastructure

Next-Gen Paradigms


2026 Budget Planning Template

Based on actual 2025 deployments, here’s the reallocation framework that worked:

Traditional Line Item2025 BaselineRecommended 2026Rationale
Static Creative Production100%30-40%Sora/Nano Banana handles 60-70% of variants
Junior Dev Headcount100%40-50%Claude Code agents handle ops tasks
SERP Keyword Expansion100%60-70%Zero-click reduces inventory; reallocate to schema
Display/Video CPM100%70-80%Shift to conversational slots + voice
New: Agent Infrastructure0%15-20% of totalMetaMCP, API costs, edge inference
New: Voice Channel0%5-10% of totalRealtime API, conversation design
New: Agentic Commerce0%10-15% of totalAPI integrations, checkout flows

What to Watch in Q1 2026

Regulatory Signals:

  • EU AI Act enforcement on agentic systems (March deadline)
  • FTC guidance on AI-powered persuasion (expected Feb)
  • State privacy laws covering voice data (CA, NY, TX in flight)

Technical Milestones:

  • GPT-5.1 with native tool execution (rumored Jan)
  • Claude Opus 5 reasoning + video (Q1 target)
  • Gemini 4 with multi-agent orchestration (Google I/O 2026)

Market Shifts:

  • First major AI-attribution lawsuit (predicted Q1-Q2)
  • Agentic commerce surpasses 30% of e-commerce GMV (optimistic case)
  • Voice channel recognized in MTA models (lagging indicator)