May 13 – May 20, 2026

This week in AI infra

Every deprecation, API change, pricing shift, and model update that could affect builders shipping on AI APIs -- risk-classified and ready to share.

High-impact changes

18 changes across 5 providers.

0 critical17 high1 medium0 low
mediumMicrosoftLatency Update5/14/2026

Advancing enterprise AI: New SAP on Azure announcements from SAP Sapphire 2026

Microsoft is outlining infrastructure and inference changes that can affect serving cost, latency, and deployment architecture for builders.

Action: Re-run latency/cost checks and adjust timeout budgets.

highOpenAIModel Release5/15/2026

Databricks brings GPT-5.5 to enterprise agent workflows

Databricks uses GPT-5.5 for enterprise agent workflows after the model set a new state of the art on the OfficeQA Pro benchmark.

Action: Benchmark candidate model behavior before adopting in production.

highGoogleModel Release5/20/2026

At #GoogleIO, we shared how we’re making AI more helpful for everyone — including releasing two new models, Gemini Omni and Gemini 3.5, and

At #GoogleIO, we shared how we’re making AI more helpful for everyone — including releasing two new models, Gemini Omni and Gemini 3.5, and unlocking agents and agentic experiences across our products. Catch up on all the announcements in under 15 minutes in our recap video ↓ https://t.co/AxcdT0K4xY

highGoogleModel Release5/20/2026

Our Google AI subscriptions offer higher limits in the Gemini app, extra storage, and premium benefits across Google. Yesterday at #GoogleIO

Our Google AI subscriptions offer higher limits in the Gemini app, extra storage, and premium benefits across Google. Yesterday at #GoogleIO, we announced even more benefits at no extra cost. See what’s new — and find which one is right for you 🧵⬇️

highGoogleModel Release5/20/2026

Gemini 3.5 Flash has landed. https://t.co/FiiBrK1De1

Gemini 3.5 Flash has landed. https://t.co/FiiBrK1De1

highGoogleModel Release5/20/2026

Gemini Omni is our new AI model that can create anything from any input, starting with video. 🪄 Hear from @DemisHassabis on how you can mix

Gemini Omni is our new AI model that can create anything from any input, starting with video. 🪄 Hear from @DemisHassabis on how you can mix text, audio, and images to generate and edit high-quality videos just by having a conversation. #GoogleIO https://t.co/RVDUgzfjXA

highCohereModel Release5/20/2026

Introducing: Cohere Command A+ We’ve created our most powerful LLM yet, optimized it to run on as little hardware as possible, and released

Introducing: Cohere Command A+ We’ve created our most powerful LLM yet, optimized it to run on as little hardware as possible, and released it open-source for all. https://t.co/C1KYnvA8JB

highGooglePricing Change5/20/2026

We’re launching a new $100/month AI Ultra plan, specifically tailored for developers, technical leads, knowledge workers and advanced creato

We’re launching a new $100/month AI Ultra plan, specifically tailored for developers, technical leads, knowledge workers and advanced creators. We’re also reducing the monthly price of our top-tier AI Ultra plan from $250 to $200. Both tiers grant access to some of our most

highbytedance-researchModel Release5/20/2026

Lance momentum +30%

bytedance-research model showing momentum in AI Model.

Action: Run model migration checks for quality, latency, and cost.

highSulphurAIModel Release5/20/2026

Sulphur-2-base momentum +1%

SulphurAI model showing momentum in AI Model.

Action: Run model migration checks for quality, latency, and cost.

highopenbmbModel Release5/20/2026

MiniCPM-V-4.6 momentum +5%

openbmb model showing momentum in AI Model.

Action: Run model migration checks for quality, latency, and cost.

highSupertoneModel Release5/20/2026

supertonic-3 momentum +15%

Supertone model showing momentum in TTS.

Action: Run model migration checks for quality, latency, and cost.

highunslothModel Release5/20/2026

Qwen3.6-27B-MTP-GGUF momentum +1%

unsloth model showing momentum in AI Model.

Action: Run model migration checks for quality, latency, and cost.

highcirclestone-labsModel Release5/20/2026

Anima momentum +3%

circlestone-labs model showing momentum in AI Model.

Action: Run model migration checks for quality, latency, and cost.

highunslothModel Release5/20/2026

Qwen3.6-35B-A3B-MTP-GGUF momentum +1%

unsloth model showing momentum in AI Model.

Action: Run model migration checks for quality, latency, and cost.

highResembleAIModel Release5/20/2026

Dramabox momentum +30%

ResembleAI model showing momentum in TTS.

Action: Run model migration checks for quality, latency, and cost.

highsapientincModel Release5/20/2026

HRM-Text-1B momentum +7%

sapientinc model showing momentum in LLM.

Action: Run model migration checks for quality, latency, and cost.

highfroggericModel Release5/20/2026

Qwen-Fixed-Chat-Templates momentum +30%

froggeric model showing momentum in AI Model.

Action: Run model migration checks for quality, latency, and cost.