This week in AI infra
Every deprecation, API change, pricing shift, and model update that could affect builders shipping on AI APIs -- risk-classified and ready to share.
High-impact changes
18 changes across 5 providers.
Advancing enterprise AI: New SAP on Azure announcements from SAP Sapphire 2026
Microsoft is outlining infrastructure and inference changes that can affect serving cost, latency, and deployment architecture for builders.
Action: Re-run latency/cost checks and adjust timeout budgets.
Databricks brings GPT-5.5 to enterprise agent workflows
Databricks uses GPT-5.5 for enterprise agent workflows after the model set a new state of the art on the OfficeQA Pro benchmark.
Action: Benchmark candidate model behavior before adopting in production.
At #GoogleIO, we shared how we’re making AI more helpful for everyone — including releasing two new models, Gemini Omni and Gemini 3.5, and
At #GoogleIO, we shared how we’re making AI more helpful for everyone — including releasing two new models, Gemini Omni and Gemini 3.5, and unlocking agents and agentic experiences across our products. Catch up on all the announcements in under 15 minutes in our recap video ↓ https://t.co/AxcdT0K4xY
Our Google AI subscriptions offer higher limits in the Gemini app, extra storage, and premium benefits across Google. Yesterday at #GoogleIO
Our Google AI subscriptions offer higher limits in the Gemini app, extra storage, and premium benefits across Google. Yesterday at #GoogleIO, we announced even more benefits at no extra cost. See what’s new — and find which one is right for you 🧵⬇️
Gemini 3.5 Flash has landed. https://t.co/FiiBrK1De1
Gemini 3.5 Flash has landed. https://t.co/FiiBrK1De1
Gemini Omni is our new AI model that can create anything from any input, starting with video. 🪄 Hear from @DemisHassabis on how you can mix
Gemini Omni is our new AI model that can create anything from any input, starting with video. 🪄 Hear from @DemisHassabis on how you can mix text, audio, and images to generate and edit high-quality videos just by having a conversation. #GoogleIO https://t.co/RVDUgzfjXA
Introducing: Cohere Command A+ We’ve created our most powerful LLM yet, optimized it to run on as little hardware as possible, and released
Introducing: Cohere Command A+ We’ve created our most powerful LLM yet, optimized it to run on as little hardware as possible, and released it open-source for all. https://t.co/C1KYnvA8JB
We’re launching a new $100/month AI Ultra plan, specifically tailored for developers, technical leads, knowledge workers and advanced creato
We’re launching a new $100/month AI Ultra plan, specifically tailored for developers, technical leads, knowledge workers and advanced creators. We’re also reducing the monthly price of our top-tier AI Ultra plan from $250 to $200. Both tiers grant access to some of our most
Lance momentum +30%
bytedance-research model showing momentum in AI Model.
Action: Run model migration checks for quality, latency, and cost.
Sulphur-2-base momentum +1%
SulphurAI model showing momentum in AI Model.
Action: Run model migration checks for quality, latency, and cost.
MiniCPM-V-4.6 momentum +5%
openbmb model showing momentum in AI Model.
Action: Run model migration checks for quality, latency, and cost.
supertonic-3 momentum +15%
Supertone model showing momentum in TTS.
Action: Run model migration checks for quality, latency, and cost.
Qwen3.6-27B-MTP-GGUF momentum +1%
unsloth model showing momentum in AI Model.
Action: Run model migration checks for quality, latency, and cost.
Anima momentum +3%
circlestone-labs model showing momentum in AI Model.
Action: Run model migration checks for quality, latency, and cost.
Qwen3.6-35B-A3B-MTP-GGUF momentum +1%
unsloth model showing momentum in AI Model.
Action: Run model migration checks for quality, latency, and cost.
Dramabox momentum +30%
ResembleAI model showing momentum in TTS.
Action: Run model migration checks for quality, latency, and cost.
HRM-Text-1B momentum +7%
sapientinc model showing momentum in LLM.
Action: Run model migration checks for quality, latency, and cost.
Qwen-Fixed-Chat-Templates momentum +30%
froggeric model showing momentum in AI Model.
Action: Run model migration checks for quality, latency, and cost.