0
Applied AI·June 30, 2026·1 min read

AWS is spending $1bn to put its engineers inside customers’ offices

Share

AWS turning $1B into forward-deployed engineers is a clear bet that AI/cloud deals will be won in the field, not from the console. If you’re an enterprise CIO, assume your cloud and model vendors will push embedded teams—decide now where you want that influence and where you don’t.

Applied AI

Claude Sonnet 5 costs $2 per 1M input tokens and $10 per 1M output tokens through August 31, after which prices rise to $3 and $15, respectively

Anthropic is using a time-boxed discount window on Sonnet 5 to pull real agent workloads onto its stack before Labor Day—your cost models for pilots vs. production need to reflect the post-August 31 pricing, not the teaser rate. If you’re locking in contracts or budgeting for high-output agents, model against $3 / $15 now and treat the current pricing as a temporary subsidy.

Applied AI

Qualcomm targets Nvidia, AMD, Huawei with Dragonfly AI accelerator rack loaded with 43TB of LPDDR5x, future generations set to smash 7PB/s bandwidth

Qualcomm is betting that LPDDR5x plus its proprietary HBC can win inference on cost and power, not just raw FLOPs—43 TB per rack and future 7 PB/s bandwidth is a direct play at memory-bound workloads. If you’re building large-scale inference services, start tracking HBM-free architectures as a hedge against both cost and supply volatility in HBM-heavy stacks.

Applied AI

Anthropic launches Claude Sonnet 5, saying it nears Opus 4.8 performance at lower prices and is substantially better than Sonnet 4.6 for agentic work

Near-Opus 4.8 performance at Sonnet pricing, tuned for planning and tool use, compresses the premium you pay for top-tier models in agent workflows. If you’ve been reserving your heaviest agents for only the most valuable tasks, this is the moment to re-benchmark and see which can drop to a cheaper-but-capable tier without losing reliability.