Source: OpenAI engineers earlier this month told some colleagues they had figured out a way to more than halve the cost of inference
THE SO WHAT
If OpenAI can more than halve inference costs internally, the medium-term price floor for API access is lower than most current business cases assume—even if the savings aren’t passed through immediately. Treat this as a prompt to stress-test your unit economics against a world where high-quality inference is dramatically cheaper and competitors can afford to be more aggressive on AI usage.
READ THE SOURCE
MORE FROM THE WIRE
Applied AIGoogle’s NotebookLM can sum up your research in a TikTok-style clip
Research workflows are being rebuilt around short-form, auto-produced explainers — 60‑second vertical clips as a first pass over dense source material. Teams that rely on internal docs and knowledge bases should assume video-native consumption and design their knowledge architecture for remixing, not just reading.
Applied AIGrindr Is the Latest Dating App to Hook Up With AI
Dating apps adopting AI move recommendation and conversation from UX tweaks to core product logic — whoever controls the matching and messaging models controls the business. If you run a marketplace or social graph, assume users will expect AI-native onboarding, safety, and communication features within the same app, not bolted on via third parties.
Applied AINewsom strikes Anthropic deal to get California government half price Claude AI access
A 50% discount for statewide Claude access turns AI from pilot spend into line-item infrastructure for California agencies. If you sell into government or large enterprises, expect procurement to start negotiating AI access like cloud contracts — volume deals, standardized models, and pressure on per-seat and per-token pricing.
Applied AIGoogle introduces a faster, cheaper image generator with Nano Banana 2 Lite
Cheaper, faster image generation pushes AI art from creative experiment to default asset pipeline for marketing and product teams. If your business depends on stock imagery, basic design work, or UGC filters, assume margin compression and differentiate on brand, workflow, or rights — not raw generation.