silv.blog

Flat illustration of a cozy cartographer's desk at night with an old-world map spread across worn wood, a brass long-handled pointer and magnifying glass resting on the map, a soft amber dot glowing on the map under a warm brass desk lamp, stacked leather atlases and a rolled scroll beside it, on a dark navy background

AI weekly: Google I/O, Karpathy joins Anthropic, Mythos kernel exploit

X is the best way I’ve found to keep up with AI. I like tweets throughout the week, filtering for things I think are actually worth knowing. I use Claude Code to pull those likes automatically and help me turn them into this post (here’s how the pipeline works). This week: 225 tweets liked, filtered down to what’s below. Check out the previous roundup (May 11) if you missed it. AI for Everyone Google I/O 2026 dominated the week. Most of what got announced is actually live in the Gemini app, AI Studio, and the new Ultra tiers today, which has not been Google’s pattern in past years. Karpathy joining Anthropic was the other narrative event. And a small team used Anthropic’s Mythos model to write a working kernel exploit against Apple’s flagship M5 security feature in six days, which is the story most people outside infosec missed. ...

Flat illustration of a cozy librarian's desk at night with a brass lamp, an open card catalog drawer with index cards spread across the desktop, one card glowing softly, wooden shelves behind, on a dark navy background

AI weekly: GPT-5.5 Instant, Claude agent dreaming, Firefox kills 271 bugs in a month

X is the best way I’ve found to keep up with AI. I like tweets throughout the week, filtering for things I think are actually worth knowing. I use Claude Code to pull those likes automatically and help me turn them into this post (here’s how the pipeline works). This week: 148 tweets liked, filtered down to what’s below. Check out the previous roundup (May 2) if you missed it. AI for Everyone GPT-5.5 Instant Becomes the ChatGPT Default for Everyone (5 mentions) OpenAI started rolling GPT-5.5 Instant on May 5 as the default model for every ChatGPT user (also available in the API as gpt-5.5-chat-latest). The pitch is meaningful improvements in factuality, especially in medicine, law, and finance, plus a noticeable shift toward shorter, less padded answers. The memory update is the part worth digging into: ChatGPT can now use your saved memories, past chats, files, and a connected Gmail to personalize responses, and a new “memory sources” panel shows exactly what it pulled from so you can edit or delete entries. A full-duplex voice mode is coming, where the model can listen and speak at the same time. The companion OpenRouter cost analysis is the catch: short-prompt workloads are about 92% more expensive on GPT-5.5 than GPT-5.4. (source: @OpenAI, @OpenAI, @OpenAI) ...

Flat illustration of a dim server room with one warmly lit rack, glowing network cables, and red status lights, on a dark navy background

AI weekly: GPT-5.5 cracks corporate networks, ElevenMusic launches, Anthropic ARR hits $44B

X is the best way I’ve found to keep up with AI. I like tweets throughout the week, filtering for things I think are actually worth knowing. I use Claude Code to pull those likes automatically and help me turn them into this post (here’s how the pipeline works). This week: 148 tweets liked, filtered down to what’s below. Check out the previous roundup (Apr 24) if you missed it. AI for Everyone ElevenMusic Launches With Creator Payouts (7 mentions) ElevenLabs shipped ElevenMusic, a platform where you discover independent artists, remix tracks, and create songs from a prompt. The economic model has evidence behind it: ElevenLabs has paid out $11M to voice creators through its voice library and is bringing the same revenue share to music. Over 4,000 artists are on the platform at launch, and the Eleven Album Vol. 2 dropped alongside it. The same week added a Voice Changer Skill for ElevenLabs Agents that transforms voice in real time while preserving emotion and timing. (source: @ElevenLabs, @ElevenLabsDevs) ...

Flat illustration of a cozy library reading nook with glowing window, stacks of books, and a warm lamp on a dark navy background

AI weekly: GPT-5.5 ships, Google bets $40B on Anthropic, DeepSeek V4 goes open

X is the best way I’ve found to keep up with AI. I like tweets throughout the week, filtering for things I think are actually worth knowing. I use Claude Code to pull those likes automatically and help me turn them into this post (here’s how the pipeline works). This week: 150 tweets liked, filtered down to what’s below. Check out the previous roundup (Apr 16) if you missed it. AI for Everyone GPT-5.5 Ships in ChatGPT (8+ mentions) OpenAI launched GPT-5.5 on April 23. Same per-token latency as GPT-5.4, meaningfully smarter on knowledge work, research, and writing. OpenAI is calling it their smartest and most intuitive model yet, and the early reactions back the claim up. One team at Chai tested it on huge spreadsheets (100K to 1M+ cells) and called it the Pareto frontier for that work: best accuracy, fastest, most efficient. OpenAI also slipped a line into their announcement that I think will age well: “the last few years have been surprisingly slow.” They’re telling you the release pace is about to get a lot faster. (source: @OpenAI, @kimmonismus, @nicochristie) ...

Flat illustration of a cozy workshop with a cork bulletin board covered in model cards and sticky notes, warm desk lamp on dark navy background

AI weekly: Claude Opus 4.7 lands, Cloudflare floods the agent stack, Qwen3.6 runs on 23GB

X is the best way I’ve found to keep up with AI. I like tweets throughout the week, filtering for things I think are actually worth knowing. I use Claude Code to pull those likes automatically and help me turn them into this post (here’s how the pipeline works). This week: 150 tweets liked, filtered down to what’s below. Check out the previous roundup (Apr 5) if you missed it. AI for Everyone Claude Opus 4.7 Ships (12+ mentions) Anthropic launched Claude Opus 4.7 on April 16. Pricing held steady at $5 per million input tokens and $25 per million output. What changed is how the model spends those tokens: it thinks longer before answering and double-checks itself more aggressively. Vision is the part I didn’t see coming. Opus 4.7 now accepts images up to 2,576 pixels on the long edge, triple what prior Claude models took, which actually matters for diagrams, slides, and UI screenshots. If you’ve been running 4.6 for long tasks, swap the model ID and give it something harder. (source: @claudeai, @claudeai, @bcherny) ...

Flat illustration of a workshop table with circuit boards, model cards, and a quantum chip diagram under warm Edison bulb lighting on dark navy background

AI weekly: Gemma 4 runs free on a Mac Mini, axios gets backdoored, Google moves quantum threat to 2029

X is the best way I’ve found to keep up with AI. I like tweets throughout the week, filtering for things I think are actually worth knowing. I use Claude Code to pull those likes automatically and help me turn them into this post (here’s how the pipeline works). This week: 149 tweets liked, filtered down to what’s below. Check out the previous roundup (Mar 24) if you missed it. AI for Everyone Gemma 4 Runs Locally on a Mac Mini (10+ mentions) Google released Gemma 4 under Apache 2.0. The 26B version uses mixture-of-experts, only activating 4 billion parameters at a time, so it runs on a 24GB Mac Mini. It scores 79.2% on GPQA Diamond, which puts it in the same neighborhood as Claude Sonnet 4.6’s 74.1% standard score (though Sonnet hits 89.9% with max-effort reasoning). Four sizes from edge mobile to 31B dense, 256K context, multimodal input, 140+ languages. ollama run gemma4:26b and you’re running a competitive model locally for free. I’ve been waiting for an open model this good. (source: @GoogleDeepMind, @PawelHuryn, @ollama) ...

Isometric illustration of a cork bulletin board with pinned news clippings, calendar, and sticky notes, warm lamp on dark navy background

AI weekly: Claude gets Computer Use on Mac, /schedule runs jobs while you sleep, LiteLLM supply chain attack

X is the best way I’ve found to keep up with AI. I like tweets throughout the week, filtering for things I think are actually worth knowing. I use Claude Code to pull those likes automatically and help me turn them into this post (here’s how the pipeline works). This week: 104 tweets liked, filtered down to what’s below. Check out the previous roundup (Mar 16) if you missed it. AI for Everyone Claude Computer Use on Mac (5 mentions) Claude can now control your Mac: mouse, keyboard, screen, all of it. Kick off a task from your phone via Dispatch, walk away, come back to finished work. It’s slower than a human on purpose. Best for repetitive stuff that doesn’t have an API: copying data between apps, filling out web forms, clicking through legacy software. It tries connected apps (Slack, Calendar) first and only grabs screen control when needed. macOS now, Windows in a few weeks. (source: @claudeai, @felixrieseberg) ...

AI weekly: Claude 1M context goes GA, Chrome 146 unlocks browser agents, Manus Desktop launches

X is the best way I’ve found to keep up with AI. I like tweets throughout the week, filtering for things I think are actually worth knowing. I use Claude Code to pull those likes automatically and help me turn them into this post (here’s how the pipeline works). This week: ~110 tweets liked, filtered down to what’s below. Check out the previous roundup (Mar 8) if you missed it. AI for Everyone Claude Opus 4.6 Gets 1M Context Window at No Extra Cost Anthropic dropped the long-context surcharge. 1M tokens is now standard for Opus 4.6 and Sonnet 4.6, same price as any other request. Feed in an entire codebase, a stack of PDFs, hundreds of images. For Claude Code users on Max, Team, or Enterprise, it’s already the default. Benchmark that matters: 78.3% on MRCR v2 at 1M tokens, best-in-class. (source: @claudeai, @bcherny, @ClaudeCodeLog) ...

Flat illustration of colorful speech bubbles pouring into a funnel with a clean document page emerging at the bottom, warm amber and teal on charcoal background

How I summarize 300 AI tweets per week

I publish ai-weekly every week or two. Each edition starts with about 300 liked tweets and filters down to the 8-12 stories worth reading. People keep asking how, so here’s the whole thing. ...