AI Dev Essentials #29: Claude Memory & Skills, OpenAI Acquisitions, and new Vercel OSS

Hey Everyone 👋,

John Lindquist here with the 29th issue of AI Dev Essentials!

Claude Memory rolled out to Pro users, providing project-scoped context that keeps work streams separate. OpenAI acquired the team behind Apple Shortcuts, which suggests they're thinking seriously about OS-level integration. Vercel shipped significant updates to the AI SDK and Agent workflows. Google AI Studio added annotation mode for visual debugging, and Claude Code launched on the web with sandbox security features. Every AI tool is getting better and better at doing the boring stuff so that we can focus on the fun stuff.

Personally, I've been diving deep into the new Claude's new "Skills" feature which it taking the AI space by storm. Skills enable you to enhance you Agents with new capabilities based on your specific needs. I'll definitely be covering Claude's Skills in my upcoming Claude Code Power User workshop:

🎓 New egghead.io Lessons This Week

I launched "The Essential Guide to Claude Code Skills" which is a free course that covers the basics of Skills and how to use them to enhance your Agents. Lessons 1-3 are live and I'll be adding more lessons ~daily:

Create Your First Claude Code Skill
Control Claude Skills Output with References and Examples
Stacking Claude Skills to Create Complex Workflows

Check out the course here: https://egghead.io/lessons/create-your-first-claude-code-skill~sds93

🚀 Major Announcements

Claude Memory Rolls Out to Pro and Max Users

Anthropic announced on October 23, 2025, that Claude Memory is now available to Pro and Max plan users, bringing project-scoped context management to individual subscribers after initially launching for Team and Enterprise users in September.

Memory capabilities:

Project-Scoped Memory: Each project maintains its own focused context, preventing cross-contamination between different work streams
Immediate Access: Max users get immediate access; Pro users receiving rollout "over the coming days"
User Control: View, edit, and manually update memory summaries at any time
Import/Export: Transfer memories to/from other AI tools for backup or migration
Privacy Options: Opt-in system with incognito chat mode for conversations that don't save to memory
Safety Testing: Extensive safety testing conducted to prevent harmful pattern reinforcement

(Anthropic Official, MacRumors, Axios)

Memories leaking across projects is the reason I never turn memories on. I've had too many experiences where it would suggest TypeScript in a Swift project or even mention my deadlifts when planning a vacation. It just felt creepy and invasive. Scoping memories sounds like it will finally give me a level of control to actually make it useful.

OpenAI Acquires Software Applications Inc and Apple Shortcuts Team

OpenAI announced on October 23, 2025, the acquisition of Software Applications Incorporated, maker of Sky—a natural language interface for Mac—along with the founding team behind Apple's Shortcuts automation framework.

Acquisition details:

Sky Product: Natural language interface for Mac that understands screen context and executes actions
Key Team Members: Ari Weinstein and Conrad Kramer, former Apple employees who created Workflow (which became Apple Shortcuts)
Design Talent: Kim Beverett, 10-year Apple designer who worked on Safari, Messages, FaceTime, WebKit, and user privacy
Valuation: Deal valued in multibillion-dollar range; specific terms not disclosed
Previous Funding: Software Applications had raised $6.5 million; Sam Altman's investment fund held passive stake
Timing: Acquisition announced 2 days after OpenAI launched ChatGPT Atlas (AI-powered browser for Mac)
Strategic Goal: Integrate Sky's deep macOS integration and product craftsmanship into ChatGPT ecosystem

(OpenAI Official, TechCrunch, CNBC, MacRumors)

Automation specialists + OpenAI Agents working on Mac... I've always loved the general idea of the Shortcuts app on both iOS and Mac, but the execution and UI has always felt clunky. I'm curious to see what Automation/Agent design/UI will look like when they get around to releasing the tools. I'm already deeply aware of the capabilties of Mac automation (thanks to my work on Script Kit), but I hope this isn't just another connect-the-nodes UI.

Anthropic and Google Announce Multi-Billion Dollar Compute Deal

Google and Anthropic announced on October 23, 2025, a major cloud infrastructure deal that will provide Anthropic with up to 1 million custom-designed Tensor Processing Unit (TPU) chips to support training and deployment of next-generation Claude models.

Partnership details:

Scale: Up to 1 million TPU chips worth tens of billions of dollars (industry estimates: ~$50B for 1-gigawatt data center)
Compute Capacity: Over 1 gigawatt of AI compute capacity expected online in 2026
Purpose: Support training and deployment of next-generation Claude models
Rationale: TPUs selected for "price-performance and efficiency"
Business Context: Anthropic's annual revenue run rate approaching $7 billion; Claude powers 300,000+ businesses (300x increase over past 2 years)
Infrastructure Strategy: Builds on initial 2023 partnership; ensures capacity to meet exponentially growing demand
Multi-Cloud Approach: Amazon remains primary cloud provider and training partner with $8 billion investment (more than double Google's $3 billion)

(Google Cloud Press, Bloomberg, CNBC, SiliconANGLE)

It's interesting how OpenAI and Grok always talk about needed more compute, but then Google (with Gemini) can lease compute to direct competitors. I don't really have a good read on the situation and requirements for what these teams are doing, but it makes me think that Google is in a much better position than the other providers.

Vercel v0 Launches iOS App in Public Beta

Vercel announced in October 2025 the public beta launch of the v0 iOS app, extending AI-powered development capabilities to mobile devices with voice and camera inputs.

App features:

Mobile Development: Build apps using voice and camera inputs on iOS
Waitlist Access: Users can join waitlist for mobile beta access
October Updates: Team Templates, Stripe payment integration (public beta), Claude Sonnet 4.5 rollout
Cross-Platform Strategy: Expansion extends v0 beyond web development to mobile platforms
Usage: Enables on-the-go prototyping and development workflow

(Vercel Community, Vercel Blog)

I'll be honest, I switched to Android for all the deep Gemini integrations. It's fine and I've got high hopes for it, but seeing OpenAI only releasing Codex on iOS, Anthropic only releasing Claude Code on iOS, and now Vercel only releasing v0 on iOS is making me sad. I'm super curious to see if my non-AI friends will be able to use v0 to build an app for themselves. Being able to launch it on a phone removes a ton of friction from the casual user to get building. This is a super interesting space to watch over the next year to see how far this can be pushed to letting anyone build whatever they need.

🛠️ Developer Tooling Updates

Vercel Open Sources Workflow Dev Kit for Durable Workflows

Vercel released the Workflow Dev Kit on October 23, 2025, as an open-source TypeScript framework for building durable, long-running workflows with built-in observability and no vendor lock-in.

Framework features:

Simple API: Uses use workflow and use step directives with no manual queue/scheduler setup
Durable Execution: Functions can pause for minutes or months, survive crashes, and resume exactly where they stopped
Built-in Observability: CLI and web UI for monitoring workflow execution
Webhook Support: External data triggers for workflow initiation
Framework Agnostic: Runs on any framework, platform, and runtime
Target Use Cases: AI agents and data pipelines requiring reliable long-running operations
Open Source: MIT licensed and available on GitHub

(Vercel Blog, Vercel Changelog, GitHub)

I don't know how many times I can say "Agents!" in a single newsletter, apparently they're kind of a big deal. Everyone is figuring out how to host them and Vercel is throwing its hat in the ring. I'm definitely all for anything that makes these easier to understand and manage. I'm almost suprised it's not just a text prompt describing the workflow so that it's a "true" Agent, but I understand that comes with massive limitations and unexpected results.

Google AI Studio Adds Annotation Mode for Visual Debugging

Google announced on October 18, 2025, the addition of annotation mode to Google AI Studio, enabling developers to highlight and comment on UI components directly for precise visual debugging.

Annotation capabilities:

Visual Highlighting: Click and mark specific components, buttons, or text on the canvas
Collaborative Comments: Add visible comments, error pointers, and highlights for team collaboration
Screenshot Sharing: Annotated screenshots shareable in chat for reference context
Precise Targeting: Enable exact UI region specifications for Gemini model adjustments
Vibe Coding Integration: Part of broader "vibe coding" mode revamp with context-aware suggestions
One-Click Deployment: Deploy directly to Google Cloud Run from AI Studio

(Google Developers Blog, Testing Catalog)

Awesome feature! Absolutely love this direction of just busting out a red pen and writing all over what you want changed rather than trying to describe it in words. I need to show this to my young kids and see what they make.

Claude Code Launches on Web with Parallel Task Execution

Anthropic announced on October 20, 2025, that Claude Code is now available on the web as a beta research preview for Claude Pro and Max users, enabling parallel coding task delegation on cloud infrastructure.

Web platform features:

Cloud Infrastructure: Delegates multiple coding tasks simultaneously on Anthropic-managed infrastructure
GitHub Integration: Connects to GitHub repositories for seamless code management
Real-Time Progress: Track progress with ability to steer Claude during execution
Parallel Execution: Support for running tasks across different repositories simultaneously
Automatic PRs: Creates pull requests with comprehensive change summaries
Growth Metrics: 10x user growth since May 2025 launch; generates $500 million annualized revenue
Mobile Support: iOS app available in early preview

(Anthropic Official, TechCrunch, VentureBeat)

Claude Code actions have been available in GitHub actions for a long time, so I was curious when they were going to release their own hosted version. I haven't had a chance to poke at it yet, but it's definitely next on my list.

Claude Code Introduces Sandbox Security Model

Anthropic launched sandboxing for Claude Code on October 20, 2025, implementing a two-boundary isolation system for filesystem and network access with OS-level enforcement.

Security features:

Filesystem Isolation: Claude can only access and modify specified directories
Network Isolation: Restricted to approved servers via proxy service
OS-Level Enforcement: Uses Linux bubblewrap and macOS seatbelt for security
Reduced Friction: 84% reduction in permission prompts in internal testing
Open Source: Sandboxing approach open-sourced for other teams building AI agents
Easy Activation: Run /sandbox command in Claude Code to enable
Prompt Injection Protection: Protects against malicious prompt injection attempts

(Anthropic Engineering, WinBuzzer, Claude Docs)

Permissions are one of those areas of AI Agent development that are a pain to manage. You have a lot of files on your system you don't want AI's to read, but you also want your Agent to run autonomously and not have to constantly approve permissions. This is a big step in the right direction.

Gemini CLI Enhances Interactive Shell Experience

Google released Gemini CLI versions 0.9.0 (October 15) and 0.10.0 (October 22), 2025, introducing pseudo-terminal support and interactive shell tool calling for complex command execution.

CLI improvements:

PTY Support: Version 0.9.0 introduced pseudo-terminal for interactive command execution
Interactive Tool Calling: Version 0.10.0 added intelligent shell tool usage without explicit "!" prefix
Complex Commands: Run vim, nano, git rebase -i, top, htop, npm init, and interactive REPLs
Real-Time Visualization: Colors and cursor position maintained within Gemini CLI context
Bidirectional Communication: Keystrokes and window resizing transmitted to processes
Installation: npm install -g @google/gemini-cli@latest

(Google Developers Blog, Testing Catalog, GitHub)

I'm curious when Gemini/Claude/Codex will become a default shell. I know the Warp terminal already takes this kind of approach, but my guess is that in the near future, every command you run will be part of an AI session that can help fix things along the way.

🤖 AI Ecosystem Updates

Google Veo 3.1 Tops Video Arena Leaderboard

Google announced on October 15, 2025, the release of Veo 3.1 with enhanced capabilities, which subsequently topped both Text-to-Video and Image-to-Video leaderboards on Video Arena on October 20.

Model improvements:

Video Generation: Up to 60 seconds of 1080p video generation
Native Audio: Enhanced audio generation including dialogue, sound effects, and ambient audio
Improved Fidelity: Better prompt adherence and texture realism
Availability: Gemini App, Flow, Gemini API, Vertex AI
Pricing: $0.40/second standard tier, $0.15/second fast tier
Leaderboard Performance: First model to break 1400 in Video Arena history

(Google Blog, TechCrunch, LMArena)

Alibaba Qwen Releases Qwen3-VL Vision Models

Alibaba's Qwen team released Qwen3-VL-2B and Qwen3-VL-32B on October 21, 2025, bringing state-of-the-art vision-language capabilities to both edge devices and cloud deployment.

Model variants:

Qwen3-VL-2B: Optimized for edge devices (smartphones, laptops)
Qwen3-VL-32B: Cloud deployment with performance matching models up to 235B parameters
Context Window: Up to 1 million tokens
OCR Support: 32 languages with strong multilingual capabilities
Model Modes: Both Instruct and Thinking variants available
Deployment Options: FP8 quantized versions for efficient deployment
Benchmark Performance: Qwen3-VL-32B reportedly outperforms GPT-5 Mini and Claude 4 Sonnet on STEM benchmarks (MathVista, MMMU)

(GitHub, Alibaba Cloud, TechNode)

Microsoft Launches Mico Avatar for Copilot Mobile

Microsoft introduced Mico, an expressive blob-shaped avatar, at the Microsoft Fall 2025 Copilot Sessions on October 23, 2025, for Copilot on iOS and Android.

Avatar features:

Expressive Design: Blob-shaped avatar with shape-shifting animations and emotive reactions
Optional Feature: Can be disabled; includes Easter egg transformation to Clippy when tapped repeatedly
Voice Integration: Displays during voice conversations with dynamic reactions
Availability: US launch with UK and Canada rollout in coming weeks
Additional Features: Part of broader fall release including group chat and debate modes

(TechCrunch, Windows Central)

Anthropic Launches Claude for Life Sciences

Anthropic announced Claude for Life Sciences on October 20, 2025, built on Claude Sonnet 4.5 with specialized integrations for research and pharmaceutical development.

Life sciences features:

Model Performance: Scores 0.83 vs 0.79 human baseline on Protocol QA benchmark
Platform Integrations: Benchling, PubMed, 10x Genomics, Synapse.org
Use Cases: Literature reviews, hypothesis development, data analysis, regulatory submission drafting
Real-World Impact: Novo Nordisk reduced clinical study documentation from 10+ weeks to 10 minutes
Availability: Claude.com and AWS Marketplace; Google Cloud Marketplace coming soon
Strategic Focus: Formal entry into life sciences sector with domain-specific optimizations

(Anthropic Official, CNBC)

⚡ Quick Updates

OpenRouter Launches Exacto Endpoints for Precision Tool Calling

Announced: October 21, 2025
Precision Routing: Route requests to providers with measurably superior tool-calling accuracy
Access Syntax: model_slug:exacto (e.g., glm-4.6:exacto)
Launch Models: Kimi K2, DeepSeek v3.1 Terminus, GLM 4.6, GPT-OSS 120B, Qwen3 Coder
Selection Criteria: Top accuracy, normal propensity, low user blacklisting
Benchmarks: Shows materially fewer tool-calling failures on tau2-Bench and LiveMCPBench

(OpenRouter, Hacker News)

Google Releases ADK v1.17.0 for Agent Development

Released: October 22, 2025 via PyPI
Framework: Open-source Python toolkit for building, evaluating, and deploying AI agents
Agent Types: LLM agents, workflow agents (Sequential, Parallel, Loop), custom agents
Design Philosophy: Make agent development feel like traditional software development
Model Agnostic: Works with any LLM provider
Installation: pip install google-adk

(Google ADK Docs, Google Cloud)

Google Research Publishes FLAME for Rapid Model Specialization

Published: October 23, 2025
Approach: One-step active learning for model specialization
Speed: Complete specialization in ~1 minute on standard CPU (no GPU required)
Performance: 53.96% AP on DOTA benchmark, 53.21% on DIOR benchmark with 30-shot adaptation
Key Innovation: Lightweight classifier refinement layer on top of frozen base model
Use Cases: Rapid adaptation for specialized object detection tasks

(MarkTechPost, arXiv)

Google Launches Skills Learning Platform

Launched: October 21, 2025
Content: Nearly 3,000 courses, labs, and credentials consolidated into single platform
Sources: Google Cloud, Google DeepMind, Grow with Google, Google for Education
Developer Features: Gemini Code Assist integration in 20+ hands-on labs
Gamification: Achievement streaks, leagues, shareable badges
Pricing: 35 monthly lab credits for developers; many courses free; paid certifications available
Career Integration: 150+ employer partnerships including direct interview pathways

(Google Blog, Google Cloud Blog)

Framecap CLI Screen Recorder Launches for macOS

Launched: October 13, 2025 (v1.0.0); October 22, 2025 (v1.1.1)
Size: Lightweight 1.2 MB native binary with no external dependencies
Features: Multi-track recording (screen, camera, audio to separate files)
Codecs: Lossless H.264 and ProRes formats
Special Modes: Window-only recording with transparent backgrounds
Performance: Handles multiple 5K sources simultaneously
Pricing: $49 USD for lifetime updates

(Framecap Official, Hacker News)

Research: Self-Improving Language Models Survey Published

Published: October 3, 2025 (submitted to EMNLP 2025)
Focus: Self-improvement in Multimodal Large Language Models
Structure: Data collection, data organization, model optimization
Key Finding: Recent advances enhance capabilities without significant cost increases
Related Work: Self-Evolving LLMs paper shows 53.8% relative improvement with less than 1.2% additional data
Paradigm: Models generate and solve their own tasks for data-efficient learning

(arXiv Survey, arXiv Self-Evolving)

✨ Workshop Spotlight

Transform into a Claude Code Power User

Date: November 07, 2025 Time: 9:00 AM - 2:00 PM (PDT) Platform: Zoom

Pricing:

🔥 Early Bird: $300 (jumps to $375 next week)
egghead.io Pro Yearly Member: $225 - Become a Pro member →

What You'll Learn:

Master the essential skills to ship reliable AI-generated code with confidence. This hands-on workshop covers everything from foundational prompting to advanced automation using the Claude Code SDK and custom integrations.

Core Skills:

Prompt for Developers: Learn to communicate effectively with Claude for coding tasks
Context Engineering: Control what context Claude sees for reliable, consistent results
TypeScript SDK: Script Claude programmatically to build custom workflows
Custom Hooks: Automate repetitive tasks with Claude Code hooks
Model Context Protocol: Integrate APIs securely to extend Claude's capabilities
Live Q&A: Get your specific questions answered by John Lindquist

Register: https://egghead.io/workshop/claude-code

Read this far? Share "AI Dev Essentials" with a friend! - https://egghead.io/newsletters/ai-dev-essentials

John Lindquist

https://egghead.io