🔧 Tool Reliability

Google DeepMind publishes ToolFormer v2 with 94% function call accuracy. Key innovation: speculative execution with rollback—agents can "dry-run" tools before committing.

📊 Benchmarks

🔨 GitHub Trending

⚡ Infrastructure News

OpenAI extends function calling to 128 parallel tool calls. New streaming format reduces latency for multi-step agent workflows by 40%.

💡 Lab Takeaway

Tool use is becoming a solved problem. The frontier is now tool discovery—agents that find and integrate new capabilities dynamically.