← All Weekly Issues

Claude-tap inspects agent APIs; benchmarks measure what 'agentic' really means

June 19, 2026

The agent infrastructure layer is crystallizing. New tools let you own your agents' execution (Osaurus), inspect their API calls (Claude-tap), and run them safely in sandboxes. Meanwhile, benchmarks are finally catching up—forcing the question of whether your setup actually qualifies as agentic.

Industry Developments

AWS’s New Agentic Tools Trail Rivals, but Respond to Real Problems

With its latest AI tools and products, the vendor showed it is listening to its customer base even while it fails to offer novelty in the market.

Read Source

Technical Updates

liaohch3/claude-tap: Intercept and inspect Coding Agent API traffic from Claude Code, Codex CLI, Gemini CLI, Cursor CLI,

Intercept and inspect Coding Agent API traffic from Claude Code, Codex CLI, Gemini CLI, Cursor CLI, OpenCode, Kimi, Pi, and Hermes in a local trace viewer.

Read Source

IBM/AssetOpsBench: AssetOpsBench - Industry 4.0: A unified benchmark and framework for building, orchestrating, and eva

AssetOpsBench - Industry 4.0: A unified benchmark and framework for building, orchestrating, and evaluating domain-specific AI agents for Industry 4.0 asset operations and maintenance, with 460+ scenarios, 4 specialist agents (IoT, FMSR, TSFM, Work Order), and multi-agent orchestration blueprints (MetaAgent, AgentHive) over MCP.

Read Source

omnigent-ai/omnigent: A meta-harness for all your AI agents. Omnigent provides a common layer over Claude Code, Codex, Pi

A meta-harness for all your AI agents. Omnigent provides a common layer over Claude Code, Codex, Pi, and the agents you write yourself: swap or combine harnesses without rewriting, keep them in check with policies and sandboxing, and collaborate in real time on the same live session, from any device.

Read Source

opensandbox-group/OpenSandbox: Secure, Fast, and Extensible Sandbox runtime for AI agents.

Secure, Fast, and Extensible Sandbox runtime for AI agents.

Read Source

osaurus-ai/osaurus: Own your AI. The native macOS harness for AI agents -- any model, persistent memory, autonomous exec

Own your AI. The native macOS harness for AI agents -- any model, persistent memory, autonomous execution, cryptographic identity. Built in Swift. Fully offline. Open source.

Read Source

xenodium/agent-shell: A native Emacs buffer to interact with LLM agents powered by ACP

A native Emacs buffer to interact with LLM agents powered by ACP

Read Source

Agentic Resource Discovery: Let agents search

No description available

Read Source

Is it agentic enough? Benchmarking open models on your own tooling

No description available

Read Source