Better benchmarks for evaluating AI agents, fears of rogue agents wreaking havoc across the internet, the impact agents are having on software development, and more
Amazon strips the embattled company, Metaculus hosts an agentic forecasting tournament, an AI agent runs for office, and more
The company's popular LLM agent framework gets a low-code interface, LangChain's CEO talks agents, Cohere gets multi-step tool use, and more
ML-Bench tests agents' abilities to reason across an entire code repository
New research highlights both the power and risks of LLM-based agents
Building agents with Anthropic's models just got significantly easier with the general availability of tool use for the Claude API
Developers will soon be able to build customized LLM agents for business tasks through the company's Copilot Studio
The dueling announcements of OpenAI's GPT-4o and Google's Astra show the two AI giants' growing focus on agent capabilities