Challenges in Governing AI Agents (Noam Kolt, Lawfare)

Leading AI companies have released a new type of AI system: autonomous agents that can plan and execute complex tasks in digital environments with limited human involvement. OpenAI’s Operator, Google’s Project Mariner, and Anthropic’s Computer Use Model all perform a similar function. They type, click, and scroll in a web browser to carry out a variety of online tasks, such as ordering groceries, making restaurant reservations, and booking flights. While the performance of these agents is currently unreliable, improvements are on the horizon. Scores on multiple benchmarks are steadily improving. The aspiration is to create AI agents that can undertake a broad range of personal and professional activities, serving as artificial personal assistants and virtual coworkers.

Challenges in Governing AI Agents | Lawfare

Latest articles

Related articles