34 results found

Cognition, the AI coding agent startup behind Devin, secured $1 billion at a $26 billion valuation this week. Despite this, CEO Scott Wu insists AI agents shouldn't replace humans, aiming for augmentation to free programmers from tedious tasks. Wu envisions Devin as a "buddy" that enhances creativity, even as it handles 89% of Cognition's internal code.

AI agent usage has nearly doubled, yet developers maintain a strong preference for human oversight. A recent survey reveals single-agent workflows are dominant, driven by concerns for accuracy and security, even as work quality improves. Fintech and media lead adoption, leveraging tools like GitHub Copilot and LangChain under careful monitoring.

AI agents are inadvertently causing significant, untracked production incidents by acting without full system context, blurring the lines between autonomous remediation and chaos engineering. Enterprises lack the frameworks to categorize these failures, leading to cascades stemming from narrow agent perspectives. Experts advocate for integrating agent governance with chaos engineering by treating agent actions as experiments and managing a shared “resilience budget” based on live system signals.

Google is transitioning from a traditional search engine to an AI agent that proactively gathers information, fundamentally redefining the act of "googling." This shift, discussed on The Vergecast after Google I/O, raises profound questions about the future of the web itself and Google's identity in the AI era.

Google unveiled a sprawling, albeit complex, ecosystem of AI agents at its I/O developer conference on Tuesday, intending to transform how consumers interact with the web and manage their digital lives. However, the

Learn to harness Google's new AI agents to continuously monitor interests, synthesize information, and receive proactive updates, transforming how you stay informed without constant searching.

Anthropic announced Monday it has acquired Stainless, a dev tools startup whose SDK automation software is widely used by rivals like OpenAI, Google, and Cloudflare. This strategic acquisition will see Anthropic wind down Stainless's hosted products, granting it exclusive access to critical technology for building AI agents and impacting competitors.

Anthropic's new `/goals` feature for Claude Code revolutionizes AI agent reliability by separating task execution from goal evaluation. This prevents agents from prematurely ending tasks, using a dedicated evaluator model to ensure specified conditions are fully met before declaring completion. The innovation offers a more robust, auditable approach to AI agent deployment.

Amazon has integrated its unified Alexa for Shopping AI assistant directly into its main search bar for US customers. This move consolidates the Rufus chatbot and Alexa+ functionalities, providing conversational answers, comparisons, and automated shopping tasks as agentic commerce intensifies. The strategic shift aims to defend Amazon's advertising business against competing AI agents.

A critical flaw dubbed "AI tool poisoning" has been uncovered in enterprise AI agent security. The vulnerability exploits AI agents' reliance on unverified tool descriptions, rendering traditional software supply chain controls insufficient for ensuring behavioral integrity. A new runtime verification layer, using behavioral specifications and a proxy, is proposed to validate tool actions and prevent sophisticated attacks like prompt injection and behavioral drift.

The rapid evolution of AI has created a dense lexicon, leaving many confused. This guide demystifies key terms like LLMs, AI agents, and hallucinations, providing a foundational understanding. Grasping this language is crucial for navigating AI's transformative impact and future.

As autonomous AI systems become prevalent, intent-based chaos testing emerges as a critical method to prevent catastrophic failures caused by AI agents acting confidently but incorrectly. This approach addresses the limitations of traditional testing, which fails to account for AI's probabilistic nature and complex interactions. By measuring deviation from an agent's intended behavioral boundaries, this testing methodology helps ensure AI systems operate safely in unpredictable production environments.

Learn to establish robust AI agent governance in six stages, from discovery to compliance, protecting your organization before agents reshape your security policies.

Perplexity's Personal Computer, its local AI agent, is now available to all Mac users via a new desktop app. It securely accesses local files, apps, and the web to automate complex workflows, with full features requiring a Pro or Max subscription.

Stripe has launched Link, a new digital wallet that uniquely enables autonomous AI agents to make secure payments on behalf of users. It tackles security concerns by allowing agents to process transactions without direct access to sensitive payment credentials, utilizing virtual cards and user approval. The wallet also offers comprehensive traditional features like spending tracking and subscription management.

The web intelligence industry is rapidly evolving to meet the escalating demands of advanced AI, particularly for multimodal data processing and autonomous AI agents. Innovations in data extraction, infrastructure, and user-friendly tools are crucial for powering the next wave of artificial intelligence. These developments are building the essential links between vast web data and sophisticated AI models.

Nvidia CEO Jensen Huang took the stage at GTC 2026 on Monday to unveil the Agent Toolkit, an open-source platform for building autonomous AI agents, announcing a significant industry alignment with 17 major enterprise

Anthropic's Claude Code AI agent source code, comprising 512,000 lines of TypeScript, was accidentally leaked, revealing critical architectural details, security validators, and unreleased features. This breach creates new attack paths and forces enterprise security leaders to take immediate actions to protect their AI-assisted development environments.

Cloud infrastructure has undergone a significant transformation, evolving from manual configuration to deeply programmable systems. Over the past decade, nearly every platform has exposed robust APIs, enabling

AI is breaking software org charts. Zencoder's PMs and designers now directly ship code, thanks to AI agents drastically cutting implementation costs. This "AI-first" shift eliminates bottlenecks, enabling rapid delivery and widespread ownership.

In the lead-up to 2025, the developer community buzzed with anticipation. Promises of sophisticated AI agents, capable of autonomously executing complex tasks and revolutionizing workflows, dominated tech discussions.

Dana Lawson, Netlify's CTO, shares insights on building and leading a lean, globally distributed engineering team. Her approach emphasizes a strong written culture, intentional in-person connections, and pragmatic tech choices to power 5% of the internet with only ~50 R&D staff. AI agents play a crucial role in augmenting communication and documentation.

World ID's Agent Kit offers a novel "proof of human" solution for AI agents to prevent Sybil attacks and ensure human direction. Leveraging iris-scanned World ID, it aims to bring trust and accountability to automated online interactions.

Microsoft is reorganizing its Copilot division, elevating former Snap executive Jacob Andreou to lead all consumer and commercial AI efforts. Mustafa Suleyman, Microsoft AI CEO, will now focus exclusively on superintelligence and frontier AI models. This aims to create a more integrated AI system and accelerate the shift to AI agents.

The promise of Artificial Intelligence (AI) in software development has captured the industry's imagination. Large Language Models (LLMs) and AI agents are touted as revolutionary tools capable of dramatically boosting