The Underdiscussed Risk of AI Agent Memory Across Calls

Lisa Hawkins

Director, Threat Research & Intelligence, Vercon

A contact center recently found itself facing an unusual pattern of fraudulent activity: a multi-stage attempt to exfiltrate funds, initiated across several inbound interactions. The initial calls were innocuous-a payment inquiry, a request for account balance, status updates-each seemingly routine. What made the sequence noteworthy was that the AI agent handling these calls seemed to 'remember' details from previous, disconnected interactions, details that, when aggregated, allowed a highly sophisticated, social engineering attack to progress past the initial authentication layers.

This was not an isolated incident. The persistent storage and retrieval of conversational context by AI agents, particularly across separate call instances, introduces a novel and increasingly urgent risk. It fundamentally alters the attack surface of contact center operations. For security leads, operations directors, and chiefs of staff, understanding this shift is no longer a theoretical exercise but a practical imperative for Monday morning operational reviews.

Why AI Agent Memory is a Critical Risk Now

The discourse surrounding AI agent memory often prioritizes technological specifics over operational realities. This misdirection obscures the core issue: the capacity for a single inbound interaction, when augmented by persistent memory, to trigger significant and unintended workflow decisions without human oversight. The pertinent question is not which AI platform offers the most robust memory management features, but rather, which critical actions can be initiated or advanced purely through the inputs of an AI agent with recall of prior, potentially compromised, interactions.

AI agent security, once a niche concern, has rapidly transitioned into a core operational domain. The drivers are well-understood: the proliferation of affordable attacker tooling, the expansion of production AI channels, and the belated but firm engagement of regulatory bodies. Organizations that deferred these considerations are demonstrably lagging, a gap that widens with every generative AI advance that enables credible impersonations at near-zero cost. The operational burden is manifesting in shifting search query trends: away from general incident reports and towards specific, tactical needs like "memory policy template" or "memory verification workflow." This indicates a quiet, internal scramble to implement tangible controls.

The Evolving Threat Pattern in Practice

When examining this vulnerability with security and operations teams, the scope of affected workflows consistently broadens beyond initial expectations. Password resets, address modifications, refund approvals, service dispatches, and wire confirmations all share a common characteristic: their underlying processes, at some point, rely on the implicit trustworthiness of a single input channel. It is precisely this foundational assumption that crumbles under targeted, multi-interaction attacks.

Such attack patterns typically emerge first in workflows designed for operational flexibility or customer convenience. Recovery flows, manager override protocols, and after-hours intake systems, all engineered to maintain service continuity during exceptions, become prime targets. Adversaries dissect these pathways with the same analytical rigor as internal auditors, often identifying exploitable friction points long before internal teams. The efficacy of an attack is rarely determined by the sophistication of the attacker's toolkit, but rather by the absence of systemic friction once they breach the initial layers of a workflow.

Consider the 'straight-through-processing' (STP) paradigm, particularly in areas like FNOL (First Notice Of Loss) or claims. If an AI agent, informed by disparate, remembered pieces of information from prior calls (e.g., policy number from Call 1, account holder DOB from Call 2, stated incident details from Call 3), can initiate or significantly advance an STP process without human review, the exposure is substantial. This isn't theoretical; we've observed initial probes against such systems attempting to aggregate seemingly benign data points to establish sufficient 'trust' for subsequent, more critical actions.

Architecting Effective Defense

The required mitigations are often unassuming: mandatory second-channel confirmations for high-impact actions (e.g., an SMS code to a registered number for an address change), granular rate limits on sensitive operations, and explicit policies that empower front-line staff to introduce delays for verification without fear of reprimand. The more challenging aspect lies in integrating these changes into the broader business operations, which necessitates executive-level endorsement rather than solely technical implementation.

Our guiding principle for clients is straightforward: 'raise the cost.' Effective controls do not aim for absolute prevention. Instead, they elevate the time, resources, and complexity required for a successful attack to a point where the adversary’s return on investment diminishes, compelling them to seek softer targets. This principle underpins all robust security programs and, when applied consistently rather than as a one-off measure, proves equally effective in this evolving landscape.

Practical Steps for Your Team

Vercon's detailed approach to these challenges is documented within our Threat Frameworks. Most engagements initiate with a review of these frameworks.

As an immediate tangible step, conduct a focused review: enumerate every action a single inbound interaction, facilitated by an AI agent, can authorize within your most sensitive workflows. For each action, critically assess whether it would withstand a determined impersonation attempt leveraging aggregated, remembered information. Invariably, this exercise yields a concise, prioritized list of enhancements that often deliver measurable returns within a single quarter, frequently without requiring new capital expenditure.

What We Are Monitoring Next

Looking forward, the concept of memory risk will increasingly permeate beyond dedicated security teams into operations, legal, and customer experience departments. This migration is a positive development, indicating a maturing understanding of the systemic implications. Proactive planning for this shift now will yield significant dividends compared to reactive responses down the line. We will continue to disseminate field observations here as these patterns evolve and new mitigation techniques emerge.

Sources & Further Reading

#memory#isolation

← Previous

How Restoration Companies Can Harden Their First Notice of Loss

Why You Should Stress-Test Your AI Agent Quarterly