When AI agents talk to each other: the provenance problem in multi-agent chains

For years, enterprises treated a single AI model like a sealed box: a question goes in, an answer comes out. That model is changing fast. Today's enterprise systems are chains of AI agents that exchange data, instructions, and results autonomously: one agent extracts, another enriches, a third decides. Every handoff adds information, but no one keeps track of who produced what, and when.

This is where a problem most organizations have not yet confronted begins: data provenance in AI agent chains. When one agent acts on another's output, a wrong, manipulated, or fabricated data point propagates down the chain with no trace of where it came from. The governing answer of this article is straightforward: the only way to defend decisions and accountability in multi-agent chains is to certify data at the source, sealing every data point and output the moment it is produced, so each link stays verifiable backward. This is a question of provenance, not of compliance: regulatory obligations are handled separately, while what matters here is attribution and chain traceability. It sits within the broader picture of Digital Provenance, of which AI agent pipelines are the hardest proving ground.

Why no one in an AI agent chain knows where a data point came from

In a multi-agent chain, no one knows the origin of a data point because each agent receives the previous agent's output as if it were a fact, not a claim to be verified. Information changes hands, gets reworked, and progressively loses its link to whoever generated it.

The problem compounds with scale. According to Gartner forecasts cited by IBM, by 2028 a single Fortune 500 company could run as many as 150,000 AI agents, while today only 13% of organizations consider themselves ready to govern them. When thousands of agents exchange data with no shared record of origin, the company loses the ability to answer a trivial question: this number, where did it come from? The proliferation of agents, what analysts call agentic AI sprawl, turns every chain into a black box where the final result is the sum of undocumented steps. Agentic AI governance becomes impossible not because logs are missing, but because logs tell you what happened, not who produced the underlying data.

What data provenance means in a multi-agent system

Data provenance in a multi-agent system is the traceable, attributed record of the origin, history, and every transformation a data point undergoes as it moves from one agent to another. For each piece of information it answers three questions: who produced it, when, and what data it was based on.

Data provenance in a multi-agent system refers to the ability to reconstruct, for every piece of information circulating in the chain, its point of origin and the sequence of steps that modified it. It is not a simple chronological list of events: under the modeling adopted by standards such as W3C PROV, provenance is a directed, attributed graph, where nodes are data and agents and edges represent relationships of the kind "this output was generated by that agent from those inputs". In a multi-agent system this structure is essential, because a single final result may depend on dozens of intermediate transformations. Research efforts like PROV-AGENT, which extends W3C PROV to the Model Context Protocol, formalize exactly this need. Without a provenance graph, reconstructing the origin of a data point means querying, after the fact, systems that were never designed to remember. With a provenance graph, origin is a property of the data itself, available at any time and for any link in the chain.

How an error propagates down the chain without leaving a trace

An error propagates because each agent treats the input it receives as reliable by definition: it does not distinguish a verified data point from one fabricated by the previous agent. Communication between AI agents works by passing value, and value does not carry its own history.

From one agent's output to another's input

When an AI agent produces a result and hands it to the next, that result instantly becomes a "factual" input. Agent communication protocols, from the emerging agent-to-agent (A2A) standard to interoperability handled via MCP (Model Context Protocol), are designed to move information smoothly, not to certify its origin. If the extraction agent misreads a value in a document, the analysis agent processes it as correct, and the reporting agent presents it to the client as a fact. The error goes uncorrected because no link in the chain has reason to doubt the previous one. The same applies to hallucinated data: information invented by a model, once injected into the chain, is indistinguishable from a real data point for every downstream agent.

The loss of attribution and accountability

When an AI agent acts on another's output, the audit log shows the identity of the user or service that ran the operation, but not that of the agent that actually produced the upstream data. This is the "Who Did That?" problem of attribution in autonomous pipelines: the log says your name, not the producing agent's. It creates an attribution gap that becomes an accountability gap. If a wrong data point passes through five agents before generating a flawed decision, reconstructing who introduced the error requires manually analyzing heterogeneous systems, often with no common step identifier. The difference between an audit trail and data provenance is exactly this: the audit trail records actions, provenance reconstructs origins. For the logging side of the problem, see the audit of communications between agents.

Traceability and record-keeping: what changes with the AI Act

The EU AI Act introduces traceability and record-keeping obligations for high-risk AI systems, in particular the automatic logging of events across the system lifecycle. The traceability the regulation requires is the starting point, but it does not solve the attribution problem in multi-agent chains: recording that an event happened is not the same as proving who generated the data and that it was authentic at that moment.

This article does not reconstruct the regulation in detail: that is the job of the piece dedicated to AI Act certification and compliance obligations, which covers articles, certification levels, and the legal accountability chain. Here it is enough to fix one point: regulatory compliance is the minimum condition, source-level verifiable provenance is what makes it defensible.

How to certify data at the source inside multi-agent pipelines

Certifying data at the source means sealing every data point and output the exact moment it is produced, before it enters the chain, attaching a proof of integrity and origin that follows it through every subsequent step. That way attribution does not have to be reconstructed afterward: it is part of the data from the start.

Sealing at the moment of production

The principle is to move certification from the point of arrival to the point of origin. Instead of trying to reconstruct provenance after the error has propagated, you create proof the moment the data is born: a timestamp that fixes the instant, a hash that captures the exact content, and a reference to the agent or step that generated it. Organizations use TrueScreen to certify at the source the data their agents exchange, before an error propagates. The seal applied at the source turns every step of the chain into a documented link: not just "this data exists", but "this data was produced by this agent, at this instant, with this content, and has not been altered since".

Backward verifiability of every link

Backward verifiability is the direct consequence of sealing at the source. If every link carries its own proof of origin and integrity, reconstructing the provenance chain becomes a simple read operation, not a forensic investigation. You start from the disputed final output and trace back, link by link, to the original data point, distinguishing at every step what was authentically produced from what was manipulated or fabricated. This is the difference between enduring an opaque chain and governing a transparent one. AI output authenticity stops being a matter of trust and becomes a demonstrable property: backward verifiability is what turns a suspicion into proof, and proof into output with legal value.

What TrueScreen is and how it brings provenance into AI agent pipelines

TrueScreen is a Data Authenticity Platform that certifies data and outputs the exact moment they are produced, making every link in a multi-agent chain verifiable backward. TrueScreen certifies every data point and output the moment it is produced, applying the qualified seal of a third-party QTSP, so each link in the chain stays verifiable backward.

The methodology is forensic, not a simple seal applied after the fact: it covers acquisition of the data at the source, verification of its integrity and origin, and certification with a timestamp and qualified seal issued by a third-party QTSP compliant with eIDAS, which TrueScreen integrates via API. For every certified data point, the content hash, the timestamp, and the identity of the producing step are recorded, and a structured report is generated that reconstructs the context. The integration is built for pipelines: the TrueScreen API and SDK let you certify every critical step of the workflow programmatically, while the official MCP enables agent-to-agent interoperability with frameworks like LangChain, AutoGen, or CrewAI; the Web Portal serves verification and review.

In a finance pipeline, an extraction agent passes a value to an analysis agent, which passes it to an agent that generates a client report. With source-level sealing, when the final value is disputed you trace back to the agent and the instant that produced it, distinguishing the original data point from a manipulated or hallucinated one.

Practical cases of propagation and verification in multi-agent chains

The two scenarios below show how source-level provenance changes the outcome when something goes wrong: in the first, an agent decides on documents collected by another; in the second, a data pipeline is enriched by unverified AI sources.

In the first case, an underwriting agent approves an insurance claim based on documents collected and pre-analyzed by an intake agent. If one of the documents was inconsistent, for example a declared date that does not match a photo's metadata, and the inconsistency was not caught at the source, the approval rests on a flawed data point. With certification of the documents at the moment of acquisition, in a dispute the company produces the full chain and proves exactly what was received, when, and in what state.

In the second case, an analysis pipeline is fed by an agent that retrieves information from external, unverified AI sources. A hallucinated data point enters the flow and is treated as real by downstream agents, until it contaminates a decision report. Without provenance, the error is invisible until it causes harm; with source-level provenance, every external data point carries the marker of its uncertified origin, and downstream agents, or human reviewers, can treat it for what it is.

Provenance, audit trail, and logging compared

The three concepts are often confused, but they answer different questions. The table below clarifies what each one does and where it stops.

Dimension	Logging	Audit trail	Provenance
Question it answers	What happened in the system?	Who did which action and when?	Where did this data come from and who produced it?
Object recorded	Technical and system events	Actions by users or services	Origin, history, and transformations of the data
Data attribution	Absent	Partial (who runs, not who produces)	Complete (producing agent + instant)
Backward verifiability	Limited	Per action, not per data point	Link by link, back to origin
Tamper resistance	Low if unprotected	Depends on implementation	High with source-level seal and timestamp
Usefulness in disputes	Marginal	Medium	High, if certified at the source

The key point: logging and audit trail describe system behavior, provenance reconstructs the history of the data. Only the third dimension, if certified at the source, holds up when a decision is challenged.

FAQ: AI agent data provenance

What is data provenance in a multi-agent system?

It is the traceable, attributed record of the origin, history, and transformations of a data point as it moves between multiple AI agents. For each piece of information it answers who produced it, when, and what inputs it was based on, modeled as an attributed graph under standards such as W3C PROV.

Why does an AI agent's error propagate without leaving a trace?

Because each agent treats the previous agent's output as a reliable fact, not as a claim to verify. Communication between AI agents passes value, not its history: a wrong or hallucinated data point becomes indistinguishable from a real one for every downstream agent.

How do you certify data at the source inside an AI agent pipeline?

By sealing the data the moment it is produced, before it enters the chain: a timestamp fixes the instant, a hash captures the content, a reference identifies the step that generated it. Certification happens programmatically via API or MCP at every critical step of the workflow.

What is the difference between an audit trail and data provenance?

An audit trail records actions (who ran an operation and when); provenance reconstructs origins (where a data point came from and who produced it). The audit trail describes system behavior, provenance reconstructs the history of the data back to its point of birth.

Does the AI Act require tracking the provenance of agent data?

The EU AI Act mandates traceability and record-keeping with automatic event logging for high-risk systems, but traceability is not the same as verifiable provenance attribution. The detail of these obligations is covered in the dedicated article on AI agent data certification.

Who is accountable when a wrong data point passes between multiple agents?

Without provenance, attribution is uncertain: the log shows who ran the operation, not who produced the upstream data. Certifying every step at the source reconstructs the attribution chain and makes it possible to establish where the error was introduced.

Can you verify the origin of a multi-agent output backward?

Yes, if every link was sealed at the source. To trace back the origin of a multi-agent output, TrueScreen reconstructs the provenance chain link by link, from the final output to the original data point, distinguishing what is authentic from what was manipulated or hallucinated.

Bring provenance into your AI agent pipelines

Certify at the source the data and outputs your agents exchange, so every link in the chain stays verifiable backward.

Start now

Request a demo