Deploying AI Agents in a GDPR-Compliant Way
GDPR-compliant AI Agent deployment: legal bases, data flows and technical measures for privacy-compliant operation.
GDPR-compliant AI Agent deployment refers to the legally sound development and operation of LLM-based and agentic AI systems while fully upholding the General Data Protection Regulation — from the legal basis (Art. 6, and where applicable Art. 9), through automated decisions (Art. 22), processing on behalf of a controller (Art. 28) and the Data Protection Impact Assessment (Art. 35), to data residency in the EU. Because with AI Agents nearly every processing step — prompt, agent memory, tool call, vector store, log — constitutes a processing of personal data, controllers must justify and document each layer of the stack individually. This article is informational and does not constitute legal advice.
Key Takeaways
- ✓With AI Agents, nearly every touchpoint is a processing operation within the meaning of the GDPR: prompts, outputs (including hallucinated personal data), agent memory, tool-call payloads, logs and embeddings — the latter being classified by the CNIL and the Hamburg Data Protection Authority as pseudonymous personal data rather than anonymous, due to inversion attacks.
- ✓For internal deployments, training, fine-tuning and RAG, Art. 6(1)(f) (legitimate interest) is the typical legal basis; it requires the three-part test (purpose, necessity, balancing) under EDPB Guidelines 1/2024 and EDPB Opinion 28/2024 (17 Dec 2024).
- ✓Following the CJEU rulings in SCHUFA (C-634/21, 7 Dec 2023) and Dun & Bradstreet (C-203/22, 27 Feb 2025), even a score on which a third party substantially relies falls under Art. 22 — and data subjects have a right to a concrete, comprehensible explanation of the logic that enables them to understand, verify and contest it.
- ✓Enterprise DPAs from the major providers (Microsoft, OpenAI, Anthropic, Google, AWS) contractually commit not to use customer data for model training — yet these commitments are contractual, not statutory, and often reserve narrow rights for abuse monitoring; the sub-processor cascade of 5–8 layers must be covered seamlessly by DPAs (Art. 28(4)).
- ✓AI Agents almost always trigger a Data Protection Impact Assessment (DPIA, Art. 35), since they regularly involve profiling, automated decisions with effect, innovative technology or employee monitoring — as stated in the DSK guidance 'AI and Data Protection' (Version 1.0, 6 May 2024).
- ✓The Italian Garante imposed a fine of EUR 15 million on OpenAI (decision of 2 Nov 2024) for training without a legal basis and non-transparent processing, and EUR 5 million on Luka Inc. (Replika) (10 Apr 2025) — Clearview AI was fined EUR 20 million each in, among others, Italy and France, because public availability does not substitute for a legal basis.
- ✓EU data residency can be achieved via the EU Data Boundary (Microsoft), EU regions (Vertex AI, AWS) or sovereign providers (Aleph Alpha Pharia, IONOS, STACKIT, T-Systems, Swisscom); the EU-US Data Privacy Framework has been in force since 10 Jul 2023 but was challenged in court in 2025, which is why SCCs plus a Transfer Impact Assessment are recommended as a fallback.
- ✓In Switzerland, Art. 21 revFADP is stricter than Art. 22 GDPR — the duty to inform in the case of automated decisions is the rule, not the exception — and intentional violations can hold natural persons (management, DPO) criminally liable for up to CHF 250,000.
GDPR-compliant AI Agent deployment: what it is about
An AI Agent is not a classic software tool but a system that continuously processes personal data — often in places that project teams initially overlook. Anyone operating an AI Agent as a controller in the DACH region must therefore justify and document each processing layer individually under data protection law. This hub page provides an overview of the five central levers: legal basis (Art. 6, and where applicable Art. 9 GDPR), automated individual decisions (Art. 22), processing on behalf of a controller (Art. 28), Data Protection Impact Assessment (Art. 35) and data residency in the EU.
Important upfront: the two legal regimes most relevant to AI Agents — the GDPR and the EU AI Act — overlap but apply independently of one another. A processing operation can be GDPR-relevant without falling under a high-risk classification of the AI Act; conversely, a high-risk system under Annex III must satisfy every GDPR principle on its own. This article is informational and does not constitute legal advice.
Where the GDPR applies within the agent stack
Under Art. 4(1)/(2) GDPR, any operation on personal data triggers the regulation. In an agentic system, these are typically:
- Inference inputs — prompts, system prompts, uploaded files, voice and image samples.
- Inference outputs — texts and summaries that name or describe individuals, including hallucinated personal data (both the Hamburg ChatGPT complaint and the Garante proceeding confirm: fabricated personal information remains personal data).
- Agent memory — short-term context windows, episodic and semantic memory, persistent vector stores (mem0, LangGraph, Letta, custom databases).
- Tool-call payloads — every JSON body sent to an internal API or an MCP server carries personal data into a new processor sphere.
- Multi-agent communication — A2A messages are processing events even when both agents reside with the same controller.
- Logs and traces as well as embeddings.
Particularly underestimated: embeddings are not automatically anonymous. Text inversion attacks (Morris et al., 2023) and membership inference attacks show that embeddings are re-identifiable via similarity search. The CNIL and the Hamburg Data Protection Authority therefore treat embeddings as pseudonymous personal data by default.
Legal basis: Art. 6 GDPR
Every processing operation needs one of the six legal bases under Art. 6(1). In practice, three are especially relevant for AI Agents:
Legal basis | Typical agent application | Practical note |
|---|---|---|
(a) Consent | Consumer assistants, voice cloning, biometric agents | Freely given, specific, informed; in the employment context usually not freely given; hardly practicable for training |
(b) Performance of a contract | Customer service agent for existing customers, employee agents | Narrow: only what is necessary for the contract; fine-tuning on customer data is rarely contractually necessary |
(f) Legitimate interest | Internal deployment, training, fine-tuning, RAG, fraud detection, B2B customer service | Dominant basis; three-part test mandatory; not for public authorities in the performance of their tasks |
The three-part test under Art. 6(1)(f) — pursuant to EDPB Guidelines 1/2024 (8 Oct 2024) and EDPB Opinion 28/2024 — requires: (1) a concretely articulated, real interest ("improving our HR screening agent for our own employees" rather than abstractly "improving AI"), (2) necessity (could it be done with less or anonymized data?) and (3) a balancing against the reasonable expectations of the data subjects.
The "publicly available" trap: A persistent misconception is that freely accessible web data is GDPR-free. Clearview AI was sanctioned in, among others, Italy (EUR 20 million, 10 Feb 2022) and France (EUR 20 million, 17 Oct 2022) — public availability substitutes neither for consent nor for any other legal basis. The CJEU ruling Meta v Bundeskartellamt (C-252/21) confirms: even with public data, the balancing under Art. 6(1)(f) must be carried out separately.
Enforcement benchmark: On 2 Nov 2024, the Italian Garante imposed EUR 15 million on OpenAI — for training without an identified legal basis (violation of Art. 5(1)(a), 5(2), 6), insufficient transparency and missing age verification. The Replika decision against Luka Inc. (10 Apr 2025, EUR 5 million) adds: performance of a contract (Art. 6(1)(b)) does not hold up where the user base includes minors without legal capacity.
Special categories of data: Art. 9 GDPR
As soon as an agent processes health, biometric or other sensitive data, the fundamental prohibition of Art. 9(1) applies. Practically relevant exceptions are explicit consent (Art. 9(2)(a)), employment and social security law grounds (Art. 9(2)(b), e.g. §26(3) BDSG) and substantial public interest (Art. 9(2)(g)). Caution regarding the interplay with the AI Act: its Art. 10(5) permits the processing of sensitive data for bias detection but is not a standalone Art. 9(2) GDPR exception — the legal basis must additionally derive from Art. 9(2) (usually (g)), which in most member states still requires national legislation.
Automated individual decisions: Art. 22 after SCHUFA and Dun & Bradstreet
Art. 22 is the most-litigated GDPR provision in the AI Agent environment. Two CJEU rulings have considerably broadened its scope of application:
- SCHUFA (C-634/21, 7 Dec 2023): Even an automated probability value (score) is itself an automated individual decision within the meaning of Art. 22(1) if a third party (e.g. the bank) substantially relies on it. Art. 22 contains a fundamental prohibition, not merely a right that must be invoked — the controller bears the burden of proof for an exception.
- Dun & Bradstreet (C-203/22, 27 Feb 2025): "Meaningful information about the logic involved" (Art. 15(1)(h)) means a concrete, comprehensible explanation that allows data subjects to understand, verify and contest the decision. Disclosure of the algorithm or of a mathematical formula does not suffice. Trade secrets do not automatically preclude disclosure — in the event of a dispute, the supervisory authority or the court decides on a case-by-case basis.
For practice this means: score-driven agents (HR screening, credit decisions, insurance underwriting, dynamic pricing) trigger Art. 22 almost as standard. The most common audit finding in 2024–2025 is the rubber-stamp review — a merely formal human confirmation. For a decision not to be "solely automated," the human must have the competence and authority to override, must actually examine the data, and must exhibit plausible override rates. As the originating authority of the Dun & Bradstreet case, the Austrian DSB is a direct yardstick-setter here for the DACH region.
Processing on behalf of a controller: Art. 28 GDPR
Most enterprise AI contracts (Microsoft Azure OpenAI, OpenAI Enterprise, Anthropic Claude for Work, Google Vertex AI, AWS Bedrock, Mistral, Aleph Alpha) are structured as controller-to-processor, with the explicit commitment not to use customer data for model training. These commitments are, however, contractual rather than statutory in nature and frequently reserve narrow rights for abuse monitoring or safety review.
A modern agent stack typically generates a five- to eight-layer sub-processor cascade: foundation model provider → cloud/hosting → orchestration runtime → vector store → memory provider → MCP server → observability → evaluation. For each layer a DPA must exist, the chain must be seamless under Art. 28(4), and data residency must be consistent. The most common audit findings: undisclosed MCP server flows, observability providers without a DPA, and evaluation services that collect prompt traces. Beyond the pure processor question, joint controllership (Art. 26) also looms: as soon as a provider uses data for its own purposes — for example when a web search tool (Bing in Copilot, Google Search in Gemini) queries the public web — the data flow leaves the processor sphere.
Data Protection Impact Assessment (DPIA): Art. 35 GDPR
AI Agents almost always trigger a DPIA because they regularly meet at least one of the triggers under Art. 35(3) and the DPA blacklists: automated decision with effect, large-scale profiling, biometric processing, innovative technology application or employee monitoring. The DSK guidance "Artificial Intelligence and Data Protection" (Version 1.0, 6 May 2024) explicitly requires the DPIA for most LLM deployments.
The DPIA methodology (DSK short paper No. 5, BfDI tool, EDPB WP248) comprises: (1) necessity and proportionality assessment, (2) risk identification (for AI Agents: hallucination, bias, prompt injection, memory leakage, output regurgitation, tool-call exfiltration, re-identification), (3) risk evaluation and (4) risk treatment. Required artifacts are data-flow diagrams across every personal-data hop, a roles map (controller, joint controller, processor with the Art. 28(4) chain), a risk register and consultation records (DPO opinion under Art. 35(2); works council / staff representation involvement under §87 BetrVG or the ArbVG). Where Annex III high risk applies, the DPIA can be merged with the Fundamental Rights Impact Assessment (FRIA, Art. 27 AI Act) into a joint artifact (Art. 27(4) AI Act).
EU data residency and cross-border transfers
For DACH decision-makers, EU data residency is a central lever for risk reduction. It is achievable via the EU Data Boundary (Microsoft), EU regions (Vertex AI with EU regional endpoints, AWS) or via sovereign providers such as Aleph Alpha (Pharia), IONOS AI Model Hub, STACKIT, T-Systems Open Telekom Cloud (DE/EU) and Swisscom Sovereign AI (CH).
For transfers to third countries, the Schrems II standard applies: SCCs only with a case-specific Transfer Impact Assessment (TIA). The EU-US Data Privacy Framework (adequacy decision of 10 Jul 2023) is currently valid; on 3 Sep 2025 the EU General Court dismissed the first action in Latombe v Commission, but an appeal before the CJEU is pending. Operational recommendation: use the DPF certification, but keep SCCs as a fallback and carry out TIAs even for DPF-certified recipients. For US providers, the CLOUD Act and FISA 702 remain risks to be addressed in the TIA. Switzerland has been recognized as adequate since 15 Jan 2024; the Swiss-US DPF has been in force since 15 Sep 2024.
DACH specifics and the Swiss tightening
Within the DACH region, the supervisory authorities diverge. The Hamburg Data Protection Authority takes the position that the mere storage of an LLM is not a processing operation (discussion paper, 15 Jul 2024) — EDPB Opinion 28/2024 implicitly disagrees. For deployers, the safe stance is: treat the model as potentially personal and concentrate the compliance work on the deployer-controlled layers (RAG indices, memory, logs), where deletion is technically possible.
In Switzerland, Art. 21 revFADP is stricter than Art. 22 GDPR: the duty to inform in the case of automated individual decisions is the rule, not the exception. In addition, intentional violations carry the threat of criminal sanctions against natural persons (management, DPO) of up to CHF 250,000. For FINMA-regulated financial service providers, outsourcing notifications and banking secrecy analyses are added, which push many deployers toward the sovereign cloud.
Practical outlook
GDPR compliance with AI Agents is not a one-time document but an architectural principle: separate the inference layer (managed API with a "no-training" DPA) from the deployer-controlled layers, which must remain fully deletable, and add a filter layer for DSAR and Art. 17 requests. Anyone who factors in the legal basis, Art. 22 architecture, a seamless DPA cascade, the DPIA and data residency from the outset avoids the most common audit findings — from unprotected memory stores through prompt logs without a deletion concept to undisclosed MCP flows. For the concrete contractual and technical design, controllers should always obtain expert legal advice; this overview does not replace it.
All Articles in this Topic
7 ArticlesData Processing Agreements under Art. 28 GDPR with AI Providers: The DPA Guide
A data processing agreement (DPA) under Art. 28 GDPR is mandatory as soon as an AI provider processes personal data on your behalf under instruction – for example via prompts, embeddings or logs. It bindingly governs instruction-bound processing, security, sub-processors, audit rights and deletion at the end of the contract.
GDPR Legal Basis for AI Agents (Art. 6): When Consent, Contract or Legitimate Interest Applies
The GDPR legal basis for AI determines which ground under Art. 6(1) GDPR a company relies on for data processing by an AI agent. For internal deployments, fine-tuning, RAG and B2B service agents, legitimate interest (Art. 6(1)(f)) is the dominant basis, alongside consent, contract or legal obligation.
EU Data Residency for LLMs: Where Your Data Is Processed
EU data residency for LLMs means that prompts, outputs, embeddings and logs are stored and processed exclusively within an EU region. It is relevant under data protection law because any processing outside the EEA constitutes a third-country transfer under Chapter V GDPR and therefore requires additional safeguards.
Using OpenAI / ChatGPT in a GDPR-Compliant Way: Consumer vs. Enterprise vs. API
Using OpenAI in a GDPR-compliant way means deploying only ChatGPT Enterprise/Team or the OpenAI API with a signed data processing agreement (DPA) for corporate data, where customer data is not used for training by default. Consumer ChatGPT is unsuitable for this because a binding DPA and configurable data residency are missing.
Mistral and Aleph Alpha: The GDPR Advantages of European LLM Providers
EU LLM providers are EU-based language-model providers such as Mistral AI (France) and Aleph Alpha (Germany). Their GDPR advantage: EU data processing, no third-country transfer to the US and therefore no structural CLOUD Act or FISA 702 risk. This simplifies transfer impact assessments but does not replace a full compliance review.
Third-Country Transfers to the USA: Data Privacy Framework and AI
The EU-US Data Privacy Framework (DPF) is the European Commission's adequacy decision in force since 10 July 2023, on the basis of which personal data may be transferred to certified US companies. For AI agents using US providers it is currently the central, though legally contested, basis for third-country transfers under Chapter V GDPR.
Data Protection Impact Assessment (DPIA) for AI Agents: Obligation, Thresholds and a Step-by-Step Approach
A data protection impact assessment (DPIA) is mandatory under Art. 35 GDPR where processing is likely to result in a high risk to the rights and freedoms of natural persons. For AI agents it is almost always required, because they profile, make automated decisions or deploy innovative technology.