The consent question L&D teams are asking in 2025
A Dutch L&D manager at a financial services firm recently asked me: "We want to clone our lead trainer's voice to create an AI coach for customer service scenarios. Do we need written consent? What about employee data when they practice with it?"
She's not alone. Between February and April 2025, I've had variations of this conversation with twelve different training companies across the Netherlands. The EU AI Act took effect in February 2025 with mandatory AI literacy requirements, but specific consent frameworks for voice cloning in training contexts remain fragmented across member states.
That changes in 2026. The European Data Protection Board's working group on biometric data processing is finalising harmonised guidelines for voice cloning consent that will apply across all EU training providers. The draft guidance, circulated to DPAs in March 2025, establishes three consent tiers based on use case and data retention.
Here's what training companies need to know now, while implementation timelines are still flexible.
Why voice cloning requires different consent than other training tools
Most corporate training tools operate under existing GDPR provisions. Learning management systems, video platforms, and assessment tools process employee data under legitimate interest or contractual necessity. Voice cloning sits in a different category.
Under the GDPR's biometric data provisions (Article 9), voice patterns that can identify an individual constitute special category data. When you clone a trainer's voice to create an AI coach, you're processing their unique vocal characteristics. When employees practice with that AI coach, their voice recordings may also constitute biometric data depending on how they're processed and stored.
The distinction matters because special category data requires explicit consent or one of the limited alternative legal bases under Article 9(2). "Legitimate interest" doesn't apply. Contract performance alone isn't sufficient unless the voice processing is strictly necessary for the employment contract, which it rarely is in training contexts.
The 2026 guidelines clarify this with specific consent requirements for three voice cloning scenarios in workplace settings.
Three consent tiers for voice cloning in training
The draft EDPB guidance establishes three consent frameworks based on how voice data is captured, processed, and retained. Dutch training companies implementing AI voice coaching need to understand which tier applies to their use case.
Tier 1: Trainer voice cloning with immediate deletion
This covers the most common training scenario: cloning a trainer's voice to create an AI coach, where the original voice sample is deleted after the cloning model is created, and no voice biometric template is retained separately.
Required consent elements:
- Written consent document specifying that the trainer's voice will be cloned to create an AI coaching agent
- Explicit statement that the original audio sample will be deleted within 30 days of model creation
- Description of use: who will interact with the AI coach, for what training purposes, and for how long
- Right to withdraw: clear process for the trainer to revoke consent and request deletion of the voice model
- No adverse consequences clause: explicit statement that refusal or withdrawal will not impact employment status or trainer relationships
For independent trainers working with multiple clients, the consent document must specify each organisation that will use the cloned voice. For in-house trainers, the consent applies within their employing organisation but requires renewal if the voice model is shared externally (e.g., with partner companies or training networks).
Tier 2: Employee practice sessions with ephemeral storage
When employees practice conversations with an AI voice coach, their responses are typically recorded for immediate transcription and evaluation, then deleted. Under the 2026 guidelines, this qualifies as ephemeral storage if audio is deleted within 72 hours and no voice biometric analysis is performed.
Required consent elements:
- Clear notice at the start of each practice session that voice will be recorded
- Specific retention window: "Your voice will be recorded during this practice session and deleted within 72 hours"
- Purpose limitation: voice data used only for immediate feedback generation, not for identity verification or ongoing monitoring
- Opt-out mechanism: employees can complete text-based alternatives without voice recording
This tier doesn't require separate signed consent if the training participation itself is part of normal employment duties and voice recording is clearly disclosed. However, the opt-out mechanism is mandatory. You cannot require voice participation in AI coaching as a condition of employment or training completion.
Tier 3: Long-term voice data retention for progress tracking
Some training implementations retain voice recordings beyond 72 hours for quality assurance, coaching review, or longitudinal progress tracking. This moves voice processing into permanent biometric data territory and requires the strongest consent framework.
Required consent elements:
- Separate explicit consent beyond general training participation agreement
- Specific retention period with maximum duration (EDPB guidance suggests 12 months maximum for training contexts)
- Named access list: which roles can access stored recordings (e.g., direct manager, L&D team, external consultants)
- Technical safeguards disclosure: encryption, access logging, and data residency location
- Periodic renewal: consent must be re-confirmed annually if recordings are retained beyond initial training period
Most European training implementations avoid Tier 3 entirely by designing for ephemeral storage. The compliance complexity and employee privacy concerns rarely justify long-term voice retention for practice-based learning.
The European data residency requirement for voice cloning
Beyond consent frameworks, the 2026 guidance includes a data localisation preference for voice biometric data processing. While not a hard requirement under GDPR (which allows international transfers with adequate safeguards), the EDPB working group notes that voice cloning for EU employees should use EU-based infrastructure "wherever technically feasible."
This creates a practical compliance advantage for training platforms with European data residency. When voice data never leaves EU servers, you avoid the additional documentation burden of Transfer Impact Assessments and Standard Contractual Clauses for international data transfers.
For Dutch training companies evaluating AI voice coaching platforms, this means asking vendors two specific questions:
- Where is voice data processed during cloning? (Voice model training can happen on non-EU infrastructure if the resulting model doesn't contain extractable voice samples)
- Where is employee practice data processed and stored? (This should remain in the EU to avoid transfer complexity)
The distinction matters. A platform might clone a trainer's voice using US-based AI infrastructure (acceptable under Tier 1 if the audio sample is deleted), but all employee practice sessions should process through EU servers to maintain data residency for the higher-volume, ongoing voice interactions.
What consent looks like in practice: a training company example
A Rotterdam-based sales training company recently implemented AI voice coaching for their team of twelve trainers. Here's how they structured consent across both trainer voice cloning and employee practice sessions.
For trainer voice cloning: Each trainer received a two-page consent document explaining that their voice would be cloned to create an AI coach delivering their sales methodology. The document specified that the original audio sample (three minutes of guided recording) would be deleted within 14 days of voice model creation. It listed the three corporate clients whose sales teams would interact with the AI coach, and included a withdrawal process where the trainer could request voice model deletion with 30 days' notice.
The training company included a financial consideration: trainers receive a quarterly licensing fee when their AI voice coach is used in client implementations. This creates a commercial relationship around the voice licensing, which strengthens the consent framework by demonstrating clear value exchange beyond employment duties.
For employee practice sessions: Each corporate client's sales team receives clear notice when starting an AI practice conversation: "This session will record your voice for immediate feedback generation. Recordings are deleted within 72 hours. You can switch to text-based practice at any time." The platform offers a toggle in the practice interface to disable voice input and use text responses instead.
No sales team members are required to complete voice-based practice. Text alternatives provide equivalent practice opportunities with the same coaching feedback, just without voice analysis of tone, pace, or confidence markers. This opt-out mechanism satisfies the Tier 2 requirement that voice participation cannot be mandatory.
By designing their implementation around Tier 1 and Tier 2 consent requirements, the training company avoids the additional compliance complexity of long-term voice data retention while still delivering effective AI coaching at scale.








