Skip to content
← Research

2026-06-27

Audio · Sonic Identity · Strategy

Sound is the next synthetic surface.

Voice, music, and sonic identity are about to be generated at scale. Brands should be ready.

By AIM Lab· 7 min read· 3 citations· Studio essay

Abstract

Text and image generation dominated the first wave of generative AI. Audio is the next surface. This piece maps the rise of synthetic voice, adaptive music, and AI-assisted sound design, and argues that the brands with a documented sonic identity will be the ones that survive the change intact.

Text and image generation consumed the first wave of public attention around generative AI. Audio is next. In 2024 and 2025, synthetic voice, adaptive music, and AI-assisted sound design crossed from demo to production. The tools are now good enough to ship, and the legal and brand frameworks around them are not.

The three audio surfaces

Audio breaks into three distinct production surfaces, each with its own tooling and risk profile. - Synthetic voice. Voice cloning and text-to-speech models can now produce broadcast-quality narration in multiple languages from a small licensed sample. The risk is likeness rights, disclosure, and the listener's trust [1]. - Adaptive music. Generative music systems can score to picture, loop infinitely, and adapt to mood or interaction. The risk is homogenization and the erosion of composer credit [2]. - Sound design. AI-assisted libraries and procedural synthesis can generate foley, textures, and backgrounds. The risk is that the sound becomes unmoored from a real source, which changes how audiences feel a scene.

The ear is less forgiving than the eye. A slightly wrong image is a style choice. A slightly wrong voice is uncanny.

Why sonic identity matters now

A sonic identity — a documented set of voices, instruments, sonic textures, and cadences — is the only way a brand can generate audio at scale without becoming a different brand in every execution. Without it, every synthetic voiceover, every AI music track, every procedural sound becomes a random draw from a model's training distribution.

The same thing happened to visual identity in the early days of generative images. Brands that had strong visual systems could steer the models. Brands that did not became averages of their categories. Audio is repeating the pattern, compressed into a shorter window [3].

What a brand should document

We advise clients to build a sonic identity document before they generate their first synthetic audio asset. It should cover: 1. Voice profile. If the brand uses a synthetic voice, what does it sound like? What does it never sound like? Who owns the rights? 2. Instrumental palette. The instruments, timbres, and production treatments that are on-brand and off-brand. 3. Cadence and silence. How the brand uses rhythm, pause, and density. Generative audio tends to overfill. The identity document is where the negative space is defined. 4. Disclosure rules. Where and how the brand labels AI-generated audio, per jurisdiction and per platform.

The opportunity for campaigns

Synthetic audio is not only a cost play. It lets a campaign travel across languages, lengths, and contexts without re-recording. The same spot can have a dozen voice variants, each adapted to a market, without losing the original performance's timing. The same music can stretch, loop, and respond to a user's path through an experience [2].

But the opportunity is conditional. The brand has to own the identity first. Otherwise, the scale becomes noise.

The takeaway

Audio is becoming generative. The brands that will benefit are the ones that treated their sonic identity as a real asset before the tools made it cheap to ignore.

Sources

  1. [1]ElevenLabs. Voice cloning and multilingual speech synthesis research https://elevenlabs.io/research(accessed 2026-06-27)
  2. [2]Suno / Udio. Generative music models and artist rights landscape (2024–2025) https://www.suno.ai/(accessed 2026-06-27)
  3. [3]Spotify. Sonic identity and audio branding research https://ads.spotify.com/en-us/insights/(accessed 2026-06-27)