Skip to content

Breaking News

Industry Insider — AEQ schedules February intro webinar for new console

NStories Studios invites radio stations to join global broadcast

CRA welcomes registration of new commercial radio code of practice

DRM releases World Radio Day schedule

Cumulus Media appoints Trey Dolle as VP in Cincinnati

Italian broadcasters to mark World Radio Day with live AI-focused simulcast

Hutton Broadcasting selects Nautel NV20LT for KQBA

Thai Government Radio upgrades FM transmission with Orban Optimod 5950

2026 NAB Show relocates TV and Radio HQ to Central Hall

vCreative and Radio.Cloud integrate to streamline workflows

Friday February 13, 2026
Partners
Newsletter
Contact us
About
Edit Content
RedTech RedTech
  • News & Business
  • Strategy & Views
  • Technology
  • Products
  • All stories
  • Contact
  • Advertise
DAC System Redesigns Website
Trending
DAC System Redesigns Website

Salem Media Group
Featured

New regional GM and GSM for Salem Media Pittsburgh

Jason Mosher and Dave Cuddihy join the Pittsburgh team

Featured

Audacy named exclusive U.S. sales partner for Sonos Radio

The agreement refers to Sonos Radio’s streaming inventory

Inrush Broadcast Services, Andy Gunn, people, appointments, United States
Featured

Gunn joins Inrush Broadcast Services

Newly created position of director of project management and process engineering

Featured Strategy & Views

Bob Orban reflects on 50 years of audio processing solutions

The broadcast pioneer discusses solving multiple technical challenges

AEQ, consoles, webinar
Events Featured News & Business

Industry Insider — AEQ schedules February intro webinar for new console

The online event takes place Feb. 19 at 9:30 a.m. CET

Featured Strategy & Views

When a golden ear meets a neural net

A request to change a girl into a wolf sparked a new approach to AI-driven sound design

  • Contact
  • About RedTech
RedTech RedTech
  • News & Business
  • Strategy & Views
    • Strategy & Views
    • Videos
  • Technology
    • Tech Focus
  • Products
  • Events
    • RedTech Summit 2026
    • Previous RedTech Summits
      • RedTech Summit 2025
      • RedTech Summit 2024
      • RedTech Summit 2023
      • RedTech Summit 2022
    • RadioWeek 2026
      • RadioWeek 2025
      • RadioWeek 2024
      • RadioWeek 2023
    • Global Online Content Series 2024
    • Events
      • 2026 NAB Show
      • World Radio Day 2026
      • IBC2025
      • 2025 NAB Show
      • IBC2024
      • 2024 NAB Show
      • IBC2023
      • 2023 NAB Show
      • IBC2022
    • Events Calendar
  • Publications
  • Advertise
  • News & Business
  • Strategy & Views
    • Strategy & Views
    • Videos
  • Technology
    • Tech Focus
  • Products
  • Events
    • RedTech Summit 2026
    • Previous RedTech Summits
      • RedTech Summit 2025
      • RedTech Summit 2024
      • RedTech Summit 2023
      • RedTech Summit 2022
    • RadioWeek 2026
      • RadioWeek 2025
      • RadioWeek 2024
      • RadioWeek 2023
    • Global Online Content Series 2024
    • Events
      • 2026 NAB Show
      • World Radio Day 2026
      • IBC2025
      • 2025 NAB Show
      • IBC2024
      • 2024 NAB Show
      • IBC2023
      • 2023 NAB Show
      • IBC2022
    • Events Calendar
  • Publications
  • Advertise

Click Here to Subscribe to RedTech's Newsletter

RedTech RedTech
  • News & Business
  • Strategy & Views
    • Strategy & Views
    • Videos
  • Technology
    • Tech Focus
  • Products
  • Events
    • RedTech Summit 2026
    • Previous RedTech Summits
      • RedTech Summit 2025
      • RedTech Summit 2024
      • RedTech Summit 2023
      • RedTech Summit 2022
    • RadioWeek 2026
      • RadioWeek 2025
      • RadioWeek 2024
      • RadioWeek 2023
    • Global Online Content Series 2024
    • Events
      • 2026 NAB Show
      • World Radio Day 2026
      • IBC2025
      • 2025 NAB Show
      • IBC2024
      • 2024 NAB Show
      • IBC2023
      • 2023 NAB Show
      • IBC2022
    • Events Calendar
  • Publications
  • Advertise

Click Here to Subscribe to RedTech's Newsletter

Featured Strategy & Views

The Innovators: AI voice technology for broadcast: A professional audio perspective

by Raoul Wedel June 10, 2024 10 min read
 The Innovators: AI voice technology for broadcast: A professional audio perspective
Print Friendly, PDF & Email

THE HAGUE, Netherlands — In the diverse world of artificial intelligence voice technology, much of the innovation is tailored toward e-learning platforms and social media content, leaving a gap in solutions specifically designed for high broadcast production standards.

This guide aims to bridge that gap for professionals in the audio industry, offering insights into the crucial aspects of AI voice technology that must be scrutinized when selecting a system for broadcast use. It highlights the need for pro audio users to delve deeper into the capabilities of AI voice solutions, ensuring that the technology they choose can rise to the challenges of professional broadcasting.

Sample rate: Pro audio standards and spectrum analysis

The sample rate in AI voice technology is critical for sound quality, especially in professional audio. Most AI voice technologies typically use 16 kHz or 24 kHz, but pro audio standards require 48 kHz. While some systems may produce 48 kHz files, the underlying technology may not. The internal sample rate can be found by analyzing the audio spectrum, where lower rates will show cutoff frequencies at 8 kHz or 12 kHz.

SSML support: Tailoring AI voice for pro audio

Speech Synthesis Markup Language is a crucial tool in professional audio for refining style, delivery and pronunciation. Many text-to-speech providers do not support this feature.

SSML features the following:

  • Prosody: Fine-tunes pitch, rate and volume for emotional depth and content emphasis.
  • Pause: Strategically placed pauses to enhance speech rhythm and realism.
  • Emphasis: Strengthens expression by highlighting keywords or phrases.
  • Phoneme: Precisely dictates pronunciation, ensuring accuracy for complex terms.
  • Say-as: Instructions on the vocal interpretation of specialized content.
  • Voice and language: Adapts voice traits and supports multilingual projects — essential for diverse pro audio applications.
  • Multistyle voice generation.
  • AI voice systems with multistyle generation capabilities can produce various speech styles and emotions, such as cheerful, serious, formal, informal and enthusiastic. The degree of these styles can be adjusted, offering nuanced variations in tone and enhancing the versatility of the voice output.
Raoul Wedel
Wedel Software and Adthos CEO Raoul Wedel

Lexicon: The foundation for accurate pronunciation

A comprehensive lexicon is crucial in AI voice technology for resolving pronunciation issues. The lexicon serves as a reference guide for the AI, ensuring it pronounces words correctly, especially those that are uncommon, technical or borrowed from other languages.

The need for a lexicon arises from the inherent limitations of text-to-speech systems in accurately predicting the pronunciation of every word.

Words with nonstandard pronunciations, industry-specific jargon, names and borrowed words often pose challenges. A well-developed lexicon addresses these challenges by providing specific pronunciation guides for these words.

When pronunciation errors are identified, corrections are applied directly to the lexicon.
This involves specifying the phonetic representation of the word or phrase according to the International Phonetic Alphabet or a similar phonetic system.

Once updated, the AI model references this lexicon to produce the correct pronunciation.

Voice conversion for targeted voice performances

Voice conversion technology in AI is particularly valuable for specific voice performance requirements, like in advertising. It enables the transformation of one voice to another while maintaining a speaker’s accent and pronunciation.

Fixed-length voice outputs in advertising

AI voice technology can produce speech within exact time constraints, which is essential for advertising where fixed-length ads are crucial.

A comprehensive lexicon is crucial in AI voice technology for resolving pronunciation issues. The lexicon serves as a reference guide for the AI, ensuring it pronounces words correctly, especially those that are uncommon, technical or borrowed from other languages.

Training costs vs. operational expenses

The economics of AI voice technology can vary significantly. Some systems may be cheaper to train but come at a higher cost per character. This is because the initial training cost, while potentially lower, doesn’t always translate to lower operational costs. The cost per character generated depends on various factors, including the sophistication of the technology, the quality of the output and the efficiency of the AI algorithms.

Systems with lower training costs might use less sophisticated models, resulting in higher per-character costs due to less efficient processing or the need for more post-processing to reach a desired quality level. Conversely, systems with higher initial training costs often use more advanced models, leading to lower costs per character due to more efficient processing and higher-quality outputs that require less editing.

Audio processing in AI voice technology

Audio processing is crucial for broadcast-quality voice content, involving level adjustments, equalization and compression to meet broadcasting standards.

However, using processed speech to train AI voice models can introduce artifacts, affecting the model’s performance and resulting in a less natural AI voice. It’s advisable to apply audio processing post-text-to-speech creation, ensuring the model trains on clean speech while the final output benefits from enhanced audio quality, maintaining the fidelity essential for broadcast standards.

Summary

AI voice technology, with its diverse capabilities like SSML support, lexicon accuracy, multistyle voice generation and voice conversion, offers transformative potential for digital voice interaction.

nderstanding these aspects, including the importance of audio processing and economic considerations, is crucial for effectively using this technology.

The author is CEO of Wedel Software and Adthos.

More stories about AI

Futuri pushes AI SpotOn into TopLine

Editorial responsibility in times of generative AI

Tags: AI Audio A.I.
Previous post
Next post

Raoul Wedel

contributor


Most Recent
Featured

New regional GM and GSM for Salem Media Pittsburgh

February 12, 2026
Featured

Audacy named exclusive U.S. sales partner for Sonos Radio

February 12, 2026
Featured

Gunn joins Inrush Broadcast Services

February 11, 2026
Latest Newsletters

5 Feb 2026 – Saudi Media Forum Explores Transformation | Free AI Tools | Broadcasters Reunite

1 Feb 2026 – RedTech Magazine Jan/Feb is here!

29 Jan 2026 – Reinventing Content Creation | Philippe Generali Retires | AI Energizes Cumulus

22 Jan 2026 – Rebuilding For Visual | RadioWeek Next Week | Trouble In Italy

15 Jan 2026 – Fishy Collaborative Podcasting | Italian FM Interference | Podcast Growing at Home

8 Jan 2026 – London Calling U.DAB | Audio Listening Habits | Sweden’s FM Race

30 Dec 2025 – The Quiet Engineering Behind Radio’s Next Phase

18 Dec 2025 – Radio 2 Winter Heat | Radio’s Human Advantage | Mediaset Muscles Up

11 Dec 2025 – Growing Nordic Radio | Lighting Up Christmas | A Commemorative Stamp

10 Dec 2025 – Meet The Solutioneers 2025/2026

4 Dec 2025 – Africa IP Shift | MPW Scholarships | LATAM Listener Trends

2 Dec 2025 – RedTech Magazine November/December 2025 Is Here!

27 Nov 2025 – Bright Color Radio | Win For Bauer | Radio Still On Receivers

20 Nov 2025 – Football-Mad Radio | 30 Under 30 Talent | Berlin Online Listening

13 Nov. 2025 – AI Radio News | Debating Radio’s Impact | Immersive Streaming Audio

6 Nov 2025 – Music An Asset |Bold Aussie Radio | DRM Drives India

30 Oct 2025 – Africa’s Collective Voice | AI As PD | Bauer Media Group realigns

23 Oct 2025 – Culture Powers Growth | 60 Years Of Innovation | Marconi Awards Winners

16 Oct 2025 – Is DAB+ The Answer? | Saothair Acquires GatesAir | Rethinking The Radio Console

9 Oct 2025 – Campus Radio Project | In The Club | AI In The Driver’s Seat

8 Oct 2025 – RedTech Magazine September/October 2025

2 Oct 2025 – BBC Mobile Tech | NPO Cuts Jobs | Awards Canned

25 Sept 2025 – AI Revisited | Rádio Rock Powers Up | RTL’s Six Of The Best

18 Sept 2025 – IBC2025 Insights | RedTech Award Winners | 2 Minutes Of Tech

11 Sept 2025 – Hearing Children’s Voices | Broadcast Giants Honored | Virtual Mixing

5 Sept 2025 – Read Now — Radio Futures: AI and Radio

4 Sept 2025 – IBC2025 All Change | Incentivizing Digital Transition | Video Takes The Lead

 

Related Stories for you

When a golden ear meets a neural net

by Davide Moro February 10, 2026 12 min read

A request to change a girl into a wolf sparked a new approach to AI-driven sound design

Super Hi-Fi, Connoisseur Media, AI

Super Hi-Fi and Connoisseur Media partner on AI

by Brett Moss January 27, 2026 4 min read

Connoisseur CEO to join Super Hi-Fi board

Human voices: Radio’s edge in the age of AI

by David Fernández Quijada December 15, 2025 8 min read

Distinctive human values become an asset in an AI-flooded media landscape

RedTech RedTech

RedTech International SAS
250 bis boulevard Saint-Germain
75007 Paris, France

contact@redtech.pro

Subscribe to our newsletter

About

About Us
Work With Us
Contact Us

Advertising

Advertise

Useful Links

Partners
Newsletter

more

Terms and Conditions
Privacy Policy

latest news

Salem Media Group
Featured

New regional GM and GSM for Salem

Featured

Audacy named exclusive U.S. sales partner for

Inrush Broadcast Services, Andy Gunn, people, appointments, United States
Featured

Gunn joins Inrush Broadcast Services

Featured

Bob Orban reflects on 50 years of

AEQ, consoles, webinar
Events

Industry Insider — AEQ schedules February intro

Follow us:

Copyright RedTech International 2026. All Rights Reserved