Skip to content

Breaking News

EBU Names Annsofi Eriksson as Chief Technology Officer

Genelec monitors support Titrafilm’s expansion of Paris post-production facilities

ArtSound FM upgrades Canberra studios with DHD consoles

RCS Worldwide names Susan Larkin president and CEO

NAB Show — Nautel releases speakers list for Radio Technology Forum

AEQ equips Lisbon’s Correio da Manhã Rádio studios with IP-based infrastructure

MIW opens applications for country radio mentorship program

Australian Podcast Ranker enhances audience data tools

Café Nashville relaunches syndicated shows using Radio.Cloud

Malaysian communications minister to open ABU DBS 2026

Friday March 27, 2026
Partners
Newsletter
Contact us
About
RedTech RedTech
  • News & Business
  • Strategy & Views
  • Technology
  • Products
  • All stories
  • Contact
  • Advertise
DAC System Redesigns Website
Trending
DAC System Redesigns Website

Cumulus Media, Darlene Park
Featured

Cumulus Media expands Park’s portfolio

Indianapolis, Kokomo, Muncie stations and digital properties under her watch

Events Featured

Lawo builds anticipation ahead of April 8 “One” reveal

The “one” innovation could make system designs "more efficient, flexible and future-ready"

Dielectric, RF equipment, infrastructure, NAB Show 2026, heat exchanger
2026 NAB Show Featured Products

NAB Show — Dielectric debuts OptiLoad heat exchanger

Much of the equipment remains outside

GatesAir, monitoring, NAB Show 2026, AirWatch365
2026 NAB Show Featured Products

NAB Show — GatesAir adds to AirWatch365 service

Expansion of availability and a launch of hardware portal

Annsofi Eriksson
Featured News & Business

EBU Names Annsofi Eriksson as Chief Technology Officer

Will lead a newly formed department unifying the organization's work across technology, AI and digital platforms

Featured News & Business Technology

Genelec monitors support Titrafilm’s expansion of Paris post-production facilities

The installation includes Atmos-ready mix rooms and Smart IP ceiling channels

  • Contact
  • About RedTech
RedTech RedTech
  • News & Business
  • Strategy & Views
    • Strategy & Views
    • Videos
  • Technology
    • Tech Focus
  • Products
  • Events
    • RedTech Summit 2026
    • Previous RedTech Summits
      • RedTech Summit 2025
      • RedTech Summit 2024
      • RedTech Summit 2023
      • RedTech Summit 2022
    • RadioWeek 2026
      • RadioWeek 2025
      • RadioWeek 2024
      • RadioWeek 2023
    • Global Online Content Series 2024
    • Events
      • 2026 NAB Show
      • World Radio Day 2026
      • IBC2025
      • 2025 NAB Show
      • IBC2024
      • 2024 NAB Show
      • IBC2023
      • 2023 NAB Show
      • IBC2022
    • Events Calendar
  • Publications
  • Advertise
  • News & Business
  • Strategy & Views
    • Strategy & Views
    • Videos
  • Technology
    • Tech Focus
  • Products
  • Events
    • RedTech Summit 2026
    • Previous RedTech Summits
      • RedTech Summit 2025
      • RedTech Summit 2024
      • RedTech Summit 2023
      • RedTech Summit 2022
    • RadioWeek 2026
      • RadioWeek 2025
      • RadioWeek 2024
      • RadioWeek 2023
    • Global Online Content Series 2024
    • Events
      • 2026 NAB Show
      • World Radio Day 2026
      • IBC2025
      • 2025 NAB Show
      • IBC2024
      • 2024 NAB Show
      • IBC2023
      • 2023 NAB Show
      • IBC2022
    • Events Calendar
  • Publications
  • Advertise

Click Here to Subscribe to RedTech's Newsletter

RedTech RedTech
  • News & Business
  • Strategy & Views
    • Strategy & Views
    • Videos
  • Technology
    • Tech Focus
  • Products
  • Events
    • RedTech Summit 2026
    • Previous RedTech Summits
      • RedTech Summit 2025
      • RedTech Summit 2024
      • RedTech Summit 2023
      • RedTech Summit 2022
    • RadioWeek 2026
      • RadioWeek 2025
      • RadioWeek 2024
      • RadioWeek 2023
    • Global Online Content Series 2024
    • Events
      • 2026 NAB Show
      • World Radio Day 2026
      • IBC2025
      • 2025 NAB Show
      • IBC2024
      • 2024 NAB Show
      • IBC2023
      • 2023 NAB Show
      • IBC2022
    • Events Calendar
  • Publications
  • Advertise

Click Here to Subscribe to RedTech's Newsletter

Featured Strategy & Views

When a golden ear meets a neural net

by Davide Moro February 10, 2026 12 min read
 When a golden ear meets a neural net
Pedro Leite, left, and Luiz Fernando Kruszielski in a Globo post production room. Photo: Carlos Eduardo Rocha Miranda
Print Friendly, PDF & Email

RIO DE JANEIRO — It began like many good stories: Two colleagues with entirely different backgrounds, fired by a common, genuine passion for hands-on research, unexpectedly collaborating on an abstract concept. The one, Pedro Leite, machine learning engineer and AI researcher at Grupo Globo, has spent years exploring generative audio systems. The other, Luiz Fernando Kruszielski, is an innovation technologies specialist at Globo and a veteran sound engineer with a reputation for a “golden ear” — the ability to instinctively hear what others miss. 

Kruszielski and Leite were intrigued by early AI technology that enabled singers to create different voices and wondered whether it could produce something suitable for radio broadcasting.

The AI speech transformation process changes a performer’s voice into a target voice using information about timbre, pitch and spectrum — a sort of “voice DNA” — from a voice model. Emotions come from the performer’s voice — the source. This way, “the source plays with the target (voice), and the result is a very reliable voice,” Kruszielski said. “In broadcast applications, listeners shouldn’t be able to perceive the resultant voice as something unnatural.” 

Not all sounds have a fundamental frequency, but in voiced speech, the vocal folds generate a fundamental frequency — the basic rate at which they vibrate — and this determines the perceived pitch of the voice. The harmonic structure built on this base frequency shapes the timbre, the tonal quality that makes one voice sound different from another. Because every speaker has a unique combination of fundamental frequency and harmonic patterns, AI-assisted speech-to-speech systems must analyze these elements to recreate a speaker’s characteristic timbre while preserving the timing, rhythm, melody and emotional contours of the original performance. 

Perhaps the most impactful application for high-volume productions is converting low-quality recordings into studio-grade audio. 

Transforming into a wolf

In the beginning, Kruszielski and Leite explored the technology simply out of interest, running quick tests on open source models with hardware available in their lab. It was interesting but not obviously useful. The time to push their research a step further came when they encountered a particular creative challenge. A key scene of a Globo drama series “Vermelho Sangue” (“Blood Red”) required a girl to transform into a werewolf, and her voice had to transform accordingly. The writing team wanted accuracy — not a movie-style monster or a generic animal sound, but the vocalization of a specific species. They tried every traditional technique, including layering recordings, shifting pitch and formants (the resonant frequencies of the vocal tract that shape the characteristic timbre of a voice or vowel sound, independent of pitch), blending organic and synthetic tones, and using early voice-to-voice models. Yet nothing felt authentic enough.

Promotional poster for Globo’s “Vermelho Sangue” series. Photo: Grupo Globo

Kruszielski and Leite realized that if they wanted authenticity, they had to start with an authentic source. It would become the defining insight of the entire project. That meant real wolf vocalizations — scientifically documented recordings.

So, their next meeting was with a wildlife biologist specializing in animal vocalizations who had an extensive archive of wolf recordings. Unfortunately, the first outputs based on those samples sounded synthetic and unrealistic. Kruszielski and Leite didn’t give up. The field recordings included layers of environmental noise, such as wind, rustling vegetation and distant birds. To train a model capable of producing realistic transformations, they needed isolated vocalizations. So, they carefully cleaned each file, separating harmonic content from wildlife ambience and removing contamination without damaging the integrity of the wolf’s “voice.”

As soon as the refined dataset was fed into the AI model, everything changed. The voice of the transformed girl carried the texture, tension and resonance of a real animal. When they played the result for the director, he stood up from his chair. AI speech-to-speech was no longer an experiment. It was ready for production.

Built for sound engineers

The backbone of the system Kruszielski and Leite designed is the open-source RVC Project AI algorithm. Although powerful and flexible, it is not intended for everyday workflows in a sound department. It required command-line interfaces, cryptic flags, hidden configuration files and robust IT skills.

The purpose-designed graphic interface allows sound engineers to interact with the RVC processing engine in a familiar way.
Photo: Carlos Eduardo Rocha Miranda

Kruszielski and Leite designed a custom GUI specifically for audio specialists, with familiar controls. By reframing the AI system as studio-grade software rather than a technical experiment, they made it accessible to colleagues across the production team. The entire processing runs on a consumer-grade, gamer-level graphics card from the Nvidia RTX 40 family, which retails at a price well within the reach of any production studio and capable of faster-than-real-time processing. What started as a two-person side project became part of Globo’s broader audio production workflow.

The team applied lessons learned from the wolf transformation to everyday production challenges, such as correcting minor dialog mistakes without recalling actors for costly retakes. They have also used the system to modify accents. In one case, an actress needed to perform with a Yiddish accent that proved challenging during the shoot. With a reference sample and a carefully tuned AI audio model, they were able to shift her accent in post while preserving the shape, emotion and timing of her original performance. The result was seamless and expressive.

The risk of collapsing an illusion

Perhaps the most impactful application for high-volume productions is converting low-quality recordings into studio-grade audio. Actors or talents who are traveling or unavailable for booth time can record a line on a phone, in a hotel room, or anywhere convenient. The AI system reperforms the lines using the actor’s or talent’s vocal identity, producing audio that sounds as if it were recorded in ideal studio conditions.

For teams producing large volumes of scripted content, this flexibility can be transformative, improving the quality of everyday productions while saving time.

Kruszielski believes the technology does not support the idea AI might soon replace actors wholesale. While AI models can reproduce tone, timbre and certain expressive gestures, they cannot comprehend the subtle patterns that make a human performance unique. “An actor is defined not just by the sound of their voice but by their micropauses, breathing rhythm, nuanced hesitations, emotional timing and the way tension rises and releases across a line,” he explained.

Current AI models can approximate fragments of this but not sustain these characteristics across a long monolog without drifting into something increasingly unnatural. “As soon as a listener senses that something is off, the immersive experience breaks. The illusion collapses,” Kruszielski warned. For that reason, the team insists the technology is best understood not as a replacement for performers but as a tool that enhances their work, offers flexibility, and preserves creative intent.

After receiving a Master of Science in Engineering, the author worked for Telecom Italia and the Italian public broadcaster, Rai. Based in Bergamo, Italy, he now spends his time as a broadcast consultant for radio stations and equipment manufacturers, specializing in project management, network design and field measurement.

This article first appeared in the January/February 2026 edition of RedTech Magazine. You can read or download this edition for free here. You can access past editions of RedTech Magazine, also for free, here.

You might be interested in these stories

Super Hi-Fi and Connoisseur Media partner on AI

Saudi Media Forum to spotlight the kingdom’s media ambitions

DAB+ expansion gains momentum across Europe

Tags: AI AI Audio Grupo Globo RedTech Magazine January/February 2026
Previous post
Next post

Davide Moro

contributor


Most Recent
Featured

Cumulus Media expands Park’s portfolio

March 27, 2026
Events

Lawo builds anticipation ahead of April 8 “One” reveal

March 27, 2026
2026 NAB Show

NAB Show — Dielectric debuts OptiLoad heat exchanger

March 27, 2026
Latest Newsletters

19 March 2026 – Dual Standard Argument | Making Magic Between Music | Dashboard Dolby Atmos

12 March 2026 – 26 Radio Lessons | Japan Turns To Shortwave | RTL Belgium’s New Boss

5 March 2026 – Radio’s Competitive Advantage | Local News Rules | Radio Mandatory in Cars

25 Feb 2026 – East Africa Rising | Swedish Shake Up | Finnish Radio Strong

19 Feb 2026 – Young African Digital Voices | Bauer Drives Connected Journeys | Audio Campaign Effectiveness

12 Feb 2026 – AI Sound Design | World Radio Day Global Broadcast | New Code of Practice

5 Feb 2026 – Saudi Media Forum Explores Transformation | Free AI Tools | Broadcasters Reunite

1 Feb 2026 – RedTech Magazine Jan/Feb is here!

29 Jan 2026 – Reinventing Content Creation | Philippe Generali Retires | AI Energizes Cumulus

22 Jan 2026 – Rebuilding For Visual | RadioWeek Next Week | Trouble In Italy

15 Jan 2026 – Fishy Collaborative Podcasting | Italian FM Interference | Podcast Growing at Home

8 Jan 2026 – London Calling U.DAB | Audio Listening Habits | Sweden’s FM Race

30 Dec 2025 – The Quiet Engineering Behind Radio’s Next Phase

18 Dec 2025 – Radio 2 Winter Heat | Radio’s Human Advantage | Mediaset Muscles Up

11 Dec 2025 – Growing Nordic Radio | Lighting Up Christmas | A Commemorative Stamp

10 Dec 2025 – Meet The Solutioneers 2025/2026

4 Dec 2025 – Africa IP Shift | MPW Scholarships | LATAM Listener Trends

2 Dec 2025 – RedTech Magazine November/December 2025 Is Here!

27 Nov 2025 – Bright Color Radio | Win For Bauer | Radio Still On Receivers

20 Nov 2025 – Football-Mad Radio | 30 Under 30 Talent | Berlin Online Listening

13 Nov. 2025 – AI Radio News | Debating Radio’s Impact | Immersive Streaming Audio

6 Nov 2025 – Music An Asset |Bold Aussie Radio | DRM Drives India

30 Oct 2025 – Africa’s Collective Voice | AI As PD | Bauer Media Group realigns

23 Oct 2025 – Culture Powers Growth | 60 Years Of Innovation | Marconi Awards Winners

16 Oct 2025 – Is DAB+ The Answer? | Saothair Acquires GatesAir | Rethinking The Radio Console

9 Oct 2025 – Campus Radio Project | In The Club | AI In The Driver’s Seat

8 Oct 2025 – RedTech Magazine September/October 2025

2 Oct 2025 – BBC Mobile Tech | NPO Cuts Jobs | Awards Canned

25 Sept 2025 – AI Revisited | Rádio Rock Powers Up | RTL’s Six Of The Best

18 Sept 2025 – IBC2025 Insights | RedTech Award Winners | 2 Minutes Of Tech

11 Sept 2025 – Hearing Children’s Voices | Broadcast Giants Honored | Virtual Mixing

5 Sept 2025 – Read Now — Radio Futures: AI and Radio

4 Sept 2025 – IBC2025 All Change | Incentivizing Digital Transition | Video Takes The Lead

 

Related Stories for you

Malawi eases financial pressure on community radio stations

by Lameck Masina March 20, 2026 9 min read

A donation of motorbikes and a funding review have provided much-needed support

Guest Commentary: Rethinking regional and multisite workflows

by Jorma Kivelä March 3, 2026 7 min read

As broadcasters push for more localization, mobile tools and unified automation reshape production demands

Tech Focus: Axia Studio Edge provides an edge

by RedTech Staff March 3, 2026 3 min read

The high-density AoIP edge device allows broadcasters to add substantial audio I/O to a Livewire+ AES67 network

RedTech RedTech

RedTech International SAS
250 bis boulevard Saint-Germain
75007 Paris, France

contact@redtech.pro

Subscribe to our newsletter

About

About Us
Work With Us
Contact Us

Advertising

Advertise

Useful Links

Partners
Newsletter

more

Terms and Conditions
Privacy Policy

latest news

Cumulus Media, Darlene Park
Featured

Cumulus Media expands Park’s portfolio

Events

Lawo builds anticipation ahead of April 8

Dielectric, RF equipment, infrastructure, NAB Show 2026, heat exchanger
2026 NAB Show

NAB Show — Dielectric debuts OptiLoad heat

GatesAir, monitoring, NAB Show 2026, AirWatch365
2026 NAB Show

NAB Show — GatesAir adds to AirWatch365

Annsofi Eriksson
Featured

EBU Names Annsofi Eriksson as Chief Technology

Follow us:

Copyright RedTech International 2026. All Rights Reserved