The Use of Artificial Intelligence Enabling Scalable Audio Description on Brazilian Television: A Workflow Proposal

Authors

  • Luiz Fernando Kruszielski Globo Tv Network
  • Pedro H. L. Leite Grupo Globo
  • Edmundo Hoyle Grupo Globo
  • Marcelo Lemmer Grupo Globo

Keywords:

Audio Description, Artificial Intelligence, Voice Synthesis, Accesibility

Abstract

Recently, Artificial Intelligence (AI) technologies have been gaining ground in various areas of knowledge, significantly impacting many academic and business spheres. One application that can benefit from AI is the inclusion of people with disabilities in audiovisual content, where the scaling capacity of certain processes can bring new accessibility opportunities. In this work, we show what a traditional workflow of an audio description for dramaturgy audiovisual content looks like, and from there, we propose a new workflow for generating audio description audios for visually impaired people using synthetic voice created with Artificial Intelligence models. The proposed workflow simplifies and considerably reduces production time and costs, besides allowing the generation of audios on a larger scale compared to a traditional workflow, enabling a broader reach of the target audience. It also allows multiple people to work simultaneously on the same project while preserving sound identity through the synthetic voice and standardized mixing. With this proposal, we believe that accessibility on Brazilian television can be expanded to serve a much larger audience.

Downloads

Download data is not yet available.

Published

2025-01-31

How to Cite

Kruszielski, L. F., H. L. Leite, P., Hoyle, E. ., & Lemmer, M. . (2025). The Use of Artificial Intelligence Enabling Scalable Audio Description on Brazilian Television: A Workflow Proposal. SET INTERNATIONAL JOURNAL OF BROADCAST ENGINEERING, 9(1). Retrieved from https://revistas.set.org.br/ijbe/article/view/271

Issue

Section

Advanced audio technology and processing