Loading…

Log in to bookmark your favorites and sync them to your phone or calendar.

Monday, October 11
 

11:00am EDT

EIDA Day Introduction
Monday October 11, 2021 11:00am - 11:15am EDT

11:15am EDT

Dead Tech: Is Anything Ever Really Obsolete?
While some choice pieces of audio equipment are thought of as "vintage," many technologies we use become obsolete after relatively short life spans. While we can assume that almost anything to do with a computer will end up in the recycling bin after a few years, there are ways of revisiting some of these technologies and developing strategies for extending their usefulness. This session will present some case studies with examples of how artists and engineers have been able to re-examine the usefulness of some technologies we might consider obsolete.

Moderators
avatar for Michael Bierylo

Michael Bierylo

Chair, Electronic Production and Design, Berklee College of Music
Michael Bierylo is an electronic musician, guitarist, composer, and sound designer. He has been a faculty member at Berklee College of Music since 1995 and is currently Chair of the Electronic Production and Design Department where he led the development of Berklee’s Electronic... Read More →

Speakers
avatar for Devine Lu Linvega

Devine Lu Linvega

Composer, Illustrator, Software Developer
Devine Lu Linvega is a multi-disciplinary artist connecting audio, visual and programming into interactive experiences, installations and live performances
avatar for Brian Crabtree

Brian Crabtree

Monome
Brian Crabtree creates objects, music, and objects that make music. In 2006 with Kelli Cain he founded Monome, pioneering the grid-based performance instrument and exploring new composition systems. His work has been shown at the Museum of Modern art in New York, the Los Angeles County... Read More →


Monday October 11, 2021 11:15am - 12:15pm EDT
Streamcast

12:15pm EDT

MIDI 2.0: What’s New and How it Will Affect the Way We Work
The original MIDI specification was released in 1983 and for 35 years remained at version 1.0. The new revision will be completely backward compatible and will codify many of the technical developments manufacturers have introduced in recent years. This session will provide an overview of MIDI 2.0 and examine the implications for the users and the industry.

Moderators
avatar for Jennifer Hruska

Jennifer Hruska

Berklee College of Music
Jennifer Hruska is a composer, sound designer, electronic instrument designer, entrepreneur, and educator. She is currently on the faculty at Berklee College of Music in the Electronic Production and Design department.  Early in her career, Jennifer was a sound designer for Kurzweil... Read More →

Speakers
avatar for Rick Cohen

Rick Cohen

Antares Audio Technologies
Rick Cohen is the chairman of the Technical Standards Board in the MIDI Association. He was formerly the chair of the Protocol Working Group which developed the MIDI 2.0 protocol specification. Rick is a senior software engineer at Antares Audio Technologies.  He was a lead engineer... Read More →
avatar for Mike Kent

Mike Kent

MK2 Audio
Mike Kent is the Chairman of the MIDI 2.0 Working Group and co-author of USB MIDI. Mike is a consultant with clients including Yamaha, Apple, Roland, Musescore, and Artiphon. Mike previously worked for Roland over 30 years. Mike is a co-founder and CSO of AmeNote, a new technology... Read More →
avatar for Dave Smith

Dave Smith

Sequential Circuits
Dave Smith is a legendary instrument designer and Grammy-winner, and the founder of Sequential Circuits in 1974. In 1977 Dave designed the Prophet-5, the world’s first fully-programmable polyphonic synth, and the first musical instrument with an embedded microprocessor. Sequential... Read More →


Monday October 11, 2021 12:15pm - 1:15pm EDT
Streamcast

1:15pm EDT

AES TechTour: The Power Station/Berklee NYC
Virtual Tour of Power Station/Berklee NYC, produced by Simon Yu, co-produced by Stephen Webber (Berklee).

Speakers
avatar for Roy Hendrickson

Roy Hendrickson

Chief Engineer, Power Station at BerkleeNYC
Roy Hendrickson is an engineer and mixer whose focus for the past 30 years has been on the integration of musical creativity with the evolving studio landscape. His musically rich discography is a testament to the work he has done with many artists and producers including Miles Davis... Read More →
avatar for Gloria Kaba

Gloria Kaba

Studio Manager, Power Station at BerkleeNYC
Gloria Kaba is a Ghanaian-American sound engineer, producer, mixer, and writer with over a decade of experience in the studio, often operating under the moniker Redsoul. She’s worked on A Tribe Called Quest’s final album We Got It From Here...Thank You For Your Service and Solange’s... Read More →
avatar for Ian Kagey

Ian Kagey

Director of Operations, Power Station at BerkleeNYC
Ian Kagey is a Grammy-nominated engineer, musician, and educator who came up at MSR Studios (formerly Right Track Recording) in New York City. He has worked on countless professional recordings ranging from major motion picture soundtracks to Broadway cast albums. Kagey recently returned... Read More →
avatar for Stephen Webber

Stephen Webber

Executive Director of BerkleeNYC, Berklee College of Music
Stephen Webber is Executive Director of BerkleeNYC, overseeing the iconic Power Station studios. Webber has hosted sessions for Janelle Monáe, Lin-Manuel Miranda, Kate McKinnon, Trey Anastasio, DJ Khaled, Chris Martin, John Legend, Bette Midler, Paul Simon, Mick Jagger, Wynton Marsalis... Read More →


Monday October 11, 2021 1:15pm - 2:00pm EDT
Streamcast

2:00pm EDT

User Experience Design in Electronic Instruments
As electronic instruments become increasingly complex, offering users a myriad of options, designers have become increasingly aware of the need for a focus on user experience design. This can range from software to hardware designs that explore effective ways to present this increasing complexity to experienced professionals, to users with limited or no experience with electronic instruments. This session will explore new approaches in UIX designs and whether improvement in this area can lead to wider consumer market penetration.

Moderators
avatar for Jennifer Hruska

Jennifer Hruska

Berklee College of Music
Jennifer Hruska is a composer, sound designer, electronic instrument designer, entrepreneur, and educator. She is currently on the faculty at Berklee College of Music in the Electronic Production and Design department.  Early in her career, Jennifer was a sound designer for Kurzweil... Read More →

Speakers
avatar for Dave Smith

Dave Smith

Sequential Circuits
Dave Smith is a legendary instrument designer and Grammy-winner, and the founder of Sequential Circuits in 1974. In 1977 Dave designed the Prophet-5, the world’s first fully-programmable polyphonic synth, and the first musical instrument with an embedded microprocessor. Sequential... Read More →
avatar for Daniel Clarke

Daniel Clarke

Focusrite
Dan Clarke is a Senior Product Designer at Focusrite and Novation with ten years of experience in the industry.He has worked on a comprehensive range of products such as Novation’s Circuit, Peak, and Components, along with Focusrite’s FAST plugins and RedNet R1.In his role, he... Read More →
avatar for Chris Rudzinski

Chris Rudzinski

InMusic
Chris Rudzinski is a Senior UI/UX designer at InMusic currently located in Seattle. He is primarily assigned to the Akai Pro MPC product series which is a hardware hybrid that can run stand-alone or connect to a computer — used in the studio or live performances. With the Akai team... Read More →


Monday October 11, 2021 2:00pm - 3:00pm EDT
Streamcast

3:00pm EDT

Keynote: The Soul of the Machine: Do Electronic Instruments Have a Personality? A Conversation with Dave Smith and Marcus Ryle
After 50 years, we can say that analog subtractive synthesizers have stood the test of time, and while many new types of synthesis have emerged, manufacturers continue unabated to design and release instruments based on this model. Musicians develop an affinity for different types of instruments that have a common basic design, such as the guitar, based on the nuances of the sound and feel of the instrument. Is that true for synthesizers and what exactly are the characteristics that attract players to these instruments? Since founding Sequential Circuits in 1974, Dave Smith has devoted much of his career to developing instruments that musicians will love to play. This session will be a conversation between Dave and fellow music technology pioneer Marcus Ryle. Together they will explore some of the reasons for the enduring appeal of electronic instruments, each from their own unique perspective.

Speakers
avatar for Marcus Ryle

Marcus Ryle

WRiiG
Marcus began his professional industry career as a design engineer at the synthesizer company Oberheim Electronics. From 1980 through 1985 he was a key architect and designer of such instruments as the OB-Xa, OB-8, Xpander, and Matrix-12, as well as the DSX sequencer and DMX drum... Read More →
avatar for Dave Smith

Dave Smith

Sequential Circuits
Dave Smith is a legendary instrument designer and Grammy-winner, and the founder of Sequential Circuits in 1974. In 1977 Dave designed the Prophet-5, the world’s first fully-programmable polyphonic synth, and the first musical instrument with an embedded microprocessor. Sequential... Read More →


Monday October 11, 2021 3:00pm - 4:00pm EDT
Streamcast

4:00pm EDT

Prototyping New Instrument Designs with Open Source Microprocessor Platforms
While we've seen embedding computing platforms like Arduino and Raspberry Pi used in prototyping electronic instruments, this session will explore open source platforms designed specifically for instrument design, lowering the barrier for independent designers and manufactures to easily bring new designs into the marketplace.

Moderators
avatar for Michael Bierylo

Michael Bierylo

Chair, Electronic Production and Design, Berklee College of Music
Michael Bierylo is an electronic musician, guitarist, composer, and sound designer. He has been a faculty member at Berklee College of Music since 1995 and is currently Chair of the Electronic Production and Design Department where he led the development of Berklee’s Electronic... Read More →

Speakers
avatar for Corry Banks

Corry Banks

Modbap Modular
A pioneer of the Modbap community and movement, Corry Banks coined the term “Modbap”. As the founder of BBoyTechReport, Beatppl, and Modbap Modular, Banks focuses on assuring the voice of hip-hop is present in music technology. Banks’ passion is to bridge the gap between hip-hop... Read More →
avatar for Andrew Ikenberry

Andrew Ikenberry

Qu-Bit Electronix
Andrew Ikenberry is an electronic instrument designer and music tech entrepreneur. He is the co-founder of multiple companies including Qu-Bit Electronix, 2hp, and Electrosmith. His current work is focused on the Daisy audio platform which aims to disrupt the world of digital instrument... Read More →
avatar for Kris Kaiser

Kris Kaiser

Noise Engineering
Stephen McCaul and Kris Kaiser co-own Noise Engineering, a modular-synthesizer and music-software company in Los Angeles, California. Noise Engineering started as a hobby in 2014, but Stephen and Kris abandoned their previous careers to pursue module making full-time in 2017. Today... Read More →
avatar for Stephen McCaul

Stephen McCaul

Noise Engineering
Stephen McCaul and Kris Kaiser co-own Noise Engineering, a modular-synthesizer and music-software company in Los Angeles, California. Noise Engineering started as a hobby in 2014, but Stephen and Kris abandoned their previous careers to pursue module making full-time in 2017. Today... Read More →


Monday October 11, 2021 4:00pm - 5:00pm EDT
Streamcast
 
Tuesday, October 12
 

11:00am EDT

TC-SAA Meeting
Please contact the TC Chair for participation details: https://www.aes.org/technical/saa/ 


Tuesday October 12, 2021 11:00am - 12:00pm EDT

12:00pm EDT

AES Technical Committee for Spatial Audio
https://fau.zoom.us/j/64086741970?pwd=WlZ6bkszQUcwMlFicVZmOFArUjRtQT09

Tuesday October 12, 2021 12:00pm - 1:00pm EDT

1:00pm EDT

Sound System Optimization: The Good, Bad and the Ugly (optimization techniques when you have two days, two hours or two minutes)
How long does it take to fully optimize a sound system? The real answer is all the time you’ve got, but in the real world we need to manage the process into the time allotted. This workshop will focus on how to best allocate the available time to ensure that a successful optimization. Industry veterans Bob McCarthy and Michael Lawrence and others will discuss their approaches to this challenge.

Moderators
avatar for Bob McCarthy

Bob McCarthy

Meyer Sound Laboratories
Bob McCarthy’s involvement in the field of sound system optimization began in 1984 with the development of Source Independent Measurement™ (SIM™) with John Meyer at Meyer Sound Laboratories. Since that time he has been an active force in developing the tools and techniques of... Read More →

Speakers
avatar for Carolina Antón

Carolina Antón

3BH
Carolina Antón is an internationally recognized leader in the field of live sound mixing, systems design and optimization of sound reinforcement. For more than 15 years, Carolina has established a trajectory within her career, collaborating with distinguished artists and productions.Carolina... Read More →
avatar for Michael Lawrence

Michael Lawrence

Application Specialist, Rational Acoustics
Michael is a freelance system engineer and the technical editor of ProSoundWeb and Live Sound International magazine. He is also a Smaart instructor for Rational Acoustics, where he acts as Product Manager for Smaart's SPL toolset. Michael has been interested in audio since age 14... Read More →
avatar for Jessica Paz

Jessica Paz

Sound Designer
Jessica Paz is a Tony Award-winning Sound Designer for theater, film, and music. Most recently, she collaborated with Nevin Steinberg on the sound design for Anaïs Mitchell’s acclaimed production of Hadestown, which earned the duo a Tony Award, Drama Desk Award, and an Outer Critics... Read More →
avatar for Finlay Watt

Finlay Watt

Live Sound Engineer
Finlay Watt is a freelance live sound engineer from Scotland and is currently based on the UK's South Coast. With approximately ten years of experience in the industry, Finlay has worked on shows ranging from club gigs to stadium tours and has a wealth of experience in all roles within... Read More →
avatar for Jim	Yakabuski

Jim Yakabuski

Live Sound Engineer, Yak Sound
Jim Yakabuski has spent more than 35 years as a live sound engineer, working with artists such as Van Halen, Journey, Avril Lavigne, Peter Frampton, and many others. Jim currently serves as Director of Audio Projects for Solotech US. He's also the author of "Professional Sound Reinforcement... Read More →


Tuesday October 12, 2021 1:00pm - 2:30pm EDT
Streamcast

2:30pm EDT

Finding the Path: Knowledge, Learning, and Professional Curiosity in Audio
A "roundtable" discussion featuring veteran audio engineers Robert Scovill, Dave Rat, Grace Royse, and Ryan O. John, and moderated by Signal to Noise Podcast host Michael Lawrence. The discussion will explore the great tradition of learning, exploration, and professional curiosity that is deeply embedded into the history and practice of audio engineering. The expert panelists will also discuss their personal approaches to learning and professional curiosity, as well as how they communicate and teach their ideas to others.

Moderators
avatar for Michael Lawrence

Michael Lawrence

Application Specialist, Rational Acoustics
Michael is a freelance system engineer and the technical editor of ProSoundWeb and Live Sound International magazine. He is also a Smaart instructor for Rational Acoustics, where he acts as Product Manager for Smaart's SPL toolset. Michael has been interested in audio since age 14... Read More →

Speakers
avatar for Dave Rat

Dave Rat

Rat Sound Systems
Dave Rat has mixed FOH for artists such as Red Hot Chili Peppers, Rage Against The Machine, Blink 182, Soundgarden, Foo Fighters, The Offspring, and more. When he was just 18 years of age, he co-founded Rat Sound Systems, which has grown to become a major provider of touring concert sound systems and much more. Dave’s work to encourage and foster professional curiosity and innovation is well known in the industry... Read More →
avatar for Grace Royse

Grace Royse

Grace Royse has clocked nearly two decades as FOH, TM and PM spanning a variety of broadcasted events, festivals and tours. Grace credits her network of top performing colleagues and a strong sense of purpose for making every successful event possible.
avatar for Ryan John

Ryan John

Senior Product Manager, Universal Audio
Ryan John is a FOH engineer who has worked with artists such as Jessie J, Andy Grammer, One Republic, Robin Thicke, and more. He has also served as product manager for companies such as AVID, JBL/Harman, and Universal Audio, and is unquestionably the most handsome member of this... Read More →
avatar for Robert Scovill

Robert Scovill

Robert Scovill is a 35-year veteran of professional concert sound and recording and has mixed more than 3,500 events in his career for noted musical artists such as Tom Petty and the Heartbreakers, Matchbox Twenty, Jackson Browne, Rush, Def Leppard, Prince, and many others. His body... Read More →


Tuesday October 12, 2021 2:30pm - 4:00pm EDT
Streamcast
 
Wednesday, October 13
 

11:30am EDT

“Mission Completed!” Radio Kingston’s Five-Year Design/Build Program Raises the Bar for Broadcast Radio Facility Design
Funded by a grant from The NoVo Foundation, in 2017 Radio Kingston acquired a 153-year-old hexagonal building in Kingston NY, and engaged WSDG to design a 2000 SQ. FT. broadcast complex powered by renewable energy and equipped to provide broadcast, podcast audio (and video) programming for their listeners. Numerous changes in the scope of the design were incorporated as the project expanded its mission statement and, its physical footprint to include retooling an existing two-story garage, as a 325 SQ. FT. multipurpose live performance studio. WKNY-Radio Kingston’s metamorphosis into a template for future radio station design will be explained by a panel co-chaired by WSDG Founding Partner, John Storyk and WSDG Partner,/COO, Joshua Morris. Participants include WSDG Partner/Project Manager, Mathew Ballos; Philanthropist and NoVo Foundation Founding Director, Peter Buffett; Jimmy Buff, Executive Director, Radio Kingston Corp., Kristen Thorne, Station Business Manager, WKNY, and Kale Kaposhilin, Station Technical Director/Chief Engineer, WKNY.


Moderators
avatar for Joshua Morris

Joshua Morris

Partner/COO, WSDG
Joshua Morris graduated from the University of North Carolina in Charlotte with two Degrees, a Bachelor of Arts in Architecture and a Bachelor of Architecture. A love of music led him to seek a combination of architecture and acoustics, beginning with his thesis on acoustics. Morris... Read More →
avatar for John Storyk

John Storyk

Founding Partner, WSDG
Since graduating from Princeton as an architect in 1968, and co-founding WSDG (Walters-Storyk Design Group) with his wife, Beth Walters in 1986, John Storyk has been involved with the design of over 3500 recording studios, corporate conference rooms, performance venues, broadcast... Read More →

Speakers
avatar for Matt Ballos

Matt Ballos

WSDG
Matthew Ballos earned dual degrees in Architecture and Construction Management. A background in civil engineering and a lifelong love of construction and design has enabled Matt to quickly become a valuable member of WSDG’s design and production team, currently as a project designer... Read More →
avatar for Jimmy Buff

Jimmy Buff

Executive Director, Radio Kingston Corp.
Jimmy Buff, Executive Director, Radio Kingston, began his radio journey at NYC’s legendary WNEW-FM. He came to the Catskills in 1993 to work at the legendary WDST Radio and along the way has had stops at K-Rock in New York City and WEHM in East Hampton. Buff has done national TV... Read More →
avatar for Peter Buffett

Peter Buffett

NoVo Foundation
Peter Buffett is the co-president of NoVo Foundation and co-chair of its Board of Directors. In partnership with his wife, Jennifer, he helps to guide NoVo’s vision, strategic mission, and program development. Peter Buffett is a well-established musician, composer, and producer... Read More →
KK

Kale Kaposhilin

Station Technical Director/Chief Engineer, WKNY
KT

Kristen Thorne

Station Business Manager, WKNY


Wednesday October 13, 2021 11:30am - 1:00pm EDT
Streamcast

1:00pm EDT

Maintenance, Troubleshooting and Training
This panel will cover maintenance, troubleshooting, and training at broadcast facilities. Troubleshooting and maintenance will incorporate different situations such as unusual situations where a broadcast engineer needs to cover a wide area to take care of transmitters on mountains and at different sites, AoIP Systems, component or single device maintenance, automation systems, on-air issues, and national and local programs.

Training will encompass overall training for a new technician at a Broadcast Facility, training to learn a system flow at a facility, doing a program, and for making special cables and connectors which may use soldering and, or punch tools techniques.

Moderators
JL

Joyce Lieberman

Radio & Fresh Air, WHYY
Radio Engineering Supervisor at local NPR Station in Philadelphia. Instrumental in setting up the tech side for Fresh Air. Work on maintaining our good sound, troubleshooting and training. Record classical and Jazz.

Speakers
ML

Marcy Lefkovitz

Vice President Technology & Workflow Strategy, Disney Media & Entertainment Distribution
Coordinate implementation of new technologies and new workflows within all business units of Disney Media & Entertainment Distribution and its business partners, with an emphasis on content creation and delivery. Ensure that tools are available and accessible to those who need them... Read More →
GK

Gary Kline

Kline Consulting
Gary Kline has been actively involved with radio broadcasting for over 30 years. Before starting his own successful consulting firm, he was most recently with Cumulus Media for sixteen years as Senior Vice President of Engineering.Kline has also held positions with several radio... Read More →
BM

Barry Mishkind

The Broadcasters' Desktop Resource
TT

Tracy Teagarden

Audacy
I have been in Las Vegas since it was a little town. I transitioned with radio from the analog days, to today, including bringing the first commercial radio station on with HD Radio technology. I relentlessly pursue 5 nines, in every aspect of my life. I have built countless radio... Read More →


Wednesday October 13, 2021 1:00pm - 2:00pm EDT
Streamcast

2:00pm EDT

Spatial Audio in Podcasting
Spatial audio has incredible potential in audio storytelling. Recently it became much more accessible to consumers — podcasting is the natural next step for this technology. But there are so many questions to be answered! What kinds of stories are a good fit for spatial audio? What formats are ready for podcasting? What does it mean for the production process and post-production workflows? How can spatial sound design contribute to an audio story — or detract from it? What are the financial costs? Industry leaders with hands-on production experience will discuss these aspects of spatial audio for podcasting.

Moderators
avatar for Rob Byers

Rob Byers

American Public Media

Speakers
avatar for Sandra Yee Ling

Sandra Yee Ling

Q Code Media
Sandra Yee Ling is VP of Production at Q Code Media where she produced Shana Feste’s narrative podcast DIRTY DIANA starring Demi Moore, GHOST TAPE starring Kiersey Clemons, UNWANTED starring Lamorne Morris and Billy Magnussen, and ELECTRIC EASY starring Kesha, Mason Gooding, and... Read More →
avatar for Matt Yocum

Matt Yocum

Sound Designer / Re-recording Mixer
Matt Yocum is a freelance Sound Designer and Re-recording mixer in Los Angeles. After graduating from Savannah College of Art and Design, Matt moved to LA where he dove into the world of sound for motion pictures and has worked in a variety of mediums from film and TV to commercials... Read More →
avatar for Bob Kellough

Bob Kellough

Bob Kellough is a sound designer and re-recording mixer. He enjoys using sound to develop narratives and enrich storytelling experiences.


Wednesday October 13, 2021 2:00pm - 3:00pm EDT
Streamcast

3:00pm EDT

Using SNMP
SNMP has been around since the 1980s and IT folks have been trying to get rid of it since the early 2000s and before. It’s only recently, relatively speaking, become commonplace in broadcast and is still a bit of a mystery to lots of folks. Once you understand the structure and how it works, it really is a fairly simple protocol (SNMP stands for Simple Network Management Protocol). In this session, we will discuss how SNMP works, what MIBs and OIDS are and how you can use them to monitor and control multiple devices from a single manager. We will discuss some of the premade SNMP management devices out there, as well as some of the software managers that allow you to roll your own. Join us, this session should be a lot of fun, as we sort out sets and traps and all that great stuff!

Speakers
avatar for David Bialik

David Bialik

Consultant, David K. Bialik & Associates
Audio has been an important part of David's professional work, beginning with U.S. major market radio engineering in 1983, becoming chief engineer at United Broadcastings’ WKDM in 1991.  He worked at NPR affiliate WAMU-FM and the NAB Science and Technology Department, publishing... Read More →
TP

Tony Peterle

Worldcast Systems


Wednesday October 13, 2021 3:00pm - 4:00pm EDT
Streamcast

4:00pm EDT

Advantages of using Metadata
Audio-specific metadata can offer new consumer benefits as well as a more compelling listening experience. This presentation discusses the development of systems using audio metadata and explains the types of metadata that monitor key characteristics of audio content, such as loudness, dynamic range, and signal peaks. The metadata is added to the content during encoding for real-time distribution – as in streams, or for file storage – as with podcasts. In the playback device or system, this metadata is decoded along with the audio data frames. It will be shown that audio dynamic range can be controlled according to the noise environment around the listener – quiet parts of a performance can be raised to audibility, but for those listeners who need it. Content producers need to make only one target level for all listeners, rather than one for smart speakers, another for fidelity-conscious listeners, etc. An audio demonstration is planned to allow the audience to hear the same encoded program over a range of playout conditions with the same device, from riding public transit to full dynamic range, for listeners who want highest fidelity.

Speakers
RB

Robert Bleidt

General Manager, Audio and Multimedia Division, Fraunhofer USA Digital Media Technologies
avatar for John Kean

John Kean

Senior Engineer, Cavell & Mertz
John applies 40 years of experience in television and radio technology.  As Senior Technologist at National Public Radio until 2015, he directed network projects and technical studies at NPR Labs.  He started in radio as an FM chief engineer in San Diego, leaving to join NPR in... Read More →


Wednesday October 13, 2021 4:00pm - 5:00pm EDT
Streamcast

5:00pm EDT

Stream Monitoring: Media Operations in the Digital Age
Content creators are in the show business. And the show must go on! Operations teams are a vital part of a content delivery service. As master control and production workflows increasingly are moving online and into the cloud, monitoring of critical paths, workflows, and user experience are items that operations teams need to watch. Is your digital media team in the know about all aspects of your content performance? Let’s explore some of the common techniques used today and where we are going tomorrow.

Speakers
avatar for Eduardo Martinez

Eduardo Martinez

Director of Technology, StreamGuys
Eduardo Martinez brings over 15 years of podcasting and streaming experience to StreamGuys as their Director of Technology. Eduardo oversees the planning, architecture and design of StreamGuys’ fleet of services.  Always on the pulse of the rapidly evolving digital media landscape... Read More →
avatar for Robert Minnix

Robert Minnix

Product Manager, StreamGuys


Wednesday October 13, 2021 5:00pm - 6:00pm EDT
Streamcast
 
Thursday, October 14
 

10:00am EDT

TC-FOA Meeting
Update and talk about Webinar for AES TC-FOA Getting Savvy on Usage of Fiber Optics for Audio

Join Zoom Meeting
https://us02web.zoom.us/j/81469417270?pwd=RjNvblRZWW1XYldJZ3hqa0dpQWNjQT09
Meeting ID: 814 6941 7270
Passcode: 252026

Thursday October 14, 2021 10:00am - 11:00am EDT

11:00am EDT

12:00pm EDT

 
Friday, October 15
 

10:00am EDT

11:00am EDT

TC-ASR Meeting
Please contact the TC Chair for participation details: https://www.aes.org/technical/asr/

Friday October 15, 2021 11:00am - 12:00pm EDT
 
Saturday, October 16
 

2:00pm EDT

YouTube: A DIY Playground
YouTube is a great place to go if you want to share your ideas, knowledge, and experiments. Panelists including the creator of the Furby Organ, Sam Battle (Look Mum No Computer) instrument maker, Tim Sway, and amp designer and repair technician, Colleen Fazio will discuss their backgrounds, teaching and learning on the platform, and how developing a little or a lot of knowledge about tools and designs helped them achieve their goals.

Moderators
avatar for Chris Kincaid

Chris Kincaid

Program Director, Audio Builders Workshop
Chris Kincaid is a musician, audio engineer, and educator spending his free time building synthesizers and audio equipment. If you get excited about taking things apart, building things, breaking things, or unusual sounds then we should be friends!He completed his M.M. in Music Composition... Read More →

Speakers
avatar for Tim Sway

Tim Sway

New Perspectives Music


Saturday October 16, 2021 2:00pm - 3:30pm EDT
Streamcast

3:30pm EDT

Practical Soldering Skills for the Studio
Are you signed up for the “Build Your Own Filter” workshop? Would you like to learn how to solder? Then this is the primer video for you. Christina Milinusic introduces the basics of soldering and shares some great tips to get you started. Then Chris Kincaid will show you how to build one of the most valuable DIY tools you can have in your arsenal, a continuity tester for next to nothing. After this video, you will be ready to begin diagnosing and repairing bad cables instead of buying new ones!

Speakers
avatar for Christina Milinusic

Christina Milinusic

Unity Sound Ltd.
Christina Milinusic is a Calgary based sound artist and theremin player. A professional audiopractitioner for over a decade, she has held technical positions with the Calgary Stampede andhistoric Grand Theatre. Currently, she volunteers as chapter head for SoundGirls Alberta and isan... Read More →
avatar for Chris Kincaid

Chris Kincaid

Program Director, Audio Builders Workshop
Chris Kincaid is a musician, audio engineer, and educator spending his free time building synthesizers and audio equipment. If you get excited about taking things apart, building things, breaking things, or unusual sounds then we should be friends!He completed his M.M. in Music Composition... Read More →


Saturday October 16, 2021 3:30pm - 4:00pm EDT
Streamcast
 
Sunday, October 17
 

1:00pm EDT

Reduce, Reuse, Resolder
Aisha Loe of Loe Sounds designs guitar pedals using salvaged materials. Aisha takes us on a search for inspiration through non-traditional components and enclosures, then back to her workspace to talk about building circuits.

Moderators
avatar for Chris Kincaid

Chris Kincaid

Program Director, Audio Builders Workshop
Chris Kincaid is a musician, audio engineer, and educator spending his free time building synthesizers and audio equipment. If you get excited about taking things apart, building things, breaking things, or unusual sounds then we should be friends!He completed his M.M. in Music Composition... Read More →

Speakers
avatar for Aisha Loe

Aisha Loe

Loe Sounds


Sunday October 17, 2021 1:00pm - 2:30pm EDT
Streamcast

2:30pm EDT

Build Your Own Metronome/Filter: DIY Workshop
Join Audio Builders Workshop for this online DIY build. Purchase a $20 ABW metronome or filter kit at crowdsupply.com before the convention and build it at our first-ever virtual workshop. Follow along with the designers and documenters of the circuit, Michael and Brewster as they take you step by step through the process of building your very own metronome. For the filter we will discuss and walk you through how to modify the circuit, changing its function from a low pass to a high pass filter. We’ll have a team of tech support in the chat to help with any potential troubleshooting. These are great kits for beginners as they are a low part count build and include thorough documentation.

Speakers
avatar for Brewster LaMacchia

Brewster LaMacchia

Clockworks Signal Processing
avatar for Christina Milinusic

Christina Milinusic

Unity Sound Ltd.
Christina Milinusic is a Calgary based sound artist and theremin player. A professional audiopractitioner for over a decade, she has held technical positions with the Calgary Stampede andhistoric Grand Theatre. Currently, she volunteers as chapter head for SoundGirls Alberta and isan... Read More →
avatar for Michael Swanson

Michael Swanson

Audio Builders Workshop
avatar for Chris Kincaid

Chris Kincaid

Program Director, Audio Builders Workshop
Chris Kincaid is a musician, audio engineer, and educator spending his free time building synthesizers and audio equipment. If you get excited about taking things apart, building things, breaking things, or unusual sounds then we should be friends!He completed his M.M. in Music Composition... Read More →


Sunday October 17, 2021 2:30pm - 4:00pm EDT
Streamcast
 
Monday, October 18
 

10:00am EDT

TC-HRA Meeting
https://us02web.zoom.us/j/7157190090

Monday October 18, 2021 10:00am - 11:00am EDT

11:00am EDT

AES Technical Committee on Audio Coding
https://us02web.zoom.us/j/82216397103?pwd=L0VhaUlBK3JjcTJwai9jdjV6VUh2Zz09

Monday October 18, 2021 11:00am - 12:00pm EDT
 
Tuesday, October 19
 

10:00am EDT

TC-ARDL Meeting
Please contact the TC Chair for participation details: https://www.aes.org/technical/ardl/

Tuesday October 19, 2021 10:00am - 11:00am EDT

11:00am EDT

TC-MLAI Meeting
Please contact the TC Chair for participation details: https://www.aes.org/technical/mlai/

Tuesday October 19, 2021 11:00am - 12:00pm EDT

12:00pm EDT

 
Wednesday, October 20
 

11:00am EDT

Fraunhofer Presentation
Wednesday October 20, 2021 11:00am - 12:00pm EDT
Stream A

12:00pm EDT

Opening Ceremonies
Wednesday October 20, 2021 12:00pm - 12:15pm EDT
Stream A

12:15pm EDT

All About Stereo Panning
In this tutorial, I will cover stereo panning techniques and applications.
  • First, I will discuss the difference between stereo pan and stereo balance and demonstrate it with audio examples.
  • Then, I will recommend solutions for those who work with DAWs that do not natively support stereo panning in their mixers.
  • Finally, I will explain frequency-based panning (spectral panning), discuss its applications, and then demonstrate it with audio examples.
I developed this tutorial partly based on my articles published in Production Expert:

Speakers
avatar for Ufuk Onen

Ufuk Onen

Bilkent University
Ufuk Önen is an audio engineer, author, and educator with 30 years of experience in music production and audio post-production. He has published four books, 50 articles, and 250 blog posts on audio and music technologies. He has contributed to more than 300 sound reinforcement, cinema... Read More →


Wednesday October 20, 2021 12:15pm - 12:30pm EDT
Stream A

12:30pm EDT

Keynote: Peter Asher
In this keynote, Asher will share stories and media ranging from his early career as one half of the ‘60s pop duo Peter and Gordon, to managing artists including James Taylor, Linda Ronstadt, Joni Mitchell, Carole King and Randy Newman, and going on to produce hits for acts including Taylor, 10,000 Maniacs, Diana Ross, Robin Williams, Neil Diamond, Ringo Starr, Ronstadt, Hans Zimmer, Ed Sheeran, Miguel and Cher, among many others. Throughout his presentation, Asher will share stories of momentous events from the span of his career, as well as rare photos and recordings, including his own bedroom demo recording of a young Paul McCartney performing a song that turned out to be a number-one hit for Peter and Gordon in 1964.

Speakers
avatar for Peter Asher

Peter Asher

Peter Asher
1964, one half of Peter & Gordon, who amassed nine Top 20 records. 1968, head of A&R for the Beatles’ record company, Apple Records, where he found, signed, and produced James Taylor. 1971, founded Peter Asher Management, representing James Taylor, Linda Ronstadt, Joni Mitchell... Read More →


Wednesday October 20, 2021 12:30pm - 1:30pm EDT
Stream A

1:30pm EDT

Best Paper: Perceptual Evaluation of Interior Panning Algorithms Using Static Auditory Events
Interior panning algorithms enable content authors to position auditory events not only at the periphery of the loudspeaker configuration, but also within the internal space between the listeners and the loudspeakers. In this study such algorithms are rigorously evaluated, comparing rendered static auditory events at various locations against true physical loudspeaker references. Various algorithmic approaches are subjectively assessed in terms of; Overall, Timbral, and Spatial Quality for three different stimuli, at five different positions and three radii. Results show for static positions that standard Vector Base Amplitude Panning performs equal, or better, than all other interior panning algorithms tested here. Timbral Quality is maintained throughout all distances. Ratings for Spatial Quality vary, with some algorithms performing significantly worse at closer distances. Ratings for Overall Quality reduce moderately with respect to reduced reproduction radius and are predominantly influenced by Timbral Quality.

Speakers
TR

Thomas Robotham

International Audio Laboratories Erlangen
AS

Andreas Silzle

International Audio Laboratories Erlangen
AN

Anamaria Nastasa

Aalto University
AP

Alan Pawlak

University of Huddersfield
JH

Jürgen Herre

International Audio Laboratories Erlangen


Wednesday October 20, 2021 1:30pm - 2:00pm EDT
Stream A

2:00pm EDT

RF: Regulatory Considerations & Alternative Spectrum
Experts in RF spectrum and FCC regulatory matters pertaining to wireless microphones and similar low power Broadcast Auxiliary Service RF equipment used in entertainment, news, sports, film and other content capture applications discuss the current state of FCC rules and spectrum status for BAS operations. They will also talk about equipment and workflow pros and cons concerning alternative spectrum currently permitted and soon to be permitted. Moderated by industry veteran James Stoffo.

Moderators
avatar for James Stoffo

James Stoffo

CTO, Principal, Radio Active Designs
James Stoffo has been working in the RF field for his entire professional career. Starting out as a Radio Surveillance Technician in the US Navy Submarine Service, he later literally wrote the book on production wireless as the author of the RF Coordination and Procedures Manual for... Read More →

Speakers
avatar for Joe Ciaudelli

Joe Ciaudelli

Sennheiser
Joe Ciaudelli was hired by Sennheiser in 1987 upon graduating from Columbia University with an electrical engineering degree.  He provided frequency coordination for large multi-channel wireless microphone systems used by Broadway productions, major theme parks, and broadcast networks... Read More →
JG

Jason Glass

Clean Wireless Audio LLC
avatar for Jackie Green

Jackie Green

Nexonic Designs
Jackie Green has enjoyed many opportunities to pursue great sound and innovative technologies. After BS and MBA degrees, Green pursued graduate courses in microprocessor design and digital signal processing in order to support creative work in digital wireless and audio. She is an... Read More →
avatar for Prakash Moorut

Prakash Moorut

Shure, Inc.
Prakash Moorut is the Senior Director of Spectrum and Regulatory Affairs at Shure where he is responsible for leading Shure’s efforts to advocate for audio professionals as it pertains to industry regulations. Before coming to Shure, he spent more than 10 years with Nokia, a leading... Read More →


Wednesday October 20, 2021 2:00pm - 3:00pm EDT
Stream A

3:00pm EDT

The Secret Life of Low Frequencies
The low frequency region is where the bulk of our perception of sonic quality takes place, making it a range requiring meticulous attention. It is also an unforgiving territory where many complications and pitfalls can arise that damage the sound quality in our critical listening spaces - mix rooms, control rooms, studios and the like. In designing these rooms to world class standards, it’s easy to stumble and fail.

In “The Secret Life of Low Frequencies”, Bruce Black explores what can occur in the all-important low frequency realm, and why it happens. Using data collected over decades from the acoustic analyses of rooms ranging from large world class mix stages to small home studios, he shares his surprising discoveries and unexpected answers to sometimes baffling questions.

Controlling the LF requires a deep understanding of its unique attributes and character, and answers these questions -

• How does the LF range work?
• What are the different ways sound propagates?
• What can go wrong in the low end?
• How can we identify and quantify the problems?
• How do we fix the issues?
• Which fixes solve the problems, and which ones lead us astray?
• Why won't adding absorptive panels fix LF problems?


Good LF control requires good, honest and informed answers to all these questions. Some of what Mr. Black presents will go against the “common wisdom”, and some may be controversial. But what he offers is based on years of field experience; it will give you plenty to contemplate.

The attendees will leave with a deeper understanding of how LF works, why many of the solutions offered by the “common wisdom” fail to work or even do more sonic damage, and a comprehensive road map of how to control this sonically critical region.

Speakers
BB

Bruce Black

Director of Engineering, MediaRooms Technology LLC


Wednesday October 20, 2021 3:00pm - 4:00pm EDT
Stream B

3:00pm EDT

4 Bars and More: The Techniques Behind Arranging EDM
Electronic Dance Music employs an arrangement structure that differs from most traditional forms of music, often relying solely on the constant repetition of a 4-bar basic groove running for 5 minutes or longer. But with the streaming platform now the most popular way to digest music, music is becoming shorter, and with it, our listeners' attention span. How do dance music producers excite the listener, and more importantly, maintain their interest for longer periods when the current musical trends dictate the opposite?

In this workshop, we walk through the ideas, concepts, and techniques behind arranging EDM, showing how artists create 5-minute arrangements from just 4 bars of limited instrumentation while also maintaining a listener's interest.

Speakers
RS

Rick Snoman

Altar Studios UK


Wednesday October 20, 2021 3:00pm - 4:00pm EDT
Stream A

4:00pm EDT

Social Media and the Modern EDM Artist
It's estimated that 40,000 tracks are uploaded to Spotify daily. That’s nearly 1.2 million a month. Every month. A large majority are electronic dance music records created by lone producers in their bedrooms and with such a vast and ever expanding catalog of music, its becoming increasingly difficult to be found without an active marketing campaign. For many producers, the majority of the campaign is focused via social media. But is this really the best option for an EDM artist?

In this session, Alexandra Bartles examines the importance of Social media to the modern electronic dance music professional. The role that it plays in a producers success or failure, and the possible lasting impact it may have on our mental health.

Speakers
AB

Alexandra Bartles

Altar Studios UK


Wednesday October 20, 2021 4:00pm - 5:00pm EDT
Stream B

4:00pm EDT

NFT Tutorial: “Let’s Drop An NFT!”
'Let's Drop An NFT!' - moderated by Elijah Walsh, this panel features acclaimed multi-platinum music producer LA+CH, renowned deconstructive pop artist Matt Gondek and the phenomenal creative technologist White Lights.

All of the speakers are engaged in the NFT community. They have a track record and history of NFT drops minted on the blockchain, featuring collections, 1/1 rare works and collaborations with other artists and platforms.

This tutorial/discussion aims to cover a step-by-step format on ‘how to mint’ an NFT on the blockchain. While discussing the artistic process and creative approach, we will learn what is involved technically behind the scenes, from conception to final execution. The subject matter will touch on music, fine/street art, and the intersection of audio and visual art-forms in producing quality NFTs, geared towards collectors and supportive communities all over the metaverse!

Moderators
avatar for Elijah Walsh

Elijah Walsh

Producer, Mixer, Recording & Mixing Things

Speakers
L

LA+CH

Music Producer, Composer, Sideways Music
MG

Matt Gondek

Deconstructive Pop Artist
avatar for White Lights

White Lights

White Lights
sad but it's okay :: web3 engineer, audiovisual cryptoartist, and music producer.


Wednesday October 20, 2021 4:00pm - 5:00pm EDT
Stream A

5:00pm EDT

Harlem School Of The Arts’ New Performance Venue A Multi-Faceted Educational Gem
The Harlem School of the Arts (HSA) has been a mecca for art, music, and dance students to explore for over fifty years. When Grammy winning musician/label exec/philanthropist, Herb Alpert learned HSA faced serious financial difficulties, he had the Herb Alpert Foundation fund ‘The Renaissance Project,’ a $9.5 million-dollar renovation to enable HSA to meet the needs of Harlem’s next generation of artists and performers. Imrey Studio LLC, founder/architect Celia Imrey was retained to lead the physical and aesthetic re-design. This panel will discuss the creative goals and solutions devised to turn an outdated “Brutalist Design” building into a welcoming, performance venue.

Moderators
avatar for John Storyk

John Storyk

Founding Partner, WSDG
Since graduating from Princeton as an architect in 1968, and co-founding WSDG (Walters-Storyk Design Group) with his wife, Beth Walters in 1986, John Storyk has been involved with the design of over 3500 recording studios, corporate conference rooms, performance venues, broadcast... Read More →

Speakers
avatar for Herb Alpert

Herb Alpert

Herb Alpert, is a Grammy-winning recording artist, label exec musician/ philanthropist, and co-founder with his wife, Lani Hall, of the Herb Alpert Foundation.  With five No. 1 albums, nine Grammy Awards, and over 72 million worldwide record sales, Herb Alpert is the only musician to hit No. 1 on the U.S. Billboard Hot 100 pop chart as both a vocalist  (“This Guy’s In Love with You,”) and an instrumentalist (“Rise.”)  Following... Read More →
CI

Celia Imrey

Principal Designer/Director, Imrey Studio
Celia Imrey  has been principal designer/director of Imrey Studio LLC, design practice for almost 20 years. Her experience includes work for I. M. Pei, William McDonough and Partners, and Pei Cobb Freed & Partners. She has taught architectural design at Barnard College for 12 years... Read More →
avatar for Joshua Morris

Joshua Morris

Partner/COO, WSDG
Joshua Morris graduated from the University of North Carolina in Charlotte with two Degrees, a Bachelor of Arts in Architecture and a Bachelor of Architecture. A love of music led him to seek a combination of architecture and acoustics, beginning with his thesis on acoustics. Morris... Read More →
avatar for PK Pandey

PK Pandey

Founder, AVN Systems
PK Pandey is founder of Guitar Center’s GCPro B2B division which eventually grew into a $60+ million division, Pandy is a partner in Symphonic Acoustics and owner of Boston’s Mad Oak Studios.  In 2019 he joined with Dr. Peter D’ Antonio and WSDG Founding Partner John Story... Read More →
FP

Federico Petrone

Buenos Aires-Based Sr. Systems Designer, WSDG
Federico Petrone earned Degrees in Contemporary Music and Audiovisual Communications in Argentina. He served as lead audio engineer for Disney Cruise Lines and joined WSDG in 2007 as Audiovisual Systems Designer/Chief Installer. His assignments have taken him to projects around the... Read More →
EP

Eric Pryor

President, Harlem School of the Arts
Since joining HSA in 2015, Pryor has led a world-class team in strengthening the institution’s financial position, increasing its presence in the community, and establishing partnerships with leading arts organizations.  He is an accomplished nonprofit executive, creative arts... Read More →


Wednesday October 20, 2021 5:00pm - 6:30pm EDT
Stream A

5:00pm EDT

Case Study: Distance Collaboration Demonstration with the Blue Note Entertainment Group
Over the last year, Audinate (makers of Dante), the Blue Note Entertainment Group (of the iconic Blue Note Jazz Clubs) and Peltrix (an integration firm) collaborated on an industry-first demonstration for long-distance, networked audio-video links. In June of 2021, they demonstrated their solution linking three locations up to 750 miles apart with multiple streams of Dante audio and Dante AV video links.

In this demonstration, system latency was so low, musicians were able to comfortably play together as if they were in the same room. No click tracks or guides were used, and timing was natural for performers. Dante AV played an important part in the demo, allowing musicians to see each other, taking visual cues, read body language and engaging with each other seamlessly. The products used are all commercially available today.

Join us for a discussion on the making of this demonstration with the key players behind it, offering their unique perspective and lessons learned. See clips from the bands, hear perspective from the musicians and learn how soon this demonstration could be a reality for you.

Speakers

Wednesday October 20, 2021 5:00pm - Saturday October 23, 2021 6:00pm EDT
Stream B

6:00pm EDT

Audio in the WAN / Cloud
Audio networks are increasingly moving beyond local area networks and spanning even into WANs and the cloud. The presenter will describe how the technology works and should be implemented.

Speakers

Wednesday October 20, 2021 6:00pm - 6:30pm EDT
Stream B

6:30pm EDT

Case Studies: Innovative Live Sound Design with the Dante AV-over-IP Solution
Over the last decade, the Dante AV-over-IP solution has become a staple of our industry. Audinate certainly continues to expand and improve the Dante solution, as does the people who use it in the field.

This session will focus on a series of installations, showing innovative ways designers and operators used Dante to improve sonic clarity and provide better system security, flexibility and reliability. This session will be moderated by Audinate representatives, with the stories and insights provided by guest presenters closer to the project.

Whether you’re a veteran looking for fresh ideas or a newcomer looking for well thought-out advice, join us for some fresh perspective. Time will be provided for questions.

Speakers

Wednesday October 20, 2021 6:30pm - 7:30pm EDT
Stream B

7:30pm EDT

Avoiding the Pigeon-Hole
As a producer or engineer, understanding the nuances of any genre requires a dedication of focus and a passion for the music. Different styles of music have their own specific needs, so becoming a specialist might be a valuable career move. However, engineers are people too, and people tend to enjoy many genres of music. Also, people can excel in more than one area, so expanding into more areas could be even more valuable than specializing. How can a working producer/engineer avoid the pitfalls of being pigeon-holed and enjoy a diverse career?

Panelists: Stephen Dent, Jeanne Montalvo, Neal Pogue

Moderated by: Paul "Willie Green" Womack

Moderators
avatar for Paul Womack

Paul Womack

Owner/Chief Engineer, Willie Green Music
A producer, engineer and sonic artist, Paul "Willie Green" Womack has built a discography boasting names such as Wiz Khalifa, The Alchemist, The Roots, Billy Woods, ELUCID and many more, and established himself as one of the top names in independent Hip-Hop & R&B. Currently residing... Read More →

Speakers
avatar for Stephen Dent

Stephen Dent

Stephen Dent is a recording mix engineer who has worked with the most iconic artists, producers, writers throughout the 90’s, 2000’s, and 2010’s. Education, Studio Management, Facility Design, and Corporate AV have also been Stephens passions.Stephens discography includes Diamond... Read More →
avatar for Jeanne Montalvo

Jeanne Montalvo

Engineer/Producer, The Futuro Media Group
Jeanne Montalvo is a Grammy-nominated audio engineer and radio producer. In 2017, she was nominated for a Grammy as Mastering Engineer for Vladimir Horowitz: The Unreleased Live Recordings, and her recording of Multiverse by Bobby Sanabria's Latin Jazz Big Band was nominated for Best... Read More →
avatar for Neal Pogue

Neal Pogue

Veteran music Producer/Mixer/Songwriter Neal H Pogue, born and raised in a New Jersey Borough called Roselle, has more than 30 years of experience in the music industry and has worked with a multitude of artists across just about every genre. Most notably, in 2004 Pogue won an Album... Read More →


Wednesday October 20, 2021 7:30pm - 8:30pm EDT
Stream B

8:30pm EDT

Next Generation Audio for Advanced Music Creations and Distributions - Part 2
Immersive audio, high-resolution audio, and high-definition audio have become available for recording, creations, and online delivery the music. While the lossy codec such as MPEG4-AAC was the key technologies to enable digital broadcasting and online-delivery, viewers and listeners of digital broadcasting and online-delivery have become noticed the difference of audio quality and musical emotion between the legacy 2ch stereo audio using lossy codec and immersive audio using high-quality audio codec.

This workshop is the part 2 of previous workshop held at AES 150th Convention and will update it. This workshop will introduce recent actual use cases of next generation audio such as immersive audio, high-resolution audio, and high-definition audio for the advanced music creations and online-delivery of music.

Kimio Hamasaki will summarize the history and latest status of next-generation audio and introduce examples of his own researches and development as well as his musical recording works. Hideo Irimajiri will report on the live experiment of music online-delivery with high-resolution audio and immersive audio in Japan done in last October by WOWOW also his latest music recoding reproduction, and discuss the future prospects. Toru Kamekawa will report on music creations using next-generation audio at the Tokyo University of the Arts and introduce some actual examples and discuss the future prospects. Kazuya Nagae will report on the recent productions, distributions and education using immersive audio at Nagoya University of the Arts and discuss the future prospects.

These panels will also discuss the advantages of the next-generation audio for advanced music creations and online-delivery and issues of those they encountered during the actual cases. And finally, they will discuss how to improve the quality and emotion of music recordings and deliveries in the future.

Speakers
avatar for Kimio Hamasaki

Kimio Hamasaki

President, ARTSRIDGE LLC
Kimio Hamasaki, AES fellow, is a producer and balance engineer of music recording, a researcher on spatial audio, an educator on audio engineering and acoustics as well as a consultant in audio engineering. He had recorded and produced many orchestral and operatic works by the Vienna... Read More →
avatar for Toru Kamekawa

Toru Kamekawa

Professor, Tokyo University of the Arts
Toru Kamekawa: After graduating from the Kyushu Institute of Design in 1983, he joined the Japan Broadcasting Corporation (NHK) as a sound engineer. During that period, he gained his experience as a recording engineer, mostly in surround sound programs for HDTV.In 2002, he joined... Read More →
avatar for Hideo Irimajiri

Hideo Irimajiri

Executive Creator, WOWOW Inc.
I was born in 1956, graduated from Kyushu Institute of Design in 1979 with a bachelor's degree in acoustical design and from Kyushu Institute of Design in 1981 with a master's degree in acoustical design, and received a doctorate in design in 2013 for my research on reverberation.In... Read More →
avatar for Kazuya Nagae

Kazuya Nagae

Associate Professor, Nagoya University of the Arts
Graduated the Nagoya University of the Arts in vocal performance in 1996. Served at a recording studio and Radio production company. Established a recording production company in 2000. Lecturer of Department of Music Arts Creation at Nagoya University of the Arts, teaching and researching... Read More →


Wednesday October 20, 2021 8:30pm - 10:00pm EDT
Stream B
 
Thursday, October 21
 

11:00am EDT

A Deep Dive into DAW Digits: Can Floating Point Save You?
Digital Audio Workstations (DAWs) are now using floating-point arithmetic for their signal processing. This allows for a far wider dynamic range within the DAW, and file storage. However, the way this processing might affect the audio signal is not so well known. The older fixed-point methods of processing are much better understood and applying remedies such as “dither” when the signal is rounded, is well known.

However, floating-point numbers are dynamic entities that depend on the instantaneous value of the audio signal. This means that the step size of the underlying digital word is continually changing dependent on the sample value. How does this affect the audio signal? What is the effect of adding such signals together? How might we apply “dither” to such signals?

The purpose of this tutorial is to look at floating point numbers and arithmetic in some detail with a view to gaining some insight into their effect on the audio signal.
We will look at both floating-point and fixed-point number formats in order to understand how they differ. We will also examine the performance of both formats to understand what penalties floating-point numbers impose for their greater dynamic range when compared to the equivalent sized fixed-point number.

We then look at how to add/subtract and multiply floating-point numbers with a view to understanding how add/subtract operations in particular can be an issue because, if they are not carefully implemented, they can cause significant damage to the audio signal.

Finally, we will look at how you might apply “dither” and noise-shaping in a floating point environment.

The tutorial will be at a level that should allow it to be understandable to; students, users of DAWs, and signal processing, and plug-in developers, and will allow them to use and program DAWs more effectively, mindful of audio quality.

Speakers
avatar for Jamie Andrea Shyla Angus-Whiteoak

Jamie Andrea Shyla Angus-Whiteoak

University of Salford/JASA Consultancy
Jamie Angus-Whiteoak Is Emeritus Professor of Audio Technology at Salford University. Her interest in audio was crystallized at age 11 when she visited the WOR studios in NYC on a school trip in 1967. After this she was hooked, and spent much of her free time studying audio, radio... Read More →


Thursday October 21, 2021 11:00am - 12:00pm EDT
Stream A

12:00pm EDT

Loudspeaker Work-Bench Measurements
Transducers are usually clamped in a relatively large baffle and measured under anechoic conditions to approximate perfect acoustical half-space conditions. A new acoustical near field method is presented that uses an extremely small ROUND baffle at least 10 cm around the transducer and can be operated in a normal office, workshop, or any other environment. Holographic post-processing removes the acoustical short-cut and baffle reflections and suppresses room modes and gives accurate data in the near and far-field. Exploiting the symmetry of the transducer reduces the measurement time to 5 minutes while providing full directivity information. The near field scanner is a small add-on to the KLIPPEL laser vibrometer using the same robotics. This seminar discusses important electrical, mechanical, and acoustical measurement performed on the SCN work-bench according to IEC 60268-21 and 22.

Speakers
avatar for Simon Steiner

Simon Steiner

HW Development, Klippel GmbH
avatar for Wolfgang Klippel

Wolfgang Klippel

Klippel GmbH


Thursday October 21, 2021 12:00pm - 1:00pm EDT
Stream A

1:00pm EDT

Platinum Engineers
Rhythm Across the World 
The Whole Is Greater Because of the Sum of its parts—the Producer/Engineer/Composer/Musician Perspective

When you hear a song that has a rhythmic quality that is soulful and emotionally moves you - we say that track is in the “groove." But how did we get there? This panel brings together an international group of award-winning individuals, each sharing their perspective on how their craft approaches each song/project and how they interact with the other players on the track with the goal of obtaining the maximum groove.

Moderators
avatar for Rafael Sardina

Rafael Sardina

Producer/Mixer/Engineer
Los Angeles based 17-Time Grammy® & Latin Grammy® Winner Producer/Mixer/Engineer Rafa Sardina has built an eclectic and impressive client list over the years: Stevie Wonder, Camila Cabello, D’Angelo, Dr. Dre, Rosalía, John Legend, Elvis Costello & The Roots, Lady Gaga, Michael... Read More →

Speakers
avatar for Roland Gajate-Garcia

Roland Gajate-Garcia

Multi-Percussionist/Drummer/Producer/Artist
Roland Gajate-Garcia is a dynamic and versatile multi-percussionist, drummer, producer, and artist based in Los Angeles.Roland played as lead percussionist for Kanye West’s groundbreaking Sunday Service performances and also performed on the Superbowl LIV halftime show with J Lo... Read More →
avatar for Alan Meyerson

Alan Meyerson

Producer/Mixer/Engineer
Brooklyn-born Alan Meyerson was a trumpet performance major at Brooklyn College in 1977 when he discovered it took actual talent to make a living as a musician. On that fateful day when he realized it was time for “Plan B” he was roaming the hallways of The Gershwin Theater and... Read More →
avatar for Arturo O’Farrill

Arturo O’Farrill

Pianist/Composer/Educator
Arturo O’Farrill, pianist, composer, and educator, was born in Mexico and grew up in New York City. Arturo’s professional career began with the Carla Bley Band and continued as a solo performer with a wide spectrum of artists including Dizzy Gillespie, Lester Bowie, Wynton Marsalis... Read More →


Thursday October 21, 2021 1:00pm - 2:00pm EDT
Stream A

2:00pm EDT

Keynote: Poppy Crum - Enhancing Human Capabilities: A Sound Future
Speakers
avatar for Poppy Crum

Poppy Crum

Neuroscientist and Technologist
Poppy Crum, PhD, is a neuroscientist and technologist. She is currently Chief Technology Officer for Trimble Inc. and was previously Chief Scientist at Dolby Laboratories. Poppy is also an Adjunct Professor at Stanford University in the Center for Computer Research in Music and Acoustics... Read More →


Thursday October 21, 2021 2:00pm - 3:00pm EDT
Stream A

3:00pm EDT

How YOU Are Protecting Legacies: Topline Results from the Recording Academy P&E Wing and Iron Mountain Entertainment Services Survey
We’ll share a first look at results from the recent survey of the P&E Wing and AES music production and engineering community on current practices and challenges in audio archiving. This survey followed on the heels of the archiving summit hosted by IMES and the P&E Wing in June 2021: “Protecting Legacies: The Art, Science and Value of Musical Archives.”

Focused on the critical role archiving and preservation play in artists’ careers, the summit featured key stakeholders in the music community from archivists to artists, producers, engineers, managers, lawyers, and publishers. Join us to hear some of the survey results and what your colleagues are saying about what is—and what isn’t—working in archiving, workflow, and preservation technology.

Moderators
avatar for Meg Travis

Meg Travis

Director, Head of Global Marketing, Iron Mountain Entertainment Services
Meg Travis is Director, Global Head of Marketing for Iron Mountain Entertainment Services (IMES). A musician with more than 25 years of marketing and communications experience, Meg spends her time at work helping to spread the word about all the ways in which IMES helps to protect... Read More →

Speakers
avatar for Cheryl Pawelski

Cheryl Pawelski

Co-founder, Omnivore Records
Two-time Grammy® Award-winning producer, Cheryl Pawelski has, for more than 30 years, been entrusted with preserving, curating, and championing some of music’s greatest legacies. Before co-founding Omnivore Entertainment Group, she held positions at Rhino, Concord, and EMI-Capitol... Read More →
avatar for Maureen Droney

Maureen Droney

Sr. Managing Director, The Recording Academy Producers & Engineering Wing
Maureen Droney is the Sr. Managing Director of the Recording Academy (GRAMMY Awards) Producers & Engineers Wing. A former recording engineer herself, Maureen has worked on GRAMMY-winning recordings for artists including Aretha Franklin, Whitney Houston, and Santana—as well as on... Read More →



Thursday October 21, 2021 3:00pm - 4:00pm EDT
Stream B

3:00pm EDT

Guidelines for Stream Loudness
Internet Audio Streaming and on-demand file playback have become major sources of media delivery, affecting the ways that audio is recorded, mixed, post-produced, and delivered. Excessive loudness compromises quality, inconsistent loudness annoys listeners. To resolve these issues, the AES Technical Committee for Broadcast and Online Delivery has created recommended guidelines for establishing and implementing an effective Distribution Loudness for streaming and on-demand audio file playback. This session will have members of the committee discussing the recently released TD1008 Recommendations for Loudness of Internet Audio Streaming and On-Demand Distribution.

Moderators
avatar for David Bialik

David Bialik

Consultant, David K. Bialik & Associates
Audio has been an important part of David's professional work, beginning with U.S. major market radio engineering in 1983, becoming chief engineer at United Broadcastings’ WKDM in 1991.  He worked at NPR affiliate WAMU-FM and the NAB Science and Technology Department, publishing... Read More →

Speakers
avatar for Rob Byers

Rob Byers

American Public Media
avatar for Eelco Grimm

Eelco Grimm

Grimm Audio
avatar for Bob Katz

Bob Katz

Digital Domain, Inc.
Bob Katz, workshop co-chair, has been an audio engineer since 1971 and AES member since 1975. He’s played the Bb Clarinet since age 10. Currently President and mastering engineer at Digital Domain, he is an AES Fellow, engineered 3 Grammy-winning albums and numerous nominees. He... Read More →
avatar for John Kean

John Kean

Senior Engineer, Cavell & Mertz
John applies 40 years of experience in television and radio technology.  As Senior Technologist at National Public Radio until 2015, he directed network projects and technical studies at NPR Labs.  He started in radio as an FM chief engineer in San Diego, leaving to join NPR in... Read More →
RO

Roy Orban

Orban Electronics
avatar for Jim Starzynski

Jim Starzynski

Director and Principal Audio Engineer, NBCUniversal
Jim Starzynski is Director and Principal Audio Engineer for NBCUniversal, overseeing audio technologies and practices for NBCUniversal television properties. Jim is responsible for establishing NBC’s audio strategy for digital television and works on advanced technologies.Jim is... Read More →


Thursday October 21, 2021 3:00pm - 4:00pm EDT
Stream A

4:00pm EDT

AES TechTour: The Power Station/Berklee NYC - Special Encore Presentation
Virtual Tour of Power Station/Berklee NYC, produced by Simon Yu, co-produced by Stephen Webber (Berklee).

Speakers
avatar for Roy Hendrickson

Roy Hendrickson

Chief Engineer, Power Station at BerkleeNYC
Roy Hendrickson is an engineer and mixer whose focus for the past 30 years has been on the integration of musical creativity with the evolving studio landscape. His musically rich discography is a testament to the work he has done with many artists and producers including Miles Davis... Read More →
avatar for Gloria Kaba

Gloria Kaba

Studio Manager, Power Station at BerkleeNYC
Gloria Kaba is a Ghanaian-American sound engineer, producer, mixer, and writer with over a decade of experience in the studio, often operating under the moniker Redsoul. She’s worked on A Tribe Called Quest’s final album We Got It From Here...Thank You For Your Service and Solange’s... Read More →
avatar for Ian Kagey

Ian Kagey

Director of Operations, Power Station at BerkleeNYC
Ian Kagey is a Grammy-nominated engineer, musician, and educator who came up at MSR Studios (formerly Right Track Recording) in New York City. He has worked on countless professional recordings ranging from major motion picture soundtracks to Broadway cast albums. Kagey recently returned... Read More →
avatar for Stephen Webber

Stephen Webber

Executive Director of BerkleeNYC, Berklee College of Music
Stephen Webber is Executive Director of BerkleeNYC, overseeing the iconic Power Station studios. Webber has hosted sessions for Janelle Monáe, Lin-Manuel Miranda, Kate McKinnon, Trey Anastasio, DJ Khaled, Chris Martin, John Legend, Bette Midler, Paul Simon, Mick Jagger, Wynton Marsalis... Read More →


Thursday October 21, 2021 4:00pm - 4:15pm EDT
Stream B

4:00pm EDT

Platinum Producers
Moderators
avatar for Julian Raymond

Julian Raymond

John Varvatos Records/Big Machine
Julian Raymond is a multi-Grammy Award-winning and Academy Award-nominated record producer and songwriter. He arrived in Nashville from Los Angeles in 2014 to work with Scott Borchetta at Big Machine Label Group.  Julian Raymond is the SVP of A&R at Big Machine John Varvatos Records... Read More →

Speakers
avatar for St. Vincent

St. Vincent

Annie Clark made her recorded debut as St. Vincent in 2007 with Marry Me, quickly becoming regarded as one of the most innovative and fascinating presences in modern music. Each of St. Vincent’s subsequent albums has met with critical acclaim and commercial success eclipsing that... Read More →


Thursday October 21, 2021 4:00pm - 5:00pm EDT
Stream A

4:15pm EDT

AESTechTour: Blackbird Studios - Special Encore Presentation
Founded in 2002 by sound engineer John McBride and his wife, country artist Martina McBride, Blackbird has become one of Nashville's preeminent sound studios. The Blackbird complex now includes nine studios and houses The Blackbird Academy, a post-secondary audio engineering school. Driven by a passion for great audio, Blackbird boasts an attentive, professional staff and a gear inventory second to none.

Speakers
avatar for Mark Rubel

Mark Rubel

Director of Education and Instructor, Blackbird Academy


Thursday October 21, 2021 4:15pm - 5:00pm EDT
Stream B

5:00pm EDT

D&I Town Hall
AES Diversity & Inclusion Committee Co-Chair Cecilia Wu will facilitate a discussion on initiatives within the AES focused on Diversity & Inclusion. Multiple panelists from the D&I Committee will participate.

Speakers
avatar for Jonathan Wyner

Jonathan Wyner

Education and Music Engineering, AES
Record Production - technology influences on production aesthetics and extending our music vocabulary
avatar for Jiayue Cecilia Wu

Jiayue Cecilia Wu

Assistant Professor, University of Colorado Denver
Originally from Beijing, Jiayue Cecilia Wu (AKA: 武小慈) is a scholar, composer, vocalist, multimedia technologist, and audio engineer. Cecilia earned her Bachelor of Science degree in Design and Engineering in 2000. She then worked as a professional musician at EMI Records and... Read More →
avatar for Brian (Bt) Gibbs

Brian (Bt) Gibbs

Skyline Entertainment & Publishing
AES member passionate about every and all things immersive! Brian (Bt) "Gibbs" is co-owner & principal at Skyline Entertainment & Publishing in Silicon Valley. With 30+ record credits in 2020, Gibbs uses his B.M. from Berklee College of Music (Jazz Composition) as a record producer... Read More →
avatar for Mary Mazurek

Mary Mazurek

Recording Engineer
GRAMMY-nominated recording engineer based in Chicago.
avatar for Agnieszka Roginska

Agnieszka Roginska

Professor of Music Technology, New York University
Agnieszka Roginska is a Professor of Music Technology at New York University. She conducts research in the simulation and applications of immersive and 3D audio including the capture, analysis and synthesis of auditory environments, auditory displays and applications in augmented... Read More →
avatar for Joey Stuckey

Joey Stuckey

Owner, Producer & Senior Engineer, Shadow Sound Studio
Joey Stuckey is the Official Music Ambassador of his hometown of Macon, Georgia. Joey spends every moment living life to the fullest and sharing his story and inspirational spirit through his musical performances and speaking engagements. Asa toddler, Joey was diagnosed with a brain... Read More →


Thursday October 21, 2021 5:00pm - 6:00pm EDT
Stream B

6:00pm EDT

Case Studies: Deploying Audio Networks for Live Sound Applications Using Milan
Touring audio systems are built under unforgiving circumstances: they must be set up to be configured quickly while scaling to venues of various sizes. With thin financial margins and often constrained physical space, using audio networking will reduce cables and consolidate distribution, but the savings cannot come at the cost of quality, flexibility, or ease of use; the audio data must be precisely timed and utterly reliable for it all to work.

This session will examine IEEE-based audio networking protocol Milan in particular, a widely accepted solution to the audio distribution challenges posed by live sound applications. Through live sound case studies, including Roskilde Festival, Northern Europe’s largest running music festival, Rammstein stadium tour featuring 180+ networked Milan amplifiers/processors, Arcade Fire tour across over 50 locations, the 17,000 seat Hollywood Bowl venue, Metallica tour, and others, the presenters will demonstrate in detail how Milan networks would be deployed and configured in a variety of live sound applications. We will examine the capabilities and constraints of the protocol, clarifying how they impact system design.

The real-world case study examples will also demonstrate rapid deployment of a reliable, deterministic audio network with interoperable audio endpoints from a wide variety of vendors. Attendees will learn about the core challenges of AV networking and how to solve them, as well as when to select Milan for audio distribution – and upon selection, how to design the network, specify components, and deploy the system correctly – crucial considerations under touring timelines.

Visit www.avnu.org/milan to learn more, or visit the Milan public forum to ask questions and connect directly with Avnu’s Pro AV members.

Speakers
avatar for Richard Bugg

Richard Bugg

Meyer Sound
avatar for Morten Lave

Morten Lave

Adamson Systems
avatar for Scott Sugden

Scott Sugden

Product and Technology Outreach Manager, L-Acoustics
Scott Sugden contributes to the success and growth of L-Acoustics by liaising with key stakeholders across geographies and company functions to help L-Acoustics identify and capitalize upon new product opportunities. He is responsible for accompanying L-Acoustics products from launch... Read More →


Thursday October 21, 2021 6:00pm - 7:00pm EDT
Stream B

7:00pm EDT

Tutorial: What does it take to write a paper for the Journal of the Audio Engineering Society?
Do you ever wonder if you could write a paper for the Journal of the Audio Engineering Society (JAES)? Would you like to be known as a published author? Are you curious about how technical papers are prepared and submitted? Ever wonder what "peer review" means? If any of these questions apply to you, then this tutorial will help get you started! JAES--which has been in continuous publication since 1953--presents technical articles in all areas of audio engineering practice. While authors often come from established companies in the audio industry, academic institutions, and government laboratories, individuals of any background are also eligible to submit manuscripts for publication consideration. Submissions need to present prominent, state-of-the-art accomplishments in the domain of audio engineering. This tutorial covers the basic requirements of JAES manuscripts, explains the publication review process, and gives many important tips on what it takes to be the author of a successful technical paper. Come learn how YOU can be a key contributor to the technical knowledge contained in the AES Journal.

Speakers
avatar for Robert C. Maher

Robert C. Maher

Professor, Montana State University
Audio digital signal processing, audio forensics, music analysis and synthesis.
avatar for Joshua Reiss

Joshua Reiss

Queen Mary University of London
Josh Reiss is a Professor with the Centre for Digital Music at Queen Mary University of London. He has published more than 200 scientific papers, and co-authored the book Intelligent Music Production, and textbook Audio Effects: Theory, Implementation and Application. He is the President-Elect... Read More →


Thursday October 21, 2021 7:00pm - 8:00pm EDT
Stream B

8:00pm EDT

New Developments In Small Room Acoustics – Accent of The “e-Studio” + Innovative Predictive Software e.g. NIRO™
New Developments In Small Room Acoustics – Accent of The “e-Studio” + Innovative Predictive Software e.g. NIRO™ - Led by WSDG founding partner, John Storyk , REDIAcoustics co-principal / RPG Diffusor Systems founder, Dr. Peter D’Antonio and Abbott Road Studio owner /Berklee College of Music Music, Boston Music & Engineering Dept. Chair, Rob Jaczko, this virtual panel will address the rise of a new generation of powerful, diverse, compact recording & mixing studios. And, the evolution of NIRO™, an innovative wave-based, computer-modeling tool created to facilitate the design of these new production environments. This 90-minute AES panel (additional participants TBD) will reveal how NIRO’s findings were achieved and their impact on the final design process on the e-Studio and traditional facility design.

Moderators
avatar for John Storyk

John Storyk

Founding Partner, WSDG
Since graduating from Princeton as an architect in 1968, and co-founding WSDG (Walters-Storyk Design Group) with his wife, Beth Walters in 1986, John Storyk has been involved with the design of over 3500 recording studios, corporate conference rooms, performance venues, broadcast... Read More →

Speakers
avatar for Dr. Peter D’Antonio

Dr. Peter D’Antonio

REDIAcoustics
Dr. Peter D’Antonio is a pioneering sound diffusion expert. He received his B.S. from St. John’s University in 1963 and his Ph.D. from the Polytechnic Institute of Brooklyn, in 1967.  During his scientific career as a diffraction physicist, Dr. D’Antonio was a member of the... Read More →
RJ

Rob Jaczko

Berklee College of Music
Rob Jaczko is Chairman of the Berklee College of Music Boston, MA Music Production & Engineering Department and, a Platinum and Gold Record RIAA certified engineer whose credits include Bruce Springsteen, James Taylor, Warren Zevon, Don Henley, and Sheryl Crow. In planning a home... Read More →
avatar for David Molho

David Molho

WSDG
David Molho graduated Magna Cum Laude as a Music Production & Engineering major from Berklee College of Music in 2011. Since then he has worked as an engineer, producer, and composer for world-renowned record labels in his Miami, Fla.-based Groovyland Studios. His WSDG assignments... Read More →
avatar for Rinaldi Petrolli

Rinaldi Petrolli

Acoustical Engineer, REDIAcoustics
Rinaldi Petrolli earned a 2019, B.S. degree in Acoustic Engineering from the Federal University of Santa Maria, Brazil. While at university, his studies included: room acoustics, digital signal processing, electroacoustics, environmental acoustics, psychoacoustics, building acoustics... Read More →


Thursday October 21, 2021 8:00pm - 9:30pm EDT
Stream B
 
Friday, October 22
 

11:00am EDT

The Recording Academy P&E Wing presents "Whose Job Is It Anyway?"
With Maureen Droney (Recording Academy P&E WIng), Cameron Craig (MPG), Craig Rosen (head of A&R Admin at Warner), Sylvia Massy, and manager Jr. Regisford.

Friday October 22, 2021 11:00am - 12:00pm EDT
Stream A

12:00pm EDT

Streaming: Mastering, Preparation, Aggregation, Distribution and Delivery of Files
The relationship between Audio Mastering and Streaming Distribution is still in its infancy. This workshop aims to clear up misconceptions, open up a dialogue between Mastering and Distribution and help bring this relationship to a new level of maturity.

Topics will include:

  1. Acceptable formats for delivery to distributors: WAV, FLAC, ALAC? Wordlengths? Sample rates? What about MQA? Now that more streamers have begun to stream high resolution audio, these concerns become even more important.
  2. Metadata. Is it necessary or desirable to encode source files with metadata, including ISRC, catalogue number, artist, date, loudness?
  3. Sample rate conversion. Who should perform it? What source sample rates are being stored and how are they streamed? What are streamers doing about multiple sources: Lossy, lossless, and multiple sample rates?
  4. Streaming levels: What happens to our master files when they are streamed, played back and loudness normalized?
  5. File Names: Is there a recommended or standardized format for naming master files?

The panel will consist of mastering engineers and representatives of streaming firms and distributors. We expect a lively discussion.

Please note: The discussion will continue in Discord following the session! You can join the server HERE.

Moderators
avatar for Anna Frick

Anna Frick

Mastering Engineer, Airshow Mastering
Anna Frick, workshop co-chair, is a Mastering Engineer at Airshow Mastering in Colorado, working primarily with independent artists across a wide variety of genres, including several Grammy-nominated projects. Anna serves as a Trustee of the Recording Academy, Vice Chair of the AES... Read More →
avatar for Bob Katz

Bob Katz

Digital Domain, Inc.
Bob Katz, workshop co-chair, has been an audio engineer since 1971 and AES member since 1975. He’s played the Bb Clarinet since age 10. Currently President and mastering engineer at Digital Domain, he is an AES Fellow, engineered 3 Grammy-winning albums and numerous nominees. He... Read More →

Speakers
RD

Robert D'Amico

Sr. Director of Product Management, Audio Services, Avid
Rob D'Amico is the Sr. Director of Product Management, Audio Services at Avid. He brings more than 25 years of experience in music and audio production to his role at Avid Technology. He delivers new audio services and solutions to enable and inspire artists to be more creative.
AH

Anna Held

Director of Product Development, CD Baby
Anna Held started in the music industry in 2004 as a project manager at Disc Makers, where she helped independent creators get their content ready for CD manufacturing. She has been at CD Baby since 2008, spending 11 years as Director of Artist Services and now is Director of Product... Read More →
avatar for Buddy Judge

Buddy Judge

Advanced Media Design, Apple
Buddy Judge likes music. He has been on both sides of the microphone during his career — as a recording artist and as a producer/engineer. He currently works at Apple Inc. where he developed the Apple Digital Masters program and recently launched Lossless and Spatial audio on Apple... Read More →
avatar for Dan Mackta

Dan Mackta

Managing Director - Qobuz USA, Qobuz
Dan Mackta joined Qobuz in 2018 after decades spent on the label and artist side of the business. In February 2019, he launched the US version of this French Hi-Res music platform. As VP of Marketing at Jive Records and RCA Records, he oversaw all projects for artists as diverse as... Read More →
KM

Kevin Moo

CEO and Founder, Alpha Pup Records
Kevin Moo (aka Daddy Kev) is an American DJ, Grammy Award-winning audio engineer, record producer and executive from Los Angeles, California. He is CEO and founder of Alpha Pup, a global independent music distributor, and leads development on its cloud-based supply chain platform... Read More →
avatar for Andreas Rossholm

Andreas Rossholm

Senior Research Scientist, Spotify
Andreas Rossholm, Ph.D. in Applied Signal Processing, is a Senior Research Scientist at Spotify focusing on Audio and Video quality from injection to consumption. Earlier Andreas has worked at Microsoft Skype Division, responsible for technical benchmarking and before that he worked... Read More →
SS

Shawn Singh

Senior Engineer for Audio Processing, YouTube
Shawn Singh is a senior engineer for audio processing at YouTube, and has been at Google for 10 years. Shawn has loved making music since childhood, has enjoyed many years as a violinist in youth and college symphony orchestras. He continues to enjoy music as a hobby through virtual... Read More →


Friday October 22, 2021 12:00pm - 2:00pm EDT
Stream A

2:00pm EDT

Keynote: John McBride with Chuck Ainlay
One of the most valuable lessons you can teach people is the importance of giving back. Devoting your time and sharing your knowledge and life experiences for the next generation to learn and grow have been foundational and represent core values for John McBride, owner of the famed Blackbird Studio and Institutional Director of The Blackbird Academy. John has spent his entire life in the pursuit of quality, professional sound as well as giving back to the industry that has given him so much. In 2013, John opened The Blackbird Academy, dedicated to advancing the quality of recording education through a hands-on, professional training curriculum, whereby he regularly appears at classes and shares his real-life experiences and knowledge with the students.

In an interview with Chuck Ainlay, multi-GRAMMY® Award-winning producer/mix engineer (Mark Knopfler, Peter Frampton, George Strait, Miranda Lambert) and founding member of METAlliance, hear how John began in the industry, starting a live sound company in his hometown of Wichita, then coming to Nashville in the 1990s and selling his company to Clair Brothers Audio, opening up Blackbird Studio and finally starting the Blackbird Academy, with all of the challenges he faced along the way, including keeping audio education going during the pandemic.

Moderators
avatar for Chuck Ainlay

Chuck Ainlay

Music Producer/Engineer, Freelance Engineer/Producer
Multiple Grammy and ACM Award-winning Producer/Engineer, Chuck Ainlay has recorded and/or mixed well over 300 albums, including work with legendary and current Country music artists such as George Strait, Taylor Swift, Miranda Lambert, Lee Ann Womack, Steve Earle, Lyle Lovett, Emmylou... Read More →

Speakers
avatar for John McBride

John McBride

Blackbird Studios


Friday October 22, 2021 2:00pm - 3:00pm EDT
Stream A

2:15pm EDT

Examining the tools and techniques for live performance in a DAW
This session will examine the various effects and techniques that can be utilised in a DAW for live performance. It will build upon a previous workshop that focused on preparing the DAW from studio to stage. The presentation will address launch modes, transitions, using third party software and building bespoke effects racks. Using these tools and techniques will allow the performer to remix live on the fly, to create a unique performance.

Speakers

Friday October 22, 2021 2:15pm - 3:00pm EDT
Stream B

3:00pm EDT

Evolving Production Trends
Production choices can define a genre, or even a moment in time. Iconic productions can change the game, and create a sound everyone else is chasing. In this discussion, we will explore trends of the past, what’s current and what’s coming next.

Moderators
avatar for Angela Piva

Angela Piva

Audio/Mix Engineer & Producer / Professor
Angela Piva, music industry ace, is highly skilled in all aspects of music/audio production, recording, mixing, and mastering with over 30 years of professional audio engineering experience and accolades. Recognized by several Grammy award nominations from NARAS as well as RIAA multi-platinum... Read More →

Speakers
avatar for MarioSo De Jesus

MarioSo De Jesus

Mixer/Producer
MarioSo De Jesus, aka MarioSo, is recognized throughout the music industry as one of the premier mixers/producers in the Latin urban genre. He has been called "the revolutionary sound of Latin urban music", "the ear responsible for your favorite hits", and "the man that will transform... Read More →
avatar for Christal Jerez

Christal Jerez

Recording and Mixing Engineer
Christal Jerez is currently working out of Los Angeles as a freelance recording and mixing engineer. After studying audio production at American University for her Bachelor's program and at New York University for her Master's program, she became a senior staff engineer at Platinum... Read More →
avatar for Paul Womack

Paul Womack

Owner/Chief Engineer, Willie Green Music
A producer, engineer and sonic artist, Paul "Willie Green" Womack has built a discography boasting names such as Wiz Khalifa, The Alchemist, The Roots, Billy Woods, ELUCID and many more, and established himself as one of the top names in independent Hip-Hop & R&B. Currently residing... Read More →


Friday October 22, 2021 3:00pm - 4:00pm EDT
Stream A

3:00pm EDT

Heyser Memorial Lecture: Frequency Modulation Sound Synthesis
The Heyser Memorial Lecture at the 151st Fall 2021 online show will be presented as the second of a two-part Heyser series by John Chowning.
 
In this lecture, "Frequency Modulation Sound Synthesis," Dr. Chowning will present the follow-up to his October 2020 introductory Heyser Lecture, "Realizing a Dream, a Discovery, and Lissajous Figures."

Dr. Chowning will explain why the YAMAHA DX7 became the "face" of FM Synthesis (FM), the circumstances of FM's unlikely discovery, and an important technical detail regarding reflected sidebands around 0Hz. He will then demonstrate a powerful synthesis concept, spectral modeling, using FM. Finally, he will present the theory of a composition that found a perfect complement in FM theory and could only be realized by FM synthesis.

https://ccrma.stanford.edu/~jc/heyser/AES-HeyserLecture-2021-1.5.5corrected.pptx


Speakers
avatar for John M. Chowning

John M. Chowning

Professor of Music, Emeritus, Stanford University
John M. Chowning was born in Salem, New Jersey in 1934. Following military and university studies, he studied composition in Paris in 1959 for three years with Nadia Boulanger. In 1966 he received a doctorate in composition from Stanford University, where he studied with Leland Smith... Read More →


Friday October 22, 2021 3:00pm - 4:00pm EDT
Stream B

4:00pm EDT

Alicia Keys: Immersive Re-Imagining
Please join Producer/Engineer Ann Mincieli, along with Mix engineers George Massenburg and Eric schilling, and Mastering engineer Michael Romanowski for a discussion on the approach to revisiting the tremendous catalog of Alicia Keys and re-imagining the releases as immersive audio. This project started more than two years ago and will contain all of her releases. They will be sharing information about the process of getting back to the original recording, and how the process of making her records over time has evolved, and how it affects the re-imagining into Immersive audio. Perspective, best practices, artist approval, and referencing, along with audio examples will all be covered in this presentation. There will be a Q&A portion of the presentation to allow for questions from the audience.

Moderators
avatar for Brian (Bt) Gibbs

Brian (Bt) Gibbs

Skyline Entertainment & Publishing
AES member passionate about every and all things immersive! Brian (Bt) "Gibbs" is co-owner & principal at Skyline Entertainment & Publishing in Silicon Valley. With 30+ record credits in 2020, Gibbs uses his B.M. from Berklee College of Music (Jazz Composition) as a record producer... Read More →

Speakers
avatar for George Massenburg

George Massenburg

Associate Professor of Sound Recording, George Massenburg Labs
...designed, authored and presented the 1972 AES paper on the Parametric Equalizer and is regularly published in professional journals and trade magazines worldwide....chartered several electronics companies, most notably GML, INC (now GML, LLC) in 1982 to produce high-performance... Read More →
avatar for Ann Mincieli

Ann Mincieli

Jungle City Studios
Ann Mincieli, longtime engineer and studio coordinator for Alicia Keys, has channeled her talents into making Jungle a technical prowess featuring the very best in vintage and modern technology. With over twenty years of experience in the music business, Ann Mincieli has traveled... Read More →
avatar for Michael Romanowski

Michael Romanowski

Coast Mastering
Michael Romanowski is a Grammy-nominated mastering engineer and owner/chief engineer of Coast Mastering, based in Berkeley, California. He works in a purposefully built 9.1.6 immersive mastering facility, designed with Bob Hodas and Bob Levy. It was awarded a TEC Nomination for studio... Read More →
avatar for Eric Schilling

Eric Schilling

Norfolk Music
Eric started his audio career at the age of 16 in community radio doing radio production. At 19 years of age, he became an assistant recording engineer at the Record Plant Studios in Sausalito Ca. Over the course of his career, he has worked with Sly Stone, Fleetwood Mac, Natalie... Read More →


Friday October 22, 2021 4:00pm - 5:00pm EDT
Stream B

4:00pm EDT

Mastering Mastering
Women's Audio Mission (WAM) is proud to present a conversation with Mastering engineers: Jett Galindo, Piper Payne, and Maria Rice. This conversation will explore the art of fine tuning final audio products, favorite tools and developing a unique sound.

Moderators
avatar for Mary Ann Zahorsky

Mary Ann Zahorsky

Women's Audio Mission
Mary Ann is an engineer/producer with over 25 years of experience in the recording industry. She has worked on projects ranging from Howard Wiley’s 2nd Line, Afro-Cuban jazz, Whitney Houston, and MC Hammer, to soundtracks for independent films and TV post work on ad campaigns such... Read More →

Speakers
avatar for Jett Galindo

Jett Galindo

Mastering Engineer and Vinyl Cutter, The Bakery
Los Angeles-based mastering engineer and vinyl cutter, Jett Galindo of The Bakery, has worked on albums spanning a wide array of genres and artists (Barbra Streisand, Haley Reinhart, Weezer, the La La Land OST, to name a few). Jett Galindo carries on the legacy left behind by her... Read More →
avatar for Piper Payne

Piper Payne

Mastering Engineer, Infrasonic Mastering
In Spring of 2019 Piper joined the famed Infrasonic Mastering team, merging her successful Oakland studio, Neato Mastering, with the Nashville-based company, owned by Pete Lyman. Piper is a Trustee of the Recording Academy and she is Co-Chair of the P&E Wing Steering Committee. She... Read More →
avatar for Maria Rice

Maria Rice

Chief Mastering Engineer, Peerless Mastering
Maria Rice is a Boston-based mastering and restoration engineer. She has been a Recording Academy member since 2011 and serves as Chapter Governor on the New York Board.  For 12+ years, she has been one of two engineers at Peerless Mastering, working with clientele ranging from indie-famous... Read More →


Friday October 22, 2021 4:00pm - 5:00pm EDT
Stream A

5:00pm EDT

It’s not just audio anymore – understanding how to mix audio, with lighting and video
Many audio engineers were thrust forward into the realms of video production via the pandemic. Even prior, the lines between audio and various forms of media production have blurred. In this workshop, lighting and videographers join up with audio professionals to share the importance of understanding the fundamentals of video production, including timecode, the importance of lighting, work with grips & lighting technicians, as well as various forms of video production capture and production and how they are impacted by audio production and vice versa.

Speakers
EB

Emily Bishai

Minnesota Opera
AK

Alex Kosiorek

Manager, Central Sound at Arizona PBS / Audio Engineering Society
ER

Eric Romani

American Public Media


Friday October 22, 2021 5:00pm - 6:00pm EDT
Stream A

5:00pm EDT

Keynote: FINNEAS with Jack Douglas
Discussion with FINNEAS (Billie Eillish, Halsey, Selena Gomez, Ben Platt, Camila Cabello, Justin Beiber) led by Jack Douglas (John Lennon, Patti Smith, Cheap Trick, Aerosmith).

Moderators
avatar for Jack Douglas

Jack Douglas

Musician, Composer, Arranger, Audio Engineer, Producer
Produced artists as varied as Alan Ginsburg to Zebra. Aerosmith to Patti Smith. Engineered and mixed too many people to list. Played bass with Chuck Berry and many other artists. Signed to major labels as an artist and writer. Composed music for tv and film. Honors include Tech Hall... Read More →

Speakers
avatar for FINNEAS

FINNEAS

Singer-Songwriter, Record Producer, Audio Engineer, Actor
Finneas notices the little things. He was always observant though. As a kid, it wasn’t out of the ordinary for him to ponder existence, worry about the end of the world, and wonder what happens when we die. He was pretty sure everybody else thought of the same things too (much to... Read More →


Friday October 22, 2021 5:00pm - 6:00pm EDT
Stream B

6:00pm EDT

"It SOUNDS Just Like a Telefunken U-47"
Producer/engineer/songwriter Karen Kosowski and Mark Rubel will discuss various aspects and uses of modeling microphones, including comparisons with hardware microphones and their “clones”, and free-standing mic modeling software. Unidentified real-world recorded examples of each are available to download, compare and rate at this link: https://bit.ly/3AQL3po

The results and identities will be revealed at the end of the presentation. Try to tell which are emulations, and which “real” U47’s!

Speakers
avatar for Karen Kosowski

Karen Kosowski

Producer, engineer, songwriter
avatar for Mark Rubel

Mark Rubel

Director of Education and Instructor, Blackbird Academy


Friday October 22, 2021 6:00pm - 7:00pm EDT
Stream B

7:00pm EDT

How to Teach Music Mixing
In the interest of continuing the well-received workshop series on Music Mixing (a five-part group of workshops over several years), the panelists have decided this time to focus more on teaching mixing rather than how we mix. An in-depth discussion will ensue on many topics: How to evaluate each students' basic skillset and approach at the beginning of the school year, how to ensure each student masters the fundamentals of mixing in a reasonable amount of time, and how to customize the learning experience for each candidate. Stereo and multichannel mixing techniques will be discussed, including immersive/3D mixing strategies. What works - and what doesn't work?!? This workshop will be beneficial to current and future educators and students alike. As an interactive online workshop, we expect to solicit many questions and comments from the audience.

Moderators
avatar for Richard King

Richard King

McGill University
Richard King is an Educator, Researcher, and a Grammy Award winning recording engineer. Richard has garnered Grammy Awards in various fields including Best Engineered Album in both the Classical and Non-Classical categories. Richard is an Associate Professor at the Schulich School... Read More →

Speakers
avatar for Alex Case

Alex Case

Sound Recording Technology, university of massachusetts lowell
Alex U. Case, Past President of the Audio Engineering Society, is a total audio junkie who has dedicated his professional life to the study of the history, aesthetics, perception, signal processing, electro-acoustics and room acoustics for creating and enjoying sound recordings. Alex... Read More →
avatar for George Massenburg

George Massenburg

Associate Professor of Sound Recording, George Massenburg Labs
...designed, authored and presented the 1972 AES paper on the Parametric Equalizer and is regularly published in professional journals and trade magazines worldwide....chartered several electronics companies, most notably GML, INC (now GML, LLC) in 1982 to produce high-performance... Read More →
avatar for Susan Rogers

Susan Rogers

Berklee College of Music
Susan Rogers holds a doctoral degree in experimental psychology from McGill University (2010). Prior to her science career, Susan was a multiplatinum-earning record producer, engineer, mixer and audio technician. She is best known for her work with Prince during his peak creative... Read More →


Friday October 22, 2021 7:00pm - 8:00pm EDT
Stream B

7:30pm EDT

Remote Podcast Production: How New Audio Technologies are Opening the Door to More Storytelling
Using a combination of internet and studio technologies today’s podcast producers have more options to bring together casts from across the globe.

SueMedia and Curveballs Productions will host a live performance of selected moments from Episode 1 of "Curveballs", followed by an audience Q&A.

Join Sue Zizza and David Shinn from SueMedia Productions as they demonstrate software options and discuss how to bring an ensemble cast together.

"CURVEBALLS" is a new original comedy podcast series inspired by Co-Creator Jennifer Jiles' memories hosting the NY Met’s baseball show for kids “DYNAMETS!”

Originally conceived for television, the current success of audio fiction podcasts led Jennifer and co-writer Fred Stroppel to partner with award-winning SueMedia Productions. Jiles and Stroppel, with executive producer Michael Guccione, are excited to be working with SueMedia to get this unique project off the ground.

"Curveballs" takes us behind the scenes of a major league baseball TV show for kids, as a female producer does battle with the entrenched forces of network and sports tradition. A classic workplace comedy series, “30 Rock” meets “Major League”, with all the comic craziness that naturally ensues when you put unruly kids, sexy hosts, and horny ballplayers on the same field.

It's a hilarious look at the ever-shifting power plays between men and women in a world where everything is a sport.

In addition to a stellar podcast cast, there will be guest appearances from sportscasters Todd Ant from ABC Radio and Marc Ernay from WINS who have agreed to do cameos throughout the series.

Moderators
avatar for David Shinn

David Shinn

SueMedia Productions
David Shinn is co-owner of SueMedia Productions and Radio Waves Studios. He is an award-winning sound designer/engineer and foley (SFX) artist for live and studio productions. Most recently he mixed the Emmy-nominated series Lucky Chow, as well as the Emmy-nominated documentary The... Read More →
avatar for Sue Zizza

Sue Zizza

Director / Producer / Festival Organizer, SueMedia Productions
Sue Zizza –was named the 2021 Norman Corwin Award for Excellence in Audio Theater, Sue is a transmedia producer, director, writer, and sound designer. She is the owner of SueMedia Productions, a full-service audio production company, and also Radio Waves Studios, a premiere audio... Read More →

Speakers
avatar for Todd Ant

Todd Ant

Todd has been broadcasting in the radio industry for 5 decades beginning in 1987 when he began a 10 year run at WCBS-Newsradio 88 reporting on all the local teams in the New York/New Jersey area.In 1999 He was named Sports Anchor/reporter for ABC Radio Network where he covered all... Read More →
avatar for Marc Ernay

Marc Ernay

Marc is Sports Director and morning drive reporter for 1010 WINS radio.The Hofstra University grad, for over two decades, has been the station’s modern-day “Magellan,” covering almost every sport you can think of.He also hosts the "On the Marc" podcast series and mans the finish... Read More →
avatar for Jennifer Jiles

Jennifer Jiles

Emmy Award-winning host of the NY Mets baseball show for kids “DYNAMETS!” and former Radio City Rockette Jennifer Jiles has appeared on CBS' "Bull" and Marvel's "Jessica Jones", iHeart radio NY Times Top 10 podcast “Lethal Lit”, is featured in the Judd Hirsch web series "Small... Read More →
avatar for Jack Mulcahy

Jack Mulcahy

A 15-time Award winner, Jack Mulcahy is a lifelong New Yorker, with over 200 film, television, commercial, and theater credits to his name, he is the quintessential working New York actor. Memorable roles include the lead of Jack McMullen in the independent breakout film and Sundance... Read More →


Friday October 22, 2021 7:30pm - 8:30pm EDT
Stream A

8:00pm EDT

Tickets For Doomsday - A discussion of a reissue from capture to shrink-wrap
Given the resurgence in vinyl popularity and a burgeoning industry of reissue labels both large and small, Tickets for Doomsday presents a case study of a reissue sourced from vinyl and released on vinyl. Remastering from vinyl is necessary when the original masters have been lost (or taped over) and the only sources available come from commercially released records. This panel will cover optimal capture of the vinyl source and restoration and mastering of the digitized material in preparation for a vinyl release. We will discuss aesthetics of declicking and restoration, balancing historic and contemporary aesthetics, and preparation of the masters for the manufacturing process, from cutting to plating to getting freshly pressed records into shrink wrapped jackets.

Moderators
avatar for Jason Bitner

Jason Bitner

Traffic Entertainment Group
Jason Bitner is a mastering engineer and production supervisor at an independent record distributor in the Boston area. His roots in the Boston music scene go back to repairing brasswinds for Rayburn music. Now, in addition to daily production operations, Jason can be found transferring... Read More →

Speakers
avatar for Jenn D’eugenio

Jenn D’eugenio

Women in Vinyl / Furnace Record Pressing
As the sales + customer service manager at Furnace Record Pressing, and vinyl enthusiast for over 20 years, Jenn is dedicated to the art and creation of vinyl. She is the founder of ‘Women In Vinyl’, and co-host of The Women in Vinyl Podcast. Her background spans that of designing... Read More →
avatar for Margaret K. Luthar

Margaret K. Luthar

Mastering Engineer, National Public Radio (NPR)/Freelance mastering engineer
Margaret Luthar is a broadcast and recording technician for NPR, as well as a freelance mastering/vinyl cutting engineer based out of Los Angeles, California. From 2019 to early 2022 she was the head mastering engineer at Welcome to 1979, and prior to that she was a mastering engineer... Read More →
avatar for Maria Rice

Maria Rice

Chief Mastering Engineer, Peerless Mastering
Maria Rice is a Boston-based mastering and restoration engineer. She has been a Recording Academy member since 2011 and serves as Chapter Governor on the New York Board.  For 12+ years, she has been one of two engineers at Peerless Mastering, working with clientele ranging from indie-famous... Read More →


Friday October 22, 2021 8:00pm - 9:00pm EDT
Stream B
 
Saturday, October 23
 

11:30am EDT

Sensational Envelopment
Reproduced sound has always struggled with the "Inception dilemma", listening to a room in a room. We examine one of the most fundamental qualities of the dilemma, the sensation of envelopment in human listeners, discuss its physiological origin, triggering and stimulating effect, and relate it to other perceptual modalities.

In addition to being an elusive sensation, conveying an enveloping experience is not assured by using a certain reproduction system, for instance "immersive". Factors in the enveloping feeling of spaces, like a church or an excellent concert hall, to survive or vanish during recording, distribution and reproduction are discussed.

Speakers
avatar for Thomas Lund

Thomas Lund

Senior Scientist, Genelec OY
Thomas Lund is senior technologist at Genelec OY, engaged in research, listening tests and measurement. He is author of papers on human perception, spatialisation, loudness, sound exposure and true-peak level. Thomas is convenor of working group TC108X/WG3 under the European Comission... Read More →


Saturday October 23, 2021 11:30am - 12:00pm EDT
Stream A

12:15pm EDT

AES TechTour: Skywalker Sound
Saturday October 23, 2021 12:15pm - 1:00pm EDT
Stream A

1:00pm EDT

Best Student Paper: InSE-NET: A Perceptually Coded Audio Quality Model based on CNN
Automatic coded audio quality assessment is an important task whose progress is hampered by the scarcity of human annotations, poor generalization to unseen codecs, bitrates, content-types, and a lack of flexibility of existing approaches. One of the typical human-perception-related metrics, ViSQOL v3 (ViV3), has been proven to provide a high correlation to the quality scores rated by humans. In this study, we take steps to tackle problems of predicting coded audio quality by completely utilizing programmatically generated data that is informed with expert domain knowledge. We propose a learnable neural network, entitled InSE-NET, with a backbone of Inception and Squeeze-and-Excitation modules to assess the perceived quality of
coded audio at a 48 kHz sample rate. We demonstrate that synthetic data augmentation is capable of enhancing the prediction. Our proposed method is intrusive, i.e. it requires Gammatone spectrograms of unencoded reference signals. Besides a comparable performance to ViV3, our approach provides a more robust prediction towards higher bitrates.

Speakers
GJ

Guanxin Jiang

Dolby Germany GmbH
avatar for Arijit Biswas

Arijit Biswas

Dolby Germany GmbH
CB

Christian Bergler

Pattern Recognition Lab, Friedrich-Alexander University Erlangen-Nuremberg
AM

Andreas Maier

Pattern Recognition Lab, Friedrich-Alexander University Erlangen-Nuremberg


Saturday October 23, 2021 1:00pm - 1:30pm EDT
Stream A

1:30pm EDT

Platinum Mastering: Mastering Immersive Audio
With the ability to bring immersive music to the masses in a way that has never been possible before, engineers, producers, and artists are presented with many new questions, among them being:

  • What is immersive audio? 
  • What format do I choose? 
  • How does one ensure that the music being listened to by the consumer is the best representation of the artist's intent? 
  • Does immersive audio need to be mastered?

This group of experts will talk about what mastering in immersive audio really means.

  • How is it similar or different to working with only one or two channels vs. multiple channels? 
  • How will the music be distributed and what are the considerations for streaming vs. physical media?

They will discuss perspectives and workflow, monitoring in headphone and speakers, formats, and delivery. They will leave time to answer questions from the attendees.

Moderators
avatar for Michael Romanowski

Michael Romanowski

Coast Mastering
Michael Romanowski is a Grammy-nominated mastering engineer and owner/chief engineer of Coast Mastering, based in Berkeley, California. He works in a purposefully built 9.1.6 immersive mastering facility, designed with Bob Hodas and Bob Levy. It was awarded a TEC Nomination for studio... Read More →

Speakers
avatar for Stefan Bock

Stefan Bock

Managing Director, msm-studios GmbH & Co. KG
Stefan Bock, born 20.08.1964 in southern Germany was starting his career in 1987 as an audio engineer. After freelancing in different facilities in Munich, he co-founded msm-studios in 1991 where he was the Chief Mastering Engineer and General Manager.He was leading msm-studios to... Read More →
avatar for Gavin Lurssen

Gavin Lurssen

Lurssen Mastering
Gavin Lurssen has been a mastering engineer since 1991 and has been in the forefront of and contributor to the many technical developments that have taken place in the industry over the past two decades. He is a twelve-time Grammy nominee and a four-time Grammy winner. Artists whose... Read More →
avatar for Darcy Proper

Darcy Proper

Owner / Mastering Engineer, Proper Prent Sound
Darcy Proper is a four-time Grammy award-winning mastering engineer whose work has covered everything from vintage re-issues to cutting-edge immersive audio releases. She has had the pleasure of working for international artists such as Johnny Cash, Dave Brubeck, Tony Bennett, Steely... Read More →


Saturday October 23, 2021 1:30pm - 3:00pm EDT
Stream A

3:00pm EDT

What Defines 'Audio for Cinema' - State of Play
With today's world of streaming, virtual movie premieres, mini-series, and more, how do we define what 'cinematic sound' is? Does it have to be presented or intended for a cinema? This workshop will discuss the current state of cinematic sound, how it has gotten to this point (including influences of technology and the COVID pandemic) and what the future of cinematic audio looks like.

Moderators
avatar for Ying Ying Zhang

Ying Ying Zhang

McGill University | New York University
YIng-Ying Zhang is a music technology researcher and sound engineer. She is the first woman to be admitted to the Sound Recording PhD program at McGill University, where she is projected to graduate in 2024. She has experience in both on-set and post-production film sound, including... Read More →

Speakers
avatar for Mark Camperell

Mark Camperell

Empty Sea Audio
Mark Camperell is the Founder & Creative Director at Empty Sea Audio. Mark’s leadership and ability to assemble uniquely talented audio teams has positioned Empty Sea as a leader in creative audio services ranging from sound effects libraries to original music composition to full-service... Read More →
avatar for Matthew Kulewicz

Matthew Kulewicz

Riot Games
An LA-based sound mixer specializing in VO recording and post-production for animation. Most recently he completed working on Space Jam: A New Legacy where he remotely recorded and edited all animated characters VO. His recent and current work includes projects at Riot Games, Dreamworks... Read More →
avatar for Adam Michalak

Adam Michalak

Hollywood Scoring
As one of the principals of Hollywood Scoring, Adam spends a great deal of time working with Riot Games Studios’ music department as a first call scoring mixer and esports music broadcast producer. His most recent projects include recording the music score for Sony Pictures’ Venom... Read More →
avatar for Ryan Young

Ryan Young

ADR Mixer/Recordist, Warner Bros.
Ryan Young is an experienced mixer and sound supervisor with a long history in the industry who is currently an ADR mixer for Warner Bros Studios. Some highlights of his recent work include Knives Out, Shazam!, Wonder Woman, and Mad Max: Fury Road.


Saturday October 23, 2021 3:00pm - 4:00pm EDT
Stream B

3:00pm EDT

Anatomy of a Hit Album Under Quarantine - Harmonium Symphonique
Histoires sans paroles - Harmonium Symphonique, a symphonic re-interpretation of the music of influential 1970's Montreal-based progressive folk-rock band Harmonium, was recorded, mixed and mastered during the pandemic through a long-distance collaboration between engineers in Montreal and Los Angeles. The studio sessions included recording the 68-piece Orchestre symphonique de Montreal (OSM) and choir at Montreal's 1,900-seat Maison symphonique (Symphony House) - with all involved practicing safe social distancing and other pandemic precautions. The album, within just seven weeks after its release, was certified platinum in Canada, and staggeringly, this album has done the unthinkable; going platinum the OLD way, sticking to physical units and downloads and not releasing the record to streaming services, and being available only via online orders. In this panel, you will hear directly from the artist, and the recording and mastering engineers how they used a number of online tools to conference, collaborate, trade mix notes, monitor progress and see the project to fruition while maintaining social distance during this landmark recording.

Moderators
avatar for Brian (Bt) Gibbs

Brian (Bt) Gibbs

Skyline Entertainment & Publishing
AES member passionate about every and all things immersive! Brian (Bt) "Gibbs" is co-owner & principal at Skyline Entertainment & Publishing in Silicon Valley. With 30+ record credits in 2020, Gibbs uses his B.M. from Berklee College of Music (Jazz Composition) as a record producer... Read More →

Speakers
avatar for Serge Fiori

Serge Fiori

Born in Montreal on March 4, 1952, from a very young age Serge Fiori started to play in his father's Big Band Orchestra, subbing for the “missing musicians of the day”. Thanks to this unique experience, passing from drums, keyboards, flutes, and all sorts of strings instruments... Read More →
avatar for Charles-Émile Beaudin

Charles-Émile Beaudin

Charles-Émile Beaudin is a recording and mixing engineer working in the studio but also on the road in a recording truck.Early in his career, he had the opportunity to start recording live bands and mixing the performances for tv or cinema in surround sound. Working with band and... Read More →
avatar for Reuben Cohen

Reuben Cohen

Lurssen Mastering
Reuben Cohen is an award-winning mastering engineer, winning a Latin Grammy for his work on Diego Torres' album "Distinto," and a TEC Award for his work on the single, "Happy," by Pharrell Williams. He has worked with many notable and award-winning artists including Metallica, Pharrell... Read More →
avatar for Simon Leclerc

Simon Leclerc

Orchestre Symphonique de Montréal
A graduate of Les Petits Chanteurs du Mont-Royal, Simon Leclerc studied at the Conservatoire de musique de Montréal and quickly immersed himself into a multitude of musical spheres: from pop backup singer (Céline Dion) to musicals (Gala, Les Misérables), he quickly became a highly... Read More →
avatar for Gavin Lurssen

Gavin Lurssen

Lurssen Mastering
Gavin Lurssen has been a mastering engineer since 1991 and has been in the forefront of and contributor to the many technical developments that have taken place in the industry over the past two decades. He is a twelve-time Grammy nominee and a four-time Grammy winner. Artists whose... Read More →
avatar for Rick Winquest

Rick Winquest

Rick Winquest


Saturday October 23, 2021 3:00pm - 4:00pm EDT
Stream A

4:00pm EDT

In-car Acoustic Measurements
MOTIVATION: the AES Technical Committee on Automotive Audio [TC-AA] has recognized that currently there is no consensus when it comes to measuring essential audio attributes in a car.

There is a need for alignment between OEMs and Tier1 suppliers for robust and repeatable acoustic measurement methods, data and analysis formats. Hence, a working group has undertaken the task of creating a white paper focused on in-car acoustic measurements, where the measurement stimuli, conditions, and computations would be established.

This will be a living document that will evolve; and could lead to standards down the road.

At the outset, the group has decided on three measurements: Frequency Response, Max SPL, and Impulsive Distortion (squeak and rattle).

FORMAT: this will be in the form of a panel discussion with five parts to it (listed below) – the purpose is to inform the AES community about this effort, provide an update, and solicit feedback

Overview
Frequency Response specifics
Max SPL specifics
Impulsive Distortion specifics
Solicit feedback via Live Q&A

Moderators
avatar for Jayant Datta

Jayant Datta

Consultant
Passionate about all things audio, causing me to get involved in a number of different aspects of this exciting field.DSP, algorithm development, R&D, systems level engineering, management of engineers, as well as strategic direction for organizations.Developed solutions for broadcast... Read More →

Speakers
avatar for Steve Temme

Steve Temme

Listen
Steve Temme is founder and President of Listen, Inc., manufacturer of the SoundCheck audio test system. Steve founded the company in 1995, and for the past 26 years the company has remained on the cutting edge of research into audio measurement, regularly introducing new measurement... Read More →
avatar for Hans Lahti

Hans Lahti

Sr. Principal Acoustic Systems Engineer, Harman
avatar for Stefan Irrgang

Stefan Irrgang

Head of Klippel Analyzer System Development, Klippel
STEFAN IRRGANG studied electrical engineering at the University of Technology in Dresden, Germany and received a Ph.D. in Technical Acoustics in 1997.In 1998, he joined Klippel GmbH as one of the first employees. Since 2016, Stefan is the company’s head of development for measurement... Read More →


Saturday October 23, 2021 4:00pm - 5:00pm EDT
Stream A

4:00pm EDT

Archiving Hip-Hop in Theory and Practice
Roughly forty-five years since hip-hop culture emerged, pioneering artists, life-long fans, industry mavens, and keen collectors have amassed collections of mixtapes, demos, and other recorded artifacts. These collections are essential to the definition of localized hip-hop scenes, providing crucial insights into the people, places, aesthetics, and other often-obscure details that trace the arc of cultural development.

This panel aims to increase the visibility of these collections to recording professionals, imparting an understanding of the emergent role of preservation in hip-hop culture and the cultural, artistic, and historical significance of hip-hop’s early cassette-based recordings (mixtapes, bootleg recordings, “demo” versions, etc.)

We will address the challenges and rewards of maintaining institutional archival practices while supporting the real-world practices of local communities. What are the operational nuances and path dependencies that emerge? Furthermore, we discuss the implementation of technologies needed to preserve, archive, and distribute hip-hop, and how DJs, artists, collectors, archivists, librarians, producers, engineers, and forward-thinking technologists can work together.

Moderators
avatar for Maria Rice

Maria Rice

Chief Mastering Engineer, Peerless Mastering
Maria Rice is a Boston-based mastering and restoration engineer. She has been a Recording Academy member since 2011 and serves as Chapter Governor on the New York Board.  For 12+ years, she has been one of two engineers at Peerless Mastering, working with clientele ranging from indie-famous... Read More →

Speakers
avatar for Mark Campbell

Mark Campbell

Northside Hip Hop Archive/University of Toronto
Mark Campbell [aka DJ Grumps] founder at Northside Hip Hop Archive, is a dj, curator, and scholar. As a co-founder at the Bigger than Hip Hop Show at CHRY 105.5fm, Mark djed on and hosted the radio show from 1997-2015.  Mark’s research interests include hip hop archives, afrosonic... Read More →
avatar for Pacey Foster

Pacey Foster

Massachusetts Hip-Hop Archive/UMass Boston
Pacey Foster is an Associate Professor in Management at the University of Massachusetts, Boston, and founder of the Massachusetts Hip-Hop Archive www.masshiparchive.org. His research focuses on networks and brokerage in creative industries, creative clusters and scenes, and community-engaged archives... Read More →
avatar for Regan Sommer McCoy

Regan Sommer McCoy

Chief Curator and Founder, Mixtape Museum/Columbia University
Regan Sommer McCoy’s professional career started in the music industry almost 20 years ago. She is the founder of The Mixtape Museum (MXM) which celebrates the cultural importance of the mixtape. MXM encourages the research, archiving, preservation, and data analysis of mixtapes... Read More →


Saturday October 23, 2021 4:00pm - 5:00pm EDT
Stream B

5:00pm EDT

Archival Albums: Curation, Preservation, and Mastering
Recording archives encompass facets of heritage and legacy, merging the temporal past with present and future implications. Presented as mass-marketed goods, archival albums are small repositories of these cultural histories. An all-star panel of engineers will discuss the archival album at each phase of inception. We will unpack the distinct perspectives of transfer specialists and mastering engineers while touching on digitization methods, file storage, and retrieval, and remastering aesthetics to present a multifaceted vignette of the forms of expertise involved in preserving music history.

Moderators
avatar for Maria Rice

Maria Rice

Chief Mastering Engineer, Peerless Mastering
Maria Rice is a Boston-based mastering and restoration engineer. She has been a Recording Academy member since 2011 and serves as Chapter Governor on the New York Board.  For 12+ years, she has been one of two engineers at Peerless Mastering, working with clientele ranging from indie-famous... Read More →

Speakers
avatar for Anna Frick

Anna Frick

Mastering Engineer, Airshow Mastering
Anna Frick, workshop co-chair, is a Mastering Engineer at Airshow Mastering in Colorado, working primarily with independent artists across a wide variety of genres, including several Grammy-nominated projects. Anna serves as a Trustee of the Recording Academy, Vice Chair of the AES... Read More →
avatar for Jeff Lipton

Jeff Lipton

Peerless Mastering
Jeff Lipton is a multiple Grammy-nominated mastering engineer and founder of Peerless Mastering. Known for his unrivaled dedication to achieving the best possible sound for every project he works on, Jeff has become one of the most highly respected mastering engineers in the business... Read More →
avatar for Alan Stoker

Alan Stoker

Country Music Hall of Fame
Alan Stoker is the Curator of Recorded Sound and Preservation Engineer at The Country Music Hall of Fame and Museum. He also freelances as a musician in the greater Nashville area. In 1976 Alan took an entry-level job at The Country Music Hall of Fame. From there, he gradually worked... Read More →


Saturday October 23, 2021 5:00pm - 6:00pm EDT
Stream B

6:00pm EDT

AES TechTour: The Power Station/Berklee NYC - Special Encore Presentation
Speakers
avatar for Roy Hendrickson

Roy Hendrickson

Chief Engineer, Power Station at BerkleeNYC
Roy Hendrickson is an engineer and mixer whose focus for the past 30 years has been on the integration of musical creativity with the evolving studio landscape. His musically rich discography is a testament to the work he has done with many artists and producers including Miles Davis... Read More →
avatar for Gloria Kaba

Gloria Kaba

Studio Manager, Power Station at BerkleeNYC
Gloria Kaba is a Ghanaian-American sound engineer, producer, mixer, and writer with over a decade of experience in the studio, often operating under the moniker Redsoul. She’s worked on A Tribe Called Quest’s final album We Got It From Here...Thank You For Your Service and Solange’s... Read More →
avatar for Ian Kagey

Ian Kagey

Director of Operations, Power Station at BerkleeNYC
Ian Kagey is a Grammy-nominated engineer, musician, and educator who came up at MSR Studios (formerly Right Track Recording) in New York City. He has worked on countless professional recordings ranging from major motion picture soundtracks to Broadway cast albums. Kagey recently returned... Read More →
avatar for Stephen Webber

Stephen Webber

Executive Director of BerkleeNYC, Berklee College of Music
Stephen Webber is Executive Director of BerkleeNYC, overseeing the iconic Power Station studios. Webber has hosted sessions for Janelle Monáe, Lin-Manuel Miranda, Kate McKinnon, Trey Anastasio, DJ Khaled, Chris Martin, John Legend, Bette Midler, Paul Simon, Mick Jagger, Wynton Marsalis... Read More →


Saturday October 23, 2021 6:00pm - 6:15pm EDT
Stream B

6:15pm EDT

6:30pm EDT

Hip-Hop Mixing for a Theater Audience
In today's world, while Hip Hop has become more and more popular, it's art form and style have integrated into other audio forms such as audio drama and theater. Hip-hop theater, for example is a form of theater that presents contemporary stories through the use of one or more elements of Hip Hop. Shows like Rent, Jam on the Groove, and even Hamilton are heavily influenced by Hip Hop and the impact it has on an audience. What mixing techniques and styles help to enhance these Hip Hop elements for shows that are usually seen as a separate style? What about this mix of genres makes it so compelling?

Moderators
avatar for Armand Jennings

Armand Jennings

Audio Engineer, Afros and Audio
Armand 'AJ' Jennings is first and foremost a storyteller. Born and raised in the Bronx, Armand uses his skills in audio engineering to work on multiple freelance projects for Afros and Audio, a start-up organization that helps Black people create podcasts and audio dramas. As their... Read More →

Speakers
avatar for Mikal Amin

Mikal Amin

MC/Educator/Essayist/Curator
An established Hip-Hop emcee for the last twenty years and a cultivator of the culture, Mikal is the producer and co-curator of the highly popular “Word. Sound. Power.” The Brooklyn Academy of Music's Hip Hop/Spoken Word performance. Also a scholar and essayist his publishing... Read More →
BI

Baba Israel

Artist/Producer/Educator/Consultant


Saturday October 23, 2021 6:30pm - 7:30pm EDT
Stream B

7:30pm EDT

Understanding Audio for SMPTE 2110
Here’s a trivia question for the next gathering of your favorite audiophiles “How Many Sections of the SMPTE 2110 Standard address audio?” Don’t expect a quick and easy consensus. The 2110 Standard is a large, complex and widely inclusive multi-disciplinary work that is still being improved and expanded. So how does an audio person decide how to build a facility or a piece of equipment that is 2110 compliant or compatible. Join our erstwhile group of experts as they explore the abilities, the possibilities, the land mines, potholes, easy wins and lingering headaches of making your audio work successfully in the 2110 based facilities that are becoming more common each month.

Speakers
PB

Paul Briscoe

Chief Architect, TAG Video Systems
Deep technical expert in media technologies, systems and Standards from SD to UHD, analog to IP. Long-time SMPTE Standards member, contributing author to ST 2110 and 2059 Standards suites (among others), organizer of multi-vendor 2110 interop testing. SMPTE Fellow, member of IEEE... Read More →
AB

Andy Butler

Public Broadcasting Service


Saturday October 23, 2021 7:30pm - 8:30pm EDT
Stream B
 
Tuesday, October 26
 

11:00am EDT

Envelopment in Immersive Productions
Immersive offers the possibility of realistically reproducing envelopment; that is one of its most impressive properties. This gives the listener the feeling of being in the recording room. One of the key parameters in this context is fluctuation: It has a strong influence on the liveliness of the sound and the perceived size of the recording room.

Fluctuation appears most pronounced in large churches and concert halls as envelopment due to room sound at low frequencies. In popular music, however, the proportion of room sound is relatively low compared to classical music. However, we can create spatiality and thus envelopment by moving direct sound.

In his workshop, Lasse Nipkow explains how he creates fluctuation with direct sound from acoustic instruments and synthetic sounds and thus creates envelopment for Immersive. He does this on the basis of psychoacoustic phenomena and impressive sound samples.

Speakers
avatar for Lasse Nipkow

Lasse Nipkow

CEO, Silent Work GmbH
Lasse Nipkow (Dipl. El. Ing. HTL) is regularly holding lectures and giving workshops on multi-channel recordings since 2008 at international conferences like AES, Tonmeistertagung and ICSA. Since 2010 he has improved his empirical research in the field of applied psychoacoustics for... Read More →


Tuesday October 26, 2021 11:00am - 12:00pm EDT
Stream A

12:00pm EDT

The new Spatial Audio Designer Vers. 3.0 upgrade
The Spatial Audio Designer is the leading tool to create multichannel and immersive audio in every audio workstation. It allows creating formats like Surround, 9.1, 7.1.4, and custom configurations. Along with the discrete outputs of the mix channels, binaural processing assists to create immersive audio mixes in real measured virtual studios, just with regular headphones. The new version 3.0 got new features like the newest MPEG-H creation workflow including encoding, scaling, and customization of the interface, new binaural locations like an unechoed chamber, Sony 360 RA 13.1 preset configuration, enhancement of the object-based export, Mac OS 11 / M1 support, the extension of OSC control, and much more. New Audio Technology’s general manager Tom Ammermann offers a quick introduction and tour of the new Spatial Audio Designer Vers.3.

Speakers
avatar for Tom Ammermann

Tom Ammermann

New Audio Technology


Tuesday October 26, 2021 12:00pm - 12:35pm EDT
Stream A

12:35pm EDT

AES TechTour: Worre Studios
Worre Studios is a state-of-the-art 25,000 square foot production studio and event facility located in Las Vegas, boasting a stage-in-the-round footprint, surrounded by a technologically-advanced dual 73-foot wide, 14-foot tall 1.9mm 4k curved LED video walls. Revolutionary in experience, via broadcast quality audio and video equipment, presenters and hosts – whether it’s a business meeting, live television broadcasts, new product launch, live concert, awards show, press conference, or hosting a speaker series, or event – Worre Studios can have a full-interactive/full-participatory style event with hundreds of thousands of attendees in real-time, giving everyone a front row seat.

With the ability to stream to multiple platforms at once (e.g. YouTube, Zoom, private IPs, Twitter, etc.) content channels allow for millions of viewers to join live from anywhere in the world. With that level of global reach, the translation platform can translate both to attendees and back to the host in multiple languages, allowing for the language barrier to disappear and for attendees to have the same live experience. The studio’s ability to expand allows a great footprint for hybrid events, and the technology capabilities continue to grow and now include 3D immersive environments allowing guests and hosts to join from around the world, but appear to be in person; Unreal Engine for XR/VR/AR capabilities; and more to come allowing event producers to reimagine what you can track and measure during an event.

Speakers
LS

Larry Smith

President, Worre Studios
RD

Ryan Durham

Director of Sales, Worre Studios


Tuesday October 26, 2021 12:35pm - 12:55pm EDT
Stream A

1:00pm EDT

Immersive Monitoring Inside Immersive Mixing
In this presentation Will Eggleston will discuss the decision-making process when entering the realm of immersive content creation and what considerations and choices engineers must make for the monitoring system to translate correctly through distribution channels.

Room layout, monitor selection, and bass management will be covered as well as what to expect when monitors are run full range. Monitor placement and listening distances and the interaction of the room dimensions will also be covered.

Lastly the presenter will show some real-world immersive calibration data and how the importance of properly calibrating any immersive system allows the mixer to maintain a sharp edge in creative thinking.

Speakers
avatar for Will Eggleston

Will Eggleston

Marketing Director, Genelec
Will Eggleston likes to refer to himself as “Chief Cook and Bottle Washer” at Genelec Inc in Natick, Massachusetts.  Having spent nearly 50 years in the audio business as a recording engineer for broadcast and music, product development specialist and product manager, Will has... Read More →


Tuesday October 26, 2021 1:00pm - 2:00pm EDT
Stream A

2:00pm EDT

Multi-microphone setups for music, using beamforming technology from single Ambisonics microphone arrays: from recording to mixing
In this workshop we will focus on multi-microphone setups for music production, using single Ambisonics microphone arrays. Ambisonics microphones are often used to record full spherical sound fields for different purposes, including virtual reality. But an especially interesting use of their spatial audio technology is the possibility to create beamformings (aka, virtual microphones) after the recording has already been done. To comment on this, this workshop will address Ambisonics recording, directionality/polar patterns, virtual microphone setup, and mixing. All this will be illustrated by practical audio/in-DAW examples of spatial recordings done in studio of several musical instruments. By the end of this workshop, you will understand the process from start to finish, being able to successfully use Ambisonics microphones as additional tools for music production.

Speakers
EP

Eduardo Patricio

Independent


Tuesday October 26, 2021 2:00pm - 3:00pm EDT
Stream A

3:00pm EDT

Hardware Monitoring Solutions in Immersive
Among the solutions available to the dilemma of immersive mixing are two: analog hardware and digital hardware – each having it’s advantages to assign and route the bussing/monitoring in Spatial/Immersive mixing. The experience is quite different from the "in-the-box" method and yields some surprising results

Moderators
avatar for F. Reid Shippen

F. Reid Shippen

GRAMMY Award Winning Engineer
A creative leader, connector, and innovator, F. Reid Shippen has spent 20+ years in Nashville building an incredibly diverse array of skills.He is a multiple Grammy Award-winning music mixer, engineer, and producer, whose credits include 11 Grammy Award-winning albums and over 100... Read More →

Speakers
avatar for Jeri Palumbo

Jeri Palumbo

Arena Waves
Jeri has 27 years experience as an A1 broadcast mixer and audio, comms and RF tech in live Sports, eSports, and Entertainment broadcasting. She attended The Juilliard School of Music as a Composition/Orchestration major and was a music arranger in the NYC music recording business... Read More →
avatar for Ronald Prent

Ronald Prent

Valhalla Studios
Ronald Prent started his career at Wisseloord Studios in the Netherlands in 1980 as an inexperiencedassistant and has since established himself as one of the most accomplished and innovative recording &mix engineers in the world. In the early years of surround sound, Ronald was a... Read More →
avatar for Jeremy Rodeschini

Jeremy Rodeschini

Merging Technologies
Jeremy Rodeschini works as a senior supervisingengineer at the NFTS (National Film and TelevisionSchool) in the UK.He is in charge of post production technology for the MA inSound Design and MA in Composing for Film and TV.Over the past three years, Jeremy has conducted majorstudios... Read More →
avatar for Paul Wolff

Paul Wolff

Fix Audio Designs
Born on December 30th, 1954 in Traverse City, Michigan.In 1972, I was hanging out at a local TV shop, a guy comes in with a Strat with "no sound". The shop owner had no clue, I pulled the jack out and fixed it, the guitar player said "from now on, your name will be FIX…". That was... Read More →


Tuesday October 26, 2021 3:00pm - 4:15pm EDT
Stream A
 
Wednesday, October 27
 

11:00am EDT

Spatial Audio Rendering for Interoperable XR Applications in the Audio Metaverse
Interactive audio spatialization technology previously developed for video game authoring and rendering has evolved into an essential component of platforms enabling shared immersive virtual experiences for future co-presence, remote collaboration and entertainment applications. New wearable virtual and augmented reality displays employ real-time binaural audio computing engines rendering multiple digital objects and supporting the free navigation of networked participants or their avatars through a juxtaposition of environments, real and virtual, often referred to as the Metaverse. These applications require a parametric audio scene programming interface to facilitate the creation and deployment of shared, dynamic and realistic virtual 3D worlds on mobile computing platforms and remote servers.

This tutorial surveys practical approaches for designing and programming 6-degree-of-freedom interactive audio engines to deliver the perceptually relevant binaural cues necessary for audio/visual and virtual/real congruence in Metaverse experiences. It addresses the effects of room reverberation, acoustic reflectors, and obstacles in both the virtual and real environments, and discusses how such effects may be driven by combinations of pre-computed and real-time acoustic propagation solvers. We envision an open scene description model distilled to facilitate the development of interoperable applications distributed across multiple platforms, where each audio object represents, to the user, a natural sound source having controllable distance, size, orientation, and acoustic radiation properties.

This presentation is an extended version of our recent i3DA conference presentation, aiming to promote industry collaboration towards a standardized representation of audio and acoustic environments in the XR Metaverse. (Paper: ieeexplore.ieee.org/document/9610971 or arxiv.org/pdf/2109.12471.pdf.) (Slides: tinyurl.com/y6nr5o8t.)

Speakers
avatar for Jean-Marc Jot

Jean-Marc Jot

VP Research, iZotope, Inc.
https://sites.google.com/site/jmmjot/
avatar for Rémi Audfray

Rémi Audfray

Owner, Remi Audfray Engineering


Wednesday October 27, 2021 11:00am - 12:00pm EDT
Stream A

12:00pm EDT

Utilizing the ADM-OSC Streaming Protocol for Interoperability Among Immersive Audio Systems
This workshop will present the opportunities offered by the ADM-OSC protocol, its use cases, and its interoperability challenges. The ITU BS.2076-2 Audio Definition Model is a solution for the archiving and interoperability of immersive audio productions. ADM-OSC provides an extension to ADM, allowing to stream positioning information of audio objects in real-time using the Open Sound Control protocol. ADM-OSC offers a universal communication language among object-based audio software for live performances and post-production that has received the support of multiple manufacturers.

Moderators
avatar for Etienne Corteel

Etienne Corteel

Director of Scientific Outreach & Education, L-Acoustics
Governing the scientific outreach strategy, Etienne and his team are the interface between L-Acoustics and the scientific and education communities. Their mission is to develop and maintain an education program tailor-made for the professional sound industry. Etienne also contributes... Read More →

Speakers
avatar for Hervé Dejardin

Hervé Dejardin

audio innovation project manager at Radio France., Radio France, FR
Hervé Déjardin is an audio innovation project manager at Radio France.He works on different aspects of creating, producing, and distributing immersive and object-based audio content, focusing on a wide range of productions for Radio France such as reports, documentaries, drama... Read More →
avatar for Yvan Grabit

Yvan Grabit

R&D Technical Leader, Steinberg Media Technologies GmbH
After a quiet childhood and youth in Pays de Gex—an area jammed between the French Jura mountain and Geneva—Yvan Grabit played in different brass bands as a drummer while completing an Engineering degree in Image Processing and computing at the Institut Supérieur d'Électronique... Read More →
avatar for Hugo Larin

Hugo Larin

Business Director - FLUX:: Immersive, FLUX::
Hugo Larin is a vital collaborator of the FLUX: IRCAM Spat Revolution project and has deep roots in audio mixing, design, operation, and networked control and data distribution. He leads the FLUX:: Immersive consulting group and the business development for FLUX::. Larin was driven toward the live production sphere due to his passion for technolo... Read More →
avatar for Guillaume Le Nost

Guillaume Le Nost

Managing Director L Acoustics Ltd. UK, L-Acoustics, UK
Driven by a passion for sound technology, Guillaume has built a career in the artistic and technical fields of audio.After acquiring two MScs in electrical engineering and signal processing from Ecole Centrale Paris and IRCAM, he started a research career in 3D sound and received... Read More →


Wednesday October 27, 2021 12:00pm - 1:00pm EDT
Stream A

1:00pm EDT

When stars come down to earth
More and more planetariums install or newly build with spatial audio systems. Also, installations and theatres have extensive and not formated loudspeaker environments. So, how to create and how deliver your new planetarium production dealing with these items?

The session will answer these items, shows opportunities to deal with them in the studio as well as with laptop studios. The Grammy-nominated presenter Tom Ammermann will share his experience of over two decades of immersive audio and music production in projects like 5th Element, Kraftwerk 3D, Dubai Expo 2021, and different planetarium shows.

Speakers
avatar for Tom Ammermann

Tom Ammermann

New Audio Technology


Wednesday October 27, 2021 1:00pm - 2:00pm EDT
Stream A

2:00pm EDT

Immersive Audio: Trends & Challenges
This panel will cover the topic of immersive audio and spatial audio. It will seek to examine these specific fields through the lens of the newest audio trends, remixing albums to conform to the Apple/Dolby Atmos and Sony standards. First, the program will explore, as a practical matter, what's required for these remixes to be made and sold. Then, a broader discussion will ensue about the impact this trend might have on audio engineers, musicians, consumers, and the industry at large. Please join us as we seek to uncover the secrets of the ever-evolving immersive audio field.

Moderators
avatar for Heather Rafter

Heather Rafter

Principal, RafterMarsh Law
Heather Dembert Rafter has been providing legal and business development services to the audio, music technology, and digital media industries for over twenty-five years. As principal counsel at RafterMarsh US, she leads the RM team in providing sophisticated corporate and IP advice... Read More →
avatar for Phil Wagner

Phil Wagner

Senior Vice President, Solid State Logic
Phil Wagner started his career as a recording engineer. He is currently Senior Vice President of Solid State Logic, Inc. (SSL), a wholly-owned subsidiary of Audiotonix. He is a respected business development executive with a long history of operating US professional audio companies... Read More →

Speakers
avatar for Leslie Gaston-Bird

Leslie Gaston-Bird

Owner, Mix Messiah Productions
Leslie Gaston-Bird (AMPS, MPSE) is author of the book "Women in Audio", part of the AES Presents series and published by Focal Press (Routledge). She is a voting member of the Recording Academy (The Grammys®) and its P&E (Producers and Engineers) Wing. Currently, she is a freelance... Read More →
avatar for Mike Larson

Mike Larson

Currently based in Miami, Larson works overtime engineering, producing, and DJ’ing.  As Pharrell Williams’ personal Sound Engineer for the past decade, Mike has been molded into one of the industry’s brightest Engineers.After traveling the world with Pharrell and N*E*R*D as... Read More →
avatar for Kevin Madigan

Kevin Madigan

Touring FOH Live & Studio
I've been very fortunate to work with a wide range of clients in diverse musical genresmixing in studios and on tour including;Santana, Lana Del Rey, Lizzo, HER, Post Malone, Migos, Alessia Cara, CamilaCabello, Crosby, Stills and Nash, Neil Young, David Gilmour, Ray LaMontagne,Smashing... Read More →


Wednesday October 27, 2021 2:00pm - 3:00pm EDT
Stream A

3:00pm EDT

Metasounds in Unreal5
A look at the audio core of the new audio engine for game Devs.

Speakers
avatar for Aaron McLeran

Aaron McLeran

Lead Audio Programmer, Epic Games
Aaron is the lead audio programmer at Epic Games where he is working with a team creating a multiplatform realtime audio renderer for the Unreal Engine, which is currently used on Fortnite and by many Unreal Engine licensees. He has a background in physics and music and was a sound... Read More →


Wednesday October 27, 2021 3:00pm - 4:00pm EDT
Stream A
 
Thursday, October 28
 

9:00pm EDT

Acoustic Decoupling Device in Coaxial Compression Driver
Coaxial loudspeakers are designed to reproduce a broad frequency range while keeping a compact form factor. Correct driver integration requires to properly deal, at the design phase, with the presence of multiple radiating units and with the interference between their acoustic emissions; this is essential to obtain a smooth response and a wide crossover region suited to flexibly accommodate different filter designs. Due to the high frequencies they reproduce, recently-appeared coaxial compression drivers require particular care to ensure excellent acoustic performance at short wavelengths. Adding an appropriate decoupling device in the structure allows to effectively manage the acoustic emission of the two transducers with respect to each other, improving response regularity and increasing the available bandwidth for the crossover versus historical approaches.
This Engineering Brief presents the results of the research carried out at B&C Speakers during the development of the patent-pending acoustic decoupling device integrated into the Company's latest two-way coaxial compression driver, the B&C Speakers DCX464.

Speakers
FB

Filippo Bartolozzi

B&C Speakers s.p.a.


Thursday October 28, 2021 9:00pm - Friday December 3, 2021 6:00pm EST
On-Demand

9:00pm EDT

An Approach for Capturing Multi-Directional Radiation Characteristics of Sound Sources for 3D
In the general practice of immersive audio recording, there is a focus on how to capture the direct sound source. However, a sound source’s complex radiation properties help define that source in three-dimensional spaces. In this research, the idea of a “holographic sound recording” or HSR is explored. We choose the term “holographic” due to its uncanny ability to create a rendering of a real 3D sound source in space, similar to holographic visual experiences.

HSRs can be defined as the capturing of complex radiation characteristics of sound sources with the intention of being played back in either a virtual environment with six-degrees of freedom or in real life through a multi-directional coincidental speaker array. To research techniques on creating sonic holographic reproductions, two recording sessions were conducted at NYU in the summer of 2021. Through documenting and reflecting on the miking, mixing, and spatialization of the audio objects in Unity with Google Resonance, experimental realizations were made in search of best practices to consider when creating HSRs.

Concepts such as acoustical points of interest, adequate number of microphones, pickup patterns and angles will be explored, as well as capturing room tone and the benefits of player isolation. These aspects are introduced to create a holographic miking system called the Multi-Channel Pyramid Array (MPA) as a starting point for users who would like to create a holographic reproduction of any instrument. The MPA can theoretically be fine-tuned and customized based on the instrument and/or to the user’s desired results.

Speakers
MM

Michael Matsakis

New York University
avatar for Parichat Songmuang

Parichat Songmuang

Studio Manager/PhD Student, New York University
Parichat Songmuang graduated from New York University with her Master of Music degree in Music Technology at New York University and Advanced Certificate in Tonmeister Studies. As an undergraduate, she studied for her Bachelor of Science in Electronics Media and Film with a concentration... Read More →
PG

Paul Geluso

New York University


Thursday October 28, 2021 9:00pm - Friday December 3, 2021 6:00pm EST
On-Demand

9:00pm EDT

An Open Source Turntable for Electro-Acoustical Devices Characterization
This work introduces an Open Source turntable for the measurement of electro-acoustical devices. The idea is to provide an inexpensive and highly customizable device that can be adjusted according to specific measurement needs. Development of such turntable devices in the past required significant investment. Specific mechanical and motor control design skills were needed, leading to both costly and time-consuming processes. Recent developments in mechatronics and 3D printing allow to design and build a cost-effective solution.

Speakers
avatar for Daniele Ponteggia

Daniele Ponteggia

Audiomatica srl


Thursday October 28, 2021 9:00pm - Friday December 3, 2021 6:00pm EST
On-Demand

9:00pm EDT

Audio Watermarking Technique Integrating Spread Spectrum and CNN-autoencoder
This e-Brief proposes a novel approach of audio watermarking based on Spread Spectrum (SS) involves the psychoacoustic model and deep learning Convolutional Neural Networks (CNN)-autoencoder. Moreover, logistic chaotic maps are employed to enhance the security level of the method. First, a compressed image produced from the CNN-autoencoder is fed to the image encryption stage to yield an encrypted image to be used as the watermark. To apply image encryption, the plain image is, at first 8-bit binary-coded and shuffled by M-sequence. Next, each encoded image is diffused with a different chaotic sequence. Within the embedding phase, the psychoacoustic model is employed to shape the amplitude of the watermark signal which guarantees high inaudibility, whereas a logistic chaotic map is used to determine the positions for watermark embedding in a random manner. This scheme offers an extremely efficient and practical method as it can be used by institutions and companies for embedding their logos or trademarks as a watermark in audio products as the scheme utilizes RGB images. Experimental results show that the transparency and imperceptibility of the proposed algorithm are satisfactory also good image quality even against various attacks. The validity of the proposed audio watermarking method is demonstrated by simulation results.

Speakers
NK

Noha Korany

Electrical Engineering Department, Faculty of Engineering, Alexandria
NE

Namat Elboghdadly

Electrical Engineering Department, Faculty of Engineering, Alexandria
ME

Mohamed Elabdein

Instructor, Electrical Engineering Department, Alexandria Higher Institute of Engineering and Technology, Alexandria
Audio watermarkingAcoustics


Thursday October 28, 2021 9:00pm - Friday December 3, 2021 6:00pm EST
On-Demand

9:00pm EDT

Automatic Classification of Enclosure-Types for Electrodynamic Loudspeakers
It is investigated whether an automated classification of loudspeaker enclosures can be realized. The acoustic load of the enclosure is reflected in the electrical impedance of the loudspeaker and is hence detectable from the point of view of the power amplifier. In order to classify the enclosures of passive one-way speakers, an artificial neural network is trained with synthetic impedance spectra based on equivalent electrical circuit models. The generalization capability is tested with measured test sets of closed, vented, band-pass and transmission-line enclosures. The resulting classification procedure works well within a synthetic test set. However, a generalization to the measured test-data has shown to require deeper investigations to achieve better separation between the different vented enclosures types.

Speakers
JW

Johannes Werner

Hochschule Mittweida
avatar for Tobias Fritsch

Tobias Fritsch

Research Engineer, Fraunhofer IDMT


Thursday October 28, 2021 9:00pm - Friday December 3, 2021 6:00pm EST
On-Demand

9:00pm EDT

Binaural mixing of popular music
In this engineering brief, we present an initial experiment that was conducted to gain insights into what kinds of sound sources would benefit from binaural rendering for typical pop, rock and electronic dance music (EDM) tracks. Original multi-tracks for three different songs (Pop, Rock and EDM) were divided into four elements: drums, bass, guitar/synth and vocals/lead. Eight different mixes of these elements were created in 3rd-order Ambisonics using the RoomEncoder and BinauralDecoder of the IEM Plugin Suite, with different combinations of binauralised and non-binauralised (i.e., stereo) elements within the mixes, ranging from a full stereo mix to a full binaural mix.

A multiple comparison listening test was conducted online, with 21 subjects participating. Their task was to rate the eight mixes in terms of overall immersive experience as well as perceived spatial and timbral qualities. Results showed that mixes with non-binauralised drums were commonly rated higher than mixes with binauralised drums for all three attributes. The full binaural mixes were rated lowest in general, whereas the mixes closest to a full stereo mix tended to be rated highest for and Rock, but less so for EDM. These results suggest that (i) simply panning all sources in binaural would not necessarily lead to a more immersive experience compared to a traditional stereo mix, (ii) a spatial contrast between binauralised and non-binauralised sources might help improve immersiveness (e.g., drums in stereo and guitars widely panned in binaural), and (iii) optimal binaural mixing techniques would tend to depend on the genre of music.

Speakers
avatar for Hyunkook Lee

Hyunkook Lee

Director, University of Huddersfield
I am a Reader (Associate Professor) in Music Technology and Director of the Centre for Audio and Psychoacoustic Engineering (CAPE)/Applied Psychoacoustics Lab (APL) at the University of Huddersfield, UK. Past and current research areas include the perception of auditory height and... Read More →
PA

Pablo Abehsera Morell

University of Huddersfield


Thursday October 28, 2021 9:00pm - Friday December 3, 2021 6:00pm EST
On-Demand

9:00pm EDT

Developing plugins for your ears
We present a new intuitive development platform that allows algorithm developers to put plugins in our ears. The growing number of advanced audio processing plugins developed for DAWs is enabling highly creative sound experiences. We explain how plugins for DAWs can be easily ported to embedded platforms used in ear worn products and other audio devices. This includes signal processing targeting low latency, low power, high compute and large memory plugins. We describe an open platform to bring machine learning based algorithms directly to the end user. This will also give plugin developers access to data streams from additional sensors and multichannel audio data beyond stereo music streaming. The next generation of hearables for gaming, music, movies, AR/VR will require processing techniques currently only available to professionals in studios. This new platform allows end users to select, download and control plugins to unlock innovation the fits their individual needs and personal preferences.

Speakers
GA

Gary A. Spittle

Audicus Inc.


Thursday October 28, 2021 9:00pm - Friday December 3, 2021 6:00pm EST
On-Demand

9:00pm EDT

Impression evaluation of mixed acoustic signals with different spatial acoustic characteristics
Teleconference systems have been widely used and mixed acoustic signals with different spatial acoustic characteristics for each place are heard. However, the perceptual impression of mixed acoustic signals containing multiple different spatial acoustic characteristics has not been sufficiently investigated. In this study, a listening test was performed to survey the difference of impression regarding the mixed acoustic signals with single and different spatial acoustic characteristics. Three instrumental signals (guitar, bass and drums) were played with a loudspeaker and recorded in three rooms. For the listening test, four mixed acoustic signals were set:
1. All instruments were captured in the small reflection room.
2. All instruments were captured in the middle reflection room.
3. All instruments were captured in the large reflection room.
4. Bass was captured in the small reflection room, drum in the middle reflection room and guitar in the large reflection room.
Participants listened to No. 4 and compared it with Nos. 1, 2 and 3 and responded to which of the seven evaluation items (resonant, pleasant, natural, coherent, clear, likeable, noisy) matched. Participants perceived 1 as a pleasant acoustic signal with little reverberation and 2 and 3 as unpleasant acoustic signals with more reverberation compared to 4. It is suggested that mixed acoustic signals recorded in the small reflection room are considered the least reverberant and the most comfortable. The homogenization of spatial acoustic characteristics by suppressing reverberation from the acoustic signals captured in multiple spaces is considered to be useful in giving a pleasant impression.

Speakers
avatar for Shota Okubo

Shota Okubo

KDDI Research, Inc.
avatar for Toshiharu Horiuchi

Toshiharu Horiuchi

KDDI Research, Inc.


Thursday October 28, 2021 9:00pm - Friday December 3, 2021 6:00pm EST
On-Demand

9:00pm EDT

Integrating Vibrato Into Artificial Reverberation
This paper discovers methods to create vibrato effects with artificial reverberation. Feedback delay networks have been used for many reverb effects. Past work with time varying feedback delay networks has focused primarily on small modulations of the delays and or feedback matrices in order to create a more natural sounding reverb. In this paper, we consider the possibility of using wider modulations of these reverbs for the purposes of sound effect generation. Specifically, amplitude modulation and frequency modulation can be obtained by varying feedback matrices or delay lines respectively. The results showed a convincing vibrato effect with minor artifacts and promise for using FDNs in sound effect generation. Future work will include reducing artifacts and fine tuning control parameters.

Speakers
SR

Sarah R. Smith

University of Rochester
SF

Senyuan Fan

University of Rochester


Thursday October 28, 2021 9:00pm - Friday December 3, 2021 6:00pm EST
On-Demand

9:00pm EDT

Measuring Voice Coil Temperature using Ultrasonic Pilot Tones
Measuring the voice coil temperature of speakers during audio playback is useful for optimizing thermal design and preventing overheating. The established DC method uses a constant current to measure resistance and determine temperature. Therefore, it suffers from noise induced by low frequency audio signals and creates an unwanted constant voice coil displacement. These problems are solved by the HF method, where the voice coil temperature is derived from a high frequency (HF) impedance measurement with an ultrasonic pilot tone. This study extends on the previous research by Gautama and Anazawa on the HF method, for example by showing that in addition to the voice coil temperature the pole plate temperature can be measured.

In this study, impedance measurements at different temperatures of an example microspeaker are used to calibrate the HF method. A comparison to the DC method with different test signals heating the speaker demonstrates that the HF method works well in this case. However, it is susceptible to errors from the skin effect in large diameter voice coil wire, changes in cabling, close metallic objects or drifting of the average voice coil position over time or with large amplitudes. The measured pole plate temperature rises with applied high frequency audio signals, which may be explained by induction heating. Overall, the HF method seems especially suited for applications where impedance measurement and non-linear excursion control are part of the design.

Speakers
avatar for Tobias Fritsch

Tobias Fritsch

Research Engineer, Fraunhofer IDMT
avatar for Johannes Fried

Johannes Fried

TU Ilmenau
Johannes Fried received a Bachelor in Technical Physics from TU Ilmenau, Germany, in 2021. He has had an interest in loudspeaker technology since his teenage years and is now a Master student in Media Technology at the same university.


Thursday October 28, 2021 9:00pm - Friday December 3, 2021 6:00pm EST
On-Demand

9:00pm EDT

MPEG-H Audio production workflows for a Next Generation Audio experience in broadcast, streaming and music
MPEG-H Audio is a Next Generation Audio (NGA) system offering a new audio experience for the audience in various applications: Object-based immersive sound delivers a new degree of realism and artistic freedom for immersive music applications, such as the 360 Reality Audio music service. Advanced interactivity options enable improved personalization and accessibility, including solutions to create object-based features out of legacy material, e.g. deep-learning-based dialogue enhancement. 'Universal delivery' allows for optimal rendering of one production over all kinds of devices and various distribution ways, e.g. broadcast or streaming. All of these new features are achieved by adding metadata to the audio, which is defined during production and offers content providers flexible control of interaction and rendering options. Thus, new possibilities and requirements during the production process are imposed. In this paper, examples of state-of-the-art NGA production workflows are detailed and discussed, with special focus on immersive music, broadcast, and accessibility.

Speakers
YG

Yannik Grewe

Fraunhofer Institute for Integrated Circuits IIS
PE

Philipp Eibl

Fraunhofer Institute for Integrated Circuits IIS
CS

Christian Simon

Fraunhofer Institute for Integrated Circuits IIS
MT

Matteo Torcoli

Fraunhofer Institute for Integrated Circuits IIS
DR

Daniela Rieger

Fraunhofer Institute for Integrated Circuits IIS
US

Ulli Scuda

Fraunhofer Institute for Integrated Circuits IIS


Thursday October 28, 2021 9:00pm - Friday December 3, 2021 6:00pm EST
On-Demand

9:00pm EDT

Perceptual evaluation of a new, portable three-dimensional recording technique: ``W-Ambisonics''
In order to exploit strengths and avoid weaknesses of the First Order Ambisonics (FOA) microphone technique, we devised a new, portable 3D microphone recording technique (``W-Ambisonics''). This new technique incorporates a stereo cardioid microphone pair (for frontal information) with two FOA microphone arrays (for lateral, rear, and height information). The design focus of this technique was ``accessibility'' in the recording stage and ``scalability'' in the reproduction stage. Our proposed portable 3D recording technique enables audio reproduction over multiple configurations including immersive platforms.
First, we evaluated lateral localization of the proposed method compared with a conventional 5-channel surround microphone technique. Second, we devised a new binauralization method utilizing two interaural-distant FOA microphone arrays for headphone-based reproduction. Each FOA microphone array renders precise spherical harmonics at each ear position. Lastly, we made a solo piano recording using 5-channel surround technique, 7-channel immersive technique (5-channel plus two height channels), and the ``W-Ambisonic'' array, and subsequently conducted subjective listening evaluations of the sound qualities of the two techniques. The results of our study show that (1) the ``W-Ambisonics'' method enables improved lateral localization over the conventional spaced array technique; (2) the binauralized headphone translation from the ``W-Ambisonics'' recording provided spacious yet precise sound images in listening evaluations; and (3) the ``W-Ambisonics'' recording produced comparable sound quality to the 7-channel recording technique for immersive experience of the concert hall. The proposed ``W-Ambisonics'' microphone technique is practical, precise, and scalable across multiple reproduction scenarios, from binaural to multichannel systems.

Speakers
avatar for Dr. Doyuen Ko

Dr. Doyuen Ko

Associate Professor, Belmont University
Dr. Doyuen Ko is an Associate Professor of Audio Engineering Technology at Belmont University in Nashville, Tennessee. He received his Ph.D. and Master of Music from the Sound Recording Department at McGill University, Canada. Before studying at McGill, he has worked as a sound designer... Read More →
LX

Lu Xuan

Rochester Institute of Technology
SK

Sungyoung Kim

Rochester Institute of Technology
MK

Miriam Kolar

Department of Music, Amherst College, Amherst, MA 01002, USA


Thursday October 28, 2021 9:00pm - Friday December 3, 2021 6:00pm EST
On-Demand

9:00pm EDT

Power Efficiency of Capacitive MEMS Speakers
Recently introduced MEMS micro speakers target in-ear audio applications such as true wireless earbuds, hearables and hearing aids. A key requirement in these ultra-mobile applications is a high energy efficiency of the components. While wireless communication standards continue to improve and offer low power profiles, audio components remain at the same level in absolute terms, but the share of the total energy budget increases. MEMS speakers, seeking a competitive advantage in the micro speaker market, claim to be more efficient than voice coil speakers, whose efficiency is limited by resistive and magnetization losses. The most advanced approaches to MEMS speakers are based on capacitive transducer technology. Therefore, these speakers can be modeled - to a first approximation - as capacitors and therefore offer negligible resistive losses but introduce a high component of reactive power. This reactive power must be provided by an electronic circuit which has a limited efficiency. Thus, it is desirable to minimize the required reactive power by the transducer. In this contribution, we discuss some fundamental energetic relations that such electroacoustic transducers are subject to, and draw some implications for the requirements of a transducer that can operate at high efficiency.

Speakers
LE

Lutz Ehrig

Arioso Systems GmbH
HS

Hermann Schenk

Arioso Systems GmbH
JM

Jorge Mario Monsalve Guaracao

Fraunhofer Institute for Photonic Microsystems
AM

Anton Melnikov

Fraunhofer Institute for Photonic Microsystems


Thursday October 28, 2021 9:00pm - Friday December 3, 2021 6:00pm EST
On-Demand

9:00pm EDT

Production Tools for the MPEG-H Audio System
Next Generation Audio Systems, such as MPEG-H Audio, rely on metadata to enable a wide variety of features. Information such as channel layouts, the position and properties of audio objects or user interactivity options are only some of the data that can be used to improve consumer experience.
Creating these metadata requires suitable tools, which are used in a process known as "authoring", where interactive features and the options for 3D immersive sound rendering are defined by the content creator.
Different types of productions each impose their own requirements on these authoring tools, which lead to a number of solutions appearing in the market. Using the example of MPEG-H Audio, this paper will detail some of the latest developments and authoring solutions designed to enable immersive and interactive live-and post productions.

Speakers
YG

Yannik Grewe

Fraunhofer Institute for Integrated Circuits IIS
PE

Philipp Eibl

Fraunhofer Institute for Integrated Circuits IIS
DR

Daniela Rieger

Fraunhofer Institute for Integrated Circuits IIS
US

Ulli Scuda

Fraunhofer Institute for Integrated Circuits IIS


Thursday October 28, 2021 9:00pm - Friday December 3, 2021 6:00pm EST
On-Demand

9:00pm EDT

Quantifying HRTF Time and Spectral Precision using Interaural Transfer Function
Generating auditory localization cues is an important feature in spatial audio processing engines that help generate a sense of plausibility of virtual sounds to the user, especially in XR applications (VR/AR/MR). Algorithmic approaches have been proposed to quantify the engine’s ability to reproduce interaural level difference (ILD) cues through regression and statistical methods, which provides a useful standardization and automation method to estimate the localization accuracy potential of a spatial audio engine. This engineering brief builds off this approach to include interaural time difference (ITD) cues as part of the analysis through the use of the interaural transfer function (ITF). The use of the ITF, comprising both ILD and ITD information, is demonstrated and discussed. Even though this approach can substitute critical listening studies as an evaluation method, it has not been validated through comparisons with localization user studies. This brief concludes with a review of listening studies that can be used to gain confidence in this algorithmic approach to measure localization accuracy potential in a spatial audio engine.

Speakers
JM

Justin Mathew

Magic Leap
LJ

Lukasz Januszkiewicz

Magic Leap/SoftServe Inc.
MP

Maria Pensko

Magic Leap/SoftServe Inc.
RA

Remi Audray

Facebook Reality Labs


Thursday October 28, 2021 9:00pm - Friday December 3, 2021 6:00pm EST
On-Demand

9:00pm EDT

Real-time Implementation of the Spectral Division Method for Binaural Personal Audio Delivery with Head Tracking
A framework for implementing the Spectral Division Method (SDM) in real-time for delivering binaural personal audio to multiple listeners with head tracking is presented. The SDM, as an analytical approach for sound field reproduction, has been applied to generating personal audio filters that create acoustically bright and dark zones. However, only the case of static listening positions has been investigated. In realistic situations, the performance of such personal audio delivery systems will degrade significantly when the listeners move out of the "sweet spots". In order to achieve dynamic personal audio delivery that compensates for listeners' head movements, the SDM-based filters are updated in real-time through simple multiplications in the wavenumber domain, by utilizing the shifting theorem of the spatial Fourier transform along the x-axis. Furthermore, by selecting two spatial window functions targeted at two ears, the generated filters are able to deliver separate binaural personal audio to multiple listeners. The proposed framework offers an intuitive and efficient solution for binaural personal audio delivery with head tracking, at a moderate computation cost.

Speakers
avatar for Yue Qiao

Yue Qiao

PhD candidate, Princeton University
Yue Qiao is a third-year Ph.D. candidate in the 3D Audio and Applied Acoustics (3D3A) Laboratory at Princeton University. His research interests include personal sound zone reproduction, binaural audio reproduction with loudspeakers and/or headphones, and general spatial audio processing... Read More →
EC

Edgar Choueiri

Princeton University


Thursday October 28, 2021 9:00pm - Friday December 3, 2021 6:00pm EST
On-Demand

9:00pm EDT

Recreating complex soundscapes for audio quality evaluation
A method for creating complex soundscapes for tasks related to audio quality evaluation is presented. This approach uses an ambisonics-derived basis for recreating dynamic noise that follows ETSI standard EG 202 396-1 for background noise reproduction. Recordings were captured with the mh acoustics Eigenmike spherical microphone array and processed to match the two-dimensional four-loudspeaker array by creating four directional beams, each feeding an individual channel. As a result, a spatial background noise ambience is recreated, preserving the transient characteristics of the original recording.

Speakers
DK

Damian Koszewski

Intel Technology Poland
JB

Jan Banas

Intel Technology Poland
PM

Przemyslaw Maziewski

Intel Technology Poland
PT

Pawel Trella

Intel Technology Poland
PP

Pawel Pach

Intel Technology Poland
PK

Piotr Klinke

Intel Technology Poland
DS

Dominik Stanczak

Intel Technology Poland
MK

Maciej Kuklinowski

Intel Technology Poland


Thursday October 28, 2021 9:00pm - Friday December 3, 2021 6:00pm EST
On-Demand

9:00pm EDT

Subjective Assessment of Multichannel Audio on a Tablet Computer
Handheld electronic devices like tablet computers are commonly used for the playback and streaming of music. With the growing popularity of multichannel and immersive audio technologies, it is important to know if they offer any improvement over traditional stereo and mono in terms of audio quality and user experience on such devices. This paper shows the results of four MUSHRA based listening tests that were conducted for the subjective assessment of multichannel audio versus stereo and mono while played back on a tablet computer with two different sets of headphones. BAQ (basic audio quality) and QoE (quality of experience) were the attributes measured. The results show that multichannel audio out performs stereo and mono for both the attributes and a repeated measures ANOVA (analysis of variance) also confirms that the audio format has a large bearing on the results. Though the use of different headphones does change the user ratings, the consolidated results for each test follow a similar trend.

Speakers
avatar for Joshua Reiss

Joshua Reiss

Queen Mary University of London
Josh Reiss is a Professor with the Centre for Digital Music at Queen Mary University of London. He has published more than 200 scientific papers, and co-authored the book Intelligent Music Production, and textbook Audio Effects: Theory, Implementation and Application. He is the President-Elect... Read More →
FT

Fesal Toosy

University of Central Punjab
MS

Muhammad Sarwar Ehsan

University of Central Punjab


Thursday October 28, 2021 9:00pm - Friday December 3, 2021 6:00pm EST
On-Demand

9:00pm EDT

Transdisciplinarity in Sound Design and Music Composition for Film Post-Production: an Experiential Remote Learning Case Study
This e-brief presents a case study of a one-semester transdisciplinary experience involving three courses into the Herberger Institute for Design and the Arts at Arizona State University. Participating students from the Film and Media Post-Production Capstone II and Advanced Sound Design courses in the Sidney Poitier New American Film School’s undergraduate program, and a Music Composition course in the School of Music, Dance and Theatre, experienced during the Spring semester of the 2020-2021 academic year a completely remote Post Production process of approximately twenty, approx 10-minute long short films, showcased at a virtual live event on April 30th and May 1st, 2021, due to the Covid-19 pandemic. On a student-centric constructivist standpoint, inspired by Paulo Freire’s pedagogy approach of raising students to be subjects and not objects of the world, the experience explores remote learning tools to develop not only hands-on immersion into techniques and workflows, but most importantly agency ecosystems where students can explore film departments’ power dynamics and develop interpersonal skills aside to a diverse set of cultural and socio-political mindfulness. Focusing on sharing the strategies and findings of a work in progress, this initial study aims for further developments and to establish hybrid remote and in-person experiential frameworks for the transdisciplinary engagement of sound and music in the visual arts.

Speakers
avatar for Rodrigo Meirelles

Rodrigo Meirelles

Professor - Sound Design, Arizona State University. The Sidney Poitier New American Film School.
Rodrigo Meirelles is an awarded multidisciplinary professional passionate about sound, music and education. He built his career combining his skills and passions, what gave him a unique background and an international reputation as an educator and audio engineer.      He started... Read More →
FN

Fernanda Navarro

Arizona State University. School of Music, Dance and Theatre.
JC

Janaki Cedanna

Arizona State University. The Sidney Poitier New American Film School.


Thursday October 28, 2021 9:00pm - Friday December 3, 2021 6:00pm EST
On-Demand

9:00pm EDT

Using Ambisonic Microphone as Spot Microphones in an Ensemble Recording
Ambisonic microphones have provided a flexible and eloquent method to capture surround sound with a compact microphone array. However, tetrahedral microphones are typically applied to free-field capture applications paired with traditional mono or stereo spot microphones, where enhanced control over individual sound source balancing and timbre is required. This paper investigates the use of tetrahedral microphones as a versatile spot microphone technique that can render both direct sound and supporting room impressions for individual sound sources. These multiple perspectives can then be mixed together in a surround environment for great effect. Multiple recordings were made at NYU in the summer of 2021 to explore these techniques. A jazz quartet was recorded using three separate miking systems; an ambisonic-only system, an ambisonic and omnidirectional coincidental spot system, and a “traditional” jazz recording system. In this paper, the techniques we used will be explained and evaluated based on discrete ATMOS mixes with each system.

Speakers
PG

Paul Geluso

New York University
KC

Kerri Chandler

New York University | University of Trinidad & Tobago
CM

Cale Michaels Bonderman

New York University


Thursday October 28, 2021 9:00pm - Friday December 3, 2021 6:00pm EST
On-Demand

9:00pm EDT

WaveBeat: End-to-end beat and downbeat tracking in the time domain
While deep learning approaches for beat and downbeat tracking have brought advancements, these approaches continue to rely on hand-crafted, subsampled spectral features as input, restricting the information available to the model. In this work, we propose WaveBeat, an end-to-end approach for joint beat and downbeat tracking operating directly in the time domain. This method forgoes engineered spectral features, and instead produces beat and downbeat predictions directly from the waveform, the first of its kind for this task. Our model utilizes temporal convolutional networks (TCNs) operating on waveforms that achieve a very large receptive field (~30s) at audio sample rates. This is achieved in a memory efficient manner by employing rapidly growing dilation factors, which enable a relatively shallower network architecture. Combined with a straightforward data augmentation strategy, our method outperforms previous state-of-the-art methods on some datasets, while producing comparable results on others.

Speakers
avatar for Christian J. Steinmetz

Christian J. Steinmetz

PhD Researcher, Queen Mary University of London
I am a PhD student working with Joshua D. Reiss within the Centre for Digital Music at Queen Mary University of London. Currently, I am researching applications of machine learning for audio signal processing with a focus on high fidelity audio and audio/music production. This involves... Read More →
avatar for Joshua Reiss

Joshua Reiss

Queen Mary University of London
Josh Reiss is a Professor with the Centre for Digital Music at Queen Mary University of London. He has published more than 200 scientific papers, and co-authored the book Intelligent Music Production, and textbook Audio Effects: Theory, Implementation and Application. He is the President-Elect... Read More →


Thursday October 28, 2021 9:00pm - Friday December 3, 2021 6:00pm EST
On-Demand

9:00pm EDT

3D Impulse Response Convolution with Multichannel Direct Sound: Assessing Perceptual Equivalency between Room- and Source- Impression for Music Production
A method for representing the three-dimensional radiation patterns of instruments/performers within artificial reverberation using multichannel direct sound files convolved with channel-based spatial room impulse responses (SRIRs) is presented. Two reverb conditions are studied in a controlled listening test: a) all SRIR channel positions are convolved with a single monophonic direct sound file, and b) each SRIR channel position is convolved with a unique direct sound file taken from a microphone array surrounding the performer. Participants were asked to adjust the level of each reverberation condition (relative to a fixed direct sound stream) to three perceptual thresholds relating to source- and room- impression. Results of separate three-way within-subject ANOVAs and post-hoc analysis show significant interactions between instrument / room type, and instrument / reverb condition on each of the three thresholds. Most notably, reverb condition b) required less level than condition a) to yield perceptual equivalency between source- and room- impression, suggesting that the inclusion of multichannel direct sound in SRIR convolution may increase the salience of room impression in the immersive reproduction of acoustic music.

Speakers
avatar for Jack Kelly

Jack Kelly

McGill University
Jack Kelly is. Ph.D. candidate at the Schulich School of Music, McGill University. His thesis research centers on the influence of spatial room impulse response convolution technologies (channel-based and HOA arrays) on the sensation of physical presence in immersive music production. He... Read More →
avatar for Richard King

Richard King

McGill University
Richard King is an Educator, Researcher, and a Grammy Award winning recording engineer. Richard has garnered Grammy Awards in various fields including Best Engineered Album in both the Classical and Non-Classical categories. Richard is an Associate Professor at the Schulich School... Read More →
WW

Wieslaw Woszczyk

McGill University


Thursday October 28, 2021 9:00pm - Friday December 3, 2021 6:00pm EST
On-Demand

9:00pm EDT

A Neural Beamforming Front-end for Distributed Microphone Arrays
Robust real-time audio signal enhancement increasingly relies on multichannel microphone arrays for signal acquisition. Sophisticated beamforming algorithms have been developed to maximize the benefit of multiple microphones. With the recent success of deep learning models created for audio signal processing, the task of Neural Beamforming remains an open research topic. This paper presents a Neural Beamformer architecture capable of performing spatial beamforming with microphones randomly distributed over very large areas, even in negative signal-to-noise ratio environments with multiple noise sources and reverberation. The proposed method combines adaptive, nonlinear filtering and the computation of spatial relations with state-of-the-art mask estimation networks. The resulting End-to-End network architecture is fully differentiable and provides excellent signal separation performance. Combining a small number of principal building blocks, the method is capable of low-latency, domain-specific signal enhancement even in challenging environments.

Speakers
JZ

Jonathan Ziegler

Stuttgart Media University
LS

Leon Schröder

Stuttgart Media University
AK

Andreas Koch

HdM Stuttgart
AS

Andreas Schilling

Eberhard Karls University Tuebingen


Thursday October 28, 2021 9:00pm - Friday December 3, 2021 6:00pm EST
On-Demand

9:00pm EDT

AI 3D immersive audio codec based on content-adaptive dynamic down-mixing and up-mixing framework
Recently, people who prefer to consume media contents via over the top (OTT) platform, such as YouTube, Netflix etc., rather than a conventional broadcasting get increased more and more. To deliver an immersive audio experience to them more effectively, we propose a unified framework for AI-based 3D immersive audio codec. In this framework, to maximize the original immersiveness even at a down-mixed audio, while enabling to precisely reproduce the original 3D audio from the down-mixed audio, content-adaptive dynamic down-mixing and up-mixing scheme is newly proposed. The experimental results show that the proposed framework can render more improved down-mixed audio compared to the conventional method as well as successfully reproduce the original 3D audio.

Speakers
avatar for Woo Hyun Nam

Woo Hyun Nam

Principal Engineer, Samsung Research, Samsung Electronics
Woo Hyun Nam received the Ph.D. degree in electrical engineering from the Korea Advanced Institute of Science and Technology (KAIST), Daejeon, Korea in 2013. Since 2013, he has been with the Samsung Research, Samsung Electronics, where he is currently a Principal Engineer. His research... Read More →
TL

Tammy Lee

Samsung Research, Samsung Electronics
SC

Sang Chul Ko

Samsung Research, Samsung Electronics
YS

Yoonjae Son

Samsung Research, Samsung Electronics
HK

Hyun Kwon Chung

Samsung Research, Samsung Electronics
KK

Kyung-Rae Kim

Samsung Research, Samsung Electronics
JK

Jungkyu Kim

Samsung Research, Samsung Electronics
SH

Sunghee Hwang

Samsung Research, Samsung Electronics
KL

Kyunggeun Lee

Samsung Research, Samsung Electronics


Thursday October 28, 2021 9:00pm - Friday December 3, 2021 6:00pm EST
On-Demand

9:00pm EDT

Analysis of a Unique Pingable Circuit: The Gamelan Resonator
This paper offers a study of the circuits developed by artist Paul DeMarinis for the touring version of his work Pygmy Gamelan. Each of the six copies of the original circuit, developed June-July 1973, produce a carefully tuned and unique five-tone scale. These are obtained by five resonator circuits which pitch pings produced by a crude antenna fed into clocked bit-shift registers. While this resonator circuit may seem related to classic Bridged-T and Twin-T designs, common in analog drum machines, DeMarinis' work actually presents a unique and previously undocumented variation on those canonical circuits. We present an analysis of his third-order resonator (which we name the Gamelan Resonator), deriving its transfer function, time domain response, poles, and zeros. This model enables us to do two things: first, based on recordings of one of the copies, we can deduce which standard resistor and capacitor values DeMarinis is likely to have used in that specific copy, since DeMarinis' schematic purposefully omits these details to reflect their variability. Second, we can better understand what makes this filter unique. We conclude by outlining future projects which build on the present findings for technical development.

Speakers
EJ

Ezra J. Teboul

Paris
Historian of electronic music technology, its users, and its makers.CHSTM sound and technology group co-convener:https://www.chstm.org/content/sound-and-technology
avatar for Kurt James Werner

Kurt James Werner

Research Engineer, iZotope, Inc.


Thursday October 28, 2021 9:00pm - Friday December 3, 2021 6:00pm EST
On-Demand

9:00pm EDT

Application of AI techniques for nonlinear control of loudspeakers
To obtain high loudness, with good bass extension while keeping distortion low, and ensure mechanical protection, one needs to control accurately the motion of the loudspeaker diaphragm. Actual solutions for nonlinear control of loudspeakers are complex, difficult to implement and to tune. They are limited in accuracy due to insufficient physical models, that do not completely capture the complexity of the loudspeaker. Furthermore, the physical model parameters are difficult to estimate.
We present here a novel approach that uses a Neural Network to map directly the diaphragm displacement to the input voltage, allowing us to invert the loudspeaker. This technique allows to control and linearize the loudspeaker without theoretical assumptions and with better accuracy than a model-based approach. It is also simpler to implement.

Speakers
YL

Yuan Li

Samsung Research America
avatar for Pascal Brunet

Pascal Brunet

Samsung Research America
Pascal Brunet obtained his Bachelor's in Sound Engineering from Ecole Louis Lumiere, Paris, in 1981, his Master's in Electrical Engineering from CNAM, Paris, in 1989 and a PhD degree in EE from Northeastern University, Boston, in 2014. His thesis was on nonlinear modeling of loudspeakers... Read More →
GK

Glenn Kubota

Samsung Research America
AM

Aaquila Mariajohn

Samsung Research America


Thursday October 28, 2021 9:00pm - Friday December 3, 2021 6:00pm EST
On-Demand

9:00pm EDT

Audio-Source Rendering on Flat-Panel Loudspeakers with Non-Uniform Boundary Conditions
Devices from smartphones to televisions are beginning to employ dual purpose displays, where the display serves as both a video screen and a loudspeaker. In this paper we demonstrate a method to generate localized sound-radiating regions on a flat-panel display. An array of force actuators affixed to the back of the panel is driven by appropriately filtered audio signals so the total response of the panel due to the actuator array approximates a target spatial acceleration profile. The response of the panel to each actuator individually is initially measured via a laser vibrometer, and the required actuator filters for each source position are determined by an optimization procedure that minimizes the mean squared error between the reconstructed and targeted acceleration profiles. Since the single-actuator panel responses are determined empirically, the method does not require analytical or numerical models of the system’s modal response, and thus is well-suited to panels having the complex boundary conditions typical of television screens, mobile devices, and tablets. The method is demonstrated on two panels with differing boundary conditions. When integrated with display technology, the localized audio source rendering method may transform traditional displays into multimodal audio-visual interfaces by colocating localized audio sources and objects in the video stream.

Speakers
MC

Michael C. Heilemann

University of Rochester
TD

Tre DiPassio

University of Rochester
MB

Mark Bocko

University of Rochester


Thursday October 28, 2021 9:00pm - Friday December 3, 2021 6:00pm EST
On-Demand

9:00pm EDT

Automatic Loudspeaker Room Equalization Based On Sound Field Estimation with Artificial Intelligence Models
In-room loudspeaker equalization requires a significant amount of microphone positions in order to characterize the sound field in the room. This can be a cumbersome task for the user. This paper proposes the use of artificial intelligence to automatically estimate and equalize, without user interaction, the in-room response. To learn the relationship between loudspeaker near-field response and total sound power, or energy average over the listening area, a neural network was trained using room measurement data. Loudspeaker near-field SPL at discrete frequencies was the input data to the neural network. The approach has been tested in a subwoofer, a full-range loudspeaker, and a TV. Results showed that the in-room sound field can be estimated within 1--2 dB average standard deviation.

Speakers
avatar for Adrian Celestinos

Adrian Celestinos

Samsung Research America
YL

Yuan Li

Samsung Research America
VM

Victor Manuel Chin Lopez

Samsung Research Tijuana


Thursday October 28, 2021 9:00pm - Friday December 3, 2021 6:00pm EST
On-Demand

9:00pm EDT

Best Paper
The livestream can be viewed HERE. Please note you will need to log into aesshow.com using your AES credentials.

Thursday October 28, 2021 9:00pm - Friday December 3, 2021 6:00pm EST
On-Demand

9:00pm EDT

Binaural Audio Externalization Processing
Headphone or earbud listening scenarios span from the home or office to mobile and automotive environments, with audio source content formats including two-channel stereo, multi-channel surround, immersive or object-based material. Post-processing methods have been developed with the intent of restoring, during headphone playback, the spatial audio cues experienced in natural or loudspeaker listening, remediating known effects of headphone-mediated audio reproduction: the perceived localization of sounds in or near the head, accompanied by timbre or balance distortions and spatial image blurring or warping. The intended benefits include alleviating listening fatigue and cognitive load. In this E-Brief presentation, we review previously reported binaural audio post-processing methods and consider a strategy emphasizing minimal signal modification, applicable to enhancing conventionally produced stereo recordings.

This is a work-in-progress report on an investigation that we plan to report on in a future paper. The slides and audio demonstrations are posted at izotope.com/tech/aes_extern.

Speakers
avatar for Jean-Marc Jot

Jean-Marc Jot

VP Research, iZotope, Inc.
https://sites.google.com/site/jmmjot/
avatar for Alexey Lukin

Alexey Lukin

iZotope, Inc.
avatar for Kurt James Werner

Kurt James Werner

Research Engineer, iZotope, Inc.
EA

Evan Allen

iZotope, Inc.


Thursday October 28, 2021 9:00pm - Friday December 3, 2021 6:00pm EST
On-Demand

9:00pm EDT

Bit Rate Requirements for an Audio Codec for Stereo, Surround and Immersive Formats
This paper describes a comprehensive study on the sound quality of the Opus codec for stereo, surround and immersive audio formats for music and cinematic content. We conducted three listening tests on Opus encoded stereo, 5.1 and 7.1.4 test samples taken from music, cinematic and EBU files encoded at bit rates of 32, 48 and 64 kbps per channel. Preliminary results indicate that a bit rate of 64 kbps per channel or higher is required for stereo, but 48 kbps per channel may be sufficient for surround and immersive audio formats.

Speakers
avatar for Sunil Bharitkar

Sunil Bharitkar

Samsung Research America
avatar for Allan Devantier

Allan Devantier

Samsung Research America
CT

Carlos Tejeda-Ocampo

Samsung Research Tijuana
CW

Carren Wang

Samsung Research America
WS

Will Saba

Samsung Research America


Thursday October 28, 2021 9:00pm - Friday December 3, 2021 6:00pm EST
On-Demand

9:00pm EDT

Comparison of different techniques for recording and postproduction using main-microphone arrays for binaural reproduction.
We present a subjective evaluation of various 3D main-microphone techniques for three-dimensional binaural music production. Forty-seven subjects participated in the survey, listening on headphones. Results suggest that ESMA-3D, followed by Decca tree with height, work best of the included 3D arrays. However, the dummy head and a stereo AB microphone performed as well if not better than any of the arrays. Though not implemented for this study, our workflow allows the possibility to include individualized HRTF's and head-tracking; their impact will be considered in a future study.

Speakers
avatar for Josua Dillier

Josua Dillier

Zürcher Hochschule der Künste ZHdK
Josua Dillier is a young audio engineer and producer living in Zurich, Switzerland. His works range from CD- or Videoproduction to live mixing. He is specialized in the recording of acoustic instruments.Before his studies as a Tonmeister at University of the Arts Zurich he studied... Read More →
HJ

Hanna Järveläinen

Zürcher Hochschule der Künste ZHdK


Thursday October 28, 2021 9:00pm - Friday December 3, 2021 6:00pm EST
On-Demand

9:00pm EDT

Deconvolution of Room Impulse Responses from Simultaneous Excitation of Loudspeakers
Traditional room-equalization involves exciting one loudspeaker at a time and deconvolving the loudspeaker-room response from the recording. As the number of loudspeakers and positions increase, the time required to measure loudspeaker-room responses will increase. In this paper, we present a technique to deconvolve impulse responses after exciting all loudspeakers at the same time. The stimuli are shifted relative to a base-stimuli and are optionally pre-processed with arbitrary filters to create specific sounding signals. The stimuli shift ensures capture of the low-frequency reverberation tail after deconvolution. Various deconvolution techniques including correlation-based, and adaptive filter-based, are presented. The performance is characterized in terms of plots and objective metrics using responses from the Multichannel Acoustic Reverberation Dataset (MARDY) dataset.

Speakers
avatar for Sunil Bharitkar

Sunil Bharitkar

Samsung Research America


Thursday October 28, 2021 9:00pm - Friday December 3, 2021 6:00pm EST
On-Demand

9:00pm EDT

Defining reverberation plugin structure: A comparative exploration of system design and expert knowledge in an audio education context
Reverberation plugin designs differ significantly between manufacturers. The use of abstract terminology, individually stylised interfaces, and a manufacturers preferred lexicon increases complexity and decreases skill transference for novice users. Two studies were undertaken to explore the degree of complexity within the reverberation domain. In study one, the extent of both lexical and functional aspects of 46 reverberation plugins were examined through in-vivo coding of manufacturer documentation. From this, parameter labels were identified and inducted into nine higher level categories based on function. In study two, a free elicitation task was undertaken by seven experienced reverberation plugin users. This study identified the most salient parameters within their underlying knowledge structures, allowing the overlap between system and user to be viewed. The results from both studies establish the lexicon used within existing reverberation plugins, and the breadth of parameters discovered suggests that recognising and understanding parameters across designs may be challenging for novice users. The findings also provide an overview of the reverberation domain whilst highlighting the core parameters identified by expert users. This data could potentially act as the basis for a novice training system.

Speakers
avatar for Kevin Garland

Kevin Garland

LIT
Kevin Garland is a Postgraduate Researcher at The Centre for Creative Informatics, Technological University of the Shannon: Midlands Midwest (TUS), Ireland. He holds a BSc degree in Music Technology & Production, and is currently pursuing a MSc degree with the intention to transfer... Read More →


Thursday October 28, 2021 9:00pm - Friday December 3, 2021 6:00pm EST
On-Demand

9:00pm EDT

Effect of Flicker Noise on Audio Signal Reproduction
The effect of multiplicative flicker noise superimposed on audio equipment on the sense of hearing was considered. Variable resistors used for volumes generate flicker noise, which indicates that it acts multiplicatively on the signal flowing through it. Flicker noise measurements were made for some variable resistors. In addition, the audition test was conducted to investigate the perceptible magnitude of the case where the flicker noise acts on the signal in a multiplicative manner. As a result, it was concluded that untrained individuals rarely could discern the multiplicative effect of volume flicker noise.

Speakers
AY

Akihiko Yoneya

Nagoya Institute of Technology


Thursday October 28, 2021 9:00pm - Friday December 3, 2021 6:00pm EST
On-Demand

9:00pm EDT

Effects of Near-field Sources on Ambisonics Recording and Playback
Ambisonic recording with spherical microphone arrays (SMAs) is based on a far-field assumption which determines how microphone signals are encoded into Ambisonic signals. In the presence of a near-field source, low-frequency distance-dependent boosts arise in SMAs in similar nature to proximity effects in far-field equalized directional microphones. In this study, the effects of near-field sources on Ambisonic signals are modelled analytically, their interaction with regularization stages is observed, and then traced further across to two basic ambisonic processing operations: virtual microphones, and binaural decoding.

Speakers
RG

Raimundo Gonzalez

Department of Signal Processing and Acoustics, Aalto University
AP

Archontis Politis

Audio & Speech Processing Group, Tampere University of Technology
TL

Tapio Lokki

Department of Signal Processing and Acoustics, Aalto University


Thursday October 28, 2021 9:00pm - Friday December 3, 2021 6:00pm EST
On-Demand

9:00pm EDT

Evaluating the Relationship Between Kurtosis Loss and Spectral Insertion Loss for Musicians' Hearing Protection Devices
Hearing protection devices (HPDs) are essential for musicians during loud performances to avoid hearing damage, but the standard Noise Reduction Rating (NRR) performance metric for HPDs metric says little about their behavior in a musical setting. One analysis tool being used to evaluate HPDs in the noise exposure research community is kurtosis measured in the ear and the reduction of noise kurtosis through an HPD. A musical signal, especially live music, will often have a high crest factor and kurtosis, so evaluating kurtosis loss will be important for an objective evaluation of musicians' HPDs. In this paper, a background on kurtosis and filters affecting kurtosis is described, as well as a setup for generating high-kurtosis signals and measuring in-ear kurtosis loss through an HPD. Measurement results on a variety of musicians' HPDs show that 83% of devices measured strongly reduce kurtosis, and that the kurtosis loss is likely an independent metric for performance because it is not correlated to the mean or standard deviation of the spectral insertion loss.

Speakers
DA

David Anderson

Applied Research Associates
TA

Theodore Argo

Applied Research Associates


Thursday October 28, 2021 9:00pm - Friday December 3, 2021 6:00pm EST
On-Demand

9:00pm EDT

Forensic Handling of User Generated Audio Recordings
User generated recordings (UGRs) are common in audio forensic examination. The prevalence of handheld private recording devices, stationary doorbell cameras, law enforcement body cameras, and other systems capable of creating UGRs at public incidents is only expected to increase with the development of new and less expensive recording technology. It is increasingly likely that an audio forensic examiner will have to deal with an ad hoc collection of unsynchronized UGRs from mobile and stationary audio recording devices. The examiner’s tasks will include proper time synchronization, deducing microphone positions, and reducing the presence of competing sound sources and noise. We propose a standard forensic methodology for handling UGRs, including best practices for assessing authenticity and timeline synchronization.

Speakers
avatar for Robert C. Maher

Robert C. Maher

Professor, Montana State University
Audio digital signal processing, audio forensics, music analysis and synthesis.
BM

Benjamin Miller

Montana State University
FR

Fraser Robertson

Montana State University


Thursday October 28, 2021 9:00pm - Friday December 3, 2021 6:00pm EST
On-Demand

9:00pm EDT

Gunshot Detection Systems: Methods, Challenges, and Can they be Trusted?
Many communities which are experiencing increased gun violence are turning to acoustic gunshot detection systems (GSDS) with the hope that their deployment would provide increased 24/7 monitoring and the potential for more rapid response by law enforcement to the scene. In addition to real-time monitoring, data collected by gunshot detection systems have been used alongside witness testimonies in criminal prosecutions. Because of their potential benefit, it would be appropriate to ask– how effective are GSDS in both lab/controlled settings vs. deployed real-world city scenarios? How reliable are outputs produced by GSDS? What is system performance
trade-off in gunshot detection vs. source localization of the gunshot? Should they be used only for early alerts or can they be relied upon in courtroom settings? Are resources spent on GSDS operational costs well utilized or could these resources be better invested to improve community safety? This study does not attempt to address many of these questions including social or economic questions of GSDS, but provides a reflective survey of hardware and algorithmic operations of the technology to better understand its potential as well as limitations. Specifically, challenges are discussed regarding environmental and other mismatch conditions, as well as emphasis on validation procedures used and their expected reliability. Many concepts discussed in this paper are general and will be likely utilized in or have impact on any gunshot detection technology. For this study, we refer to the ShotSpotter system to provide specific examples of system infrastructure and validation procedures.

Speakers
JH

John Hansen

Center for Robust Speech Systems; The University of Texas at Dallas
HB

Hynek Boril

University of Wisconsin - Platteville


Thursday October 28, 2021 9:00pm - Friday December 3, 2021 6:00pm EST
On-Demand

9:00pm EDT

Implementing and Evaluating a Higher-order Ambisonic Sound System in a Multi-purpose Facility: A Lab Report
Although Ambisonic sound reproduction has an extensive history, it started finding more widespread use in the past decade due to the advances in computer hardware that enable real-time encoding and decoding of Ambisonic sound fields, availability of user-friendly software that facilitate the rendering of such sound fields, and recent developments in immersive media technologies, such as AR and VR systems, that prompt new research into spatial audio. In this paper, we discuss the design, implementation, and evaluation of a third-order Ambisonic system in an academic facility that is built to serve a range of functions including instruction, research, and artistic performances. Due to the multi-purpose nature of this space, there are numerous limitations to consider when designing an Ambisonic sound system that can operate efficiently without interfering with the variety of activities regularly carried out in it. We discuss our approach to working around such limitations and evaluating the resulting system. To that end, we present a user study conducted to assess the performance of this system in terms of perceived spatial accuracy. Based on the growing number of such facilities around the world, we believe that the design and evaluation methods presented here can be of use in the implementation of spatial audio systems in similar multi-purpose environments.

Speakers
avatar for Anıl Çamcı

Anıl Çamcı

Assistant Professor of Performing Arts Technology, University of Michigan
SS

Sam Smith

University of Michigan
SH

Seth Helman

University of Michigan


Thursday October 28, 2021 9:00pm - Friday December 3, 2021 6:00pm EST
On-Demand

9:00pm EDT

Informed postprocessing for auditory roughness removal for low-bitrate audio coders
In perceptual audio coding using very low bitrates, modulation artifacts can be introduced onto tonal signal components, which are often perceived as auditory roughness. These artifacts may occur for instance due to quantization errors or may be added when using an audio bandwidth extension, which sometimes causes an irregular harmonic structure at the borders of replicated bands. Especially, the roughness artifacts due to quantization errors are difficult to mitigate without investing considerably more bits in encoding of tonal components. We propose a novel technique to remove these roughness artifacts at the decoder side controlled by a small amount of guidance information transmitted by the encoder.

Speakers
SV

Steven Van De Par

Carl von Ossietzky University, Department of Medical Physics and Acoustics
SD

Sascha Disch

Fraunhofer IIS, Erlangen
AN

Andreas Niedermeier

Fraunhofer IIS, Erlangen
BE

Bernd Edler

Fraunhofer Institute for Integrated Circuits IIS, Erlangen


Thursday October 28, 2021 9:00pm - Friday December 3, 2021 6:00pm EST
On-Demand

9:00pm EDT

Interactive Application to Control and Rapid-prototype in a Collaborative Immersive Environment
Human-scale immersive environments offer rich, often interactive, experiences and their potential has been demonstrated across areas of research, teaching, and art. The variety of these spaces and their bespoke configurations leads to a requirement for content highly-tailored to individual environments and/or interfaces requiring complicated installations. These introduce hurdles which burden users with tedious and difficult learning curves, leaving less time for project development and rapid prototyping. This project demonstrates an interactive application to control and rapid-prototype within the CRAIVE-Lab at Rensselaer. Application Programming Interfaces (APIs) render complex functions of the immersive environment, such as audio spatialization, accessible via the Internet. A front-end interface configured to communicate with these APIs gives users simple and intuitive control over these functions from their personal devices (e.g. laptops, smartphones). While bespoke systems will often require bespoke solutions, this interface allows users to create content on day one, from their own devices, without set up, content-tailoring, or training. Three examples utilizing some or all of these functions are discussed.

Speakers
JB

Jonas Braasch

Professor, Rensselaer Polytechnic Institute
SC

Samuel Chabot

Rensselaer Polytechnic Institute


Thursday October 28, 2021 9:00pm - Friday December 3, 2021 6:00pm EST
On-Demand

9:00pm EDT

Material models in loudspeakers using frictional elements
The compliance of a moving coil loudspeaker is known to depend on the level of the input signal. This effect is visible as a drop in resonance frequency. A nonlinear frictional element with hysteresis, and thus a level dependent compliance and damping, is added to the standard lumped parameter model. A comparison of simulation results and measurements reveals that the frictional model is able to explain the nonlinear behavior seen in the measurements.
The paper presents a scheme for fitting the model parameters to measured data. Results suggest that strong interaction between the frictional elements and the linear parameters is complicating this fitting, and strategies for solving this problem is presented and discussed

Speakers
RB

Rasmus Bølge Sørensen

Technical University of Denmark
FA

Finn Agerkvist

Technical University of Denmark


Thursday October 28, 2021 9:00pm - Friday December 3, 2021 6:00pm EST
On-Demand

9:00pm EDT

Mayflower & The Seven Seas: Sonification of The Ocean
Created in conjunction with the Marine Institute at the University of Plymouth, the intention of this project was to use data transmitted by the on-board sensors of the Mayflower Autonomous Ship (MAS), to manipulate specially created pieces of music, based on sea shanties and folk ballads. Technical issues and Covid delays forced a late change, and the project was switched to using data from the university’s weather stations. This paper will illustrate how the music was produced and recorded, and the software configured to make the musical pieces vary in real-time, according to the changing sea conditions, so that the public will be able to view the current conditions and listen to the music evolve in real-time.

Speakers
ER

Eduardo Reck Miranda

University Of Plymouth
CM

Clive Mead

University Of Plymouth
DH

Dieter Hearle

University Of Plymouth


Thursday October 28, 2021 9:00pm - Friday December 3, 2021 6:00pm EST
On-Demand

9:00pm EDT

Objective-oriented method for uniformation of various directivity representations
Over recent years, numerous attempts were taken to provide efficient methods of directivity representation, either regarding sound sources or head-related transfer functions. Because of the wide variety of programming tools and scripts used by different researchers, the resulting representations are inconvevnient to reproduce and compare with each other, hampering the development of the subject. Within this paper, an objective-oriented method is proposed to deal with this issue. The suggested approach bases on defining classes for different directivity models that share some general properties of directivity functions, allowing for easy comparison between different representations. A basic Matlab toolbox utlizing this method is presented alongside exemplary implementations of directivity models based on spherical and hyperspherical harmonics.

Speakers
AS

Adam Szwajcowski

AGH University of Science and Technology


Thursday October 28, 2021 9:00pm - Friday December 3, 2021 6:00pm EST
On-Demand

9:00pm EDT

On the comparison of flown and ground-stacked subwoofer configurations regarding noise pollution
In addition to audience experience and hearing health concerns, noise pollution issues are increasingly considered in large scale sound reinforcement for outdoor events. Among other factors, subwoofer positioning relative to the main system influences sound pressure levels at large distances, which may be considered as noise pollution.
In this paper, free field simulations are first performed showing that subwoofers positioning affects rear and side rejections but has a limited impact on noise level in front of the system. Then, the impact of wind on sound propagation at low frequencies is investigated. Simulation results show that the wind impacts more ground-stacked subwoofers than flown subwoofers, leading to higher sound levels downwind in the case of ground-stacked subwoofers.

Speakers
avatar for Etienne Corteel

Etienne Corteel

Director of Scientific Outreach & Education, L-Acoustics
Governing the scientific outreach strategy, Etienne and his team are the interface between L-Acoustics and the scientific and education communities. Their mission is to develop and maintain an education program tailor-made for the professional sound industry. Etienne also contributes... Read More →
avatar for Thomas Mouterde

Thomas Mouterde

Field application research engineer, L-Acoustics
Thomas Mouterde is a field application research engineer at L-Acoustics, a French manufacturer of loudspeakers, amplifiers, and signal processing devices. He is a member of the “Education and Scientific Outreach” department that aims at developing the education program of the... Read More →


Thursday October 28, 2021 9:00pm - Friday December 3, 2021 6:00pm EST
On-Demand

9:00pm EDT

Overview and Comparison of Acoustical Characteristics in Three Historically Significant Nashville Recording Studios
Several key studios in Nashville, TN served as the focus for the creation of the recorded music experience known as “the Nashville Sound.” Recordings were notable for their songwriting style, musical arrangement, and the nature of the technical processes employed, including the specific recording spaces themselves. Three historically significant studios were selected as representative of this era. This study reviewed the historical background of the studios and investigated whether there may be similarities in these studios’ acoustical properties that resulted in a particular recording approach within these environments. Standard acoustic measurements were obtained and analysed in each of these three recording spaces.

Speakers
avatar for Dr. Doyuen Ko

Dr. Doyuen Ko

Associate Professor, Belmont University
Dr. Doyuen Ko is an Associate Professor of Audio Engineering Technology at Belmont University in Nashville, Tennessee. He received his Ph.D. and Master of Music from the Sound Recording Department at McGill University, Canada. Before studying at McGill, he has worked as a sound designer... Read More →
avatar for Jim Kaiser

Jim Kaiser

Belmont University
Jim Kaiser is an Instructor of Audio Engineering Technology at Belmont University in Nashville, TN.  He serves on the AES Technical Council, the Recording Academy Producers & Engineers Wing, and the Nashville Engineer Relief Fund Board.  Jim is a Past President of the International... Read More →
WB

Wesley Bulla

Belmont University


Thursday October 28, 2021 9:00pm - Friday December 3, 2021 6:00pm EST
On-Demand

9:00pm EDT

Parametric Array Using Amplitude Modulated Pulse Trains: Experimental Evaluation of Beamforming and Single Sideband Modulation
We present a parametric array system realized with a microcontroller and MOSFET drivers. Pulse train signals with fundamental frequency of 40 kHz are generated by the microcontroller. The pulse trains are amplitude modulated by exploiting the switching mechanism of the MOSFETs. The higher-order harmonics are attenuated by the band-pass characteristic of the ultrasonic transducers, emitting only the carrier frequency and the sideband components. The sound beam can be steered by applying phase shifts to the pulse signals, which can be implemented by relatively inexpensive hardware. A new single sideband modulation is also introduced, where the upper sidebands of two double sideband modulation signals are acoustically cancelled. The proposed approaches for beamforming and single sideband modulation are evaluated by anechoic measurements.

Speakers
NH

Nara Hahn

Institute of Communications Engineering, University of Rostock
JA

Jens Ahrens

Division of Applied Acoustics, Chalmers University of Technology
CA

Carl Andersson

Division of Applied Acoustics, Chalmers University of Technology


Thursday October 28, 2021 9:00pm - Friday December 3, 2021 6:00pm EST
On-Demand

9:00pm EDT

Phoneme Mappings for Online Vocal Percussion Transcription
Vocal Percussion Transcription (VPT) aims at detecting vocal percussion sound events in a beatboxing performance and classifying them into the correct drum instrument class (kick, snare, or hi-hat). To do this in an online (real-time) setting, however, algorithms are forced to classify these events within just a few milliseconds after they are detected. The purpose of this study was to investigate which phoneme-to-instrument mappings are the most robust for online transcription purposes. We used three different evaluation criteria to base our decision upon: frequency of use of phonemes among different performers, spectral similarity to reference drum sounds, and classification separability. With these criteria applied, the recommended mappings would potentially feel natural for performers to articulate while enabling the classification algorithms to achieve the best performance possible. Given the final results, we provided a detailed discussion on which phonemes to choose given different contexts and applications.

Speakers
AL

Alejandro Luezas

Roli / Queen Mary University of London
CS

Charalampos Saitis

Queen Mary University of London
MS

Mark Sandler

Queen Mary University of London


Thursday October 28, 2021 9:00pm - Friday December 3, 2021 6:00pm EST
On-Demand

9:00pm EDT

Response clustering in loudspeaker radiation balloons
The measurement of the radiation balloon of a loudspeaker involves the acquisition of 2664 responses when acquired with 5º resolution in Theta and Phi angles, each response with magnitude and phase at a high number of frequencies which depends on the measurement spectral resolution. This large amount of information causes, many times, that its analysis is limited to certain frequencies and to certain planes (horizontal and vertical polar plots or isobars). In order to help to investigate radiation balloons, unsupervised machine learning data analysis tools have been applied to automatically group the loudspeaker responses that conforms a full balloon measurement according to their similarity, in order to extract meaningful patterns. Similar algorithms have also been applied to reduce the number of involved frequencies, keeping the same radiation information.

Speakers

Thursday October 28, 2021 9:00pm - Friday December 3, 2021 6:00pm EST
On-Demand

9:00pm EDT

Spatial auditory masking between real sound signals and virtual sound images
In augmented reality (AR) environment, audio signals of real world and virtual world are simultaneously presented to a listener. It is desirable that a virtual sound content and a real sound source do not interfere each other. In order to make it possible, we have examined spatial auditory masking between maskers and maskees, where maskers are real sound signals emitted from loudspeakers, and maskees are virtual sound images, generated by using head related transfer functions (HRTFs), emitted from headphones. Open-ear headphones were used for the experiment, which allow us to listen to the audio content while hearing the environmental sound. The results are very similar to those of the previous experiment [1, 2] where masker and maskee were both real signals emitted from loudspeakers. That is, with a given masker location, masking threshold levels as a function of maskee locations have symmetric property with respect to the frontal plane of a subject. Masking threshold level is, however, lowered than the previous experiment perhaps because of limitation of sound image localization by HRTFs. The results indicate that spatial auditory masking of human hearing occurs with virtually localized sound images in the same way as real sound signals.

Speakers
MN

Masayuki Nishiguchi

Akita Prefectural University
SI

Soma Ishihara

Akita Prefectural University
KW

Kanji Watanabe

Akita Prefectural University
KA

Koji Abe

Akita Prefectural University
ST

Shouichi Takane

Akita Prefectural University


Thursday October 28, 2021 9:00pm - Friday December 3, 2021 6:00pm EST
On-Demand

9:00pm EDT

Synthesizing Reverberation Impulse Responses from Audio Signals: Auto-Reverberation and Interactive Environments
A method for creating reverberation impulse responses from a variety of audio source materials forms the basis of a family of novel reverberation effects. In auto-reverberation, segments of audio are selected and processed to form an evolving sequence of reverberation impulse responses that are applied to the original source material—that is, the audio is reverberating itself. In cross-reverberation, impulse responses derived from one audio track are applied to another audio track. The reverberation impulse responses are formed by summing randomly selected segments of the source audio, and imposing reverberation characteristics, including reverberation time and wet equalization. By controlling the number and timing of the selected source audio segments, the method produces an array of impulse responses that represent a trajectory through the source material. In so doing, the evolving impulse responses will have the character of room reverberation while also expressing the changing timbre and dynamics of the source audio. Processing architectures are described, and off-line and real-time virtual acoustic sound examples derived from the music of Bach and Dick Dale are presented.

Speakers
avatar for Eoin Callery

Eoin Callery

Irish World Academy of Music and Dance, University of Limerick
Eoin Callery is an Irish artist and researcher who develops electroacoustic systems relating to chamber music, performance space augmentation, and sound installation. This often involves exploring acoustic phenomena – especially feedback and virtual acoustics – in live situations... Read More →
JA

Jonathan Abel

CCRMA, Stanford University
KS

Kyle Spratt

Applied Research Laboratories, The University of Texas at Austin


Thursday October 28, 2021 9:00pm - Friday December 3, 2021 6:00pm EST
On-Demand

9:00pm EDT

Teaching Modular Synth & Sound Design Online During COVID-19: Maximizing Learning Outcomes Through Open-source Software and Student-centered Pedagogy
This study introduces an inclusive and innovative online teaching pedagogy in sound design and modular synthesis using open-source software to achieve ideal student-centered learning outcomes and experience during the COVID-19 pandemic. This pedagogy proved to be effective after offering the course, conducting human subject research, and analyzing class evaluation data. The teaching strategies include comprehensive analysis in sound synthesis theory using sample patches, introduction to primary electronics, collaborative learning, hands-on lab experiments, student presentations, and alternative reading assignments in the form of educational videos. Online teaching software solutions were implemented to track student engagement. From a transformative perspective, the authors aim to cultivate student-centered learning, inclusive education, and equal opportunity in higher education in an online classroom setting. The goal is to achieve the same level of engagement as in-person classes, inspire a diverse student body, offer ample technical and mental support, as well as open the possibility of learning sound design through Eurorack modular synthesizers without investing money in expensive hardware. Students’ assignments, midterms, and final projects demonstrated their thorough understanding of the course material, strong motivation, and vibrant creativity. Human subject research was conducted during the course to improve the students’ learning experience and further shape the pedagogy. Three surveys and one-on-one interviews were given to a class of 25 students. The qualitative and quantitative data indicates the satisfaction and effectiveness of this student-centered learning pedagogy. Promoting social interaction and student well-being while teaching challenging topics during challenging times was also achieved.

Speakers
avatar for Jiayue Cecilia Wu

Jiayue Cecilia Wu

Assistant Professor, University of Colorado Denver
Originally from Beijing, Jiayue Cecilia Wu (AKA: 武小慈) is a scholar, composer, vocalist, multimedia technologist, and audio engineer. Cecilia earned her Bachelor of Science degree in Design and Engineering in 2000. She then worked as a professional musician at EMI Records and... Read More →
AF

Ashell Fox

University of Colorado Denver


Thursday October 28, 2021 9:00pm - Friday December 3, 2021 6:00pm EST
On-Demand

9:00pm EDT

The effect of user's hands on mobile device frequency response, Part 1
First results of a study of the effects of the user’s hands on the frequency response and channel balance of a mobile phone hands-free loudspeaker are presented. The results show that the response variation caused by the user’s hands is high (up to 10 dB boost in narrow ranges) and highly user dependent, although general trends can be observed. The variation between users is strong especially above 5 kHz. The acoustical causes for the observed response shape are studied using a FEM model, indicating that especially the shape of the palm explains the observed features of the frequency responses. A conclusion of the results is that developing more realistic measurement methods is needed if more natural tonal balance is attempted in handheld devices.

Speakers
JR

Juha R. Backman

AAC Technologies Solutions Finland OY
LV

Lauri Veko

AAC Technologies Solutions Finland Oy
YJ

Yuheng Jiang

AAC Technologies Holdings Inc


Thursday October 28, 2021 9:00pm - Friday December 3, 2021 6:00pm EST
On-Demand

9:00pm EDT

The influence of stage acoustics on the singers' performance and perception: A pilot study
It is known that musicians tend to adjust their performance to the acoustical properties of the hall as they perceive. In a large reverberant hall, for example, they may play staccato notes even shorter than they would in a less reverberant hall to make the music more clearly understandable by the audience. In this study, four singers were invited to sing two (slow and fast) pieces of music in three venues, of which the reverberation times were 0.3, 1.8, and 3.4 seconds. Singers were surveyed with questions regarding the tempo, intonation, resonance and diction of their performance in each venue. Also, the singing voice was recorded by using a headset microphone and analyzed to relate the audio features to the characteristics of the venues. The results showed that the singers’ perception of the vocal resonance was significantly related to the venue (p=0.024), and so were the average sound level and the dynamic range of the sound level (p=0.040 for both dependent variables), which could partly be explained in relation to the reverberation time.

Speakers
KK

Kajornsak Kittimathaveenan

Institute of Music, Science and Engineering, King Mongkut's Institute of Technology Ladkrabang
MP

Munhum Park

Institute of Music, Science and Engineering, King Mongkut's Institute of Technology Ladkrabang


Thursday October 28, 2021 9:00pm - Friday December 3, 2021 6:00pm EST
On-Demand

9:00pm EDT

Tools For Visual Thinking: Teaching Electronic Music
Teaching the history and compositional techniques of electronic music can be challenging because there are few practical resources available for developing course curriculums, and current music styles are constantly changing. Here we explain the benefits of a few assignments that help students connect the analysis of classic Electronic Dance Music (EDM) songs with creating their own compositions that “nail the style.” Creating timeline analyses of classic EDM songs form a visual representation of how the elements of an arrangement develop. Students later use these timeline analyses as visual blueprints for EDM song arrangements that they compose. Critical listening plays a vital role in creating these detailed timeline analyses that encourage self-discovery of each element’s musical characteristics. This work positively influences the composer’s ability to “nail the style.” Pedagogical experiences based on self-discovery offer greater permanence through structured learning.

Speakers
avatar for Graham Spice

Graham Spice

Assistant Professor of Music Production and Recording Technology, Shenandoah University


Thursday October 28, 2021 9:00pm - Friday December 3, 2021 6:00pm EST
On-Demand

9:00pm EDT

Transducer design considerations for slim TV applications.
With the development of new space-efficient display technologies over the last years has come the trend to overall decrease thickness of consumer electronics such as televisions. This slim form factor creates a challenge for the design of integrated audio systems because it severely limits the physical performance possibilities of any acoustic transducer. Modern DSP and amplifier technologies have been able to utilize the transducer up to its performance limits and thus maintained the audio quality, however this will not be enough if a further thickness reduction is desired. This paper discusses the physical limits of the current designs and suggests a new layout of a moving coil transducer for ultra-slim applications.

Speakers
avatar for Felix C. Kochendörfer

Felix C. Kochendörfer

Samsung Research America
Felix Kochendörfer was born in 1985 in Weimar, Germany. He received a M.Sc. Degree in Acoustics and Signal Processing from Aalborg University, Denmark in 2010 and a Diploma Degree in Electrical Engineering from Dresden University of Technology in 2011. After a short time at Klippel... Read More →


Thursday October 28, 2021 9:00pm - Friday December 3, 2021 6:00pm EST
On-Demand
 
Friday, October 29
 

9:00pm EDT

Deep Learning for Audio Signal Processing, with Python and Pytorch Examples Tutorial
In this tutorial, we will show some basic building blocks of deep learning, particularly for audio, from the perspective of signal processing. The idea is to show some similarities between familiar signal processing structures and deep learning architectures. For that, we use examples in Python and Pytorch.

We start with best practices for deep learning, then exploring convolutional neural networks as filter banks (analysis and synthesis) and autoencoders as a filter bank-based audio coder, and finally, we discuss recurrent neural networks as IIR (infinite impulse response) filters. This is done using audio examples and Python Pytorch program examples.

Content:

  • Best Practices for machine learning in audio
  • Specific properties of audio signals and typical features
  • Convolutional layers as filter banks
  • Autoencoders as Filter bank with optimization
  • Variational autoencoder as audio coder with quantization
  • Recurrent Neural Networks as Infinite Impulse Response filters

The Jupyter notebook file for the tutorial slides can be found at github.com/TUIlmenauAMS/AES_Tutorial_2021.

Speakers
avatar for Renato de Castro Rabelo Profeta

Renato de Castro Rabelo Profeta

Technische Universität Ilmenau
Renato Profeta is a Ph.D. Candidate in Audio Signal Processing at the Ilmenau University of Technology.He received a Master of Engineering degree in Electrical Engineering from Kempten University of Applied Sciences and a Bachelor of Engineering in Electrical Engineering from Riga... Read More →
SS

Sascha Spors

University of Rostock
avatar for Gerald Schuller

Gerald Schuller

Professor, Ilmenau University of Technology
Audio Coding, Machine Learning for Audio. Short Bio: Gerald Schuller is a full professor at the Institute for Media Technology of the Technical University of Ilmenau, since 2008. He was head of the Audio Coding for Special Applications group of the Fraunhofer Institute for Digital... Read More →


Friday October 29, 2021 9:00pm - Friday December 3, 2021 5:45pm EST
On-Demand

9:00pm EDT

Archiving the '90s (the '80s Edition)
Archival practice often spotlights the challenges of working with magnetic tape and grooved media. This panel shifts focus to the formats used frequently in 1980s recording production: Small format analog multitrack such as 8 track 1/2” and 16 track 1”, DBX and Dolby noise reduction,  DASH, ProDigi and PCM F-1. Loads of great records were made on these formats, frequently in project studios with smaller budgets. Sadly, they are some of the most at-risk formats, both because the carriers are awful and the because playback machines in working order are hard to find and maintain. The fact that most of the studios using these formats were smaller project studios with minimal budgets only heightens the urgency of preserving this content. Panelists will talk about playback and preservation of these formats, specific considerations in capturing audio, timecode and other data, sourcing and maintaining playback machines, and noise reduction techniques with tapes that often have less than optimal notes and are degrading faster than they can be cataloged.

Moderators
avatar for Jason Bitner

Jason Bitner

Traffic Entertainment Group
Jason Bitner is a mastering engineer and production supervisor at an independent record distributor in the Boston area. His roots in the Boston music scene go back to repairing brasswinds for Rayburn music. Now, in addition to daily production operations, Jason can be found transferring... Read More →

Speakers
avatar for Eddie Ciletti

Eddie Ciletti

Audio Engineering Society
Eddie has been a self-employed audio tech (and occasional recording engineer) for most of his career, with stints at MCI, Bearsville, Atlantic, Record Plant, and R/Greenberg Associates along the way. He began his professional career in 1975 as a keyboard technician for Hall and Oates... Read More →
avatar for Dan Johnson

Dan Johnson

Owner, Audio Archiving Services, Inc
Dan Johnson is the owner of Audio Archiving Services in Burbank, CA. Over the course of his career, he has worked with priceless masters by high-profile legacy artists such as Led Zeppelin, John Lennon, KISS, The Doors, Eagles, Prince, The Ramones, Van Halen, De La Soul, WAR, and... Read More →
avatar for Kelly Pribble

Kelly Pribble

IMES (Iron Mountain Entertainment Services)
Inventor, Director of the Media Recovery Technology Program at IMES (Iron Mountain Entertainment Services) Kelly Pribble is a veteran audio engineer, inventor, media archivist, and studio builder who has worked extensively in Nashville, London, New York City, and Rio de Janeiro... Read More →
avatar for Catherine Vericolli

Catherine Vericolli

Infrasonic Transfers & Archival
Catherine Vericolli is a transfer and archival engineer based in Nashville, TN at Infrasonic Transfers & Archival. With over 15 years of experience with analog media, she specializes in the preservation of historic records and collections across a wide range of mediums, often in need... Read More →


Friday October 29, 2021 9:00pm - Friday December 3, 2021 5:45pm EST
On-Demand

9:00pm EDT

Smaller, louder, smarter - loudspeaker design in the 21st century
Electronic recording and reproduction of sound is roughly 100 years old. Historical milestones include magnetic tape recording, the transition from vacuum tubes to transistors, the introduction of the digital compact disc, and modern digital signal processing. The full impact of digital technology has taken hold since the turn of the century, and its influence on loudspeaker design is evident in at least two applications: directional control of loudspeaker arrays, and squeezing low frequency output from very small devices. Although the two technologies are quite different, they are combined in some smart speaker designs. This informal review highlights some notable achievements, common misconceptions, and practical guidelines.

Speakers
avatar for George Augspurger

George Augspurger

Perception Inc.
George L. Augspurger received his M.A. degree from UCLA and has done postgraduate work at Northwestern University. After working in sound contracting and television production he joined James B. Lansing Sound, Inc. in 1958 where he served as Technical Service Manager and later as... Read More →


Friday October 29, 2021 9:00pm - Friday December 3, 2021 5:45pm EST
On-Demand
 
Saturday, October 30
 

12:00pm EDT

AES Student Recording Competition Final Presentations: Sound for Visual Media
The AES Student Recording Competition is a unique opportunity for student attendees of AES International Conventions to receive feedback and recognition for their audio production work.

Finalists will be announced, and prizes awarded during this presentation.

Judge Panelists: Ricardo Escallón G., Luke Klingensmith, Ufuk Önen & Fei Yu

Speakers
avatar for Ufuk Onen

Ufuk Onen

Bilkent University
Ufuk Önen is an audio engineer, author, and educator with 30 years of experience in music production and audio post-production. He has published four books, 50 articles, and 250 blog posts on audio and music technologies. He has contributed to more than 300 sound reinforcement, cinema... Read More →


Saturday October 30, 2021 12:00pm - 12:30pm EDT
Streamcast

12:30pm EDT

AES Student Recording Competition Final Presentations: Traditional Acoustic Recording
The AES Student Recording Competition is a unique opportunity for student attendees of AES International Conventions to receive feedback and recognition for their audio production work.

Finalists will be announced, and prizes awarded during this presentation.

Judge Panelists: David Bowles, Martha de Francisco & Gary Call Hanley


Saturday October 30, 2021 12:30pm - 1:30pm EDT
Streamcast

1:30pm EDT

AES Student Recording Competition Final Presentations: Traditional Studio Recording
The AES Student Recording Competition is a unique opportunity for student attendees of AES International Conventions to receive feedback and recognition for their audio production work.

Finalists will be announced, and prizes awarded during this presentation.

Judge Panelists: Bobbi Giel, Cesar Lamschtein & Éliézer Oubda


Saturday October 30, 2021 1:30pm - 2:30pm EDT
Streamcast

2:30pm EDT

AES Student Recording Competition Final Presentations: Modern Studio Recording & Electronic Music
The AES Student Recording Competition is a unique opportunity for student attendees of AES International Conventions to receive feedback and recognition for their audio production work.

Finalists will be announced, and prizes awarded during this presentation.

Judge Panelists: Daniel Cantor, Emiliano Caballero, & Marek Walaszek


Saturday October 30, 2021 2:30pm - 3:30pm EDT
Streamcast
 
Sunday, October 31
 

11:00am EDT

Building Community in Education
Vibrant members of the audio education community are employing unique and creative solutions for building bridges that connect higher education to high schools, to audio professionals, to other colleges and to the companies that are creating the tools that we all work with. What’s in it for the educators? What’s in it for the colleges? What’s in it for the high schools? What’s in it for the companies? Join us to hear of some effective creative strategies that bring educators out of their unique silos to participate in mutually beneficial partnerships that rise above diversity of cultures and languages, national borders, levels of study and organizational missions, motivations, and purpose.

Moderators
JK

John Krivit

Emerson College

Speakers
DF

Daniel Fox

Wondersmith Audio
avatar for Gabriel Herman

Gabriel Herman

University of Hartford
Gabe Herman is a music producer, audio engineer, sound designer and educator.Gabe has engineered and produced numerous music recordings in a variety of genres spanning from classical ensembles to modern rock, contemporary world beat music, jazz, blues, R&B and more on just about every... Read More →
avatar for Cesar Lamschtein

Cesar Lamschtein

KAPS Audio
WM

Wanda Mitchell

Director of Choirs, Lead for The Academy of Technology and Engineering, Hampton High School
avatar for Samantha Potter

Samantha Potter

Senior Editor - ProSoundWeb; Manager - Commercial Audio, A&H USA, ProSoundWeb, Allen & Heath USA
Samantha Potter is an audio engineer currently at Allen & Heath USA serving as the manager of commercial and install audio. She is also a senior editor and author for ProSoundWeb, Church Sound Magazine, and Live Sound International. Her show, Church Sound Podcast, can be found on... Read More →


Sunday October 31, 2021 11:00am - 12:00pm EDT
Streamcast

12:00pm EDT

AES TechTour: The Hideout Recording Studio
Participate in a technical tour of the four-room commercial recording facility The Hideout. This state-of-the-art, analog/digital recording and mixing studio is home to producers and artists of all genres. Tour will be conducted by studio owners Kevin and Kane Churko. After spending four years in Switzerland under the tutelage of world-renowned producer, Mutt Lange, Kevin Churko has emerged as one of the music industry’s leading engineers/producers in his own right. The Churko’s have worked with some of the biggest names in music, including Ozzy Osbourne, Disturbed, Rob Zombie, Shania Twain, Five Finger Death Punch, In This Moment, and Modern Science.

Moderators
avatar for Zoe Thrall

Zoe Thrall

Director of Studio Operations, The Hideout Recording Studio
Zoe Thrall is a music industry professional with decades of experience in recording studio management.Her career started at the famed Power Station Studios in New York City where she learned both sides of studio operations as both an assistant engineer and in studio management. There... Read More →

Speakers
avatar for Kane Churko

Kane Churko

Producer, Songwriter, Mixer, Artist, The Hideout Recording Studio
Kane Churko is a multi-platinum Grammy Nominated and Juno award-winning Canadian producer & songwriter. He is best known for his work with artists such as Papa Roach, Five Finger Death Punch, In This Moment, Ozzy Osbourne, Skillet, Gemini Syndrome, As Lions, VAMPS, and more. He currently... Read More →
avatar for Kevin Churko

Kevin Churko

Owner, The Hideout Recording Studio
After spending four years in Switzerland under the tutelage of world-renowned producer, Mutt Lange, Kevin Churko has emerged as one of the music industry’s leading engineers/producers in his own right. Canadian-bred but currently residing in Las Vegas where he works out of his private... Read More →


Sunday October 31, 2021 12:00pm - 12:30pm EDT
Streamcast

12:30pm EDT

Discussion of Coding in Audio Education
Continuing the discussion begun at the AES 150th convention on how to start implementing audio coding in an audio curriculum, this workshop/panel discussion will move to more advanced topics surrounding coding languages and their relative strengths/weakness in educational practice with an eye towards how best to prepare students to pursue careers as audio coders upon graduation.

Led by professors Tarr and Bennett, this panel will include a diverse group of audio coders that represent male and female audio coding educators from across the globe.

Moderators
avatar for Christopher Bennett

Christopher Bennett

Professor, University of Miami
avatar for Eric Tarr

Eric Tarr

Belmont University

Speakers
avatar for Jamie Andrea Shyla Angus-Whiteoak

Jamie Andrea Shyla Angus-Whiteoak

University of Salford/JASA Consultancy
Jamie Angus-Whiteoak Is Emeritus Professor of Audio Technology at Salford University. Her interest in audio was crystallized at age 11 when she visited the WOR studios in NYC on a school trip in 1967. After this she was hooked, and spent much of her free time studying audio, radio... Read More →
avatar for Gabriel Herman

Gabriel Herman

University of Hartford
Gabe Herman is a music producer, audio engineer, sound designer and educator.Gabe has engineered and produced numerous music recordings in a variety of genres spanning from classical ensembles to modern rock, contemporary world beat music, jazz, blues, R&B and more on just about every... Read More →


Sunday October 31, 2021 12:30pm - 2:00pm EDT
Streamcast

2:00pm EDT

Immersive Audio in Education
In this panel discussion led by Professor Miles Fulwider, educators who have adopted immersive audio systems into their audio curriculum will discuss some of the most recent developments in this technology and strategies and best practices for how to deliver this information in the classroom. Topics will range from pedagogical applications to technical installation, and also address the things learned by teaching immersive technologies to students.

This panel includes a diverse group of representatives from audio programs across the world who are leading the charge in immersive audio in education.

Moderators
MF

Miles Fulwider

University of St. Francis

Speakers
PG

Paul Geluso

New York University
avatar for Leslie Gaston-Bird

Leslie Gaston-Bird

Owner, Mix Messiah Productions
Leslie Gaston-Bird (AMPS, MPSE) is author of the book "Women in Audio", part of the AES Presents series and published by Focal Press (Routledge). She is a voting member of the Recording Academy (The Grammys®) and its P&E (Producers and Engineers) Wing. Currently, she is a freelance... Read More →
avatar for Ed Tetrault

Ed Tetrault

Johns Hopkins


Sunday October 31, 2021 2:00pm - 3:30pm EDT
Streamcast

3:30pm EDT

DEI in Audio Engineering in Higher Education
A panel discussion about the importance of Diversity, Equity and Inclusion in higher education, featuring the perspectives of faculty at institutions around the world, non-profit leaders whose work addresses this issue, and current/recently graduated students.

Moderators
avatar for Gabriel Herman

Gabriel Herman

University of Hartford
Gabe Herman is a music producer, audio engineer, sound designer and educator.Gabe has engineered and produced numerous music recordings in a variety of genres spanning from classical ensembles to modern rock, contemporary world beat music, jazz, blues, R&B and more on just about every... Read More →
avatar for Jiayue Cecilia Wu

Jiayue Cecilia Wu

Assistant Professor, University of Colorado Denver
Originally from Beijing, Jiayue Cecilia Wu (AKA: 武小慈) is a scholar, composer, vocalist, multimedia technologist, and audio engineer. Cecilia earned her Bachelor of Science degree in Design and Engineering in 2000. She then worked as a professional musician at EMI Records and... Read More →

Speakers
avatar for Agnieszka Roginska

Agnieszka Roginska

Professor of Music Technology, New York University
Agnieszka Roginska is a Professor of Music Technology at New York University. She conducts research in the simulation and applications of immersive and 3D audio including the capture, analysis and synthesis of auditory environments, auditory displays and applications in augmented... Read More →
AF

Ashell Fox

University of Colorado Denver


Sunday October 31, 2021 3:30pm - 5:00pm EDT
Streamcast

5:00pm EDT

AES TechTour: Grundman Mastering
Sunday October 31, 2021 5:00pm - 5:45pm EDT
Stream A
 
  • Timezone
  • Filter By Date AES Fall Online Convention Oct 11 -31, 2021
  • Filter By Venue Online
  • Filter By Type
  • Acoustics & Psychoacoustics
  • Applications in Audio
  • Archiving & Restoration
  • Audio Builders Workshop
  • Audio for Cinema
  • Broadcast & Online Delivery
  • Diversity & Inclusion
  • E-Briefs on-demand
  • Education
  • Electronic Dance Music
  • Electronic Instrument Design & Applications
  • Game Audio & XR
  • Hip-Hop/R&B
  • Historical
  • Immersive Music
  • Immersive & Spatial Audio
  • Networked Audio
  • Papers on-demand
  • Recording & Production
  • Sound Reinforcement
  • Special Event
  • Tech Tours
  • Technical Committee Meeting
  • Company
  • Subject
  • Area