Skip to main content

Quality boost for user-generated sound

anechoic-chamber-acoustics-research-university-of-salfordc.jpg

Quality boost for user-generated sound

Thursday 22 October 2015

 

SOUND quality on phones, video recorders and dictaphones is often poor; distorted or noisy with garbled speech or indistinct music.

Now, acoustic scientists at the University of Salford have developed an algorithm to improve user-generated recordings, after tests revealed the extent to which consumers are struggling to control quality.

 

A team led by Professor Trevor Cox asked thousands of volunteers to explain what they thought was interfering with the quality of sound on clips recorded in living rooms, on the street and at gigs, including at the Glastonbury Festival.

“People are often disappointed when they play their recordings back, after a concert or a party, but there is a real lack of understanding as to why,” explains Cox, professor of acoustic engineering and author of Sonic Wonderland.

Tag sound quality

“It could be microphone handling noise, distortion, wind noise or a range of other conditions. What we have worked out is a way of automatically assessing the relative impact of these sound errors.”

The algorithm, which makes it possible to tag content and quality, has already been applied to an app for assessing wind noise, which alerts the user when there is significant risk of the sound being affected.

 The three-year Good Recording Project, led by Salford University, is a response to increasing demand from consumers and from broadcasters who often use amateur footage which is compromised by sound quality.

Lagging behind cameras

 “We’re used to having visual processing improving our photos, such as the camera that spots faces and changes exposure, but we have not had the same tools to do the audio equivalent, added Cox.

Rapid quality assessment could determine whether the sound is of broadcast quality without time consuming manual auditioning.

The £0.5 million project was funded by the Engineering and Physical Sciences Research Council and run in collaboration with BBC R&D and The British Sound Archive. The research is published in the journal PloS One and in the Audio Engineering Society journal and will be presented at the Audio Engineering Society Conference in New York (October 29 – Nov 1).

The research project was carried out by Prof Trevor Cox, Dr Bruno Fazenda, Dr Iain Jackson, Dr Francis Li and Paul Kendrick.

For further information, please contact Professor Trevor Cox on 0161 295 5474 t.j.cox@salford.ac.uk mob 07986 557419 or Gareth Hollyman in the University of Salford press office on 0161 295 6895 / 07725 192767 g.b.hollyman@salford.ac.uk

http://journals.plos.org/plosone/article?id=10.1371/journal.pone.0140256