Modelling the Microphone-Related Timbral Brightness of Recorded Signals
Abstract
:1. Introduction
2. Ratings of Brightness
2.1. Selecting Microphones
2.2. Selecting Sources
2.3. Recording Stimuli
2.4. Ratings of Brightness
2.5. Analysis of Listening Test Results
3. Brightness Modelling
3.1. Correlates of Brightness
3.1.1. Metric Parameter Values
3.1.2. Metrics Summary
3.2. Initial Modelling of Brightness
3.2.1. Automated Metric Selection: Across All Sources
3.2.2. Selecting One Metric: Each Source Independently
3.3. Model Refinement
4. Model Validation
4.1. Creating Validation Stimuli
4.1.1. Selecting Microphones
4.1.2. Selecting Sources
4.2. Validation Ratings of Brightness
4.3. Validation of Model
4.4. Comparison to Existing Models
5. Discussion
6. Conclusions
Author Contributions
Funding
Institutional Review Board Statement
Informed Consent Statement
Data Availability Statement
Conflicts of Interest
References
- Pearce, A.; Brookes, T.; Mason, R. Timbral attributes for sound effect library searching. In Proceedings of the AES Conference on Semantic Audio, Erlangen, Germany, 22–24 June 2017. [Google Scholar]
- Pearce, A.; Brookes, T.; Mason, R.; Dewhirst, M. Eliciting the most prominent perceived differences between microphones. J. Acoust. Soc. Am. 2016, 139, 2970–2981. [Google Scholar] [CrossRef] [Green Version]
- Schubert, E.; Wolfe, J. Does Timbral Brightness Scale with Frequency and Spectral Centroid? Acta Acoust. United Acoust. 2006, 92, 820–825. [Google Scholar]
- Schubert, E.; Wolfe, J.; Tarnopolsky, A. Spectral centroid and timbre in complex, multiple instrumental textures. In Proceedings of the 8th International Conference on Music Perception and Cognition, Chicago, IL, USA, 3–7 August 2004; pp. 654–657. [Google Scholar]
- Grey, J.; Gordon, G. Perceptual effects of spectral modifications on musical timbres. J. Acoust. Soc. Am. 1978, 63, 1493–1500. [Google Scholar] [CrossRef]
- Poirson, E.; Petiot, J.; Gilbert, J. Study of the brightness of trumpet tones. J. Acoust. Soc. Am. 2005, 118, 2656–2666. [Google Scholar] [CrossRef]
- Juslin, P. Cue utilization in communication of emotion in music performance: Relating performance to perception. J. Exp. Psychol. Hum. Percept. Perform. 2000, 26, 1797–1813. [Google Scholar] [CrossRef]
- Laukka, P.; Juslin, P.; Bresin, R. A dimensional approach to vocal expression of emotion. Cogn. Emot. 2005, 19, 633–653. [Google Scholar] [CrossRef]
- Pearce, A.; Brookes, T.; Dewhirst, M. Validation of experimental methods to record stimuli for microphone comparisons. In Proceedings of the 139th Audio Engineering Society Convention, New York, NY, USA, 29 October–1 November 2015. [Google Scholar]
- ITU-R BS.1116. ITU-R BS.1116-3 Methods for the Subjective Assessment of Small Impairments in Audio Systems; Recommendation BS.1116-3; International Telecommunication Union: Geneva, Switzerland, 2015. [Google Scholar]
- Hermes, K.; Brookes, T.; Hummersone, C. The Harmonic Centroid as a Predictor of String Instrument Timbral Clarity. In Proceedings of the 140th Audio Engineering Society Convention, Paris, France, 4–7 June 2016. [Google Scholar]
- Lartillot, O.; Toiviainen, P. MIR in Matlab (II): A Toolbox for Musical Feature Extraction From Audio. In Proceedings of the 8th International Conference on Music Information Retrieval, Vienna, Austria, 23–27 September 2007. [Google Scholar]
- Olive, S. The Preservation of Timbre: Microphones, Loudspeakers, Sound Sources and Acoustical Spaces. In Proceedings of the 8th International Conference of the Audio Engineering Society, Washington, DC, USA, 3–6 May 1990. paper 8-018. [Google Scholar]
- ITU-T. ITU-T P.1401 Methods, Metrics and Procedures for Statistical Evaluation, Qualification and Comparison of Objective Quality Prediction Models; Recommendation P.1401; International Telecommunication Union: Geneva, Switzerland, 2012. [Google Scholar]
Metric | Variables | Total Number |
---|---|---|
Audio weighting: 3 options | 168 | |
Spectral smoothing: 7 options | ||
Frequency scale: 4 options | ||
Log transform: 2 options | ||
Audio weighting: 3 options | 1512 | |
Spectral smoothing: 7 options | ||
Frequency scale: 4 options | ||
Crossover frequency: 9 options | ||
Log transform: 2 options | ||
Ratio1 & Ratio2 | Ratio type: 2 options | 756 |
Audio weighting: 3 options | ||
Spectral smoothing: 7 options | ||
Crossover frequency: 9 options | ||
Log transform: 2 options | ||
/ and Ratio Combination | type: 2 options | 6048 |
Ratio type: 2 options | ||
Audio weighting: 3 options | ||
Spectral smoothing: 7 options | ||
Frequency scale: 4 options | ||
Crossover frequency: 9 options | ||
Log transform: 2 options |
Best Pearson’s r & RMSE | Best Spearman’s rho | ||
---|---|---|---|
Metric | & Ratio2 | & Ratio1 | & Ratio1 |
Audio weighting | None | None | None |
Frequency scale | Hertz | Hertz | Hertz |
Crossover freq. () | 500 Hz | 1250 Hz | 1250 Hz |
Spectral smoothing | -octave | -octave | -octave |
Log transform | Yes | Yes | No |
Pearson’s r | 0.922 | 0.919 | 0.646 |
Spearman’s rho | 0.902 | 0.906 | 0.906 |
RMSE | 8.52 | 9.928 | 16.779 |
Model | Pearson’s r | Spearman’s rho | RMSE |
---|---|---|---|
Tested against training data | |||
New model | 0.922 | 0.902 | 8.52 |
0.760 | 0.789 | 14.30 | |
0.811 | 0.789 | 12.87 | |
Ratio1 (1 kHz crossover) | 0.734 | 0.867 | 14.93 |
log Ratio1 (2 kHz crossover) | 0.907 | 0.887 | 9.28 |
Ratio2 (2 kHz crossover) | 0.888 | 0.887 | 10.11 |
log Ratio2 (3 kHz crossover) | 0.900 | 0.892 | 9.59 |
Tested against validation data | |||
New model | 0.955 | 0.954 | 14.46 |
0.830 | 0.801 | 19.33 | |
0.864 | 0.801 | 24.00 | |
Ratio1 (1 kHz crossover) | 0.713 | 0.901 | 17.73 |
log Ratio1 (2 kHz crossover) | 0.862 | 0.822 | 23.62 |
Ratio2 (2 kHz crossover) | 0.851 | 0.838 | 24.05 |
log Ratio2 (3 kHz crossover) | 0.798 | 0.787 | 34.58 |
Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations. |
© 2021 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
Share and Cite
Pearce, A.; Brookes, T.; Mason, R. Modelling the Microphone-Related Timbral Brightness of Recorded Signals. Appl. Sci. 2021, 11, 6461. https://doi.org/10.3390/app11146461
Pearce A, Brookes T, Mason R. Modelling the Microphone-Related Timbral Brightness of Recorded Signals. Applied Sciences. 2021; 11(14):6461. https://doi.org/10.3390/app11146461
Chicago/Turabian StylePearce, Andy, Tim Brookes, and Russell Mason. 2021. "Modelling the Microphone-Related Timbral Brightness of Recorded Signals" Applied Sciences 11, no. 14: 6461. https://doi.org/10.3390/app11146461
APA StylePearce, A., Brookes, T., & Mason, R. (2021). Modelling the Microphone-Related Timbral Brightness of Recorded Signals. Applied Sciences, 11(14), 6461. https://doi.org/10.3390/app11146461