1. Introduction
The soundscapes in the places where we eat and drink—such as restaurants, bars, and airplanes—exert a significant influence over our perception of the taste and flavor of food and drink ([
1]; see [
2,
3] for reviews). Methods and techniques are undergoing increasing innovation in order to further evaluate how a certain sound can alter the perception of different food attributes, such as sweetness (e.g., [
4,
5,
6,
7,
8,
9,
10]; see [
11] for a review).
That there might be crossmodal links between sounds (e.g., pure tones) and basic tastes (e.g., sweet, bitter, sour, and salty) is by no means obvious. However, a number of recent studies have highlighted the robustness of such associations, enriching the discussion on why it is that people associate specific sounds with particular tastes, aromas, textures, and flavors (see [
11] for a review). Beyond the basic matching of seemingly-unrelated stimuli across the senses, the latest research has also demonstrated how sound can influence the multisensory tasting experience (see [
5,
11] for overviews). Besides researching the potential influences of the sound of food being consumed, researchers have also examined how sounds that are unrelated to food and drink can nevertheless exert a significant influence over multisensory tasting experiences (see [
12,
13] for reviews). For instance, Reinoso Carvalho and his colleagues [
7] conducted an experiment in which the customers in a chocolate shop rated a chocolate before eating it, versus eating it while listening to different soundscapes (including a song). The results revealed that these participants were willing to pay significantly more for the song-chocolate experience, as compared to their ratings before having such a multisensory experience.
A pioneering study from the late 1960s has become a general source of inspiration for many of those working in this area of research: Holt-Hansen [
14] reported that participants made consistent associations between audible frequency ranges (pitch) and two beers. Here, the participants were presented with iterative frequency sweeps on a tone generator, to select the pitch that they thought best matched the flavor of each of the two beers. The participants matched Carlsberg Elephant Beer, a beer with higher alcohol content, to a significantly higher range of frequencies (640–670 Hz), when compared to regular Carlsberg lager (510–520 Hz). Some years later, Rudmin and Cappelli [
15] reported a partial replication of the aforementioned experiment with 10 participants. In the latter study, though, beer was not the only stimulus that the participants were given to match. Here, a range of five foods/drinks was presented, comprising non-alcoholic beer, Carlsberg lager beer, grapefruit juice, a hard candy, and a dill pickle. When comparing the studies of Holt-Hansen [
14] to that of Rudmin and Cappelli [
15], the same beer (Carlsberg lager) that was matched around a pitch of 510–520 Hz in [
14] was matched to a pitch of 600 Hz in [
15].
More recently, various studies have highlighted consistent and robust crossmodal associations between basic taste stimuli and auditory tones. Generally speaking, sweet and sour tastes tend to be matched with a higher pitch range than either bitter or salty tastes [
16,
17].
The research reported here analyzed whether participants would consistently associate different flavors of beer with different pitches. We reassess the hypothesis raised by the studies of Holt-Hansen and Rudmin and Cappelli [
4,
14,
15], but now focusing on better disentangling the main flavor attributes that may be behind these beer-pitch associations. Four different Belgian beers were used in this ‘tuning’ study. The study was subdivided in two experiments. Each experiment consisted of ‘tuning’ three different beers. In the first experiment, the beers were chosen to provide a range of alcohol contents. Here, we were motivated by the fact that in Holt-Hansen’s [
14] results, the participants matched the beer that just so happened to have the higher alcohol content to a significantly higher pitch. In the second experiment, however, the beers were chosen based on their differences in levels of sweetness and bitterness, as it has been previously argued that sweet tastes tend to be matched with a higher pitch range than bitter tastes [
16,
17].
In other words, we wondered if people would match different pitches based on high differences in alcohol content, as Holt-Hansen discussed in his studies, or, perhaps, if they would rely more on a clear difference in whatever the most salient taste differences between the beers happen to be as the main reference for these multisensory matching tasks.
2. Materials and Methods
2.1. Participants
Two experiments were conducted for this study. The first on the 8 December 2015, at the Vrije Universiteit Brussel—VUB (Experiment 1), and the second, on the 27 August 2016, at Savoie’s Cantina, in Brussels (Experiment 2). The Social and Societal Ethics Committee at KU Leuven (SMEC) approved this protocol (Registered as G-2015 07 282). In both experiments, general visitors or customers were invited to take part in a short experiment. They were informed that they would be given complimentary beer to taste while listening to sounds and answering a survey. A total of 75 participants took part in the two experiments (29% females, 71% males, mean (M) age of 24.8 years, standard deviation (SD) of 7.0). All of the participants were at least, 16 years of age (the minimum legal age to drink alcohol in Belgium), and gave their informed consent prior to taking part in the study. None of the participants reported having a cold or any other impairment of their senses of smell, taste, or hearing at the time of the study. In general, the participants were Belgian residents, especially from Brussels and its surroundings. In particular, they reported having a mid-high knowledge of the types and brands of Belgian beer (M = 3.17, SD = 1.21)—these ratings were based on five-point scales, with 1 being ‘Not at all’, and 5 ‘Very much’. They also reported as drinking beer often (M = 4.70, SD = 1.71)—these ratings based on seven-point scales, with 1 being ‘Never’, and 7 ‘Very often’. Finally, when asked about their music ‘expertise’, most people reported as being intermediate (corresponding to less than 10 years of music-playing experience).
2.2. Stimuli
Three Belgian beers were used in Experiment 1. Taras Boulba (Beer A) is a dry Pale Ale beer labeled as 4.5% in alcohol strength and 43.1 IBU (IBU is the standardized scale for bitterness in beers). Zinnebir (Beer B) is a golden blond Pale Ale beer labeled as 6% in alcohol strength. Jambe de Bois (Beer C) is a full-bodied
1 blond and copper-colored Tripel beer. Beer C is the strongest of the three (labeled as 8% in alcohol strength), and has 47.7 IBU. Beer B is the highest within the IBU scale (51.5), although its difference with Beer C may be insignificant. Overall, these three beers are within the bitter beer scope (all beers with an IBU over 25 are usually considered as high in bitterness). In particular, their differences can be described in terms of their alcohol content (with Beer A being the lightest, and Beer C the strongest), and, to a lesser degree, by their bitterness (with Beer A having the lowest IBU). Beer A may also be considered as the least sweet due to its dryness, and Beer C the one with more body, typical to a Tripel
2 style beer. Finally, the pH and acidity of these beers are quite similar, which in principle does not allow them to be chemically ranked in terms of sourness. A fourth beer was used in Experiment 2, along with Beers A and C (Belle-Vue Traditional Kriek Lambic-Fruit, namely Beer D). This is a very fruity and sweet beer, where fruits are added after the spontaneous fermentation (in this case, of cherries). It is labeled as having 5.1% of alcohol content. Kriek beers usually have an IBU of between 15 and 21 (information retrieved from [
20]), which makes this beer much less bitter than the other three beers.
All of the beers were served in 5 cl samples in opaque black plastic cups that obscured the color of the drinks being tasted.
Auditory stimuli: The auditory stimuli used in this study consisted of a digital version of an adjustable pitch tone generator (also known as a frequency sweeper). Using the Online Tone Generator (retrieved from
http://www.szynalski.com/tone-generator/) the participants had the opportunity to choose different tone frequencies while hearing them in real-time, in order to find the pitch that was most suitable for the flavor of each beer being tasted.
Figure 1 shows an image of the corresponding graphic interface.
In order to define the available pitch range for participants to choose from, a pilot study was conducted. In this study, participants (
n = 75) chose a pitch that matched the flavor of three bitter Belgian beers (beers A, B, and C) compared to a sweet carbonated beverage (namely ‘sweet Cola’). The initial range for the pilot was between 50–1500 Hz, and this choice was primarily based on three factors. First, the fact that the headphones started to distort the sounds below 50 Hz. Second, Holt-Hansen [
14] highlighted beer-pitch matches under 1000 Hz. Third, the formulas of these beers are more bitter, when compared to lager beers, which is the beer type that was used in Holt-Hansen’s [
14] research. Hence, in this pilot, we would also expect that participants would be rating even more towards the lower part of the frequency range [
16,
17].
The results of the pilot study revealed that the majority of the bitter beer ratings were within a much narrower range than the available choices (see
Figure 2). For instance, the boxplots show that the medians of Beers A, B, and C are situated around 200 Hz, whereas the median of the cola beverage seems to be around 400 Hz (twice as high). These median values also show us that the differences in the three bitter beer rates seem to be minimally influenced by their differences in alcohol content. The ratings of Beer C would seem, though, to be more dispersed than the other beers. In fact, the corresponding boxplot shows approximately twice the number of outliers. This probably has to do with its higher alcohol content, when compared to the other two beers (as in [
14]). Considering the results of this pilot (along with the revised literature), it was decided to define the pitch ranges for Experiment 1 as between 50 and 500 Hz, and, for Experiment 2, between 50 and 1500 Hz (here, a sweeter beer was being added as a taste stimulus).
2.3. Experimental Design
The objective of Experiment 1 was to assess whether the participants would be able to match each of the three beers with different pitch ranges (in this case, Beers A, B, and C). This experiment was conducted at the Stoa (an annexed room of the VUB’s restaurant), with a total of 55 participants. Considering the results of the pilot, we chose to give the participants a reduced range of frequencies to choose from (50–500 Hz).
In order to test whether participants were able to place beers with very different flavors within different pitch ranges, another beer was included in Experiment 2. Here, one of the three beers from the previous experimental set-up was replaced with a beer having a very different—and sweeter—taste (Beer D). Since beers A and C have the biggest chemical differences, it was decided to keep them along with the new Beer D. With the addition of a sweeter beer, and, again, considering the results of the pilot, here, the participants were given a wider range of frequencies to choose from (50–1500 Hz). Twenty participants took part in this study, developed at the Savoie’s Cantina.
2.4. Experimental Procedure
During both experiments, it was possible to have a well-controlled environment. The experiments were performed using computers placed on tables. The participants sat together, although fairly well separated, in order to prevent them from interacting. The intensity of the artificial lighting in the experimental area was dimmed to a minimum, in order to provide a more ‘intimate’ atmosphere.
Each participant was seated in front of a computer screen with a pair of headphones, a computer mouse, and a keyboard to complete the survey. Each computer was set to a listening level of approximately 70 (+/− 5 dBA). The pitch task was presented over SONY MDRZX310 headphones. The participants were not able to hear the sounds from the others participants’ headphones. The survey consisted of an electronic form containing three main steps, with five- or seven-point Likert scales, multiple choice questions, and YES/NO questions. In the first step of the survey, the participants were instructed to read and accept the conditions of the informed consent before entering their demographic details. In a second step, they responded to a pre-questionnaire, in which they outlined their basic knowledge of Belgian beers. The third step of the experiment consisted of matching the flavor of the three different beers available with pure tones, while manipulating the sweep until they reached the ‘ideal pitch match’. This matching process was made while participants tasted the beer, that is, while having the beer in their mouths. Once the match had been made, the participants entered the corresponding frequency value into a text box. Each participant was provided with three labeled cups, and the pitch-matching process was repeated three times, once for each beer. Separate cups with tap water for mouth rinsing were available as well. After each pitch-beer matching exercise, the participants answered a number of questions, including taste/flavor ratings and hedonic evaluations.
In order to shed more light on how this tuning process took place, in Experiment 2, a second round of questions was added. Here, the participants were asked to rate basic taste attributes of the beers (sweetness and bitterness), alcohol strength, as well as how much they liked each of the beers. All of these ratings were based on 7-point scales, with 1 representing ‘Not at all’ and 7 representing ‘Very much’ (i.e., ‘Not at all sweet’, compared to ‘Very sweet’, and so on). All the data from Experiments 1 and 2 were analyzed using SPSS.
As a final qualitative evaluation, in Experiment 2, the participants were also asked to explain (in their own words, using a comment box), how they had gone about making their beer-tuning choices.
The order of presentation of the beers was fully randomized. Hence, the participants were advised to follow the survey instructions carefully, in order to drink the corresponding beer at all times. At least one supervisor was present during the entire process to provide extra guidance, coordination, and support, in addition to the written guidelines concerning the experiment. Upon finishing the study, the participants were instructed to leave the room without discussing any details with the next group of participants.
4. Discussion
In these two experiments, we first measured people’s ability to choose the pitch for three Belgian bitter beers with considerable differences in alcohol strength (Experiment 1). Having obtained a similar pitch range for all three of the beers, participants made significantly different pitch choices between beers with very different taste attributes (Experiment 2). The results of Experiment 2, with the inclusion of a sweeter beer (Beer D), supported the interpretation that the participants were primarily rating the bitter beers towards the lower end of the pitch range (Beers A and C). By contrast, Beer D was systematically matched to a higher pitch (when compared to Beer A). Therefore, considering the significant contrast in the ratings between Beers A and D, the hypothesis that people tend to associate bitterness with lower pitch ranges and sweetness with higher pitch ranges is supported [
16,
17]. The general taste and flavor differences between Beers A and D were most likely the easiest to define for naïve participants. In particular, Beer A was bitter, light, and dry, whereas Beer D was, comparatively speaking, much sweeter. These results may be useful in discussing the fact that crossmodal correspondences may typically rely on explicit contrast for their effectiveness [
22]. Specifically, clear differences in taste attributes (such as bitterness versus sweetness) can lead to clear differences in the pitch of sound that is matched to those taste attributes [
23].
Incidentally, the resultant pitches of Beers A, B, and C (in Experiment 1) and Beer A (in Experiment 2) were contained in the range of the bitter soundtrack proposed by Reinoso Carvalho and his colleagues in [
8] (Link to access the soundtrack [
24]). This soundtrack has most of its sound power distributed between 50 and 350 Hz. Intriguingly, playing this ‘bitter’ soundtrack enhanced the perceived alcohol content of a Belgian bitter beer.
The pitch range available for selection was different in the two experiments reported here. The ratings corresponding to Beer A appeared to be remarkably stable across both experiments (with the mean varying from 213 Hz to 214 Hz), whereas the mean of Beer C (which was also present in both experiments) varied from 250 Hz to 400 Hz. Note that it has previously been reported that pitch-matching tasks can depend on the set of frequencies in which those sounds are presented, making choices of pitch somewhat relative to the bandwidth that is available [
25]. Even though the means of the pitch of Beer A were quite similar during both experiments, we believe that in this exercise, the participants were most likely comparing the beers and their corresponding pitch in a simpler way, rather than assigning an absolute fixed frequency to each beer. For instance, their associating tasks were perhaps assigning “low”, “medium”, and “high” values to their comparisons [
22]. Such behavioral tasks may be related to the range frequency theory, proposed by Parducci [
26], which argues that a judgment range can be psychologically sub-divided into relative sub-ranges. With the qualitative assessment that we included as part of Experiment 2, most of the participants associated bitterness with lower pitch ranges when compared with sweetness, which goes in line with part of the aforementioned theory [
16,
17]. However, strength also appeared to be an aspect in these comments, with some participants having reported that the stronger the beer, the higher the pitch that should be associated with it
4. Interestingly, this is indeed similar to the results reported by Holt-Hansen [
14], where the participants matched the lager beer with the higher alcohol content to a sound of a significantly higher pitch. This would perhaps explain why Beer C was found to be statistically within the same pitch ranges as beers A and D (this is visible by the high SD associated with its ratings, especially in Experiment 2). Beer C’s pitch was mostly rated towards the lower limit of the frequency range available in Experiment 2, since most of the associations were likely being driven by its bitterness. Still, since Beer C is by far the strongest among them, and this might have caused its corresponding mean to be drawn, in parallel, towards the upper limit of the frequency range available in Experiment 2 (50–1500 Hz); that is, some of the participants were using relative differences in strength to define their beer-pitch associations. In other words, the pitch associated with Beer C was being associated with its high bitterness by most participants (deriving in lower pitch matches) but, at the same time, it was being associated with its high strength by some participants (deriving in higher pitch matches), and this may have caused the resultant higher spreading of its ratings.
Another important observation in Experiment 2 was the fact that Beer A was rated as significantly stronger than Beer D, when actually Beer A has 4.5% alcohol strength, and Beer D 5.1% (see
Figure 2). These results may argue with the finding that people are generally poor at estimating alcohol strength in drinks [
8,
27]. It seems that people tend to associate alcohol strength with flavor intensity (in this case a more bitter beer is perceived as stronger when compared to a much sweeter one), which would be another reason why these beer-pitch matching tasks seem to be more robust when relying on differences in taste attributes, rather than differences in alcohol strength.
In summary, all three of the beers present in Experiment 1 were Belgian blond bitter types. They are, indeed, quite different and with distinguishable flavors (especially when it comes to their strength). However, this sound-flavor matching process was mostly driven by the contrast of perceived bitterness versus sweetness
5, and, to a lesser degree, by their differences in alcohol content (cf. [
8]).
A few of the limitations of the present study are worth highlighting. For instance, the graphic interface of the frequency tone generator used in the present study was always loaded with the same starting frequency (440 Hz), which presents a possible source for anchoring effects [
28]. In future studies, it would therefore be worthwhile to have the frequency starting point set randomly (or, perhaps, a scale that does not have a starting frequency at all), in order to eliminate any potential anchoring effects, should they exist in this situation.
In general, further studies are needed in order to uncover the basis on which such beer-pitch matches are made, and the perceptual effects of matching/non-matching tones on the drinking experience. For example, Holt-Hansen [
4] reported some interesting sensations described by the participants when they heard frequencies that were not the ones chosen by them as the ideal match (think of watery, strong and bitter; see [
29] for a review). In both of Holt-Hansen’s [
4,
14] studies, the beer apparently tasted better at the “pitch of harmony” than when evaluated in silence. Moreover, future research could assess whether this type of multisensory matching task could be controlled by simply reorienting the participant’s attention towards the attribute that is being manipulated (e.g., variations in alcohol strength versus differences in different taste attributes).
Nevertheless, it would be interesting, in future studies, to continue researching effective ways of better disentangling taste attributes and alcohol as factors. When it comes to sound-flavor associations, and as mentioned before, people commonly tend to associate alcohol strength with flavor intensity and/or flavor contrast (cf. [
8,
27]). Perhaps a future similar experiment could compare, for example, different-flavored alcohol-carbonated drinks with the same alcohol level (i.e., a very sweet drink versus a very bitter beer, and so on). Actually, carbonation (including its sound) can also be an important factor to be taken into consideration during future experiments. As a matter of fact, recent research has been expanding the flavor wheel in order to place mouthfeel characteristics (including carbonation) on an equal level with aroma and taste characteristics [
18,
19]. In our case, different levels and types of carbonation lead to different sounds, and this may have a significant effect during beer-pitch matching tasks ([
30]; see [
31] for an overview on the influence that the sound of carbonation can have on beverage perception).
Similar future assessments could also count on test-retest techniques for a better understanding of the liability of these types of experimental designs, especially considering the large standard deviation that is being reported here. Another interesting point to be considered in future similar assessments is the fact that higher frequency tones (when presented as single pure-tones) tend to be generally perceived as unpleasant because of their apparent loudness
6, which may also be an important factor while rating
7. Therefore, future experiments could rely on embedded loudness correction curves as part of the pitch tone generator (see [
32] for an overview on the measurement of loudness). This may allow the participants to position their pitch ratings at higher ranges, with the possible mitigation of pleasure as a confounding factor.
We believe that the use of sound as a sensory enhancement in multisensory tasting experiences could provide new ways of rethinking the design and consumption of food and drinks. For example, a bitter beer accompanied by high-pitched sounds/music (such as a high-pitched piano melody) may be perceived as less bitter, making its consumption more pleasant—and potentially with less added sugar—for those who prefer sweeter beers
8. This approach could be considered as complimentary to others, such as that reported by Deroy and Valentin in [
34]. The analysis in the latter study used beers as taste stimuli, and revealed that there is a significant correspondence between sweetness, voluminousness, and roundness; and between bitterness, thinness, and angular shapes. Moreover, flavor-pitch associations may end up having an impact on the way people choose food and drinks, since it has been previously argued that people tend to associate different types of foods with specific types of music and soundscapes [
9]. In other words, there is a whole range of crossmodal correspondences and multisensory experiential designs to be explored with beers. Actually, exploring our senses and the way in which they interact (not to mention the surprising connections between them) while we experience a food or beverage is certainly aligned with the interests of the younger generation, who are constantly looking for new ways to “promote their senses” [
35]. As a matter of fact, some brewing companies (such as Inbev and The Brussels Beer Project) are already relying on similar multisensory beer experiences as a potential commercial strategy [
36,
37].