Developmental dyslexia (henceforth, dyslexia) affects the acquisition of reading and writing skills despite adequate cognitive and motoric abilities and appropriate access to education. Beyond literacy, dyslexia is also characterized by deficits in spoken language processing, particularly in processing phonological information. For this reason, researchers have proposed that deficits in the processing of phonological information may be the bridge connecting the deficits in spoken and written language, e.g., [
1,
2,
3,
4]. One prominent theory of dyslexia proposes that phonological processing difficulties are a consequence of impaired auditory processing abilities, in particular when processing rhythm information in speech and music [
5]. The present paper aims to connect these hypotheses by investigating the processing of one specific type of phonological information, namely, rhythm information in speech, and its potential associations with literacy and the ability to perceive musical rhythms in dyslexia.
1.1. The Phonological Deficit Hypothesis
The original phonological deficit hypothesis proposes that a deficit in phonological skills underlies dyslexia, as evidenced by difficulties with tasks that tap into phoneme awareness, letter-sound knowledge, verbal short-term memory, and rapid automatized naming. For a recent review see [
6]. Research on this hypothesis has primarily concentrated on deficits regarding segmental (i.e., phoneme) information and has, for example, established that children with dyslexia do not seem to perceive phonemes in the same way as children without dyslexia. Specifically, they have a reduced sensitivity to phonemically relevant distinctions (e.g., when discriminating /p/ from /b/) and an enhanced sensitivity to allophonic variants (e.g., when discriminating different realizations of /b/) compared to listeners without dyslexia, who show clear effects of categorical perception of consonants (for a meta-analysis see [
7]). As categorical perception is assumed to result from effects of the native language phonological system to speech perception, e.g., [
8], weak categorical perception may indicate that the language-specific phoneme categories are not sufficiently well-established. In the case of dyslexia, less well-defined phoneme categories may create difficulties in the phoneme-grapheme mappings that are relevant for the acquisition and/or processing of written language.
1.2. Rhythm Perception Deficits in Dyslexia
More recent developments in dyslexia research have shown that the phonological deficits in dyslexia are not restricted to processing segmental information, but also affect the processing of suprasegmental (i.e., prosodic) information, and, in particular, the processing of rhythm. Rhythm is established by the regular occurrence of an element or a pattern in time. Rhythm is an important feature of languages’ prosody, and can be characterized by, for example, an alternation of more prominent (i.e., strong) syllables with less prominent (i.e., weak) ones. Languages differ in their rhythmic structure as the organization of speech in alternations of strong and weak syllables is determined by language-specific “metrical stress” rules [
9,
10,
11]. For example, in English and German, the basic rhythmic unit has a strong-weak (i.e., trochaic) pattern, but in other languages such as Hebrew, the basic rhythmic unit is weak-strong (i.e., iambic) [
9]. Compared to groups without dyslexia, groups of individuals with dyslexia show lower performance in tasks that require perceptual sensitivity to and/or knowledge of stress rules. For example, this is the case in discrimination tasks with words or phrases pronounced with correct or incorrect stress patterns, e.g., [
12,
13,
14]–an effect that is even present in young children with a familial risk for dyslexia [
15]. In addition, these abilities have been found to correlate with reading skills [
16,
17,
18].
Goswami [
5,
19] has proposed that a fundamental deficit in the processing of rhythmic information is associated with dyslexia. This account focuses on the periodic modulations of amplitude (amplitude envelope) that are crucial to establish speech rhythm with amplitude peaks being aligned with the strong (stressed) syllables of a speech sequence and Goswami assumes that the processing of this amplitude envelope is impeded in dyslexia. These difficulties may result from atypical basic auditory processing: numerous studies have found that individuals with dyslexia show low performance in the perception of rise time (i.e., the velocity of the amplitude increase) and that the perception of rise time is related to the discrimination of word stress patterns (for a review see [
20]. Research on the neural basis of this impairment suggests that in dyslexia, neural oscillations are not synchronized with auditory rhythms in the same way as in populations without dyslexia [
21,
22,
23,
24]. Independent of whether the basis of the impairment is perceptual or neural, according to Goswami, the problem in the processing of rhythm hinders the segmentation of speech into syllables and also the perception of subsyllabic units like rhymes and single phonemes, the latter case explaining the segmental phonological problems in dyslexia. Although at this point any causal interpretations of associations in neural rhythmic entrainment and dyslexia have to be taken with care, it is relevant to note that Goswami’s theory has the potential to account for a broader range of deficits that have been observed in dyslexia. Low performance in the perception of speech rhythm seems to extend to non-linguistic domains such as beat perception in music [
25,
26,
27], and even to motor synchronization abilities such as rhythmic tapping [
28,
29,
30], which suggests a domain-general rhythm processing deficit in dyslexia.
With its focus on rhythm processing, Goswami and colleagues’ work offers a substantial approach to the potential mechanisms underlying the performance of individuals with dyslexia in different domains. In our study, we intend to broaden the view on rhythm perception in dyslexia by looking at duration and intensity as acoustic cues of speech rhythm perception. Acoustically, strong and weak syllables can be distinguished on the basis of specific cues such as intensity, duration, and pitch, with strong syllables often being louder, longer, and higher than weak ones [
31,
32]. Interestingly, these different cues have different effects on rhythmic grouping and segmentation: while alternations in syllables’ duration lead to the perception of weak-strong patterns, alternations in intensity and pitch lead to the perception of strong-weak patterns [
9,
33,
34,
35,
36,
37,
38]; for more details see 1.3. The main goal of this paper is to investigate rhythmic grouping according to this bias in individuals with dyslexia.
Of course, not only speech is rhythmically structured. Rhythm is a domain-general phenomenon. Similar organizational rhythmic principles with regular alternations of strong and weak elements are also found in music [
9,
39], where the same acoustic cues (intensity, duration, and pitch) are relevant for conveying rhythm, and the same tendency to use these cues differently at the beginning or the end of a unit is often exhibited [
39,
40,
41]. If rhythm perception in speech and music relies on shared perceptual mechanisms or shared rhythm representations, then it should be the case that individuals with better music abilities should also show enhanced language abilities [
42,
43,
44]. In line with this, [
45] reported that, within a group of adults with dyslexia, musicians outperformed non-musicians on several auditory measures, including rise time, frequency, intensity, and timing perception, even reaching the same levels of performance as musicians without dyslexia. However, the advantage that musicians with dyslexia experienced in the auditory perception tasks did not extend to their literacy and phonological awareness. Accordingly, other researchers doubt that dyslexia relates to poor rhythm perception, e.g., [
46]. A second goal of the present paper, therefore, is to further examine whether rhythm processing deficits in speech and music are linked with reading deficits in dyslexia.
1.3. Biases on Auditory Rhythmic Grouping
In this paper, we will investigate for the first time how biases on auditory rhythm perception affect speech rhythm perception by adults with dyslexia. For this, we focus on rhythmic grouping of speech following the Iambic/Trochaic Law (ITL) [
9]. According to the ITL, rhythmic perception is guided by universal biases. These biases have the effect that sequences of sounds varying in intensity tend to be perceived as trochees (strong-weak), whereas sound sequences varying in duration tend to be perceived as iambs (weak-strong; e.g., [
33,
34,
35,
36]). These biases have been attested for speakers of various languages, including English [
33,
34,
36,
47,
48], German, French [
38,
49], Spanish [
48], and Italian [
37]. Since rhythmic grouping preferences are asymmetrical between the perceived acoustic cues, these biases cannot simply be accounted for by a tracking of acoustic cues to prominence in the signal. Importantly, asymmetries in rhythmic grouping are mirrored in the rhythm structures in language and music where final prominence is usually marked by a long syllable or note, and initial prominence by a loud syllable or beat, which supports the assumption of the ITL as universal [
19]. More recent research, however, indicates that rhythmic grouping preferences are subject to individual variation and depend to some degree on aspects such as individuals’ language background [
38,
47,
48,
49,
50] and their musical abilities [
51,
52].
1.3.1. Effects of Language Background on Rhythmic Grouping
Language background’s effects on perception may relate to differences in the function of stress between the languages: [
38] hypothesized that when perceiving speech, the ability to draw on abstract phonological representations of lexical stress would facilitate German speakers’ rhythm processing. This is because German uses lexical stress contrastively (e.g., /’te,nor/ ‘common sense’ vs. /te’nor/ ‘singer’), while French does not. In [
38]’s rhythmic grouping experiment, German and French listeners listened to syllable streams, in which syllables alternated in intensity (loud-soft-loud-soft…), pitch (high-low-high-low…), duration (long-short-long-short…), or neither (flat control condition). Participants were asked to indicate via button presses whether they perceived strong-weak or weak-strong groupings. The result was that both groups perceived iambs and trochees as predicted by the ITL, but the German listeners were more consistent and had clearer rhythmic grouping preferences than the French listeners. Moreover, German but not French listeners experienced the illusion of hearing strong-weak groupings when listening to the control sequences that did not contain any acoustic cues to rhythm. Ref. [
38] argue that this effect is likely to also be driven by the presence of abstract phonological representations of stress in German: As words in German are pre-dominantly trochaic, German listeners might apply a default grouping to sound sequences based on their linguistic experience. Since French has no lexical stress, there may be no reason for a default grouping based on their experience.
1.3.2. Effects of Musical Background on Rhythmic Grouping
Musical
experience as defined by the number of acquired musical instruments, the duration of musical training, and the earliest age of acquiring a musical instrument has been found to influence rhythmic grouping. However, this seems to be modulated by the individuals’ language background and has, to this point, only been found to affect native speakers of French and not native speakers of German [
38,
49,
51,
52]. French speakers who are musically experienced have clearer preferences for grouping acoustically complex non-speech sounds [
49] as well as for grouping speech, though only if they are also proficient speakers of German [
51]. While general musical experience never predicted monolingual German speakers’ grouping of speech, their ability to perceive musical rhythm as measured by a standardized musical ability test (the Musical Ear Test, henceforward MET [
53]) did (though their ability to perceive melodies did not) [
52]. Musical abilities can be, but not necessarily, correlated with musical experience [
46]. Instead, they may relate to more general auditory perception abilities, which vary widely among individuals [
29]. In addition, the abilities to perceive and discriminate musical rhythms do not always correlate with musical melody perception abilities [
54]. Together, these results suggest a specific connection between language and music via rhythmical properties.
1.4. Hypotheses and Predictions
Individuals with dyslexia have repeatedly exhibited relatively weak stress and rhythm processing abilities, even in domains other than language (e.g., in tapping and music perception). This suggests that their rhythmic grouping preferences will also be weak, especially since rhythmic grouping depends on native language phonological knowledge. Given the findings that musical ability also influences rhythmic speech grouping, the present study set out to investigate the relations among speech rhythm processing, musical rhythm perception ability, reading ability, and dyslexia in German listeners. We aimed at investigating the following research questions:
- (1)
Do adults with dyslexia have less consistent rhythmic speech grouping preferences than adults without dyslexia?
- (2)
Do adults with dyslexia show lower musical rhythm perception abilities than adults without dyslexia?
- (3)
Does musical rhythm perception ability predict rhythmic speech grouping in dyslexia?
- (4)
Does musical rhythm perception ability predict reading ability in dyslexia?
We hypothesized the following:
- (1)
Based on the hypothesis that adults with dyslexia have difficulties in processing rhythm, we expect them to show weak grouping preferences. Hence, they should show less asymmetrical grouping preferences when hearing sequences varying in intensity or duration than adults without dyslexia. Further, if this rhythmic deficit hinders the establishment of phonological representations for metrical structure, adults with dyslexia should not show grouping preferences when hearing rhythmically invariant sequences.
- (2)
We assumed that musical rhythm perception ability would be lower in individuals with dyslexia than in individuals without.
- (3)
If rhythm perception in music and speech share cognitive underpinnings, we expect that higher musical rhythm perception ability would be associated with more consistent preferences in rhythmic grouping of speech in adults with dyslexia.
- (4)
If reading difficulties are linked with general underlying difficulties with rhythm processing, then musical rhythm perception ability should predict reading ability in dyslexia.
To investigate these hypotheses, we conducted a rhythmic grouping experiment with adults with and without dyslexia and measured their musical rhythm ability by means of the MET [
53], and their reading ability by means of the Salzburger Lese- und Rechtschreibtest SLRT-II [
55]. In order to avoid pre-selecting or grouping participants based on their musicality and cognitive abilities, we applied regression modeling for data analysis, with musical rhythm ability, musical experience, and cognitive abilities as covariates.