1. Introduction
Syntactic priming (SP) in psycholinguistics—also referred to as structural priming, structural persistence, adaptation, or alignment—occurs when exposure to a syntactic structure influences a person’s subsequent language comprehension and production, making them more likely to process or produce a similar structure (
Bock 1986). Research has established syntactic priming effects outside a discourse context (e.g.,
Bock 1986,
1989;
Bock and Loebell 1990;
Hartsuiker et al. 1999), as well as in a conversation (e.g.,
Branigan et al. 2000;
Branigan et al. 2007;
Balcetis and Dale 2005;
Schoot et al. 2014). Speakers demonstrate an implicit tendency for linguistic convergence, starting to express themselves in similar ways as the conversation unfolds (
Branigan et al. 2000,
2007). What an individual hears as a listener in one conversation turn influences their subsequent contributions as a speaker in the following exchange. Over conversational turns, alignment can emerge across multiple linguistic levels during a conversation (
Pickering and Garrod 2004), at the semantic, lexical, and syntactic levels (
Garrod and Anderson 1987;
Brennan and Clark 1996;
Clark and Schaefer 1989;
Clark and Wilkes-Gibbs 1986;
Branigan et al. 2000).
The Interactive Alignment Model (IAM) of dialogue draws on the psycholinguistic observations of priming effects between speakers and posits that as speakers influence each other and generate similar linguistic representations, their level of alignment with one another increases. The model predicts that conversation is successful to the extent that the situation representations of conversation partners become aligned (
Pickering and Garrod 2004). Similar to IAM, the Shared-Workspace Framework (
Pickering and Garrod 2021) emphasizes the importance of alignment between interlocutors. Speakers’ linguistic choices can influence listeners’ comprehension processes, and vice versa. This reciprocal influence contributes to the ongoing alignment of mental representations and a successful conversation. The extent of syntactic alignment increases when speakers interact with a conversational partner, as opposed to performing the experiment alone, supporting a mediated account of syntactic priming, in which the communicative intention influences linguistic behavior (
Schoot et al. 2019).
Branigan et al. (
2000) reported larger syntactic priming effects in dialogue versus monologue situations, providing additional evidence that speakers are sensitive to the communicative context and the linguistic choices of their conversational partners during language production.
Less is known about how differences in the communicative environment influence language production. One study showed that speakers’ tendency to clarify their utterances increased when the visual context was potentially ambiguous, in contrast to cases without such ambiguity, reflecting awareness of the importance of facilitating comprehension. Findings highlighted the significance of speakers’ attention to the addressees’ perceived capacity for ease of comprehension (
Haywood et al. 2005).
Reitter et al. (
2006) investigated the effect of syntactic priming in a dialogue where interlocutors either aimed solely to communicate casually or to collaborate on a task. Syntactic persistence effects were more robust in task-oriented interactions compared to casual conversations. Moreover, various characteristics of communication, such as individuals’ perception of their addressees’ comprehension ease, influenced the alignment between the conversational partners. For instance, speakers tend to align their syntactic choices to a greater extent in situations where the conversational partner benefits from language adaptations tailored to the audience. This adaptation phenomenon became more pronounced when the recipient was less likely to understand the speaker (
Branigan et al. 2003). Together, these studies emphasize the importance of communicative context and suggest that speakers’ communicative intent influences linguistic behavior.
Priming from Multiple Speakers
Most psycholinguistic research on dialogue has investigated dyadic communication (e.g.,
Brennan and Clark 1996;
Garrod and Anderson 1987), with an emphasis on exploring the cognitive mechanisms that underpin language use in dyadic dialogues (e.g.,
Horton and Gerrig 2005). However, less is known about priming in more complex communicative situations, notably the cognitive processes engaged during multi-party dialogues.
Branigan et al. (
2007) investigated the role of the speaker in multi-party dialogue and showed that individuals who were formerly in the role of addressee exhibited stronger syntactic priming effects from their conversation partners, compared to individuals who were previously overhearers. Their results suggested that the alignment of syntactic structures is influenced by the participant’s specific role during comprehension in a multi-party dialogue.
The purpose of this study is to examine a dimension of priming in multi-party communication that, to our knowledge, has not been explored before, notably speaker source (also speaker variation). We asked whether the syntactic priming effect is boosted when experienced by different sources (here, individuals), compared to a single source/individual, everything else being equal.
This question derives from considering language behavior as a form of social behavior that is adopted via a process termed complex contagion in the sociology literature (
Centola 2019;
Centola and Macy 2007). Complex behavioral contagions are behaviors, beliefs, or attitudes for which adoption by an individual requires contact with multiple sources of activation and are contrasted with simple contagions that require a single source (
Centola 2019;
Centola and Macy 2007). In that literature, the emphasis is on
single versus multiple
sources (i.e., speaker source/variation) rather than merely multiple exposures (which may still be required for priming to take effect). Complex behavioral contagions have been shown empirically in various forms of human behavior, for example, in the adoption of healthier behavior practices (
Centola 2010), people’s diffusion of political hashtags in social media (
Romero et al. 2011), and other forms of innovation diffusion (for a review, see
Guilbeault et al. 2018). This concept of complex behavioral contagion in the sociology literature may, at first blush, appear distant from the phenomenon of priming in psycholinguistics as used here. However, linguistic productions such as utterances in dialogue are a spontaneous human behavior, so the concept of behavioral contagion should, in principle, extend to language behavior as well. In addition, the empirical evidence for complex contagions via multiple sources all occurred with participants using language, i.e., ideas were adopted via linguistic messages, thus possibly with the help of linguistic priming. Finally, the term ‘contagion’, as used in the sociology literature, only alludes to an analogy of the idea of pathogenic contagions and lacks a cognitive psychological explanation. Perhaps implicit priming is the cognitive mechanism that drives the complex contagion of ideas and behaviors and this links directly to our proposed hypothesis that linguistic priming could be mediated by speaker variability.
Our proposed hypothesis that speaker variability could boost linguistic priming also aligns with the idea that adapting one’s language to the listener serves the purpose of enhancing message comprehension. This adaptive behavior contributes to the establishment of shared understanding, fostering successful communication (
Branigan et al. 2000;
Branigan 2006). In everyday interactions, individuals engage in conversations with various people, processing diverse linguistic inputs to achieve communicative goals (
Clark 1996). If individuals implicitly adopt linguistic patterns common among a diverse group, their behavior is more generalizable and their chances of aligning with the larger community increase. Consequently, individuals may implicitly incorporate prevalent linguistic behaviors within their social context. This adoption, of course, results from repeated exposure to specific linguistic structures, leading to an accumulation of experience (
Kaschak et al. 2011;
Hartsuiker and Westenberg 2000;
Kaschak and Borreggine 2008). However, the complex contagion theory emphasizes the role of multiple sources, not just repeated exposures. Consequently, syntactic priming effects may be more pronounced in multi-party conversations with several speakers compared to single-speaker scenarios, even when the exposure instances are similar.
To examine whether speaker variation amplifies syntactic priming, we employed an online picture description game where adult participants were primed to produce passive and double object (DO) syntactic uses. There were two conditions, as follows: in the single-speaker priming (SSP) condition, participants were primed with passive and DO forms by one confederate, while in the multi-speaker priming (MSP) condition, participants were primed by five different confederates. Participants’ production of dative and transitive structures without priming was assessed in a baseline session prior to the experimental priming session. We expected to find a priming effect in both SSP and MSP groups from baseline to the experimental session. Furthermore, in line with the proposed hypothesis, the MSP condition should promote a greater difference from baseline in primed grammatical structures compared to the SSP condition.
There have been relatively few studies on more complex dialogues; in particular, little empirical investigation has been carried out on the cognitive processes involved in multi-speaker dialogues. To our knowledge, no mechanistic account of syntactic priming explicitly incorporates speaker variation as a moderator of the priming effect. In models that interpret priming as implicit learning (
Chang et al. 2006), prior exposure to a specific structure alters the weights of connections within the language processing system, regardless of who says what. However, these models could accommodate the effects of speaker variation by introducing a threshold parameter for the degree of priming. For example, exposure to a structure from multiple people could trigger larger weight changes, resulting in a stronger tendency to re-use a linguistic structure compared to exposure from a single individual. In other words, participants’ syntactic procedures and representations could be influenced to a greater extent when they understand an utterance they hear from multiple people, as opposed to hearing it from a single person (see
Lou-Magnuson and Onnis 2018 for a computational simulation that explicitly incorporates such a threshold parameter to account for the spread of syntactic complexity in a community of agents).
3. Results
In the baseline session, the mean proportion of active construction use was 0.95, while the mean proportion of passive construction was 0.05. The PO constructions constituted a mean proportion of 0.74. The DO constructions, while present, were less prevalent, representing a mean proportion of 0.26. Inferential model comparison was adopted for the data analysis, fitting nested mixed-effects binary logistic regressions of different complexities with the
lme4 package (
Bates et al. 2015). To find the best parsimonious fit of the data, we compared the models with the
flexplot package (
Fife 2022) in R (
R Core Team 2021). The dependent variable was whether participants used the target alternative of each syntactic construction (coded as 1 for passive and DO forms) in the picture description task versus the non-target alternatives (coded as 0 for active and PO forms). Productions categorized as neither (see scoring criteria above) were excluded from the analysis (in total, 20% of data points were excluded). The fixed effects were session (levels: baseline vs. experimental), speaker_source (levels: SSP vs. MSP), and construction (transitive vs. dative).
model0: As suggested by
Barr et al. (
2013), a maximal random effects structure model with no fixed effects was first attempted. We incorporated random intercepts for both item and subject, alongside random slopes for subject and the interaction involving speaker_source, session, and construction within subject. The model was simplified step by step until it converged without singularity issues. First, random slopes for item were removed (
Segaert et al. 2016;
van Lieburg et al. 2023). Subsequently, random slopes for the interaction between speaker_source, session, and construction by subject were excluded from the model. The model converged without singularity issues after all random slopes were removed. Thus, the final random effects model included random intercepts for both item and subject.
model1: Main predictors of session, speaker_source, and construction were added as fixed effects to model0. Sum coding was used to interpret the output of the models in terms of the main effects, and main interactions in later models:
model2: To test the proposed central hypothesis that priming of target syntactic forms should increase in the multi-speaker priming condition relative to the baseline productions of those forms, an interaction term between speaker_source and session was added to model1:
model3: To test whether priming of target syntactic forms was larger for one construction relative to baseline, the interaction between construction, session, and speaker_source was added in a third model (
Figure 3). This specific prediction is more speculative than our central hypothesis, tested with model2. However, it is possible to conjecture that the effect of speaker_source (relative to baseline), if it exists, may be larger for constructions that are harder to prime. In such a case, the construction that has been demonstrated to be harder to prime should be more sensitive to the effect of speaker source (this specific hypothesis is exploratory and did not form part of the initial preregistration plan):
Model comparison is the process of selecting the best model among a set of competing models based on fit indices and hypothesis testing. Nested models are models that can be derived from each other by adding or removing parameters or constraints, such as the ones specified above. Nested models can be compared using the likelihood ratio chi-square difference test, which probes whether the difference in chi-square values between two models is significant or not. A significant chi-square difference indicates that the more complex model fits the data better than the simpler model. Nested models can also be compared using information criteria, such as AIC or BIC, which balance the fit and the complexity of the models. A lower information criterion indicates a better model. A further measure of the relative evidence for one model over the other is the Bayes factor (BF). While BF values are not interpreted as thresholds like p-values, values larger than around 10 are considered fairly strong evidence in favor of a model, while, conversely, values much smaller than 0 are considered strong evidence against a model.
In
Table 1 and
Table 2, we report chi-square test, BF, AIC, and BIC values in pairwise comparisons between model1 vs. model2, as well as model2 vs. model3. All indexes converge toward model1 being the best fitting and most parsimonious model to fit our data (all models were fit to the same data). First, using likelihood ratio tests, no significant difference exists between model1 and model2 (χ
2 = 0.0005,
p = 0.98), suggesting that adding an interaction term between session and speaker_source did not significantly improve the model. Similarly, the results revealed no significant difference between model2 and model3 (χ
2 = 5.2,
p = 0.16), indicating the addition of three-way interaction between session, speaker_source and construction did not significantly improve the model.
Second, the BF also strongly favored model1 over model2 (BF = 67.91). Additionally, model1 yielded lower AIC and BIC values compared to model2, indicating a better fit to the data. Further comparison between model2 and model3 indicates that model3 exhibited higher AIC and BIC values than model2 and the BF very strongly favored model2 over model3 (BF = 23,277.67). In summary, the data provided stronger support for the simpler model1 over the more complex model2, and model2 over model3. This is in line with the likelihood ratio tests, suggesting that including interaction terms did not significantly improve the model.
Table 3 reports a summary of model1 in an analysis of deviance with Type III Wald chi-square tests for each predictor in the model (obtained with the Anova function from the
car package). This revealed a significant effect of session (χ
2 = 33.65,
p < 0.001) on target responses, such that participants in the experimental session produced target (primed) responses in the passive and DO structures significantly more often than in the baseline session. Moreover, there was a significant effect of construction (χ
2 = 20.51,
p < 0.001), indicating that target structure use was higher for dative than transitive forms. Also, across sessions, there was no significant effect of speaker_source (χ
2 = 2.70,
p = 0.099). Note that this last effect is meaningless per se, as the baseline session did not include a manipulation of speaker_source and subjects were randomly assigned to each speaker_source condition.
In summary, as model2 and model3 are not better than model1, the conclusion one can derive from inferential model comparison is that there was a general priming effect from baseline to experimental session, but this was not modulated by the number of speakers who produced the priming sentences.
4. Discussion
We asked whether the implicit selection of syntactic structures in adult participants engaged in a multi-party dialogic task is influenced by the number of individuals who primed such structures. Two main hypotheses were put forth. Firstly, we expected all participants to exhibit a syntactic priming effect from the baseline session of no priming to the following experimental session. The inferential statistics models, including model1 which fits our data best, indicate that participants were primed from baseline to experimental session. This finding contributes to the existing literature demonstrating the priming effect in spoken production (for a review, see
Mahowald et al. 2016). Furthermore, our study adds to the literature on syntactic priming in an online setting (
Corley and Scheepers 2002;
van Lieburg et al. 2023). It demonstrates that the phenomenon of syntactic priming persists even in conditions where both the experimental environment and participant selection are relatively less controlled, compared to a typical laboratory-based study. Observing a priming effect in both groups was in line with our expectations. These results highlight the robust and highly replicable nature of syntactic priming.
Second, we expected to find a greater priming effect in MSP compared to SSP condition, under the hypothesis that the adoption of syntactic choices is a form of complex contagion from multiple speakers. Contrary to our expectations, there was no difference in the degree of priming between MSP and SSP conditions. Likelihood ratio tests, as well as AIC and BIC values, confirmed these results by showing that including interaction terms did not significantly improve the model. BF analysis, which quantifies the relative evidence for one model over another, also further supported the results by favoring model1 over the more complex model2 and model2 over model3. The inclusion of two-way interaction terms in model2 and a three-way interaction term in model3 did not improve the model’s fit based on the data.
The absence of an effect related to speaker variation in our models prompts several considerations about the specific task we used. One plausible factor could be attributed to the online nature of the study. Simulating an online social game that convincingly delivers the feeling of genuine social interaction for participants can be challenging. During real-life interactions, coordination of body postures, gaze patterns (
Shockley et al. 2007,
2009), and temporal coordination (
Verga and Kotz 2017) typically occur between conversational partners. In the current study, participants communicated by sending each other their responses in the form of video messages, which lacked the immediacy of real-time interaction. Therefore, certain key elements inherent to face-to-face conversations were absent in our study’s design. In addition, the physical presence of another person can affect the linguistic behavior of the person. A recent study found that the physical, but not virtual, presence of others potentiates implicit learning (
Sarasso et al. 2022). This finding would suggest that an effect of speaker variation on syntactic priming may be visible only during in-person contact. However, this finding would be at odds with the literature on complex social contagions of ideas, where most of the findings were obtained via online experiments, where participants did not even interact face-to-face (e.g.,
Centola 2010).
In addition to the missing elements of face-to-face communication, previous research emphasized the role of the task on syntactic priming.
Reitter et al. (
2006) investigated the effect of syntactic priming in a dialogue where interlocutors either aimed to engage in casual conversation or collaborated on a shared task. Syntactic persistence effects were more pronounced in task-oriented interactions compared to casual conversations. In the current study, the communicative goal was essential to examine the effect of multi-party group on syntactic choices. The aim was to create an environment in which participants felt they were working together to achieve a common goal within their group. However, participants might not feel that they were part of a group that was trying to reach a common goal. Instead, they may have viewed their participation as merely playing a game without a clear collective purpose. Alternatively, their experience might have resembled a series of dyadic interactions rather than collaborative group work, potentially leading to the dominance of dyadic turn-taking dynamics over the intended multi-party nature of the experiment. Such deviations from the intended group-oriented setup could compromise the study’s aim of investigating the influence of multi-person interactions on linguistic behavior, potentially explaining the lack of evidence for an effect of speaker variability.
We suggested that the syntactic choices of an individual act as a form of complex contagion, where social context is important for linguistic behavior and the number of sources would lead to a difference in linguistic behavior. Also, by basing our hypothesis on the implicit learning account (
Chang et al. 2006), we suggested that exposure to a syntactic structure from multiple individuals could reinforce more substantial cognitive adjustments, leading to an increased tendency to incorporate that structure into one’s own language use compared to exposure from a single individual. However, our results may imply that syntactic priming is a form of automatic behavior that is less sensitive to contextual and social situations (
Pickering and Garrod 2004;
Pickering and Garrod 2006), here whether the source of priming comes from one versus many different speakers. In that respect, it would suggest that this form of linguistic behavior might diffuse through a social network in a manner more like a simple form of contagion. It could be that the cognitive weighting of utterances is not affected by the number of speakers the individual interacts with.
Centola (
2019, p. 90) reported that behaviors can spread as a form of simple contagion, as well as complex contagion. He argued that low-cost behaviors spread more effectively through networks of simple contagions, as the barriers to participation in low-cost behaviors are primarily associated with access rather than resistance. Even minimal contact with an individual exhibiting a specific behavior can offer social incentives to induce behavioral change. In this case, syntactic adaptations in terms of active/passive voice or the choice of the alternating form of a dative could be low-cost behaviors for fluent adult speakers to induce priming by a single speaker. However, language is a highly complex communication system with multiple levels and can be affected by numerous interplaying factors. Several studies showed that social context affects language learning and linguistic behavior (e.g.,
De Felice et al. 2021;
Sarasso et al. 2022;
Schoot et al. 2019). Additionally, group size has been shown to influence people’s opinions about the discussed topics in that group, suggesting characteristics of the group affect communication (
Fay et al. 2000). It is evident that social context influences linguistic behavior. Further research is essential to gain a more comprehensive understanding before reaching a conclusion on whether syntactic behavior is sensitive to speaker variation.
Another finding was that the use of PO dative structures was more prevalent among our participants compared to the use of DO forms. While
Bock and Griffin (
2000) suggested that American English speakers have a 2:1 bias toward DO forms, different studies have shown a tendency for PO dative structures over DO datives (
Corley and Scheepers 2002;
Pickering and Branigan 1998;
van Lieburg et al. 2023). In the current experiment, 36 dative verbs were used, primarily taken from the study of
Scheepers et al. (
2017). In their study, participants were about twice as likely to produce PO datives rather than DO structures overall. A similar trend was mirrored in the current experiment, where speakers have a 3:1 bias toward the PO construction. Thus, the higher production of PO structure in our study aligns with the findings of
Scheepers et al. (
2017).
Preference for DO or PO structures may be related to the probabilistic nature of the verbs that were used in the study. Frequency-based properties of words and words within specific constructions have been demonstrated to be relevant to various linguistic and psycholinguistic issues and models. Certain verbs exhibit a tendency to occur in a specific structure (
Gries 2005). In the present experiment, the probabilistic properties of words may lead to a greater bias towards the use of PO construction. For instance, prior studies have identified ‘give’, ‘show’, and ‘offer’ as significant collexemes of the dative construction, while ‘sell’ and ‘hand’ are significant collexemes of PO datives. Moreover, ‘lend’ and ‘send’ did not show a significant preference for either construction (
Gries 2005). The verbs used in this experiment may be significant collexemes of PO datives. This is consistent with the similar proportions of PO and DO forms found in both our study and
Scheepers et al.’s (
2017) research, given that most of the verbs used in both studies were the same. Another factor that could influence participants’ preferences for DO and PO structures is the presented visual stimuli in the task. A previous study found that the left–right orientation of pictures has a significant effect on how participants began describing the pictures (
Gleitman et al. 2007). In this experiment, most of the pictures depicted the object positioned between the agent and the recipient, with the agent typically located on the left side, thus mirroring the left-to-right order of PO form. Consequently, the order of objects in the pictures may contribute to an increased likelihood of participants using PO structures over DO.