Beyond “I Didn’t Do It”: A Linguistic Analysis of Denial in US Legal Settings

Didoni, Elena; Combei, Claudia Roberta

doi:10.3390/languages9110351

Open AccessArticle

Beyond “I Didn’t Do It”: A Linguistic Analysis of Denial in US Legal Settings

by

Elena Didoni

¹ and

Claudia Roberta Combei

^2,*

¹

Laboratorio di Neurolinguistica e Pragmatica Sperimentale (NEP), Dipartimento di Scienze Umane e della Vita, Istituto Universitario di Studi Superiori (IUSS), 27100 Pavia, Italy

²

Dipartimento di Studi Umanistici, Università di Pavia, 27100 Pavia, Italy

^*

Author to whom correspondence should be addressed.

Languages 2024, 9(11), 351; https://doi.org/10.3390/languages9110351

Submission received: 31 July 2024 / Revised: 1 October 2024 / Accepted: 11 November 2024 / Published: 19 November 2024

(This article belongs to the Special Issue New Challenges in Forensic and Legal Linguistics)

Download

Browse Figures

Versions Notes

Abstract

:

This paper investigates the multimodal manifestations of denial in US legal contexts, in the English language, by analyzing police interviews and cross-examinations. The research uses a 13-h corpus of video recordings portraying five male suspects, aged 20–44, eventually charged and convicted of femicide. We deploy techniques from conversation analysis, multimodal analysis, and speech processing, using tools like ELAN, Praat, WebMAUS, and Python libraries to transcribe, annotate, and analyze audio–video data. This exploratory study identifies several recurring patterns in prosodic and gestural cues associated with denial. In particular, our results indicate a prototypical multimodal denial characterized by a predominant gestural component: head positioning (neutral or lowered) and head shaking. This gestural expression is frequently repeated and can also function independently as a nonverbal marker of denial. Denial is also often accompanied by open-hand gestures, sitting upright posture, and a certain degree of vagueness in speech. Furthermore, our findings suggest that the expression of denial often involves a reduction in pitch and intensity following the confession or indictment. The analysis of pauses before denial instances reveals that a greater number of pauses typically occurs after incrimination. Overall, this study shows that there is an interesting interplay between verbal and nonverbal features of denial in legal interactions, underscoring the need for further analysis.

Keywords:

denial; multimodality; forensic linguistics

1. Introduction

Negation represents a fundamental aspect of human language that is rooted in cognitive processes. The expression of negation begins in early childhood (Morris 2003) and, as Horn (2010) claims, it is an essential device of the communicative system, since it furnishes speakers with the tools for denial, contradiction, misrepresentation, deception, and irony. Abandoning the simplistic view of negation as a mere binary operator that assigns truth values, recent research has described negation as a complex cognitive, linguistic, and logical device displaying complex syntactic, semantic, and pragmatic properties and functions (Prieto and Espinal 2020).

Indeed, languages possess morphological, syntactic, and semantic mechanisms that allow speakers to express negation verbally. In face-to-face communication, verbal expressions of negation are frequently supplemented by nonverbal cues, such as prosody (e.g., intensity, pitch, etc.) and gestural behavior (e.g., hand gestures, shoulder shrugs, etc.). These nonverbal devices can also operate independently as, for instance, head shaking is associated with negation in certain cultural and linguistic contexts.

The evolution of negation spans from the basic act of refusal, a communicative behavior that is already present in early stages of language development and is shared with animals, to a sophisticated range of conceptually grounded uses exclusive to human beings. Actually, negation serves a variety of communicative purposes, including the expression of falsity, absence, non-existence, denial, rejection, and correction (Vandamme 1972). In this respect, Roitman (2017) refines the perspective by asserting that linguistic negation encompasses three primary meanings: non-existence, rejection, and denial.

The scope of this paper is to examine one of these three specific functions, namely ‘denial’. We will focus on the distinctive realizations of denial analyzed from a multimodal perspective (i.e., gestures and prosody). Pragmatic aspects, such as how suspects refer to victims during denial instances, are also briefly considered, though not with the same systematic approach we conducted for the analysis of prosody and gestures.

Before looking at denial through the lens of multimodality, it is necessary to situate it within the broader context of negation. Hummer et al.’s (1993) study indicates that, from a developmental point of view, denial emerges later than other functions of negation, as it needs the simultaneous representation of two mental models: one reflecting the true state of the world and one reflecting its false counterpart. Equally significant is the study by Ripley (2020) that asserts that denying a certain claim involves performing an act that introduces new information, namely that the claim is ruled out. More broadly, van der Sandt (1991) defines denial as a means of objecting to utterances produced by previous speakers. In this paper, we use the term ‘denial’ to refer to a speech act encompassing verbal and/or nonverbal elements employed by a speaker to object to or correct the form, content, presuppositions, and implicatures of an utterance (Combei 2023).

This operationalization of denial allows us to examine how denial is encoded multimodally—through prosody and bodily conduct—and how this has been investigated in the literature. First of all, an important contribution to the study of multimodal denial is Harrison’s (2018) monograph which argues that negation has clear grammatical and gestural manifestations and that there are regularities between the two elements in human communication. On a similar note, Bressem and Müller’s (2017) study on multimodal patterns of negation indicates that recurrent gestures display a fixed form–meaning pairing. It has also been mentioned that multimodality can influence the speech act of denial and their associated belief statuses (Combei 2023). Moreover, the review by Prieto and Espinal (2020) indicates that denial is expressed through various prosodic and gestural features across natural languages, mentioning, in particular, the use of high tones in tonal languages and pitch accentual prominence in intonational languages.

Equally interesting are the studies that explore denial as a deception mechanism from a multimodal perspective, including more recent attempts to automatically detect it. One of the first large-scale multimodal studies on deception is the work by Buller and Aune (1987). They investigate how deceivers manage nonverbal cues to convey nonimmediacy and create a positive image, while simultaneously revealing signs of arousal and negative affect. Buller and Aune’s (1987) research, involving 130 participants, claims that deceivers display nonimmediacy and arousal but fail to project a positive image. Additionally, the study indicates that deception cues are influenced by relational history and exhibit significant variability over time. Deceivers also appear to actively regulate their nonverbal behavior, attempting to suppress signs of arousal and negative affect.

A study by Vrij et al. (1996) explores how liars are often unaware that they reduce their movements during deception. Their research aims to determine how deceivers might respond if informed about this rigidity and how factors like tension, behavioral control, and cognitive effort relate to deception. In their experiment, subjects participated in two interviews: one truthful and one deceptive. In the information-present condition, participants were informed beforehand that deception typically involves decreased movement, while the information-absent condition provided no such insight. The findings show that, despite participants believing they increase their movements while lying, they actually exhibit a decrease. Interestingly, informing deceivers about deceptive behavior has no impact on their movements. The authors claim that the decrease is linked to efforts by deceivers to control their behavior and cognitive load, rather than the tension they feel.

Moving to NLP approaches, Soldner et al. (2019) note that deception frequently occurs in everyday conversations, yet conversational dialogues remain underexplored in the field of automatic deception detection. To fill this gap, their paper focuses on detecting multimodal deceptive cues in conversational settings. They introduce a multimodal dataset featuring deceptive conversations from the Box of Lies game on The Tonight Show Starring Jimmy Fallon, where participants attempt to discern whether their opponent’s object descriptions are truthful. The authors annotate various multimodal communication behaviors, including facial expressions and linguistic cues, and derive several features from these annotations. Initial classification experiments yield promising results, significantly outperforming both random and human baselines, with an accuracy of up to 69% in differentiating between deceptive and truthful behaviors.

Similarly, Jaiswal et al. (2016) present a data-driven approach for automatic deception detection using audio–video data from real-life trials in legal contexts, focusing, among other things, on visual and verbal cues of denial. They employ OpenFace for facial action unit recognition to analyze witnesses’ facial movements during questioning, and OpenSmile to study acoustic patterns. Additionally, the authors conduct a lexical analysis of the spoken words, focusing on pauses and breaks, and feed this data into a Support Vector Machine for deception prediction. They also explore a method that fuses visual and lexical cues through string-based matching. While human judgment accuracy ranged from 53% to 60%, their automated system achieved an average accuracy of 78.95%, with higher accuracy in truth videos (81.10%) than in deceptive ones (76.80%).

As the brief literature review above suggests, previous research has demonstrated that gestures play a significant role in shaping and emphasizing denials, functioning as complementary elements to verbal negation (Harrison 2009). To sum up the overview presented in this section, the multimodal characteristics associated with denials include, among others, head shaking, finger shaking, and palm-down hand gestures (Kendon 2002, 2004; Harrison 2010).

Building upon the research outlined above, this paper seeks to examine the multimodality of denial exhibited in English-language discourse within legal settings in the United States, with a specific focus on individuals accused of femicide (and eventually found guilty). We expect to identify distinctive and systematic patterns of prosodic and gestural features that characterize denials in these specific contexts. The findings of this exploratory analysis may contribute not only to improving our understanding of denial as a linguistic phenomenon, but also to uncovering how it is conveyed through a combination of verbal and nonverbal cues in legal contexts.

The rest of this paper is structured as follows: Section 2 presents the aims, motivations, and scope of the study; Section 3 explains the corpus and methodology; Section 4 outlines the results; and Section 5 provides a discussion of the findings, addresses the limitations of this study, and offers concluding remarks and future directions for our work.

2. Motivations and Aims

This paper presents a qualitative study that is part of a broader research endeavor exploring the multimodal dimension of denial within the legal sphere across the United States. A portion of this larger project, focusing on different data and excluding prosodic analysis, has already been published in Combei (2023). To validate and build upon the findings of the previous study, the present work examines gestural as well as prosodic discursive strategies used by femicide suspects to deny their involvement in crimes during post-crime interactions, such as police interrogations and cross-examinations in courtroom proceedings. The analysis of the suspects’ discourse may, in fact, uncover the complex ways in which gendered violence is implicated in denial. This section will explain the rationale of examining the linguistic phenomenon of denial in this specific context, the importance of adopting a multimodal approach in this investigation, and what we aim to achieve with this study.

We concentrate on a specific legal context in which denials of involvement in femicide are uttered, namely situations where the suspect is acquainted with the victim. For the purposes of this paper, femicide is understood as “the killing of women and girls because of their gender” (United Nations 2013, p. 2), as was defined on the International Day for the Elimination of Violence against Women and the Vienna Declaration on Femicide. It should be stressed that femicide differs from general homicide as it is characterized by a disproportionate prevalence of intimate partner violence, familial abuse, and power imbalances (e.g., at home, at work) between victims and perpetrators.

This research centers on the discourse of suspects of femicide precisely because of their close relationship with the victims. We focus on suspects that know the victim well because they may deploy denial strategies that reflect the complex nature of their relationship with the victim (e.g., not admitting or trivializing the severity of the crime, deflecting responsibility, and shifting blame). More generally, analyzing the discourses of this kind of suspect may enhance our understanding of the dynamics of gendered violence and the ways in which such crimes are contested or minimized.

As mentioned above, our study adopts a multimodal perspective on denial, an aspect typically overlooked in forensic linguistics. The term ‘multimodality’ is used here in accordance with its understanding within the field of conversation analysis and following Mondada’s (2016, p. 338) definition as “the various resources mobilized by participants for organizing their action—such as gesture, gaze, facial expressions, body postures, body movements, and also prosody, lexis, and grammar”.

As Wang (2024, p. 163) notes, research on legal discourse from a multimodal perspective remains limited, and while gesture studies are advancing in theory and methodology, empirical research in forensic linguistics is still scarce, especially in the area of examining stance in legal discourse through gestures. Some notable exceptions that consider multimodality in analyzing discourse within legal contexts are the studies by Gregory Matoesian. For example, Matoesian and Gilbert (2016) illustrate the importance of multimodal and material actions that accompany speech, showing how attorneys use hand movements, physical objects, and verbal communication to emphasize key pieces of evidence for the jury. The authors also provide a theoretical framework explaining how beat gestures and material objects align with speech to enhance rhythm and highlight points of evidential significance, while also evoking semantic imagery.

The scarcity of multimodal research on legal language is likely attributable to the complexity and time-consuming nature of such analyses, which add to the challenges inherent in investigating legal discourse and content in general. In particular, multimodal analysis of spoken legal language requires the transcription and annotation of a wide range of features, including overlaps, pauses, hesitations, and bodily conduct. In addition, each of these features must be categorized into various classes, each comprising multiple levels (see Section 3 for an example).

Even though we acknowledge the challenges inherent in multimodal analysis, we believe that a close examination of nonverbal features offers a more comprehensive understanding of denial within legal interactions, such as those between suspects and law enforcement. In this regard, we follow Matoesian (2010, p. 541), who asserts that verbal and nonverbal elements function as “co-expressive semiotic partners—as multimodal resources—in utterance construction and the production of meaning”. Indeed, the multimodal analysis of discourses produced within legal settings may be useful to better outline the suspects’ profiles. With this in mind, our study investigates denial, aiming to describe how suspects negotiate credibility through multimodal resources as well as verbal strategies, before and after a confession or indictment. To this end, the following research question guides our exploratory research: How do suspects of femicide deny allegations?

3. Data and Methods

3.1. Materials

Due to the inherent sensitivity of the data, forensic linguistics corpus collection, storage, access, and distribution are often restricted by privacy legislation (Larner 2019). The ease of access to police recording of custodial interrogations or court data varies from country to country, contingent on the stringency of the pertinent data protection laws. In general, however, building and storing a corpus of forensic data is challenging. For instance, the jurisdictions of Italy and Great Britain impose strict limitations on the accessibility of this type of data (Petyko et al. 2022). As regards the United States, the issue appears to some extent less complex, even though the audio–video recording process and data availability may vary in scope and by state (Bang et al. 2018). A large quantity of forensic multimedia data, such as police interviews, interrogations, cross-examinations, and trials, useful for linguistic analysis can be accessed via online platforms like YouTube. For these reasons it was decided to work with a corpus of multimedia data from North America, in English, and freely retrievable from online sources. The intent was to be able to retrieve an easily accessible dataset that would allow for a focused study of the gestures and prosody of denial.

The entire corpus comprises ten videos sourced from websites and open-access YouTube channels, including Fifth Estate, Red Circle Interrogations and Confessions, Law & Crime Trial Network, and Macon Telegraph Archive1. Five North American suspects, aged 20–44, and accused of femicide are portrayed in the videos; all of them were eventually deemed guilty and convicted. In each instance, the perpetrator was either a close family member or had a close and/or intimate relationship with the victim (husband, boyfriend, or son-in-law). In addition to the accusation of femicide, all the suspects denied the charges on several occasions, some even during and after the trial, appealing the jury verdicts. Four of the suspects were recorded during police interviews. In one case, a suspect was recorded during cross-examination while his trial was in progress.

Initially, the decision to examine denial in two distinct legal contexts (police interviews and cross-examinations in courtroom proceedings) was driven by the goal of conducting a comparative analysis. This comparison was intended to explore how denial functions under different questioning situations. However, as the study progressed, we encountered significant challenges in gathering data from courtroom proceedings, which are scarce, or are not available as high-quality recordings. Given the exploratory nature of our study and its qualitative focus, we adapted our approach. Despite the imbalance in the corpus, we chose to retain the available cross-examination data, recognizing their value in contributing to our understanding of the denial phenomenon, even with a smaller sample size.

The corpus comprises a total of 10,655 tokens, corresponding to a duration of over thirteen hours of audio–video material. The duration and number of tokens in each video were determined by data availability and are, thus, independent of the research design and methodologies implemented. The audio quality of the videos is satisfactory, generally allowing automatic speech processing and analysis of the data. In terms of image quality, some of the data are less satisfactory, and this was reflected in some results (see Section 4). Even if all the videos were recorded in color, in some cases the image resolution was insufficient for the analysis of certain parameters, such as the subtle and swift movement of the eyes and eyebrows. Moreover, although the videos are publicly accessible, all identifying information, including names, sensitive details, and geographic references, were redacted, anonymized, or renamed.

3.2. Methods

The corpus data were used to pursue the examination of gestural manifestations of denial and the analysis of prosody associated with it, before and after the incrimination or admission of guilt. Some aspects related to the pragmatics of referencing the victim and the crime were annotated as comments. The data were processed in accordance with these research directions, so the implementation of distinct procedures was needed. These steps are summarized below, and each of them is discussed in greater detail in the following paragraphs.

To obtain audio data useful for the analysis of prosodic cues, the .mp4 video files were converted to .wav files using VLC Media Player and Audacity2. The resulting audio files were divided into approximately 10-min samples to facilitate the forced alignment process, the .TextGrid creation, and the automatic annotation of pauses through an Automatic Speech Recognition (ASR) pipeline, provided by WebMAUS Services (Kisler et al. 2017).

The pipeline outputted ninety-nine .TextGrid files corresponding to each .wav file considered. Subsequently, a Python script was employed to automatically identify and extract the number and the duration of each pause. The Python script produced Excel spreadsheets, which were used to store the values related to the pauses. The manually extracted information from the .TextGrid files using Praat (Boersma 2001) regarded pitch and intensity. Manual pitch and intensity analysis was preferred in this case, due to the inherent error susceptibility of automated approaches, particularly when considering the quality of the data at hand. The output of the ASR allowed us to use the automatic transcription of the speech as a base for examining the verbal dimension of denial.

As concerns gestural resources, the .mp4 files were processed directly, having been previously annotated with ELAN software3. An annotation scheme was designed and implemented using ELAN, with multiple tiers allocated to distinct components of gestural manifestation. Each audio–video track was the object of complex annotation and analysis, with the focus on the conversational turns of the suspect (the process is detailed below).

3.3. Gestures: ELAN and the Annotation Scheme

This study employed ELAN for gestural annotation. A custom annotation scheme was developed to categorize bodily conduct across multiple tiers. Each tier corresponded to a distinct, predefined element created with the controlled vocabulary feature on ELAN. The MIT Boston Speech Communication Group’s ‘Gesture Coding Manual’4 was chosen as the annotation scheme for hand gestures. The other features were annotated using the annotation scheme detailed in Combei (2023). We also considered the Linguistic Annotation System for Gesture (LASG), proposed by Bressem et al. (2013) for the annotation of our data. Although well-structured and articulated, we decided not to adopt this annotation scheme because it was too refined, and it took into account some linguistic parameters that were outside the scope of this research (such as syntactic or semantic aspects). At the same time, to the best of our knowledge, LASG lacked annotation patterns for other bodily parameters considered in our study (e.g., head movement, eyebrows, etc.). However, we acknowledge the fact that it would be useful to use LASG for a different, more complex gestural annotation, both to verify the goodness of the scheme we adopted and to explore parameters of multimodality that could not be included in this research. Since this qualitative study relied on a single annotator, future work should involve multiple annotators to measure inter-annotator agreement.

For the purposes of this research, any movement, shape, or orientation expressed by the suspects when uttering a denial act was considered to be a relevant gesture. The types of gestures of interest were restricted to those of the hands (especially their shape and positioning), the head and its movements, the direction of the gaze, the micro-movements of the eyebrows (when analyzable), and the posture during interrogations with police officers or cross-examinations in court. The annotation scheme was designed with six main tiers, each of which was associated with a specific gesture parameter:

‘Suspect HG’ (hand gesture): describes the shape of the suspect’s hand gesture while he is uttering the speech act of denial;
‘Suspect Gaze’: describes the suspect’s gaze direction while he is uttering the speech act of denial;
‘Suspect Eyebrows’: describes the suspect’s movement of the eyebrows while he is uttering the speech act of denial;
‘Suspect Head’: describes the movement of the suspect’s head while he is uttering the speech act of denial;
‘Suspect Posture’: describes the suspect’s body position while uttering the speech act of denial;
‘Handedness’ (dependent tier of the parent tier ‘Suspect HG’): specifies whether one hand or both hands were used to execute the annotated gesture.

Diverging from the annotation scheme developed by Combei (2023), we did not include the ‘legs’ feature. Due to current resource constraints, we focused our efforts on the upper body and hand movements, ensuring a more in-depth examination of these areas for our research. In fact, we updated the ELAN controlled vocabulary for all of the intra- and inter-suspect recurring movements and positions that were not portrayed in the ‘Gesture Coding Manual’ (such as ‘arms crossing’, ‘counting’, ‘measurement’, ‘pinch’).

Furthermore, a tier named ‘Suspect’ was added for each video to collect the verbal transcript of the suspects’ speech (statements). This was used for transcribing their verbal expressions of denial. A tier for comments was also included on the annotation, which was used to highlight relevant elements or findings that went beyond the established labels and annotation scheme. Observations regarding the pragmatics of the suspects’ discourses (in particular the way victims and crimes were referenced by the suspects) were also indicated in the comments tier. In order to ensure consistency and facilitate comparison between suspects, the same annotation scheme was used for all videos.

3.4. Prosody

Regarding the prosody of denial, Praat was employed as a tool for speech processing and analysis. Praat functionalities for pitch and intensity analysis were exploited to extract statistic descriptors related to prosodic parameters inherent to the episodes of denial uttered by the suspects. First of all, the intensity was normalized across all videos. Then, minimum, maximum, and average values of pitch and intensity were manually extracted for each instance of denial. To extract these values, we defined the boundaries of each ‘denial’ instance based on the discursive unit of the suspect. In particular, we considered the discursive unit to be the utterance in which the denial—whether verbal and/or gestural—occurred, extending up to the next pause in the interaction. This approach guaranteed that each denial was analyzed within its immediate context, capturing the correct communicative intent of the suspect.

In terms of speech processing and annotation, Praat was also used to control the pipeline output and check the automatic annotation of pauses. The algorithm’s accuracy in identifying the start and end of each pause was evaluated qualitatively and manual intervention was used to correct segment boundaries when necessary. Two primary categories of errors were identified. In the first case, the algorithm failed to accurately identify the onset and conclusion of spoken sequences, resulting in the misclassification of longer segments as pauses. In the second case, the error was more nuanced, involving the inclusion of vowels within the pause segment because the phonation was not correctly captured. All these issues were corrected manually.

4. Results

The research findings will be organized as follows: Section 4.1. will provide a general overview of the analysis with some information regarding the multimodal annotation, the pauses, and some pragmatic observations. Then, Section 4.2 and Section 4.3 will be dedicated to gestural and prosodic analysis, respectively.

4.1. General Overview

Table 1 provides a summary of some general results derived from both automatic (i.e., pauses) and manual (i.e., verbal denial) feature annotation. The number of pauses reported for each video depends on the length of the file. The count includes pauses of all types: from those occurring within the same conversational turn to those occurring between the conversational turns of the suspect and the police officer/lawyer/judge. Thus, we considered both ‘pre’-response pauses occurring before the suspect’s reply to the official’s question and ‘post’-response pauses that occur while awaiting the next question or the completion of the suspect’s response. In the analyses presented in Section 4.3, we only considered pauses occurring before the suspect’s response to questions posed by police officers and lawyers.

Regarding the manual annotation, the fourth and the fifth columns are dedicated to general denials and femicide denials, respectively. This distinction was introduced to account for two-fold manually performed data processing. First, all denial cases encountered during video listening and viewing were annotated, regardless of their degree of relation to the events closely connected to femicide episodes. Subsequently, a manual verification was conducted on these annotations to identify denials expressed by the suspects specifically regarding accusations of committed murder, fictitious statements about the murder weapon, innocence in the matter, concealment of bodies, etc. The column labeled as ‘general denials’ was included in Table 1 for the purpose of comparison with the column ‘femicide denials’. The latter regards the number of denials associated with falsehoods identified in police interviews. This is because isolated denials strictly related to femicides, reported in the fifth column, all turn out to be fictitious denials, intended to distort the reality and avoid a guilty verdict.

‘Femicide denials’ were identified among the ‘general denials’ using the following categories as selection criteria: denials related to the timeline of events (for all the events related to the day of the murder itself), specific denials related to the murder weapon (e.g., gun, knife), and denials related to the harm done to the victim (e.g., physical assaults, body concealment). This differentiation between ‘femicide denials’ and ‘general denials’ has allowed us to distinguish more clearly between the general use of denial in the forensic context (e.g., the suspect’s response “No” to the officer’s question “Would you like a glass of water?”) and the use of denial for aspects strictly related to the femicides. Below are examples for each identified category of ‘femicide denials’ to provide insight into the observed data and how it is classified.

Timeline of events
- Lawyer: Were you in the office when the woman was killed?
  A.B.: No, I wasn’t in the office.
- Police officer: So why would you call her if you were in the same house. From ten o’clock on. We are not making it up.
  I.J.: No, I’m just saying I’m not recalling this you are talking about.
Murder weapon
- Police officer: Do you have a gun?
  K.L.: I’ve never touched a gun before.
- Police officer: Did you have other experience where you just wake up and you don’t know what happened? Like ‘I just woke up and here I am, there was a gun and there was a knife, and drugs and I don’t know what was going on’, you know, and I understand that.
  O.P.: I’ve never touched these knives. These knives they were just there. I’ve never—I’ve never touched them.
Harmed victim
- Police officer: You know man, this is stuff we need to know to figure out what’s going on.
  M.N.: I didn’t try to, I didn’t want to, I didn’t mean to at all!
- Police officer: So what really happened that day?
  O.P.: I didn’t do anything. I didn’t hurt anybody!

An exploratory review of the transcribed and annotated data revealed some interesting instances of pragmatic choices that align with findings in the work of Combei (2023), providing validation of both studies. In particular, there is an almost complete absence of direct references to the names of the victims in the discourses of the suspects. An educated guess for this vagueness is that the suspects strategically avoid naming the victim to deflect attention and minimize their emotional engagement as well as the consequences of the crime.

The victims’ names occur in only three cases throughout the entire 13-h corpus. In example (4), the victim’s name is uttered as a response to a direct and explicit question from the police officer, in example (5), the name appears only as an appellation used in reported speech, and finally in example (6), the name is uttered as a violent way to distance oneself from the accusations made by the police officer. In all other instances where the suspects mentioned the victim, they used anaphoric expressions, typically referring to the victims with third-person singular pronouns (i.e., she or her).

4.: Police officer: And what’s your wife’s name?
K.L.: Claire.
5.: Police officer: And what did you do next?
O.P.: We said like “Have you talked to Jo?” I was like “No, have you talked to Jo?”
6.: Police officer: O. you are under arrest for murder right now. The murder of Johanna.
O.P.: I didn’t murder Johanna! I don’t.

On the same note, another interesting pragmatic aspect is the lack of direct references to the act and the result of killing in the suspects’ discourses. In fact, in the annotated speech (Suspects tier), the word ‘murder’ appears only once (uttered by one suspect), while terms like ‘death’, ‘to kill’ (0), and ‘dead’ are entirely absent from the dataset. Instead, we frequently find generic pronouns, names, verbs, or other anaphoric expressions used to refer to the crimes and their consequences. For instance, terms such as ‘it’ (126), ‘that’ (117), and ‘anything’ (75) occur frequently, as expected, especially following explicit references to the crime made by police officers. In this case, the vagueness could also be interpreted as both a mitigation strategy (i.e., it lessens the weight of the crime) and a detachment strategy from the victim.

4.2. Gestural Analysis

Regarding the gestural manifestation of denial, as detailed above, we annotated all videos based on posture, hand gestures, gaze, eyebrows, and head movements. The output of the annotation process allowed us to extract the most frequent features for various bodily characteristics,5 namely ‘front’ for head position, ‘sitting erect’ for posture, ‘towards other speakers’ for gaze, ‘open’ for hand gestures, and ‘both’ for handedness. It should be mentioned that the frequency of occurrence of each type of feature is influenced by the different sizes of the five subcorpora; in particular, the disproportionate amount of data annotated for the O.P. suspect skews the final count of each entry. To address this issue and account for the specific distribution of the gestural features, information related to the subcorpus of each suspect considered is reported in Table A1 in Appendix A.

Given the frequency of gestures and the acknowledged imbalance in our data, the results, though interesting, should be interpreted with caution. That being said, our findings largely point to a prototypical multimodal expression of denial, characterized predominantly by a simple gestural apparatus involving the head either in a frontal position or lowered downwards (presumably to obscure the gaze). Head shaking occurs parallel to the downward head movement. This gesture occurs multiple times even autonomously, serving as a paraverbal signal of denial without any verbal expression. Following the head gestures, the gaze is engaged, primarily in the configuration ‘towards other speaker’ (mostly in concordance with the head in a frontal position) or avoiding, opposing the interlocutor’s gaze by maintaining a closed-eye configuration throughout the denial. Additionally, similar to the ‘avoiding disposition’ that characterizes closed eyes, there is the ‘down’ configuration, predominantly occurring with a ‘frontal’ head position.

The significant number of cases where annotation of eyebrow-related traits was not feasible (due to the video recording quality or the angles) makes this parameter challenging to assess. This highlights the importance of using high-quality multimedia material in multimodality studies. The most suitable situation for annotating eyebrows was found in the video of suspect A.B., filmed in the courtroom, where the high-quality close-up footage allowed precise observation of facial expressions, shapes, and movements. Despite technical issues, it is interesting to note that, in the cases where these features could be analyzed, denial did not manifest through eyebrow movements support (‘relaxed’ featured 179 occurrences). Nevertheless, paradoxically, within this dataset the feature ‘frowning’ often occurs, denoting a very specific movement typically associated with negative emotions.

The most common posture associated with denial is the suspect seated, often upright (erect), and facing the interviewer. However, this posture is prevalent across the entire corpus and is not exclusive to denial scenarios. An interesting pattern in the corpus involves suspect O.P., who frequently adopts a forward-leaning posture, particularly after the confession, conducting much of the interview hunched over. Since this posture is especially traceable in the second phase of the interviews, one plausible interpretation is that the suspect may feel increasingly pressured.

The situation concerning hand gestures is more complex. To begin with, we will focus on the less complex findings. Our observations indicate that the majority of manual gestures associated with denials are predominantly performed with both hands simultaneously. It would be interesting to investigate whether this aspect is specific to the multimodality of denials (and the forensic contexts) or if what is observed is a general trend in gestural expression. Next, moving to more complex aspects of hand gestures, we found that the most frequent categories for hand gestures are the forms ‘open’, ‘relaxed’, and ‘arms crossed’. We have intentionally excluded the ‘fist’ hand gesture from what we report as prototypical denial gestures, despite their frequency. In fact, the ‘fist’ label appears predominantly in O.P. videos and is therefore more indicative of individual expression of denial rather than representative of broader patterns of this phenomenon. The ‘handcuffed’ label is excluded from our analysis because the mere presence of restrained hands does not qualify as a gesture. Handcuffs represent a state of limitation rather than an intentional communicative action. During the final “verdict” we traced an open-hand gesture performed with both hands while keeping the head in a frontal position in relation to the interlocutor. This is accompanied by a gaze mostly directed towards the interlocutor, while seated in an upright position.

As shown in Table A1 in Appendix A the ‘not available’ category of gestures is very frequent as a result of A.B.’s framing in the video, which predominantly features close-up shots that obscure the visibility of arms and hands. It is important to mention that the ‘not available’ label applied to all types of gestures (e.g., posture, gaze, etc.) does not represent a real feature. However, we documented instances where gestures were indistinguishable due to video quality or subtle movement limitations in order to capture the impact of visibility constraints on our findings.

An important point to emphasize is the difference between the gestural multimodality of denials as it occurs before the confession or ‘turning point’6 during the police interview and the multimodality of denials as it occurs after these relevant moments. In our dataset there is a marked reduction in both verbal and nonverbal expression in the ‘post’-confession phases to varying degrees among all suspects. Below is an example illustrating both the verbal and multi-modal behavior of one suspect at two distinct moments: first, prior to learning about their partner’s death resulting from their aggression, and subsequently, following confirmation and subsequent charges of femicide.

As can be inferred from Figure 1, in the ‘pre’-phase, the suspect’s conversational turn is marked by heightened gestural dynamism, complemented by generally longer and more complex sentences. In Figure 2, however, greater heaviness and stillness is observed in the physicality and gestures of the suspects. In the ‘post’-phase the curtailment in verbal expression is total, as nothing is uttered verbally; head shaking is the only element through which the suspect conveys his denial. It is interesting that the open hand shape is clearly visible, suggesting, in this case, an attitude of non-acceptance of the facts.

4.3. Prosodic Analysis

The prosodic analysis involved extracting the values of minimum, mean, and maximum pitch and intensity for each speech act of denial. This manual extraction was complemented by automatic extraction of pauses. Given the significantly large amount of annotated data, we decided to work with the extracted values of ten sample denials selected for each suspect (the denials with their pitch and intensity values were first stored in an Excel spreadsheet and then randomly selected to avoid bias). All these ten samples were selected from the subset of femicide denials. For the denials selected for each suspect, a further internal subdivision of the total collected denials was made. Denials produced before the confession or turning point during police interviews were distinguished from denials produced after these moments. Of the ten denials selected for each suspect, five were randomly chosen from the ‘pre-confession’ denials, while the remaining five were selected from those produced by the suspect during ‘post-confession’. This choice is motivated by the interest in the variation of pitch, intensity, and the number of pauses between, before, and after the confirmation of the accusations or the suspect’s admission of guilt. This is aimed at observing a possible systematic difference in the parameters between before and after instances, motivated by emotional and circumstantial reasons stemming from the exposure of lies and/or the formal accusation of femicide. It was assumed that there could be variation due to the strong emotional impact that being caught lying and/or being accused of murder entails, and that this could be found in all cases under consideration. Even if this possibility is acknowledged, its quantification falls outside the aims of this study.

Table 2 shows the ‘pre’ and ‘post’ averages for pitch and intensity for each suspect. The results appear to provide some responses to the research question. In particular, variation is observed, within the constraints of an exploratory qualitative study, regarding the intensity and the pitch of denial expression. In two cases, it seems to be more contained regarding pitch (K.L. and O.P.). What should be noted is that this snapshot of observations does not seem to indicate a steady direction of variation for the parameters considered, particularly concerning pitch. For intensity, there is a tendency towards a decrease in decibels following the confession or turning point (and during the cross-examination phase) in four out of the five suspects. O.P. contradicts this trend with a significant deviation and an increase in intensity after the turning point of the interview. Regarding pitch, however, there is a tendency towards a decrease following the confession or during the cross-examination phase in the cases of A.B. and I.J., while for the remaining three, there is an increase in F0 within the same circumstances. It is certainly important to mention, however, that the only case showing a significant increase in pitch is M.N., while K.L. and O.P. present a more subtle variation in which the increase may be more due to randomness or idiosyncrasy.

Table 3 shows the average length of pauses calculated by ‘pre’ and ‘post’ phase, in addition to the analysis of pitch and intensity. Regarding the average length of pauses observed in individual suspects, it appears that there are longer pauses in the post-phase for A.B., I.J., and M.N. Even if K.L. and O.P. do not confirm this trend, the gap between the ‘pre’ and ‘post’ phase in these two suspects is smaller than the gap observed in the other three. The ‘pre’ and ‘post’ totals reflect the majority trend. Regardless of the length, the number of pauses is almost equivalent in the ‘pre’ and ‘post’ phases for all five suspects. However, this does not correspond with the distribution of pauses in the entire dataset. In general, the number of pauses is greater in the ‘post’ phases of each suspect.

5. Discussion, Limitations, and Conclusions

Although the exploratory nature of this qualitative study precludes the derivation of systematic generalizations, the results and the interpretation of the data described in Section 4 highlighted the prototypical nature of both generic and feminicide-specific multimodal expressions of denial. Here, we extend the above considerations by adding some comparisons with the relevant literature on gestures, particularly in relation to arms and hands movements.

Concerning arms, we interpret the feature ‘arms crossed’ as a gesture of closure and separation, typically suggestive of downplaying and avoidance (Gallace et al. 2011). Therefore, we can claim that this element functions as a mechanism to express detachment and diminish the perceived importance of the crime in question, which is rendered as unexpected. Thus, this could indicate an intentional effort to downplay involvement in crime-related events.

In Figure 3 and Figure 4 we see two examples of the ‘arms crossed’ gesture, which we interpret as a cue of detachment from the crime under discussion. In the first case (Figure 3), the position of the arms co-occurs with a fake statement of desperate and sad astonishment (“I don’t know why anybody would do that”), aimed at avoiding possible allegations of involvement. In the second example (Figure 4), the closed position with crossed arms is also used by the suspect to detach himself from the reality of the situation. In this particular instance, the selected image represents a frame within a ‘bump and grind’ phase of the police interview. During this phase, the suspect assumes and maintains a defensive position, responding to all questions posed by the police officer with a lie. More generally, at the corpus level, this gesture appears to be associated with a defensive stance.

Similarly, an ‘open’ hand gesture, in both configurations (palm up and palm down), falls under Kendon’s (2004) classification which analyzes this type of hand shape in relation to manual actions of stop, refusal, denial, or interruption accompanying verbal expression. It is significant from this perspective that this specific hand shape is the most recurring one, not only throughout the dataset of denials but also for denials specifically related to femicides.

Two particularly illustrative examples are provided below. The ‘open’ hand gesture, employed in both the palm-up and palm-down configurations, was used by the same suspect to explicitly disavow any involvement with the murder weapon. In Figure 5, the suspect’s hands, with palms facing upwards, accompany the declaration of innocence concerning the allegation of firearm usage. The use of the hands creates a particular sense of surrender and innocence that follows the sentence, which not only denies the use of a firearm but also its possession (“I’ve never had a gun”).

Figure 6 illustrates the progression of denial. After discrediting the initial assertion regarding contact with a firearm, the police officer asks about the location of the firearms at the crime scene. The suspect then denies having used them, stating, “Please, don’t take it the wrong way. I really never used it. Neither one of them.” In this instance, the suspect’s use of the phrase “Neither one of them” accompanied by the gesture of shaking his open hands with the palm down in front of him, serves to reinforces his denial and strengthens his assertion that he has no connection to the murder weapon.

Then, we observed the ‘relaxed’ gesture, which to some extent resembles the ‘open’ form, occurring mostly in conjunction with other gestures. It was indeed the hand shape most often adopted individually by the right or left hand and not in double configurations.

As previously stated, the examples confirm the assumption of the ‘relaxed’ hand shape within a gesture made with only one of the two hands. In the case of Figure 7, the relaxed right hand is accompanied by a gesture of nervousness (the act of scratching the back or face) that O.P. often performs when he is in a recumbent position in relation to his actions on the day of the crime. In Figure 8, on the other hand, the suspect’s right hand, in a relaxed position, is accompanied by the left hand that instead expresses denial in an open position with palm facing upward. The ‘open’ form is once more employed to convey innocence and detachment from the facts being verbally denied, as illustrated in Figure 5 and Figure 6.

Having presented and discussed our results, it is necessary to acknowledge that this study is preliminary in nature, and it has several limitations, leaving numerous areas for further exploration and refinement. Some of the limitations may, in fact, represent new opportunities for development, which could be implemented in the relatively near future.

First of all, for our findings to be supported quantitatively, the study would require a larger and more balanced corpus (e.g., the same amount of data for police interviews and cross-examinations in courtroom proceedings). At the same time, a greater amount of audio–visual material from suspects of femicide, as well as from people accused of other crimes, is needed to compare the features assessed in this study with those in cases involving different types of criminal suspects. This would provide a better picture of the pragmatic, multimodal, and prosodic behavior of suspects, providing a more representative and generalizable overview of how denial is expressed verbally and/or nonverbally.

From a multimodal perspective, while the annotation scheme used in this study was sufficiently rich and complex, we believe it could be further enhanced by adding a few new parameters. Particularly, it could be useful to add cues regarding the lower body as well as ‘shoulder shrugs’ as a gestural feature. As reported by previous research, lower body cues (e.g., feet and legs) are less studied compared to the upper body but they are still relevant for the organization of social interaction (Mondada 2014). Equally interesting are the features regarding the shoulders: including them into the annotation of denial could be useful because the action of shoulders shrugging has been reported to signal a detachment attempt since “[they] can work as markers of ‘dis-stance’ or disengagement, in which case they take on an epistemic-evidential dimension” (Debras 2017, p. 24).

Since our study did not investigate the detailed shape and execution of gestures, future research could benefit from decomposing these gestures into finer traits, configurations, and subtler movements. This would offer a more granular picture of the multimodality of denial and, in more general terms, it would represent a multi-level analysis of gestures, investigating not only the syntactic–semantic or pragmatic aspect but also the “morphological” composition of gestures. At this stage, as outlined in the methodological section, the inclusion of an additional annotator and calculating inter-annotator agreement would be necessary for validation purposes.

As regards prosody and focusing particularly on the fundamental frequency parameter, another possibility to enhance this kind of study is to carry out precise annotation of the intonational contour of denials, to assess the possible presence of denial-specific pitch characteristics. In this regard, Mertens’ (2014, 2020) Prosogram and Polytonia tools could be used to automatically obtain a stylization of pitch contour as well as an automatic labelling of pitch movements. Parallelly, following the line drawn by classical works of Beckman and Pierrehumbert (1986) as well as Ladd (2008) on the intonational aspects of the English language (particularly the study of pitch accents and pitch contours based on the autosegmental-metrical theory), specific configurations could be observed, which may also be useful for an analysis of the pragmatic use of intonational features, both in production and perception. Moreover, once the necessary conditions for the retrieval of the aforementioned useful data are satisfied, comparisons could be drawn with studies on intonational contours of denials in other languages, such as Italian and other Romance languages (D’Imperio 2002; Prieto et al. 2005). Additionally, considering both prosodic and pragmatic aspects together, it would be interesting to complete the data on the mean length of utterances, adding speech rate (e.g., in the form of a count of syllables produced per second by each suspect), both overall and separately in the ‘pre’ and ‘post’ turning point phases.

Despite the limitations discussed above, the results of our paper outline a recognizable profile for multimodal denial. Recurrent features include a predominant gestural component characterized by head positioning (either neutral or lowered) and head shaking. The head shaking feature is frequently repeated and can serve independently as a nonverbal marker of denial. We also observed that denial is often accompanied by open-hand gestures and a sitting (erect) posture; this posture is frequently observed in conjunction with denial but is also common in non-denial instances throughout the corpus. A certain degree of vagueness in speech patterns was reported as regards the way suspects refer to the victims (i.e., they are not named explicitly). As regards prosody, our findings indicate that expressions of denial frequently involve a reduction in pitch and intensity following a confession or indictment. Finally, the analysis of pauses reveals that a greater number of pauses typically occur after incrimination.

Overall, we believe that this research may contribute to future studies on the multimodality of denial in legal settings, and to the limited literature in forensic linguistics and the broader academic discourse on this topic.

Author Contributions

Conceptualization, E.D. and C.R.C.; Methodology, E.D. and C.R.C.; Software, E.D. and C.R.C.; Validation, C.R.C.; Formal analysis, E.D.; Investigation, E.D. and C.R.C.; Resources, E.D. and C.R.C.; Data curation, E.D.; Writing—original draft, E.D. and C.R.C.; Writing—review & editing, E.D. and C.R.C.; Visualization, E.D.; Supervision, C.R.C.; Project administration, C.R.C. All authors have read and agreed to the published version of the manuscript.

Funding

This research received no external funding.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

The audio–video material used in this study cannot be made available as the videos were retrieved from online sources and are not owned by the authors. The code used for the automatic feature extraction developed for this study is available under request, contacting the corresponding author.

Acknowledgments

This research was conducted while Claudia Roberta Combei was engaged as a researcher under the initiative PON Ricerca e Innovazione 2014–2020—Linea Innovazione (D.M. 1062/2021) and while Elena Didoni was an M.A. student at the University of Pavia, where this work formed the basis of her thesis. The authors would like to thank Lucia Busso and Ilaria Fiorentini, as well as the four anonymous reviewers, for their helpful comments and suggestions on this study.

Conflicts of Interest

The authors declare no conflicts of interest.

Appendix A

Table A1. Multimodality Occurrences (Denials Femicides).

Body Part	Annotation	Whole Corpus	I.J.	K.L.	M.N.	A.B.	O.P.
Head	Back	10	0	0	0	0	10
	Front	467	38	41	7	50	331
	Left	72	0	3	2	2	65
	Nod	0	0	0	0	0	0
	Right	50	0	0	2	0	48
	Shake	187	10	32	35	87	23
	Down	235	26	0	0	0	209
	Head bent right	34	2	0	1	0	31
	Head bent left	58	2	0	0	0	56
	Not available	7	2	0	0	0	5
Gaze	Blink	0	0	0	0	0	0
	Down	290	42	3	12	5	228
	Eyerolls	0	0	0	0	0	0
	Front	25	1	0	2	0	22
	Left	68	0	0	0	2	66
	Right	10	0	0	0	0	10
	Towards other speaker	376	35	72	25	130	114
	Up	3	0	1	0	2	0
	Wink	0	0	0	0	0	0
	Closed	381	0	0	4	0	377
	Not available	29	2	0	0	0	27
Eyebrows	Both eyebrows raising	85	0	11	2	63	9
	Frowning	170	0	1	3	11	155
	Left eyebrow raising	7	0	1	0	6	0
	Relaxed	179	0	62	23	34	60
	Right eyebrow raising	23	0	0	0	23	0
	Not available	643	75	0	15	0	553
Posture	Laying on the chair	156	23	1	10	0	122
	Leaning forward	333	18	1	3	22	289
	Moving left	4	0	0	0	0	4
	Moving right	20	1	0	0	1	18
	Retracting back	12	1	0	1	0	10
	Sitting erect	718	52	109	0	182	375
	Swivel chair	4	0	0	0	0	4
	Not available	6	2	0	0	0	4
Hand Gesture	Angled	4	0	2	0	0	2
	Arms crossed	197	24	0	0	21	152
	Ball	1	0	1	0	0	0
	Clasped	39	7	3	0	14	15
	Counting	4	0	1	0	0	3
	Cross	1	1	0	0	0	0
	Cup	8	1	2	0	0	5
	Deictic	18	2	0	0	1	15
	Fist	108	0	2	8	0	98
	Folded	8	7	0	0	1	0
	Gun	0	0	0	0	0	0
	Hands in pockets	83	26	54	0	0	3
	Hole	0	0	0	0	0	0
	Intertwined	19	0	0	0	1	18
	Jailed	0	0	0	0	0	0
	Knife	11	2	3	0	0	6
	Loose	30	0	0	0	1	29
	Measurement	2	0	2	0	0	0
	Okay	0	0	0	0	0	0
	Open	311	4	17	35	16	239
	Pursued	0	0	0	0	0	0
	Relaxed	151	3	10	13	11	114
	Scratch	79	2	1	0	0	76
	Star	5	0	0	0	0	5
	Steepled	1	0	0	0	0	1
	Tulip	0	0	0	0	0	0
	Two	0	0	0	0	0	0
	Wall	47	0	0	0	0	47
	Finger closed	24	0	0	0	0	24
	Pinch	26	0	0	0	0	26
	Fingers touching	8	0	0	0	0	8
	Drinking	1	0	0	0	0	1
	Handcuffed	116	0	0	0	0	116
	Writing	1	0	0	0	0	1
	Key	1	0	0	0	0	1
	Not available	100	5	0	0	73	22
Handedness	Both	792	70	56	37	56	573
	Left	258	5	22	10	3	218
	Right	276	8	22	10	7	229
	Not available	74	1	0	0	69	4

Notes

1	These channels are available at the following web pages (accessed 18 July 2024): https://www.cbc.ca/news/fifthestate; https://www.youtube.com/@Telegraph247; https://lawandcrime.com/; https://www.youtube.com/@redcircleinterrogationsand5722.
2	The software are available at the following web pages (accessed 18 July 2024): https://www.videolan.org/vlc/index.it.html; https://www.audacityteam.org/.
3	This software is available at this web page (accessed 18 July 2024): https://archive.mpi.nl/tla/elan.
4	The MIT Boston Speech Communication Group’s ‘Gesture Coding Manual’ is available at this web page (accessed 18 July 2024): https://speechcommunicationgroup.mit.edu/gesture/coding-manual.html.
5	With respect to eyebrow features, the ‘not available’ category was the most frequently assigned due to unsatisfactory image quality or recording angles.
6	By ‘turning point’ we mean the moment of the police interview when the suspect’s alibi is overtly challenged or debunked. At that point, the suspect must face the truth and decide whether to continue with his strategy or confess to the crimes.
7	In accordance with what had been stated in lines 184–186, it was resolved that the faces of suspects depicted in the video would be obscured in deference to the right to privacy, even though the multimedia material is freely accessible online. All figures in the paper were processed following this methodology. To view the multimedia data in full, interested parties may request the image from either of the paper’s authors.

References

Bang, Brandon L., Duane Stanton, Craig Hemmens, and Mary K. Stohr. 2018. Police recording of custodial interrogations: A state-by-state legal inquiry. International Journal of Police Science & Management 20: 3–18. [Google Scholar] [CrossRef]
Beckman, Mary Esther, and Janet Breckenridge Pierrehumbert. 1986. Intonational structure in Japanese and English. Phonology 3: 255–309. [Google Scholar] [CrossRef]
Boersma, Paul. 2001. Praat, a system for doing phonetics by computer. Glot International 5: 341–45. [Google Scholar]
Bressem, Jana, and Cornelia Müller. 2017. The “negative-assessment-construction”—A multimodal pattern based on a recurrent gesture? Linguistics Vanguard 3: 1–9. [Google Scholar] [CrossRef]
Bressem, Jana, Silva H. Ladewig, and Cornelia Müller. 2013. Linguistic annotation system for gestures (lasg). In Body—Language—Communication: An International Handbook on Multimodality in Human Interaction. Edited by Cornelia Müller, Alan Cienki, Ellen Fricke, Silva H. Ladewig, David McNeill and Jana Bressem. Berlin: De Gruyter Mouton, pp. 1098–124. [Google Scholar]
Buller, David B., and R. Kelly Aune. 1987. Nonverbal cues to deception among intimates, friends, and strangers. Journal of Nonverbal Behavior 11: 269–90. [Google Scholar] [CrossRef]
Combei, Claudia Roberta. 2023. The multimodal expression of denial: A case study on femicide suspects. In Linguistica Forense in Prospettiva Multidisciplinare. Edited by Sonia Cenceschi and Chiara Meluzzi. Milano: Officinaventuno, pp. 27–44. [Google Scholar]
Debras, Camille. 2017. The shrug. Gesture 16: 1–34. [Google Scholar] [CrossRef]
D’Imperio, Mariapaola. 2002. Italian intonation: An overview and some questions. Probus 14: 37–69. [Google Scholar] [CrossRef]
Gallace, Alberto, Diana M. E. Torta, G. Lorimer Moseley, and Gian Domenico Iannetti. 2011. The analgesic effect of crossing the arms. Pain 152: 1418–23. [Google Scholar] [CrossRef]
Harrison, Simon. 2009. The expression of negation through grammar and gesture. In Studies in Language and Cognition. Edited by Jordan Zlatev, Mats Andrén, Marlene Johansson Falck and Carita Lundmark. Cambridge: Cambridge Scholars Publishing, pp. 421–35. [Google Scholar]
Harrison, Simon. 2010. Evidence for node and scope of negation in co-verbal gesture. Gesture 10: 29–51. [Google Scholar] [CrossRef]
Harrison, Simon. 2018. The Impulse to Gesture: Where Language, Minds, and Bodies Intersect. Cambridge: Cambridge University Press. [Google Scholar]
Horn, Laurence Robert. 2010. The Expression of Cognitive Categories. Berlin: De Gruyter. [Google Scholar]
Hummer, Peter, Heinz Wimmer, and Gertraud Antes. 1993. On the origins of denial negation. Journal of Child Language 20: 607–18. [Google Scholar] [CrossRef]
Jaiswal, Mimansa, Sairam Tabibu, and Rajiv Bajpai. 2016. The truth and nothing but the truth: Multimodal analysis for deception detection. Paper presented at 2016 IEEE 16th International Conference on Data Mining Workshops (ICDMW), Barcelona, Spain, December 12–15; pp. 938–43. [Google Scholar]
Kendon, Adam. 2002. Some uses of the head shake. Gesture 2: 147–82. [Google Scholar] [CrossRef]
Kendon, Adam. 2004. Gesture: Visible Action as Utterance. Cambridge: Cambridge University Press. [Google Scholar]
Kisler, Thomas, Uwe Reichel, and Florian Schiel. 2017. Multilingual processing of speech via web services. Computer Speech & Language 45: 326–47. [Google Scholar] [CrossRef]
Ladd, Dwight Robert. 2008. Intonational Phonology. Cambridge: Cambridge University Press. [Google Scholar]
Larner, Samuel. 2019. Formulaic sequences as a potential marker of deception: A preliminary investigation. In The Palgrave Handbook of Deceptive Communication. Edited by Tony Docan-Morgan. London: Palgrave Macmillan, pp. 351–70. [Google Scholar]
Matoesian, Gregory M. 2010. Multimodal aspects of victim’s narrative in direct examination. In The Routledge Handbook of Forensic Linguistics. Edited by Malcolm Coulthard and Alison Johnson. New York and London: Routledge, pp. 541–57. [Google Scholar]
Matoesian, Gregory, and Kristin Gilbert. 2016. Multifunctionality of hand gestures and material conduct during closing argument. Gesture 15: 79–114. [Google Scholar] [CrossRef]
Mertens, Piet. 2014. Polytonia: A system for the automatic transcription of tonal aspects in speech corpora. Journal of Speech Sciences 4: 17–57. [Google Scholar] [CrossRef]
Mertens, Piet. 2020. The Prosogram model for pitch stylization and its applications in intonation transcription. In Prosodic Theory and Practice. Edited by Jonathan Barnes and Stefanie Shattuck-Hufnagel. Cambridge, MA: MIT Press, pp. 259–86. [Google Scholar]
Mondada, Lorenza. 2014. Bodies in action. Language and Dialogue 4: 357–403. [Google Scholar] [CrossRef]
Mondada, Lorenza. 2016. Challenges of multimodality: Language and the body in social interaction. Journal of Sociolinguistics 20: 336–66. [Google Scholar] [CrossRef]
Morris, Bradley J. 2003. Opposites attract: The role of predicate dimensionality in preschool children’s processing of negations. Journal of Child Language 30: 419–40. [Google Scholar] [CrossRef]
Petyko, Marton, Lucia Busso, Tim Grant, and Sarah Atkins. 2022. The Aston Forensic Linguistic Databank (FoLD). Language and Law/Linguagem e Direito 9: 9–24. [Google Scholar] [CrossRef]
Prieto, Pilar, and Maria Teresa Espinal. 2020. Negation, prosody, and gesture. In Oxford Handbook of Negation. Edited by Viviane Déprez and Maria Teresa Espinal. Oxford: Oxford University Press, pp. 1–20. [Google Scholar]
Prieto, Pilar, Mariapaola D’Imperio, and Barbara Gili Fivela. 2005. Pitch accent alignment in Romance: Primary and secondary associations with metrical structure. Language and Speech 48: 359–96. [Google Scholar] [CrossRef]
Ripley, David. 2020. Denial. In Oxford Handbook of Negation. Edited by Viviane Déprez and Maria Teresa Espinal. Oxford: Oxford University Press, pp. 1–13. [Google Scholar]
Roitman, Malin. 2017. Introduction. In The Pragmatics of Negation. Edited by Malin Roitman. Amsterdam: John Benjamins, pp. 1–14. [Google Scholar]
Soldner, Felix, Verónica Pérez-Rosas, and Rada Mihalcea. 2019. Box of lies: Multimodal deception detection in dialogues. In Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Volume 1 (Long and Short Papers). Edited by Jill Burstein, Christy Doran and Thamar Solorio. Minneapolis, MN: Association for Computational Linguistics, pp. 1768–77. [Google Scholar]
United Nations. 2013. Statement Submitted by the Academic Council on the United Nations System, a Non-Governmental Organization in Consultative Status with the Economic and Social Council. Vienna Declaration on Femicide. Available online: https://www.unodc.org/documents/commissions/CCPCJ/CCPCJ_Sessions/CCPCJ_22/_E-CN15-2013-NGO1/E-CN15-2013-NGO1_E.pdf (accessed on 20 July 2024).
van der Sandt, Rob. 1991. Denial. Chicago Linguistic Society 27: 331–44. [Google Scholar]
Vandamme, Fernand. 1972. On Negation: An Interdisciplinary study. Logique et Analyse 15: 39–101. [Google Scholar]
Vrij, Aldert, Gün Refik Semin, and Ray Bull. 1996. Insight into behavior displayed during deception. Human Communication Research 22: 544–62. [Google Scholar] [CrossRef]
Wang, Min. 2024. Stance-taking in American courtroom interaction from the dual perspectives of language and gesture: A case study of the trial of ‘The State of Minnesota v. Derek Michael Chauvin 2021’. The International Journal of Speech, Language and the Law 31: 162–72. [Google Scholar] [CrossRef]

Figure 1. M.N. Multimodal Denial (‘pre’)7.

Figure 2. M.N. Multimodal Denial (‘post’).

Figure 3. O.P.’s example of denial accompanied by the ‘arms crossed’ gesture.

Figure 4. I.J.’s example of denial accompanied by the ‘arms crossed’ gesture.

Figure 5. ‘Open’ hand gesture (palm up).

Figure 6. ‘Open’ hand gesture (palm down).

Figure 7. O.P.’s example of a ‘relaxed’ gesture.

Figure 8. M.N.’s example of a ‘relaxed’ gesture.

Table 1. General Results Statistics—Absolute Frequencies.

Suspect	Videos	Pauses	General Denials	Femicide Denials
A.B.	Video 1	3211	105	70
A.B.	Video 2	2369	113	69
I.J.	Video 1	786	106	75
K.L.	Video 1	689	112	75
M.N.	Video 1	896	55	49
O.P.	Video 1	2619	201	180
	Video 2	14,109	235	202
	Video 3	10,526	177	166
	Video 4	1621	88	68
	Video 5	6477	183	162
Total		46,710	1375	1116

Table 2. Mean of Pitch and Intensity.

Suspect	Average Pitch (‘Pre’)	Average Pitch (‘Post’)	Average Intensity (‘Pre’)	Average Intensity (‘Post’)
A.B.	123.39 Hz	119.01 Hz	69.11 dB	65.05 dB
I.J.	160.10 Hz	151.81 Hz	62.46 dB	59.57 dB
K.L.	220.70 Hz	221.53 Hz	62.69 dB	59.32 dB
M.N.	175.02 Hz	208.81 Hz	79.40 dB	62.81 dB
O.P.	174.80 Hz	176.26 Hz	69.66 dB	81.17 dB

Table 3. Average Length of Pauses.

Suspect	Average Pauses Length (‘Pre’)	Average Pauses Length (‘Post’)
A.B.	0.94 s	1.51 s
I.J.	0 s	0.69 s
K.L.	1.23 s	1.04 s
M.N.	0.56 s	3.41 s
O.P.	1.11 s	0.81 s
Total	0.77 s	1.49 s

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2024 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Didoni, E.; Combei, C.R. Beyond “I Didn’t Do It”: A Linguistic Analysis of Denial in US Legal Settings. Languages 2024, 9, 351. https://doi.org/10.3390/languages9110351

AMA Style

Didoni E, Combei CR. Beyond “I Didn’t Do It”: A Linguistic Analysis of Denial in US Legal Settings. Languages. 2024; 9(11):351. https://doi.org/10.3390/languages9110351

Chicago/Turabian Style

Didoni, Elena, and Claudia Roberta Combei. 2024. "Beyond “I Didn’t Do It”: A Linguistic Analysis of Denial in US Legal Settings" Languages 9, no. 11: 351. https://doi.org/10.3390/languages9110351

APA Style

Didoni, E., & Combei, C. R. (2024). Beyond “I Didn’t Do It”: A Linguistic Analysis of Denial in US Legal Settings. Languages, 9(11), 351. https://doi.org/10.3390/languages9110351

Article Menu

Beyond “I Didn’t Do It”: A Linguistic Analysis of Denial in US Legal Settings

Abstract

1. Introduction

2. Motivations and Aims

3. Data and Methods

3.1. Materials

3.2. Methods

3.3. Gestures: ELAN and the Annotation Scheme

3.4. Prosody

4. Results

4.1. General Overview

4.2. Gestural Analysis

4.3. Prosodic Analysis

5. Discussion, Limitations, and Conclusions

Author Contributions

Funding

Institutional Review Board Statement

Informed Consent Statement

Data Availability Statement

Acknowledgments

Conflicts of Interest

Appendix A

Notes

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI