1. Introduction
There are very large individual differences in human face identity recognition ability (e.g., [
1,
2]). Developmental prosopagnosics (DPs) inhabit the extreme low end of the ability spectrum, while so-called super-recognisers (SRs) occupy the top. Faces comprise different internal features (e.g., eyes, nose, mouth) which vary in their prominence. Indeed, previous research has shown that there is a “feature-salience hierarchy” in terms of the ease with which facial features are recognised and manipulations to those features are detected. Using unfamiliar face-matching paradigms, participants most easily detect changes that are made to the eyes, followed in turn by changes to the mouth and then the nose [
3,
4,
5,
6,
7,
8,
9]. Recently, Lai et al. (2014) demonstrated similar “feature-saliency hierarchy” effects using a sequential face feature change detection procedure [
10]. In the present study we aim to recruit super-recognisers, controls with ability in the typical range, and developmental prosopagnosics, to determine whether individual differences in face recognition ability might be associated with qualitative variations in the face feature-salience hierarchy, or instead by a general reduction or improvement in performance irrespective of the feature being tested.
Prosopagnosia is a condition which presents as marked impairments in the ability to recognise faces even if familiar (i.e., close acquaintances). While acquired prosopagnosia is typically caused by brain trauma, lesions, or strokes affecting occipitotemporal or anterior temporal regions (see [
11,
12] for reviews), developmental prosopagnosia (DP) is a lifelong neurodevelopmental impairment which occurs in the absence of brain damage or other cognitive impairments [
13]. Further, while acquired prosopagnosia is extremely rare, DP is estimated to affect approximately 2–3% of the population [
14,
15]. Studies have found that some DPs are selectively impaired in face identity as compared to facial expression recognition [
16,
17] and voice recognition [
18], which is line with a vast range of studies showing distinct, but overlapping, mechanisms and brain areas involved in facial recognition, emotion recognition, and voice recognition (i.e., [
19,
20]); however, other studies have found positive correlations between unfamiliar face and voice recognition abilities in both DPs and SRs [
21]. Many studies have investigated face perception in developmental prosopagnosia, yet few have measured the ability to recognise or detect changes to individual facial features.
One strategy that measures the ability to match separate face parts is the part–whole task, in which participants must match faces in which either the nose, mouth, or eyes change in isolation or in the context of a whole face. Using the procedure, in contrast to controls, DPs do not show an advantage in presenting feature changes in the context of a whole face [
22], suggesting an impairment in a holistic style of face perception which makes use of the context of a whole face in a single glance (e.g., [
8,
23]). A further study using the part-whole task found DPs had an overall drop in performance across features, but the pattern for detecting eye changes > mouth changes > nose changes was the same for DPs and controls [
24]. Taken together, these results suggest that, while differences exist between DPs and controls for holistic face processing, the feature-saliency pattern may remain the same, with an equal drop in performance across all conditions for DPs. Alternatively, other studies have suggested that DPs have an impairment in processing the eyes above and beyond other facial features; for example, an ERP study showed that contrast changes were particularly impaired when they occurred in the eye region compared to other parts of the face [
25]. Furthermore, an eye-tracking study showed that DPs paid less attention to the eyes than control participants [
15]. The current study aims to test these two competing hypotheses for face perception impairments in DP.
While DPs have impaired face-recognition abilities compared to the average population, others have face recognition abilities that are far above the average. These ‘super-recognisers’ (SRs [
26]) are able to identify and perceive familiar and unfamiliar faces across large variations in lighting, viewpoint, expressions, and age-changes with an unusually high degree of accuracy (e.g., [
26,
27,
28,
29]). Few studies have examined feature processing ability for separate features in super-recognisers (SRs). Using the part–whole task, Belanova and colleagues found that SRs were significantly better at detecting feature changes than controls, with a subgroup of SRs performing better on the nose condition [
30]. However, it is important to note that some SRs showed significant effects in the opposite direction, suggesting the underlying mechanisms supporting this skill may substantially differ, which may explain the variation in outcomes both within and between studies. This supports the finding by an eye-tracking study that showed half of SRs fixate more on the nose area than controls [
31]. The SR nose dominance effect may, however, be an optimal fixation point in the centre of the image from which to more easily take in the other facial features in a single glance. Indeed, a more recent study by Dunn and colleagues suggested that, unlike typical-ability controls who tend to fixate more on the eyes, SRs employ a flexible eye gaze mechanism which the authors likened to an adjustable spotlight [
32]. This mechanism appears to automatically fixate on the most salient or idiosyncratic aspect of an individual face that will support subsequent accurate identity recognition. As such, dependent on the face under observation, fixation could be on individual facial features, component parts of features, or even on the whole face. In studies involving multiple face displays, it is likely that the combined fixation data regress to the central regions of the face, explaining the nose advantage results of Bobak et al. (2017) [
31]. Overall, these findings suggest SRs may have either a general increase in face feature change detection ability, or alternatively they might have a qualitative difference in the way that they view faces in terms of being more likely to fixate in the centre of the image or on the nose, as compared to control participants who tend to fixate closer to the eyes or the nasion. However, SRs show greater variation in the direction of eyes for each face, which may explain idiosyncratic response patterns across SRs.
A recent study examining familiar faces found that, similar to controls, both SRs and DPs were more sensitive to changes in “critical” features (i.e., eyebrow thickness, eye shape, eye colour, lip thickness) than changes to “non-critical” features (face proportion, eye distance, mouth size, skin colour) [
32], adding weight to the idea that there are qualitative similarities in the way that these individuals perceive faces. However, it is unclear how the responses to the individual “critical” feature changes (i.e., eye changes versus nose changes) differed between DPs and SRs. We expand upon this here by directly comparing face feature change detection performance across DPs, typical-ability controls, and SRs for changes in individual features. The aim of this study is therefore to use a feature change detection task to examine the ability of members of these groups to identify changes in the whole face, the eyes, the nose, the mouth, or whether they are viewing the same face image [
10]. More specifically, we examine whether, compared to controls, DPs and SRs display either quantitative differences in face feature change detection ability across the spectrum of face-recognition ability, or whether there are more qualitative differences concerning the importance of the eye region for face recognition or the utility of fixating on the nose to take in more of the face image. In order to achieve these goals, we will collect accuracy and response-time data (and a combined balanced integration measure) across three age-matched groups of SRs, DPs, and controls likely to possess typical-range levels of face-recognition ability.
3. Results
Figure 2 shows the mean accuracy, response times, and balanced integration scores (BIS) from the three participant groups (controls, DPs and SRs). Based on accuracy and response times, all groups were better for the different and the same-face changes than detecting when individual features changed. While examining only accuracy scores, DPs and controls showed similar performance across conditions, while super-recognisers performed better on the nose condition than both groups. For reaction times, all groups were slightly faster at identifying nose changes than eye and mouth changes. As planned a priori, and due to the less accurate scores for the nose condition together with the faster responses suggesting speed-accuracy trade-offs, we analyse both the accuracy scores (as typically used in previous papers, and to allow readers to compare findings across our study and other studies) and the balanced integration scores. Complete statistical analysis of reaction times, including tables, are included as
Supplementary Materials, but a brief summary is included below.
3.1. Accuracy
A 3 (group; between subjects) × 5 (condition; within subjects) ANOVA revealed significant main effects of Group (F(2,96) = 4.65, p = 0.001), Condition (F(4,384) = 257.47, p < 0.001), and a significant interaction (F(8,384) = 3.19, p = 0.002). As we were interested in how the two groups (DPs and SRs) differed in their response to controls, we examined these differences by conducting 2 × 5 ANOVAs for SRs vs. controls, and another for DPs vs. controls.
3.2. Super-Recognisers vs. Controls
For the SRs vs. controls ANOVA, effects of Group (F(1,64) = 4.41,
p = 0.04), Condition (F(4,256) = 149.51,
p < 0.001) and the Interaction (F(4,256) = 5.01,
p < 0.001) were significant. To further investigate the significant Interaction, we conducted two one-way ANOVAs on the condition variable, separately for controls and SRs. The results were significant for controls (F(4,128) = 122.00,
p < 0.001) and SRs (F(4,128) = 46.10,
p < 0.001), although effect sizes were stronger in controls. Tukey’s post-hoc
t-tests (
Table 2) revealed significant differences between the five conditions for SRs (P’s < 0.001), with the exception of the same vs. different comparison, and the nose vs. mouth comparison (P’s = 0.99). The analyses with controls revealed identical outcomes to SRs, except the same-face and eyes comparison was also not significant. Controls showed a very similar pattern to SRs with significant differences between all conditions (P’s < 0.05), but no differences between the same and different conditions, or between the nose and mouth (P’s > 0.13); however, controls also showed no difference between the same-face and eye changes (
p = 0.12).
The Interaction between SRs and controls (
Table 3) was caused by a greater accuracy to the same condition for SRs than controls (
t(64) = 3.31,
p < 0.001) as well as the nose condition (
t(64) = 2.99,
p < 0.001). There were no other significant differences (P’s > 0.08).
In sum, these results therefore showed that SRs were more accurate on the same-face condition as well as the nose-change condition, suggesting qualitative differences between SRs and controls.
3.3. Developmental Prosopagnosics Compared to Controls
For DPs versus controls, there was an effect of Condition (F(4,256) = 239.80,
p < 0.001), but no effect of Group (F(1,64) = 0.74,
p = 0.39) or Interaction (F(4,256) = 0.21,
p = 0.93). The effect of Condition was explored using Tukey’s post-hoc
t-tests collapsed across groups (
Table 4) which showed significant differences between all conditions (P’s < 0.05) with the exception of no accuracy differences between the same and different conditions (
p = 0.99). In sum, controls and DPs showed similar patterns of accuracy.
3.4. Response Times
Complete statistical analysis of reaction times, complete with tables, is included in the
Supplementary Materials. However, we include a brief summary here. When comparing controls and SRs, there was no effect of Group, but an effect of Condition and Interaction. SRs had no significant differences in reaction times between the eyes, nose, and mouth, or between whole face changes or the same face, while controls only showed differences in reaction times between the same face and eye changes, and the same face and mouth changes. When comparing controls to DPs, there was an effect of Group and Condition, but no Interaction. Controls had overall faster reaction times than DPs. Both controls and DPs had faster reaction times for same face than for eye changes, nose changes, and mouth changes, as well as faster reaction times for different face than mouth changes.
3.5. Balanced Integration Analysis
Next, we examined the balanced integration scores (BSI) which take into account speed-accuracy trade-offs. A 3 (group; between subjects) × 5 (condition; within subjects) ANOVA revealed a main effect of Group (F(2,96) = 6.24, p = 0.003), Condition (F(4,384) = 127.40, p < 0.001), and an Interaction (F(8,384) = 2.26, p < 0.001). As we were interested in how the two groups (DPs and SRs) differed in their response to controls, we examined these differences by conducting 2 × 5 ANOVAs for super-recognisers versus controls, and another for prosopagnosics versus controls.
3.6. Super-Recognisers Compared to Controls
For SRs versus controls, there was no effect of Group (F(1,64) = 0.01,
p = 0.94), but an effect of Condition (F(4,256) = 125.94,
p < 0.001) and an Interaction (F(4,256) = 4.84,
p < 0.001). The effect of Condition was explored using Tukey’s post-hoc
t-tests (
Table 5) which showed significant differences between all conditions for both controls and SRs (P’s < 0.05), with the exception of no differences between the same and different conditions, or between the nose and mouth (P’s > 0.68). This replicated the findings by Lai et al. (2014), which found that eye changes were easier to detect than nose and mouth changes for both groups (P’s < 0.05), but no difference between nose changes and mouth changes (P’s > 0.84).
The Interaction between SRs and controls (
Table 6) was caused by a greater response to the same condition for SRs than controls (
t(64) = 2.41,
p = 0.02). There was also a trend for controls to be better than SRs for the eye condition (
t(64) = 1.63,
p = 0.06). There were no other significant differences (P’s > 0.15).
In sum, these results therefore showed no quantitative differences between SRs and controls with no overall differences in performance, but instead a qualitative difference with SRs better able to detect when the face was the same and marginally worse at detecting when the eyes changed.
3.7. Developmental Prosopagnosics Compared to Controls
For DPs versus controls, there was an effect of Group (F(1,64) = 12.7,
p < 0.001) and an effect of Condition (F(4,256) = 142.76,
p < 0.001) but no Interaction (F(4,256) = 0.79,
p = 0.53). The effect of Group with no Interaction was caused by an overall better performance for controls than prosopagnosics across all conditions; however, we have included these comparisons for reference in
Table 7.
The effect of Condition was explored using Tukey’s post-hoc
t-tests collapsed across groups (
Table 8), which showed significant differences between all conditions (P’s < 0.001), with the exception of no differences between the same and different conditions, or between the nose and mouth (P’s > 0.62).
In sum, these results confirmed the observed general reduction in performance for DPs compared to controls across all feature-change conditions. Separate analyses of accuracy and response times showed that these pervasive differences on the task were mainly accounted for by DPs’ slower response times.
3.8. Individual Differences
Figure 3 shows the individual results for each participant on the individual feature changes (eyes, nose, mouth) for accuracy and balanced integration scores. As can be seen in the figure, the controls and DPs showed similar patterns and were relatively consistent across participants. However, the SRs showed much larger variations in their responses—particularly for accuracy. While the average and most SRs showed better accuracy for eyes and similar performance for nose and mouth, some SRs showed greater accuracy for nose changes (this in line with previous findings; [
21]). When reaction times were accounted for in the balanced integration scores, the variability became less; this suggests that SRs had significant variability and heterogeneity across their speed–accuracy trade-offs, and that individual SRs may be using different strategies to identify feature changes.
3.9. Results Summary
DPs showed an overall drop in performance across conditions but no differences in the pattern of responses across conditions, and these differences were mainly accounted for by DPs slower response times. On the other hand, SRs showed no overall differences in performance but were specifically better able to detect when the face was the same and marginally worse at detecting when the eyes changed. Interestingly, SRs were also more accurate at detecting nose changes than control participants, but this increase in accuracy was accompanied by slower response times in this condition (as well as for the other feature-change conditions), suggesting a more complex pattern of behaviour. The only condition which yielded significant differences both between DPs and controls, and between SRs and controls, was the ability to identify when the same face was presented, with SRs being better than controls, and controls being better than DPs.
4. Discussion
The aim of this study was to examine whether developmental prosopagnosics and super-recognisers showed differences in their ability to detect face feature changes compared to controls. Specifically, we asked whether any differences in the ability to detect face feature changes were quantitative, meaning that face feature detection performance would be generally better or worse across all features tested, or qualitative, meaning that specific patterns emerged that distinguished between the different groups. For example, these could include feature-specific hypotheses involving a proposed eye-processing deficit in DP, or a “nose-superiority” effect for SRs. Overall, we found that prosopagnosics showed a quantitative difference with an overall drop in performance across conditions but no differences in the patterns of responses across conditions. On the other hand, super-recognisers showed qualitative differences, with greater accuracy for detecting nose changes than controls, and were also better able to detect when the face was the same. Furthermore, the only condition which reliably distinguished between the three groups was the ability to identify when the same face was presented, with SRs being better than controls, and controls being better than DPs. These findings will be discussed in detail below.
Our findings for our control group largely replicate those reported by Lai et al. (2014), who found better performance when the whole face changed as compared to individual feature changes, and better performance for eye changes compared to nose or mouth changes [
10]; however, in our participant sample, there was no difference between nose and mouth. One possible explanation for this difference is that our participants were older than the participants used in the Lai study (mean age = 42.6, compared to mean age = 22.9). Indeed, one study examining the effect of participants’ ages on face matching and face memory found that worse performance on a face-memory task could be explained by the reduced performance on the face-matching tasks [
50]; further, studies have reported specific deficits in face-feature processing with increasing age (e.g., [
50,
51]).
We found that DPs showed an overall drop in performance across all conditions compared to controls, but no difference in the pattern of response across conditions between the two groups. In other words, DPs showed the same feature effects as controls but were just overall worse at detecting face-feature changes (and repetitions). These differences primarily stemmed from slower response times than the control group, suggesting a lower level of confidence in the DP or that a more time-consuming strategy may have been employed. Our findings fit with those from a previous study which found that DPs had an overall drop in performance for feature changes, but the pattern for detecting eye, mouth, and nose changes was the same for DPs and controls [
24]. This was also found in another study examining the ability of DPs and controls to detect changes in “critical” features and “non-critical” features, where DPs showed similar patterns of response to controls [
32]. One study which used eye tracking to measure scan paths during a face memory task showed no difference in scan paths of faces in DPs and controls, with both groups showing a bias towards the eyes over the mouth [
52] (for different findings, see [
31]). Taken together with our results, this suggests that DPs may be impaired in detecting changes to all internal facial features rather than having a specific or pronounced deficit for the eye region.
Our results showed that SRs were not significantly better than controls on our task overall, but instead showed differences across conditions. Specifically, SRs were better able to detect when the face was the same than the other groups and showed a more complex pattern in terms of their accuracy and response times. While a previous study found SRs were overall better than controls at identifying eye, mouth, and nose changes than controls on the part–whole task, a large number of SRs showed a bias towards nose changes [
30]. We replicated this nose-superiority-effect and showed that SRs were generally more accurate than controls on the nose trials (see
Figure 2C). However, after taking into account SRs’ slower response times for these trials (using balanced integration scores), no significant differences for the nose condition were observed. Indeed, SRs appear to have slower response times for all individual feature change trials (but not for whole face changes or repetitions), suggesting a more careful and time-consuming appraisal of these more difficult single feature change trial types than the other groups. The trend towards worse performance on the eye-change condition accompanied by superior accuracy for the nose condition in SRs suggests that SRs tend to fixate more on the nose area than controls (in line with the findings of a previous study; [
31]). SRs’ numerically superior accuracy for the mouth trials is also consistent with this proposed difference in task strategy, though this apparent difference was a non-significant trend. The only condition in which SRs performed better in terms of both accuracy and response times was when identifying face repetitions. Further, examining the individual differences in accuracy and balanced integration scores, there was a larger variation in detecting individual feature changes in the SRs than in the controls and the DPs. This suggests that individual SRs may utilise different strategies on this task.
Post-hoc tests showed that the only condition that could distinguish between all three participant groups (DPs, controls, and SRs) was the face-repetition condition. Differences between controls and DPs for identifying face repetitions have previously been found by others (e.g., [
25]). In this study, DPs incorrectly reported identity changes when faces were repeated (24%), while controls showed remarkably few such errors (5%). This has also been observed in another study which found the DPs were impaired in face-repetition trials but not face-change trials [
44]. We suggest that DPs have difficulty with identifying when two faces are the same (i.e., “telling faces together”), even when these images are physically identical to one another (apart from a modest change in size). The finding that the face-repetition condition is the one condition in which SRs were overall reliably superior to the other groups (alongside the finding of higher accuracy for the nose-change condition than controls) also suggests that SRs have a disproportionate superiority in “telling faces together”. SRs are extremely good at identifying faces across large changes between comparison images [
24,
25,
26,
27], and therefore it is perhaps not surprising that they should find this condition easy—indeed, average accuracy for SRs on this condition was 97% and they had the fastest reaction times (534 ms). Taken together, it may be the case that the vast differences in face recognition ability observed between our three groups can most reliably be accounted for by the ability to tell that two images depict the same person rather than the ability to notice subtle differences between face images.
Interestingly, our data suggest that SRs’ superior accuracy in detecting certain feature changes may in part be strategic. SRs have superior accuracy for nose trials while also having slower response times for these single feature change trials. This pattern suggests a trade-off between speed and accuracy on these trials. Given that SRs also perform generally better on the same-face trials (both faster and more accurate), we suggest that this speed-accuracy trade-off is due to SRs’ specific awareness of these difficult trial types and is suggestive of increased diligence in these conditions. Interestingly, this increased diligence for nose and mouth changes appears to come at least in part at the expense of performance for the easier eye-change condition. If superior face-recognition ability is accompanied by strategic differences in the way in which face-processing tasks are completed, it opens the possibility that these strategies could be learned by others. Along with previous evidence for a “nose-superiority effect” in SRs [
30], we suggest (along with others, e.g., [
53]) that fixation at the centre of face images allows for more information to fall on the sensitive central retinal area which contains the highest visual acuity. Therefore, fixation around the nose may be an optimal strategy for completing face matching tasks [
53,
54] and might also allow for a more effective spread of visual attention across the whole face. Whether this optimal strategy can be learned via intervention, and whether those individuals with poor or normal face-recognition ability can benefit from this approach remains to be determined. Indeed, previous intervention studies have used the qualitative differences found in face perception between controls and individuals with DP in an attempt to improve face recognition, such as distances between features [
55,
56], and detecting subtle whole-face changes through the use of morphing techniques [
57,
58] (see [
59,
60] for reviews).
One limitation of this study is that it was not possible to measure response biases. It was not feasible to adequately employ signal-detection theory (i.e., d’prime) with these data, as, unlike other studies in which same/different decisions are made, “hits” can only occur in one condition in 20% of trials (“same face”) and “false alarms” can occur in the remaining four feature-change conditions. Therefore, it is not possible to clearly identify response biases in this study (i.e., would a response bias be deviation from 50/50 responding or 20/80 responding?). Further studies should examine this, and it would be interesting to compare any response biases between prosopagnosics and super-recognisers.