2. Reconstruction: Late Merge and the Lebeaux Effect
The introduction of the Copy Theory of Movement in the Minimalist Program [
1] paved the way for accounts of violations of grammatical principles involving copies, rather than the moved categories occupying positions at the head of their movement chains. The typical example of such a relation involves reconstruction of wh-movement—that is, interpretation of a copy left behind in the VP-internal position:
- (1)
a. *[Which claim [that John1 was asleep]] was he1 willing to discuss t?
b. [Which claim [that John1 made]] was he1 willing to discuss t?
This pair of examples comes from [
8,
9] and illustrates the so-called Lebeaux Effect (LE) or Late Merge. One would have expected that if the position of t in (1a,b) is filled with copies of the wh-phrase (the operator with the NP-restriction), so both (1a) and (1b) should be excluded as Condition C violation, as in both cases the pronominal subject c-commands a coindexed name embedded within the clause following
claim. However, Lebeaux points out that the two clauses following
claim differ in their status; while the clause in (1a) is a complement clause, the one in (1b) is a relative clause. He proposes treating the complement clause as an argument of the noun
claim, its complement, while the relative clause receives the treatment of an adjunct. This distinction has profound consequences for the timing of the merger of the two clauses with
claim. Refs. [
8,
9] argues for an early, cyclic merger of complements by invoking the Projection Principle of Chomsky (1981) [
13]:
- (2)
The Projection Principle: The subcategorization property of lexical items must be satisfied throughout the derivation.
The complement clause in (1a) is merged with claim cyclically, at the stage of creating the complex NP. Thus it finds itself in the c-command domain of the subject pronoun co-indexed with John, which triggers a violation of Condition C. If the wh-movement leaves copies behind, this violation persists. The relative clause is not subcategorised for by the noun claim, so it in fact it can be merged with it after the movement of which claim to [spec, CP]. However, if so, the copy of the wh-phrase (which claim) does not contain John in the c-command domain of the pronominal subject and Condition C is not violated.
Chomsky proposes accounting for the distinction between (1a,b) through the notion of the Preference Principle:
- (3)
The Preference Principle [
1], (p. 209): To account for the judgements [in (1a,b) above, J.W.] it is only necessary to add a preference principle for Reconstruction: do it when you can (i.e., try to minimize the restriction in the operator position).
The Preference Principle implies that in A-bar movement chains the bottom copy should be interpreted for binding relations, with the complement clause triggering Condition C violations but a relative clause absent from this position. The Principle is not inviolable and can be disobeyed when other relations relevant to the convergence of a given derivation should force its violation; see (11) below.
The empirical scope of a late merger is expanded further, beyond the argument/adjunct distinction pointed out by Lebeaux, to a wider context of syntactic movement. Takahashi and Hulsey (2009) [
14] developed the hypothesis of a wholesale late merger, which goes a long way toward explaining the difference between reconstruction in A/’A-chains, different types of A’-chains, and QR. For instance, they provided a principled account of the standard observation that, while A-bar movement obligatorily leaves a trace and tends to reconstruct, A-movement seems to leave a trace only optionally. They followed [
3,
4] and relied on the procedure of Trace Conversion. Leaving aside the details of the justification for their proposal, Hulsey and Takahashi claimed that their LF Interpretability approach predicts that a restrictor of an operator/determiner should be able to be merged with this operator/determiner countercyclically (a wholesale late merger) unless other properties of the grammar prohibit this option.
1 This is the crucial difference between a wholesale late merger and Lebeaux’s late merge in (1) above. In concrete terms, they provide for a late merger in the following manner:
- (4)
a. Merge as late in the derivation as possible not only adjuncts but also NP restrictions on quantifiers and determiners [
13] (pp. 396–397).
b. DPs as a whole demand a case because both determiners and nouns, which constitute DPs, must receive a case [
13] (p. 401).
An analysis of the following example serves as a brief illustration of their system:
- (5)
??*[which argument that John1 is a genius] did he1 believe t?
There are two possible derivations of this construction at early stages:
- (6)
a. [VP believe [which+case]]
b. *[DP which+case [NP argument-case that John1 is a genius]] did he1 [VP believe [DP which+case]]] Condition (4b) violated
c. *[vP he1 [VP believe [DP which+case [NP argument+case [that John1 is a genius]]]]]
Binding Condition C violated
In the first option, the late merger of the NP-restriction applies in (6a), so only the wh-operator is merged with the verb within VP. At a later stage, this wh-operator has its case valued by v-V. In the final part of this derivation in (6b), the NP-restriction to the wh-operator is late-merged but only in the position of [spec, CP]. Thus, there is no structural environment for a Condition C violation, but the NP restriction (argument) has no case (it was absent from the vP domain when the wh-operator had its case valued) and the derivation crashes. The second possibility involves an early merger of the NP-restriction with the wh-operator in the domain of the verb phrase in (6c), which provides for case valuation of the NP-restriction but involves a cost in the form of a Condition C violation, on the strength of the Projection Principle in (3).
In brief, [
13] predicted that wh-movement requires reconstruction of the NP-restriction in the VP for the reasons of case.
2 According to standard GB lore, case is assigned at the foot of the A-bar chain. Since case (for both the operator and its NP restriction) is valued at the head of the A-chain, no reconstruction is expected in this case:
- (7)
[every argument [that John1 is a genius]] seems to him1 t to be flawless.
- (8)
[TP T [VP [DP every+case [NP argument+case that John1 is a genius]] [VP seems [to him1] [TP [DP every-case] to be flawless.
So, the wholesale late merger, with its insistence on the assignment of case to both the wh-operator/determiner and the NP restriction, correctly captures the distinction between A-bar movement and A-movement. In the former, case is assigned to elements within the bottom copy of the movement chain, while in the latter, it is assigned to the copy at the landing site of movement. Thus, the presence of the NP restriction (and whatever further elements it contains) at the bottom position in the A-chain is redundant. Consequently, A-movement is not expected to bring about reconstruction effects.
Recently, the debate on reconstruction with wh-movement has reappeared on the research agenda in the context of the debate on the nature of syntactic displacement. Syntactic movement can be presented as either a sequence of copies of the moved element, the Copy Theory of Movement [
1], or a representation where a single element is dominated by multiple nodes (Multiple Domination (MD)) in (Gärtner 2002 [
16], Franks 2017 [
17], Citko and Gračanin-Yuksek 2021 [
18] and Citko 2021 [
19]).
- (9)
[CP [wh…] [TP … [VP V … {wh …}]]]
- (10)
Although the MTC stems from mainstream minimalist theorising, MD appears to be its attractive junior sibling, as it seems to avoid several problematic aspects of overt movement, specifically ones connected to successive cyclicity. As ref. [
16] argues, an MD approach (or multiattachment, as he prefers to call it) reduces the tension between overt and covert movement and addresses the problem of irrelevance of intermediate landing sites.
3 Ref. [
15] highlights the ‘resource problem’ caused by copies at the bottom of movement chains containing features that require checking/valuation at a later stage than the categories moved overtly (assuming there is a distinction between strong and weak features). Multiple attachment of one syntactic object to various addresses in the phrase marker, as shown schematically in (10), avoids these problems.
Larson (2016) [
20] brings up an argument in favour of the CTM based on the Lebeaux Effect: bleeding of the Principle C effect is detected when an adjunct (relative clause) is pied-piped with the moved wh-DP, but feeding of the said Principle C takes place when a complement is pied-piped. Assuming that copies cannot be interpreted in parts in scattered positions, Larson points out that LE can be captured by the CTM, where the copies may be distinct from one another in size as a result of late adjunction of the relative clause. As a result, the offending name embedded in the RC is absent from the bottom copy and Principle C is ameliorated. Larson points out that an MD-inspired account of movement faces a challenge in this case, as it provides for one and the same constituent, including the RC, to be attached to two distinct nodes.
Larson acknowledges the fact that both systems make similar predictions in the case of the satisfaction of Binding Condition A:
- (11)
John1 wonders [[which picture of himself1/2] Bill2 took [which picture of himself1/2]]
As is well known from [
1], the CTM uses different copies (terms) for the distinct LF readings of this construction: the bottom copy in the wh-chain in VP is interpreted when the picture shows
Bill and the meaning may be idiomatic (the components of the idiom
take the picture {of oneself} must be interpreted in an adjacent position at LF), while the copy in [spec,CP] is interpreted when the picture shows
John. In this case, the idiomatic reading is excluded. The general idea is that scattered interpretation of copies is unavailable. The MD can handle this correlation between interpretations and positions of attachment through the following postulate:
- (12)
If position 1 of element X is interpreted, then all elements Y that X dominates are also interpreted in position 1 unless elements Y displace to a yet higher position [
20] (p. 13).
However, when satisfaction of Binding Condition C comes into play, differences between the CTM and MD show up. In this context, ref. [
20] brings up an argument in favour of the CTM based on the Lebeaux Effect (LE):
- (13)
a. [which picture [RC that Bill1 hated]] did he1 say that Mary2 took {x picture}?
b. *[which picture of Bill1] did he1 say that Mary2 took {x picture of Bill1}
Assuming that copies cannot be interpreted in parts in scattered positions, Larson points out that LE can be captured by the CTM, where the copies may be distinct from one another in size as a result of late adjunction of the relative clause (RC). As a result, the offending name (Bill), embedded in the RC, is absent from the bottom copy in (13a) and Principle C is ameliorated. An MD-inspired account of movement faces a challenge in this case, as it provides for one and the same constituent, including the RC to be attached to two distinct nodes.
Ref. [
20] (pp. 13–14) observes that CTM accounts can rely on the late adjunction of the relative clause to capture these antireconstruction effects. The higher copy can be manipulated independently of the lower one and, if the relative clause is appended to the higher copy only, no Principle C effects are predicted. MD accounts can also resort to late adjunction, but the adjunct is adjoined late and low as well as high and the lack of Principle C effects is not predicted, simply because the name embedded in the RC in the position of low attachment in (10) ends up being c-commanded by the subject pronoun and postulate (12) prevents any splitting of wh-phrases in MD accounts.
3. The Lebeaux Effect Reconsidered
As MD fails to deal with reconstruction for Condition C, its proponents seek ways to undermine the LE and its predictive power. One option is to question the validity of a late, countercyclic merger in any shape or form.
Sportiche (2019) [
21] provides an argument that a late merger is an extremely unwelcome operation for narrow syntax, as it presupposes unlimited countercyclicity and may lead to serious overgeneration. He points out that, while a late merger is taken to provide the right solution in case of local movement, it can also apply across a large number of phrasal boundaries and derivational phases (Chomsky 2000 [
22], 2001 [
23]):
- (14)
Which villages near Picasso1′s estate did he1 visit t?
- (15)
[which villages {near Picasso1′s estate}] did he1 visit [which villages]
- (16)
[whose criticism of [Mary’s rendition of (…) the claim that you formulated (…) the hypothesis [that Henri [ visited the villages {near Picasso1′s estate}]]]]2 did he1 endorse t2?
In (14) and (15), showing its alleged LF form, a late merger seems manageable, but in (16) every element below the head noun criticism is a complement of a complement, so it must be merged cyclically. Only at the very bottom is there the PP adjunct ({PP near Picasso’s estate}). However, in order to access the lowest complement domain, the operation of the late merger of the adjunct needs to penetrate a number of phrases and derivational phases in an unbounded manner. As Sportiche points out, this is a very unwelcome effect in the context of current minimalism relying on phase-based operations.
Furthermore, Sportiche observes that, once taken seriously, a late merger should not be limited to an External Merger (as in the discussion initiated by Lebeaux) but also Internal Merge (Move), if both operations are said to be nearly identical and distinguished only by the source of the constituent merged in. So, the countercyclic nature of a late merger may lead to overgeneration and allow for structures resulting from illicit movement. Sportiche considers the following examples:
- (17)
[near Paris] John thinks … that you live t.
- (18)
*near Paris John said [ … that you live t] [… when a plane crashed t]
(17) shows the regular case of adjunct movement following its cyclic Internal Merge. The presence of the trace/copy is required for appropriate computation of the scope of modification for the adjunct (so [
PP near Paris] modifies the act of living rather than thinking). However, if the merger in Move is, to all intents and purposes, the same as the merger in External Merge in (16) above, and if it is practically unbounded, the door is open for licit derivations of illicit examples, as in (18). Here, the adjunct can first be merged in its overt position at the head of the syntactic object, and only then it can be late merged inside the two bracketed constituents. The lower copies then subsequently undergo trace conversion (as in Fox 2002) for the purpose of LF interpretation. Ultimately, (18) shows a nonexistent parasitic gap construction headed by a PP. An unlimited late merger certainly overgenerates in this case.
4Ref. [
21] concludes that any form of late merger should be avoided in the grammar and all operations should follow the derivational cycle.
5 If so, the CTM loses its edge over MD on this count.
Another option is to deny the difference between arguments and adjuncts in wh-reconstruction ([
10,
16]). Ref. [
16] points out empirical inadequacies of this distinction on several counts. Let me mention two of his arguments. First, he observes that in German a name embedded in the restriction of the wh-operator that is an argument does not violate Condition C, although it should and does so in the affirmative clause (19):
- (19)
[Wessen Behauptung [dass Maria1 krank sei]]2 hat sie1 (durch ihre1
whose claim that Mary ill be has she through her
Anwesenheit) t2 wiederlegt?
presence refuted
‘Whose claim that Mary was ill did she refute (through her presence).’
- (20)
*Sie1 widerlegte Peters Behauptung [dass Maria1 krank sei].
So, the argument/adjunct distinction appears to be heavily language-dependent.
6Second, he provides evidence that certain adjuncts cannot be merged late in the derivation. He mentions the classic ‘that trace effect’—the trace/copy of the wh-extracted subject cannot be linearly adjacent to the lexical complementiser:
- (21)
*Who1 do you think [that t1 will vote for Smith]
- (22)
Who1 do you think [that [after all things are considered] t1 will vote for Smith]
As (21) shows, the ‘that-trace’ effect disappears when an adjunct immediately follows the complementiser but precedes the (thus far illicit) trace. However, for the adjunct to perform its rescue mission vis-à-vis subject extraction, adjunct placement must be cyclic—that is, it must follow the formation of TP but precede wh-movement. This is necessary for both ECP-inspired accounts, such as [
13], Chomsky (1986) [
26], Aoun, Hornstein, Lightfoot and Weinberg (1987) [
27], and Rizzi (1990) [
28], and minimalist accounts like Pesetsky and Torrego (2001) [
29]. Thus, the conclusion that Gärtner draws is that the concatenation of both the adjunct and the argument with the syntactic object under construction must be cyclic, as per [
16] (p. 51):
- (23)
There are no countercyclic operations.
In the same vein, an even more radical approach to the argument/adjunct distinction in wh-movement reconstruction is taken in [
10]. To put it bluntly, the authors conclude that the Lebeaux Effect finds little empirical confirmation. Bruening and Al Khalaf conducted a series of experiments (online acceptability judgement studies) showing that there is little difference in acceptability between expected reconstruction with NPs within arguments and antireconstruction with NPs within adjuncts. At the outset, they noted that the contrast captured by the Lebeaux Effect was not as strong as expected and, even in light of the wh-reconstruction debate in the late 1990s, many authors pointed out that this contrast would not come out as strongly as expected; see [
10] (p. 250).
- (24)
Which witness’s attack on John1 did he1 try to get expunged t from the trial records?
- (25)
Which artist’s portrait of Nixon1 do you think he1 liked t best?
- (26)
Whose criticism of John1 did he choose to ignore t?
The most illustrative empirical argument against the Lebeaux Effect comes from an empirical study in which four similar examples were considered, two involving affirmative structures and two involving embedded interrogatives (one including an NP in the argument part of the wh-restriction and the other involving an NP in the adjunct part of the wh-restriction). The embedding of the question was necessary for controlling the readings accessed by the subjects:
- (27)
A female staffer told everyone which of the announcements that Hillary Clinton was running for president she had actually authorised.
- (28)
A female staffer told everyone which of the announcements that Hillary Clinton had tried to take back she had actually authorised.
- (29)
A female staffer told everyone that she had actually authorised one of the announcements that Hillary Clinton was running for president.
- (30)
A female staffer told everyone she had actually authorised one of the announcements that Hillary Clinton had tried to take back.
The answers relevant for the experiment can be presented in the following manner [
10] (p. 253):
- (31)
Empirical tests of the Lebeaux Effect
No Wh Arg | No Wh Adj | Wh Arg | Wh Adj |
4.7% | 2.7% | 42.7% | 56% |
Where there is no wh-movement, subjects chose the answer violating Condition C at a rate of 4.7 and 2.7%, respectively. In the examples with wh-movement, the subjects chose the Condition C-violating answers in nearly 50% of cases for both the argument- and adjunct-contained NP restrictions. If the Lebeaux Effect were genuine, a much lower percentage of the former and a much higher percentage of the latter should be expected. However, the results in (30) show that the Lebeaux Effect, on which so much theoretical literature has relied, does not have empirical confirmation. But the authors move further. The examples in (27–28) demonstrate wh-reconstruction/antireconstruction facts with the type of wh-constructions originally discussed by Lebeaux—that is, sizeable restrictions on wh-operators including clauses. Bruening and Al Khalaf also considered other types of frequently discussed constructions, involving PP restrictions postmodifying nouns. They used an analogous set-up of four closely corresponding examples and their results were very similar to the results of the first experiment: interpretations excluded by Condition C are marginal with nonmovement examples and account for 30% of the answers with both argument and adjunct PPs. So, again, the argument/adjunct status of the PP does not seem to make a difference. However, the difference that constantly shows up is between examples with no wh-movement and examples with wh-movement. The movement itself consistently ameliorates Condition C effects.
However, it would be spurious to expect that no effects of reconstruction with movement are present. The authors show that such effects do not show strongly only when the (wh-) fronted elements are DPs (with restrictions of differing seizes (CP, PP), but when the PP itself is fronted, it seems to reconstruct; see (32). Also, fronted AP predicates require reconstruction, as names embedded in them cannot be co-indexed with subject pronouns; see (33):
- (32)
*Near Dan1, he1 saw a snake t.
- (33)
?* How afraid of Margaret1 do you think she1 expects John to be t?
On the basis of their experiments, the authors formulated the following observation:
- (34)
Reconstruction for Binding Condition C:
Where a phrase XP with head X occupies the head of an A-bar chain:
if X is in category V, P, or A, X reconstructs along with the head Y of its complement YP;
if X is in category N, only X reconstructs; none of its arguments or adjuncts do.
The postulate in (34) is supposed to capture the effect of embedding, generally perceived as facilitating antireconstruction (bleeding of Condition C). Any NP embedded in a postmodifier of the NP restriction, as well as any NP embedded in a postmodifier of the complement of the AP/PP restriction, is supposed to bleed Condition C. For instance, when the offending name is embedded further in the AP, the acceptability of the example increases dramatically; see (35):
- (35)
[How pleased with the pictures [Pollock1 painted in his youth]] do you think he1 really was t?
Clause (34b) means that ‘bare’ NP restrictions of wh-operators reconstruct, while whatever is further merged with them, either as argument or adjunct, does not:
- (36)
*[which girl1] does she1 claim t has seen the unicorn?
- (37)
[which portrait of the countess1] does she1 consider t to be the most valuable?
In fact, they propose that dependents of the nominal head may have the semantics of either arguments or adjuncts, but, syntactically, they function as adjuncts. For instance, unlike verbal arguments, they are optional. Bruening and Al Khalaf referred to the findings of Grimshaw (1990) [
30], where the mixed argument/adjunct properties of nominal dependents are discussed at length. In general, Grimshaw argues that, unlike verbs, nominals do not assign thematic roles. Only complex event nominals behave like verbs in the sense that they have argument structure and require that their arguments be projected in syntax in line with (2). However, even complex event nominals do not assign thematic roles and require assistance from prepositions to do so. Simple event nominals and result nominals are different from complex event nominals in the sense that they do not have an argument structure and are semantically compatible with modifiers whose presence is not obligatory. Grimshaw notes that certain nouns may be ambiguous between the complex event and result readings, and the presence of the possessive subject somehow forces the complex event reading; see (38a,b). The clausal dependent of the nominal typically points to its result nominal status and it can be separated from it by the copula verb, which is taken to be a property of adjuncts (modifiers), such as (38c), rather than arguments, not only in [
30] but also in Stowell (1981) [
31] and Schutze (1995) [
32], see (38d,e):
- (38)
a. *the instructor’s deliberate examination.
b. The instructor’s deliberate examination of the patient
c. the red car/the car is red
d. [the explanation [that he was temporarily insane]]
e. Paul’s explanation was [that he was temporarily insane]
Significantly for the matter of reconstruction properties of nouns embedded in appositive clauses, [
31] (p. 200) formulates the following view:
- (39)
‘The derived nominals guess, claim and explanation do not refer to the action of guessing, claiming and explaining something, as is true for the verbs upon which they are based. A claim refers to the thing that is claimed, rather than to the act of claiming. The derived nominal heads actually refer to the same thing that their ‘complements’ do: the object argument of the verb. The relation between the derived nominal and its ‘complement’ is actually one of apposition, rather than theta role assignment.’
Against such a background of the debate on differences between verbal predicates and nominal heads in theta role assignment and argument licensing, Bruening and Al Khalaf conclude that nominals do not license arguments in syntax. The gist of their account and explanation for the findings of their experiments can be presented as three points:
- (40)
a. Dependents of nominal heads (both CPs and PPs) function like syntactic ad juncts.
b. The copy mechanism involved in the derivation is constrained by economy in such a way that it will copy as little as possible for convergence (Minimal Copy). Minimal Copy retains the complements but leaves out adjuncts.
c. The derivation runs from top to bottom or left to right, so the ‘pruned’ copies find themselves at the bottom of the syntactic object.
The prevailing lack of reconstruction effects evident from the experimental part of the study is accounted for by the mechanism detailed in (34) and (40). For example, in the following case ([
10] (p. 265)), Condition C is bled due to the application of Minimal Copy:
- (41)
[CP [DP which corner [PP of John1′s room]] was [TP he1 sitting in {which corner}]]
The pruned copy of the wh-restriction does not contain the PP dependent (treated as an adjunct) that contains the offending name. Because the derivation is said to proceed from left to right, just like the order of parsing, the original wh-phrase is placed at the left periphery of the clause, and the (pruned) copy is placed inside the VP, in the c-command domain of the subject. Significantly, the combination of clauses (40a) and (40b) brings about reverse effects to Chomsky’s Preference Principle in (3); the restriction of the wh-operator is minimised in the launch position of the movement chain (in terms of the mainstream bottom-to-top derivation) and is maximised in the position that the operator occupies in overt syntax.
Minimal Copy does not cause pruning of the copies of arguments/complements of non-NPs. Thus, AP contains them in full in the position c-commanded by the subject; hence the feeding of condition C in (33) above. However, as soon as the offending name is embedded further in an adjunct (a relative clause), as in (35), Minimal Copy sets in and ameliorates the expected Condition C effect.
Finally, Ref. [
10] (pp. 268–269) addressed the question of why, if there is no argument/adjunct distinction with reconstruction, the issue of movement and Condition C reconstruction has dominated the research agenda for so long, with many authors treating the Lebeaux Effect as a fact and repeating it in their own intuitive judgements. They submit that this may be due to a strong pragmatic bias against coreference between two referential DPs, which goes beyond sentence grammar and operates at the discourse level. As they show, coreference between the two names in the following example involving separate clauses is not acceptable:
- (42)
He1 came in. John1 sat down.
This ban on coreference can be lifted once the example is manipulated with and first a pronoun referring back to the name is introduced into the discourse and the name may be used again, but in a slightly different context (for instance, in a coordinate structure):
- (43)
Mary had been waiting for John1 at the back of the room. Finally, he1 came in. Then John1 and Mary sat down, but not together.
Bruening and Al Khalaf provide for the construction of the syntactic object (through External and Internal Merge) in the top-to-bottom, left-to-right fashion, which is far from minimalist mainstream views. However, multiple copies of different sizes are still necessary for this system to operate. Thus, their proposal retains, rather than weakens, the need for creating movement chains including multiple copies.
Ref. [
19] proposes a solution for the MD account that predicts antireconstruction with wh-movement and at the same time satisfies the structural requirement in (12). She reviews a number of options based on a parallel merge of some nodes in the nominal domain to both the selecting V inside its VP (to produce the effect of the copy at the bottom of the A-bar chain), and to further projections in the nominal domain that would eventually constitute the full wh-phrase. The wh-phrase, for a while built in a separate workspace, is ultimately linked to the CP area of the clause. Ultimately, Citko opts for the following structure:
- (44)
[which claim that Mary1 made] did she1 regret t?
- (45)
[VP regret [D0] [DP D0 [NP [NP claim] [CP that Mary made]]]
The part common to both phrase markers at this stage is the empty Determiner head D0 (marked in grey), merged in parallel with both V (regret) and its DP projection (while D0 is dominated by VP, DP is outside VP, in a separate workspace). The CP relative clause is adjoined to NP. Both workspaces teem with derivational activity and, at a later stage, the following objects are formed in them:
- (46)
a. [CP C [TP she [T’ T [vP she [v’ v [VP regret [D0]
b. [QP Q [DP which [D’ D0 [NP [NP claim] [CP that Mary made]]]]].
The object above VP projects to the full clause with the interrogative C at its head. The
D0 projects further, with the interrogative determiner merged in at [spec, DP] and a separate Q head merged on top of the DP. The two objects are still separate (except for the link at
D0) and become merged with QP, linked to CP at the final step of the derivation. Where does the antireconstruction effect come from? Ref. [
19] (p. 26) proposes the following notion of derivational c-command:
- (47)
Derivational c-command:
On this definition, the pronominal subject (she) does not c-command Mary embedded within the relative clause because the relative clause is merged with the main spine of the projection only at the CP level, as part of QP. To all intents and purposes, this produces an MD equivalent to a late merger. Certainly, the CP tier of the structure is beyond the c-command domain of the subject. The key component of this account that leads to avoiding Condition C is the merger of the empty head D0, which projects further in a parallel workspace to be concatenated with CP above the position of the subject. The D0 is silent, so the entire QP can be pronounced as a single category at the head of the clause, with D0 satisfying the selection requirements of the verb. This account shows that, in fact, it is possible to have one’s cake and eat it.
This account appears to be successful for the matter at hand but requires at least two caveats (and is subject to further comments in the section below). First, it makes the prediction that there should be no reconstruction effects for Condition C with nominal phrases at all; empty
D0 is not accompanied by any NP restriction in its VP-internal position. Yet, as (36) shows, the NP restriction itself (stripped of any dependents) does produce Condition C effects stemming from reconstruction (taken to be the interpretation of the VP-internal copy). One wonders how this fact can be captured by the derivational mechanics proposed in [
19]; as it stands, the silent Determiner account cannot predict the ungrammaticality of (36).
Second, this account, unlike an account based on multiple copies, cannot be extended to movement of other types of phrases. Ref. [
10] observed that A-bar movement of APs and PPs shows effects of reconstruction; see (32–33). In the parlance of the MD architecture proposed in [
19], this implies that these categories cannot be linked to some VP-internal position only through a category with a silent head that (for some derivational time) projects in a detached workspace. Probably in these cases, a more substantial part of the AP, PP, and VP is attached cyclically in the lower position.
The discussion above has shown that the foundations of the Lebeaux Effect have been shaken, if not undermined completely. However, wh-reconstruction is a complex phenomenon, and while the LE has been one of its pillars, there are other facts that require explanation, resting on the need for multiple copies.
4. Challenges to the Minimal Copy Approach
This section presents four phenomena concerning connectivity effects that pose problems for the proposal that copies in the VP-internal positions should be maximally minimal: reconstruction required by Condition A, the correlation between D-linking and reconstruction, the correlation between antireconstruction and obviation effects, and the contrast between reconstruction with respect to the subjects versus reconstruction of the object.
4.1. Reconstruction of Condition A Reconsidered
The first challenge comes from (11), repeated below for convenience:
- (48)
John1 wonders [[which picture of himself1/2] Bill2 took [which picture of himself1/2]]
My point of interest is the interpretation marked as 2, where Bill serves as the antecedent for himself and where the idiomatic interpretation of ‘take a picture’ is available. How can these interpretations be derived in the approach involving parallel merge and multidomination or Minimal Copy? If the derivation of (48) follows the same guidelines as the derivations in (41) or (46), these proposals predict that the subject of the embedded clause should c-command only the wh-operator and its NP restriction (which picture) or a silent Determiner:
- (49)
a. [TP Bill2 took [which picture]]
b. [TP Bill2 took [ D0]]
These representations do not include the lexical anaphor at the point in the derivation where it is c-commanded by its antecedent, so interpretation 2 is impossible to derive.
However, this is not an inescapable conclusion. Proponents of these analyses could claim that derivational mechanisms applied to cases of Condition A reconstruction and Condition C reconstruction should be distinct from each other. However, from the point of view of the satisfaction of Binding Conditions A and C, such derivational mechanisms should be radically distinct: while Condition A requires that the complement to the NP restriction (of himself) should be present at the derivational stage in shown in (49a), Condition C requires it to be absent. So, it appears that it would have to be a Minimal Copy proposal for the combination of wh-movement and Condition C cases and a ‘Maximal Copy’ approach for wh-movement and Condition A cases.
The Copy Theory of Movement, as proposed in [
1], treats both types of constructions in an equivalent manner; both involve copies of the NP-restriction on the wh-operator that need to be present in narrow syntax. Subsequently, the Preference Principle applies and feeds particular interpretations. The Preference Principle is far from perfect at accounting for connectivity phenomena in a precise manner (as the preceding sections show), but its use of multiple copies constitutes a common denominator for both (48) and (1). The novel proposals in [
10,
19] remove this common denominator.
4.2. Connectivity Effects and D-Linking
Another problematic issue concerns only Condition C and the correlation between the semantic type of the wh-phrase and Condition C effects. Heycock (1995) [
34] argues that wh-operators evoking specific or definite interpretations of their restrictions, typically
which, are interpreted with their restrictions in the position of wide scope (out of the vP domain). Such wh-phrases are D(iscorse)-linked in the sense of Pesetsky (1987) [
35].
7 The wh-operators evoking nonspecific interpretations of their restrictions, such as
how many in the cardinal reading, are interpreted with the restriction taking narrow scope (vP-internal). Now, this distinction is strongly correlated with connectivity effects, specifically in constructions with verbs of creation:
8- (50)
[Which allegations about John1]2 do you think he1 will deny t2?
- (51)
*[How many stories about Diana1]2 is she1 likely to invent t2?
- (52)
*[How many lies [aimed at exonerating Clifford1]]2 is he1 planning to come up with t2?
While an account of reconstruction based on Minimal Copy or parallel merge of a silent Determiner predicts antireconstruction effects for (50), where the wh-phrase is D-linked, takes wide scope, and is interpreted with its restriction outside vP, it does not predict the unacceptability of (51–52). For these examples to be struck out by Condition C, the copy in the vP-internal position, c-commanded by the pronominal subject, must be bigger than just the operator and the NP-restriction (how many stories/how many lies). In contrast, a syntactic representation of movement involving multiple full-size copies can serve as input to interpretive mechanisms that derive the differences between (50) and (51–52).
4.3. Antireconstruction, Distance, and Obviation Effects
The discussion of reconstruction has shown that antireconstruction effects often appear in structures showing obviation, where the subject of the embedded clause bears a different index from the pronominal subject of the main clause. For instance, ref. [
25] reports the following facts for Polish:
- (53)
*on1 odkurzył [zdjęcia Piotra1]
heNOM dusted off picturesACC PiotrGEN
intended: ‘He dusted off Piotr’s pictures.’
- (54)
*on1 chce [żeby Maria2 odkurzyła [zdjęcia Piotra1]
heNOM wants so-that MariaNOM dusted offPRT picturesACC PiotrGEN
intended ‘He wants Maria to dust off Piotr’s pictures.’
A pronominal subject coindexed with a nominal possessor placed in its c-command domain triggers the Condition C effect irrespective of the distance between them; the coindexed pronoun may be either a local subject, (53), or the subject of the superordinate clause, (54). Yet, corresponding examples involving wh-movement of the object do not receive equal acceptability judgements:
- (55)
*[które zdjęcia Piotra1] on1 odkurzył t?
which picturesACC PiotrGEN heNOM dusted off
intended: ‘which pictures of Piotr did he dust off?’
- (56)
?[które zdjęcia Piotra1] on1 chce [żeby Maria2 odkurzyła t]?
which picturesACC PiotrGEN heNOM wants so-that MariaNOM dusted offPRT?
‘Which pictures of Piotr does he want Maria to dust off?’
A plain Condition C effect shows in (55), assuming the copy of the NP-restriction on the wh-operator is interpreted in the position marked with t, in line with the Projection Principle. This effect is considerably ameliorated in (56), where the pronominal subject coindexed with name embedded in the NP-restriction is separated from the bottom copy of this restriction by another subject (Maria).
The effect of the distance between the name embedded in the bottom copy and the impact of obviation on the antireconstruction effects is captured neither by [
10] nor by [
19]. These sources do not take into account the distance of the wh-movement undergoing reconstruction. The operation of Minimal Copy in (36) and (37) or linking just a bare
D0 in the VP-internal position in (44–45) render it impossible to capture the obviation effects, for the simple reason that the content of the material at the bottom of the wh-dependency is radically impoverished. Their prediction is that (55) and (56) should receive equal treatment in syntax. However, it must be admitted that a standard account of reconstruction based on the CTM should not do much better, as it predicts (56) to be as incorrect as (55), with a name in the copy c-commanded by the pronoun. I return to this issue in the section below.
4.4. Antireconstruction Effects and the Subject/Object Asymmetry
As shown in ref. [
25], an interesting subject/object asymmetry shows with wh-reconstruction phenomena in Polish:
- (57)
??[które listy Piotra2 z Paryża] on2 oddał Marii1 po roku?
which letters PiotrGEN from Paris heNOM returned MariaAT after a year
Intended: ‘Which of Piotr’s letters from Paris did he return to Maria after a year?’
- (58)
[które listy Piotra2 z Paryża] Maria1 oddała mu2 po roku?
which letters PiotrGEN from Paris MariaNOM returned himDAT after a year
‘Which of Piotr’s letters from Paris did Maria return to him after a year?’
Although the pronoun in the subject position can force what looks like Condition C reconstruction effects under favourable circumstances, as shown in (57) above, there is little evidence for reconstruction of the wh-phrase to a position below one of the objects in the double object construction. However, in affirmative clauses the first object c-commands the second one and triggers ungrammaticality (Condition C):
- (59)
*Maria1 oddała mu2 po roku [tamte listy Piotra2 z Paryża].
MariaNOM returned himDAT after year those letters PiotrGEN from Paris
As soon as the object is wh-moved, this effect disappears entirely; see (58). Although the Minimal Copy approach may have little to say about the contrast in (57–58), it appears that the standard approach to reconstruction based on the CTM should be equally powerless. As (59) shows, the copy of the direct object in the VP-internal position is c-commanded by the indirect object, so a name embedded in the NP restriction of the operator c-commanded by a coindexed pronoun is expected to cause a Condition C effect in (58), contrary to fact.
However, an account of reconstruction based on the CTM can deal with data in (57) and (58), provided one adopts the hypothesis of Vehicle Change of [
11]. Ref. [
11] develops an analysis of reconstruction in wh-movement, which is inspired by an account of ellipsis presented in [
12]. These authors propose a mechanism of Vehicle Change (VC) through which a name (DP
i/NP
i) embedded in the ellipsis site may be represented as its pronominal correlate (pron
i). They support this premise with the following examples:
- (60)
*Lara loves Sol1 and he1 thinks that Sally loves Sol1 too.
- (61)
a. Lara loves Sol1 and he1 thinks that Sally does [e] too.
b. Lara loves Sol1 and he1 thinks that Sally does {love him1} too.
Clearly, a local Condition B seems to be involved in (61), rather than the unconstrained Condition C, as in the unelided (60). Otherwise, both (60) and (61) should be ungrammatical. Ref. [
11] applied VC to copies in the context of wh-movement. In the spirit of the day, he assumed the copy theory of movement and turned the CTM into a pivotal component of his account. He assumed that names in the copies left by movement can be replaced with their pronominal correlates (the b-examples represent intended copies at LF):
9- (62)
a. [that John1 was under surveillance] he1 never realised t.
b. [that John1 was under surveillance] he1 never realised {that he1 was under surveillance}
- (63)
a. Milt2, [Bill1′s pictures of whom2] he1 was ashamed of
b. Milt2, [Bill1′s pictures of whom2] he1 was ashamed of {his1 pictures of whom2}
Safir’ VC does not apply to every type of nominal phrase; it is constrained in the sense that Quantifier Phrases do not undergo VC, hence the contrast between (63) and (64):
- (64)
a. *Milt2, [everyone1′s pictures of whom2] he1 was ashamed of
b. *Milt2, [everyone1′s pictures of whom2] he1 was ashamed of {everyone1′s pic- tures of whom2}
An account of reconstruction including vehicle change makes a number of predictions; for instance, it predicts that the distance between the pronoun and the offending name (calculated through clausal embedding and obviation effects) affects the acceptability of such constructions. The role of the distance is understandable if the pronominal correlate within the copy is involved, rather than the name, as Binding Condition B has a local domain of application, while Binding Condition C does not. An account including VC explains the data in (56) above when the bottom copy includes the possessive pronoun:
- (65)
[które zdjęcia Piotra1] on1 chce [żeby Maria2 odkurzyła [x jego1 zdjęcia]]
which picturesACC PiotrGEN heNOM wants so-that MariaNOM dusted offPRT [x his picturesACC]
- (66)
On1 chce [żeby Maria2 odkurzyła [jego1 zdjęcia]]
heNOM wants so-that MariaNOM dusted offPRT [his picturesACC]
‘He wants Maria to dust off his pictures.’
If the copy at the bottom of the wh-chain includes the pronominal correlate for Piotr (jego ‘his’), then its grammatical status is as good as the status of the affirmative (66), where the possessive pronoun is coindexed with the main clause pronominal subject.
A brief digression on Polish morphosyntax is necessary before I go any further: local nominative subjects license reflexive possessives, while nonlocal subjects and objects license pronominal possessives.
10 With this digression in mind, the subject/object asymmetry in reconstruction can be addressed by adopting the VC approach. From this perspective, the copy left by movement includes a pronominal correlate to the name (Piotr = him) and its nonmovement equivalent is (68) rather than (59):
- (67)
[które listy Piotra2 z Paryża] Maria1 oddała mu2 po roku [x jego2 listy z Paryża]?
which letters PiotrGEN from Paris MariaNOM returned himDAT after a year [x his letters from Paris]
- (68)
Maria1 oddała mu2 po roku [ jego2 listy z Paryża].
MariaNOM returned himDAT after year those letters PiotrGEN from Paris
Now, it must be said again that the perfect acceptability of wh-constructions with double objects fully correlates with Binding Theory facts in their nonmovement equivalents. Although Polish (nominative) subjects act as strongly preferred antecedents to reflexive pronouns and reflexive possessives, Polish objects do not do so. Significantly, an object in Polish acts as an antecedent for a co-indexed pronominal possessive in the other object, while the local subject in Polish licenses a coindexed reflexive possessive swój ‘self’s’; see note 10:
- (69)
*on1 odkurzył [zdjęcia Piotra1]
heNOM dusted off picturesACC PiotrGEN
intended: ‘He dusted off Piotr’s picture.’
- (70)
*on1 odkurzył [jego1 zdjęcia]
heNOM dusted off [okself’s/*his picturesACC]
intended: ‘He dusted off his picture.’
- (71)
*[które zdjęcia Piotra1] on1 odkurzył t?
which picturesACC PiotrGEN heNOM dusted off
- (72)
*[które zdjęcia Piotra1] on1 odkurzył [x jego1 zdjęcia]
which picturesACC PiotrGEN heNOM dusted off [x his pictures]
Thus, vehicle change is not sufficient to satisfy Binding Theory in this case; although it ameliorates a Condition C effect, it leads to a violation of Condition B: a possessive pronoun cannot be coindexed with a local subject. However, VC returns a perfect result when a possessive pronominal is co-indexed with a c-commanding object.
A potential counterargument can be raised against the analysis above on the grounds of lax word order discipline between the objects in Polish; with most verbal ditransitive predicates, they can appear in either order. If so, the object undergoing wh-movement (wh-o1) could be moving from a position above the other object (NPo2) and thus be out of the scope of its c-command domain altogether (assuming, for instance, that the lowest leg of movement constitutes an A-chain, so wh-o1 is invisible to NPo2). It would then only reconstruct to the top position in its A-chain:
- (73)
[CP wh-o1 [TP NPs [vP wh-o1 [v’ NPs [v’ v [VP NPo2 [V’ V wh-o1]]]]]]]
However, the examples above contain a safeguard against such a possibility. In (67–68), the indirect object is a clitic/weak pronoun that must be adjacent to the verb in the surface representation in Polish, so it must serve as the first of the two objects in the linear order with, possibly, the one c-commanding the other.
Furthermore, ref. [
11] claims that VC does not apply to QPs. Now, it should be possible to construct an example where VC is arrested and an effect of Condition C caused by the c-commanding is evoked. The following set of data seems to confirm this proposal. (74) shows a Strong Crossover effect (Condition C) with a relevant co-indexation:
- (74)
Maria1 pokazała mu*2/3 wczoraj [pięć zdjęć każdego profesora2]
MariaNOM showed himDAT yesterday five pictures every professorGEN
‘Maria showed him five pictures of every professor yesterday.’
The ungrammaticality persists with the relevant co-indexation when the QP is embedded in a wh-phrase:
- (75)
[ile zdjęć każdego profesora2] Maria1 pokazała mu*2/3 wczoraj t?
how many picturesACC every professorGEN MariaNOM showed himDAT yesterday
‘How many pictures of every professor did Maria show him yesterday?’
- (76)
[ile zdjęć każdego profesora2] Maria1 pokazała mu*2/3 wczoraj [ile zdjęć każdego profesora2]?
how many picturesACC every professorGEN MariaNOM showed himDAT yesterday [x picturesACC every professorGEN]
The distributive reading of mu ‘him’ is unavailable in this example. Its unavailability can be credited to the same Crossover effect as in (74), because the QP is commanded by the pronoun coindexed with its restriction (within the copy) and the QP does not undergo vehicle change.
A lack of reconstruction effects with double object structures appears to evade the explanation in [
10] based on (41) and the explanation that ref. [
19] devises for the LE in (46) above, if my reading of these proposals is adequate. In both cases, the content of the VP-internal category is too limited to accommodate the facts above. Both proposals predict that there should be no distinction between reconstruction of the NP-restriction of the wh-phrase vs. the subject pronoun and vs. the object pronoun. However, the distinction is genuine, with some reconstruction effects vs. the subject pronoun but no reconstruction effects vs. the object pronoun. Take the MD-compatible solution proposed in (46). Crucially, the articulated QP is attached at the CP level, thus remaining outside the c-command domain of the subject pronoun in [spec, TP]. If an identical derivation were to be applied to ex. (57), with the possessive present only within the semi-attached QP, later to be attached to the CP area, the possessive should remain outside of the c-command domain of both the object and the subject, but this does not seem to be reflected in the speakers’ judgements.
One more point: Bruening and Al Khalaf credited connectivity effects to pragmatic factors such as reluctance to use two co-referential nominal phrases a short distance from each other within the same discourse fragment, irrespective of such sentence grammar notions as c-command or Condition C. However, the phenomena of obviation and the subject/object asymmetry mentioned above in
Section 4.3 and
Section 4.4 belong to a set of well-defined and robust factors in syntactic studies.