1. Introduction
The access time for memory is a critical element in computing. Memory access is composed of address decoding, word-line (WL), and bit-line (BL) delays, in addition to data output. Among these, WL and BL delays are limiting factors, especially for devices with large memory arrays, such as NAND flash and storage class memories. Pre-emphasis (PE) pulses are design techniques to reduce the access line delay of 3D NAND devices [
1] and large flat panel displays [
2]. By driving large RC delay lines with a pulse whose initial period is made with a voltage higher than the target voltage, the entire WL delay time can be reduced significantly, where the delay time is defined by the farthest point of WL. Circuit analyses have been discussed to design PE pulses for minimizing the WL delay time in [
3,
4,
5]. PE pulses are also effective for memory devices used for storage or 3D cross-point memory [
6,
7,
8,
9,
10], where the WL delay is determined by the point at the selected cell. The delay time strongly depends on the column address. Thus, an address-dependent optimum pulse width of the PE pulse can significantly reduce the delay time according to the position of the selected memory cell across a selected WL [
11]. All the aforementioned studies have focused on PE pulse design for reducing WL access delay. Another interesting factor is whether PE pulses can be also effective for BL access, which is the motivation of this paper. Unlike WL, with no direct current path, BL has two current states depending on the cell states. Two sensing operations are known, namely, shielded BL [
12] and all BL [
13] operations.
Figure 1 illustrates a simplified BL path of NAND flash. The value of parameter x represents the position of a selected cell along the BL. The nearest and farthest cells have x percentages of 0% and 100%, respectively. After one WL is selected with 1 V in this example, keeping all the remaining WLs grounded, the BLs are pre-charged with PASS high. Conventional BL path operations are performed with a step pulse for PASS in both shielded BL [
12] and all BL [
13] operations. This paper investigates BL delay with a PE pulse for PASS, as shown by the waveform in blue. An overdrive voltage during the PE pulse that is 20–30% higher than the target voltage is effective to reduce the voltage delay [
3,
4]. Therefore, the BL voltage at the input terminal of the sensing circuit is assumed to be 600 mV during the PE pulse and 500 mV after the PE pulse in this paper. BL voltages go up to lower and higher levels via source follower PASS transistors depending on the cell states. In this paper, data “1” and “0” correspond to higher and lower cell currents, respectively. Please note that BL voltages do not always rise monotonically depending on the PE pulse width and the position that the BL voltage is observed. These details are discussed in
Section 2. After the sense current (I
SENSE) is stabilized to be as high as the cell current, pre-charging transistors turn off with PREB high. The charges stored in the capacitors at sense nodes (C
SN) are discharged with I
SENSE to BLs. Because BL “1”, which is the BL connected with the selected cell whose data is “1”, has a higher current than BL “0”, the sense node voltage for BL “1” reduces much faster than that for BL “0”. After the voltage window at the sense nodes between “1” and “0” becomes sufficiently large, the sense signal SNS becomes high to output the data to a NAND controller. If the PE pulse can reduce the BL delay, one can start a sensing operation, which contributes to reduction in latency. Accordingly, the BL delay time decreases as the BL voltage during the PE pulse increases, whereas the energy associated with BL charging increases. Thus, there is trade-off to design the PE pulse. Energy increases by about 4% when an overdrive voltage of 20% is used for the PE pulse [
4]. This paper does not pay attention to energy thereafter.
In this paper, PE pulse design is discussed using the three different BL models as shown in
Figure 2. Single-BL read (SBL) operation can be analyzed with a single line model (SLM). Three-line model (TLM) is suitable for All-BL read (ABL) operation.
Figure 2c shows actual BL control with PASS transistors. The cell current model given by a liner resistor RC is omitted in (b) and (c), but each RC is connected with each BL in actual netlists for simulation. This paper is organized as follows. In
Section 2, a single-line model is used for a shielded BL read operation. The characteristics of the sensing current delay as a function of PE pulse width are compared with those of the BL voltage delay. The differences that arise afterwards are investigated in detail.
Section 3 discusses a three-line model to see how adjacent BLs affect the delay of the target BL in all-BL read operation. It is shown that the optimum PE pulse width for the sensing current delay is longer by a factor of two than that for the BL voltage delay. The effectiveness of the address-dependent PE pulse width on the average BL delay is shown in
Section 4.
2. Single Line Model (SLM)
Figure 2.
BL path models studied in this paper. (a) Single line model (SLM). (b) Three-line model (TLM) and (c) TLM with PASS transistors. The cell current model given by a liner resistor RC is omitted in (b,c), but each RC is connected with each BL in actual netlists for simulation.
Figure 2.
BL path models studied in this paper. (a) Single line model (SLM). (b) Three-line model (TLM) and (c) TLM with PASS transistors. The cell current model given by a liner resistor RC is omitted in (b,c), but each RC is connected with each BL in actual netlists for simulation.
Table 1 summarizes the device and design parameters used in this study. The memory cells are modeled by linear resistors for simplicity. Distributed RC elements are used for BL operations. The selected memory cells are located at x percentages of 25%, 33%, 50%, 66%, 75%, and 100%.
Figure 3a shows the BL voltage waveform with different PE pulse widths when the “0” cell at 25% of the BL is selected. At the transition in PASS from the PE voltage to the target one, the BL voltage reduces due to charge distribution in BL capacitance. After a certain amount of time, the BL voltage increases again. When the acceptable voltage window is as large as 10% of the target BL voltage, with Tpre = 1.0 μs, the BL voltage delay is about 1.6 μs (shown by “Tdly2”). With Tpre = 1.2–2.2 μs, the delay is about 0.7 us (shown by “Tdly1”). With Tpre = 2.4 μs, the delay is about 2.4 μs (shown by “Tdly3”). A BL voltage with too short of a Tpre time has a large drop after Tpre, which results in a long Tdly value. A BL voltage with too long of a Tpre values becomes higher than the target voltage, which also results in long Tdly value. Thus, a voltage window of 10% translates into a wide time window for Tpre. Thus, the BL voltage delay (Tdly) has a good shape with respect to Tpre, even though Tdly is longer when a farther cell is selected. The Tdly values for the cells between 25% and 100% are limited in a shady region of
Figure 3b. Please note that the impact of the cell current on Tdly depends on the position of the selected cell due to the difference in BL resistance between the sensing circuit and the selected cell. The cell at 25% of the BL has no significant variation, whereas that at 100% of the BL has significant variation due to the differences in data. One can find the worst case of Tdly to pick up the longest Tdly value at every Tpre value. Interestingly, a BLL of 100% does not always determine Tdly. BL 100% determines Tdly when Tpre is <2.2 μs or >3.2 μs, whereas BL25% exhibits Tdly at 2.4 μs < Tpre < 3.0 μs.
Figure 4a shows the sensing current delay at different BL positions. In comparison with the BL voltage delay shown in
Figure 3b, the current delay is more sensitive to Tpre. Note that the dependence of the position of the selected cell on Tdly for the sense current is quite different than that for the BL voltage. The current delay time in the worst case is determined in all the BL positions, resulting in
Figure 4b. The optimum Tpre to minimize the current delay is about the same as the BL voltage delay.
In order to understand the characteristics affecting the sensitivity of Tpre, the cell current (I
CELL) and the BL current (I
BLF) at the selected BL position were measured in addition to the sensing current (I
SNS), as shown in
Figure 5.
Figure 6 shows the BL voltage (upper) and current waveform (lower) with different Tpre (2.0 μs, 2.4 μs and 2.8 μs from left to right, respectively) values when the “1” cell at x = 25% is selected. The arrows in the upper graphs indicate the processing time to show how the charges at every BL node are redistributed after Tpre. The current delay is minimized when Tpre is 2.4 μs. I
BLF becomes very close to 0 nA as soon as PE pulsing is finished. Thus, the sense current becomes stable in a short time. One can notice that BL 66% does not change much over time. As concluded in [
4], the RC delay line has a position of 2/3 with no change in time in the case where an optimum Tpre is used to minimize the voltage delay time in theory. Thus, the sensed current delay can be also minimized with the optimum Tpre for BL voltage delay. With Tpre values of 2.0 μs or 2.8 μs, the sensed current delay becomes much longer. With a shorter Tpre of 2.0 μs, the BL voltages are insufficient such that I
BLF becomes positive after the PE pulse, which in turn affects I
SNS. Conversely, with a longer Tpre of 2.8 μs, the BL voltages are more than sufficient such that the I
BLF becomes negative after a PE pulse, which in turn affects I
SNS as well. BL 66% with a Tpre of 2.0 μs or 2.8 μs changes significantly over time, which indicates that it takes more time for all the BL nodes to become stable.
Figure 7 shows the BL voltage (upper) and current waveform (lower) with different Tpre (2.0 μs, 2.4 μs and 2.8 μs from left to right, respectively) values when the “1”-cell at x = 100% is selected. Even though the optimum Tpre is different than that for the case where the cell located at BL 25% is selected, a Tpre value of 2.0 μs allows the BL voltage at 2/3 to become very stable over time and features the minimum sense current delay as shown in the left bottom graph.
Figure 8 shows the BL voltage (upper) and current waveform (lower) with different Tpre (2.0 μs, 2.4 μs and 2.8 μs from left to right, respectively) values when “0”-cell at x = 100% is selected. Even though the optimum Tpre is different than that for the case where the “1” cell located at BL of 100% is selected, a Tpre value of 2.4 μs allows the BL voltage at 2/3 to become very stable over time and features the minimum sense current delay as shown in the center bottom graph. A Tpre value of 2.8 μs or longer can be optimum as well, because the excess pre-charging allows I
SNS to be <10 nA at any time after Tpre. If the value of I
SNS is <10 nA, the sensing circuit interprets the data as “0”.