



# **MRAM DTCO and Compact Models**

#### Jeehwan Song<sup>\*</sup>, Jian-Ping Wang, and <u>Chris H. Kim</u>

#### Dept. of ECE, University of Minnesota \*Now with TSMC, San Jose, CA

This work was supported in part by C-SPIN, one of the six SRC STARnet Centers, through MARCO and DARPA and in part by the NSF/SRC E2CDA Program.

# Spin Transfer Torque MRAM Basics



- Magnetic Tunnel Junction (MTJ) is the storage element of STT-MRAM
- Thermal stability factor ( $\Delta$ ) determines the retention time while anisotropy field (H<sub>K</sub>) determines the energetic preference of the magnetization vector
- Key features: Low operating voltage, good CMOS compatibility, high speed, high density (<20F<sup>2</sup>), zero static power, and high endurance.
- Applications: non-volatile memory (eflash replacement in 22nm), cache (SRAM replacement)

# **DTCO for STT-MRAM**



- Initial set of compact models and design rules for a preliminary standard cell library and entire optimization loop is repeated to obtain a satisfactory set of the device and design rule parameters
- MTJ compact model is a critical component of the overall MRAM DTCO flow.

# Outline

- Introduction: DTCO and STT-MRAM
- MTJ SPICE Models
- STT-MRAM Array Level Evaluation
- State-of-the-art STT-MRAM Circuits
- Conclusions

### **MTJ SPICE Model**



J. Sun, Nature 2003 (IBM)

- Landau-Lifshitz-Gilbert (LLG) equation solved using SPICE
- The MTJ SPICE models are available at <u>mtj.umn.edu</u>.



J. Kim, et. al., CICC, 2015.

### **MTJ SPICE Model Verification**



- Model parameters can be tuned to match experimental data.
- Variability effects of both the MTJ (W, L, t<sub>F</sub>, RA) and CMOS (W, L, V<sub>th</sub>, T<sub>ox</sub>) can be incorporated

## **Simulated Write and Read Delay Distributions**



- Includes realistic variation for both MTJ (i.e. W, L, t<sub>F</sub>, RA) and CMOS (i.e. transistor W, L, Vth, tox)
- Top: Write delay distribution is narrower for a higher write voltage due to faster switching.
- Bottom: TMR ratio (=(R<sub>AP</sub>-R<sub>P</sub>)/R<sub>P</sub>) impact on read sensing margin.

J. Kim, et. al., CICC 2015

## **Incorporating Stochastic Switching Behavior**



J. Sun, Nature 2003 (IBM)

- Thermal fluctuation causes switching time to vary each write operation
- Can be incorporated into the model using the initial angle parameter (e.g. Monte Carlo) and time-varying thermal field



$$hitial(\theta) = \frac{\pi}{\int_0^{\frac{\pi}{2}} \sin\theta \exp(-\Delta \sin^2\theta) d\theta}$$

- $\Delta$  = Thermal stability factor,
- $\theta$  = Initial angle between magnetization and Z-axis

I. Ahmed, et al., JxCDC 2017

# Outline

- Introduction: DTCO and STT-MRAM
- MTJ SPICE Models

#### STT-MRAM Array Level Evaluation

- State-of-the-art STT-MRAM Circuits
- Conclusions

#### "Dennard" Style Scaling Study for STT-MRAM Caches





K. Chun, et al., JSSC 2013\*based on MTJ macromodels, beforeMTJ SPICE models became available

 $\alpha$ : dimension scaling factor  $\beta$ : voltage scaling factor  $J_{c0}$ : Critical switching current density RA: Resistance area product

- First order scaling analysis based on MTJ macromodels
- $\Delta$  for a target retention time is set by adjusting free layer thickness and MTJ anisotropy.
- STT-MRAM's scalability based on a constant  $J_{C0}$  RA/V<sub>DD</sub> scaling scenario

#### "Dennard" Style Scaling Study for STT-MRAM Caches MTJ, CMOS



|                                          |                                                                                                                          | <b>†</b>                     |   | ₹                      |
|------------------------------------------|--------------------------------------------------------------------------------------------------------------------------|------------------------------|---|------------------------|
| Power supply noise                       | -10% to account for supply noise                                                                                         | Monte Carlo,<br>Evaluate PPA | - | MRAM<br>circuit design |
| Bit-cell                                 | Device mismatches                                                                                                        |                              |   | layout, read/write)    |
| Parasitic capacitance (C <sub>BL</sub> ) | $\sigma/\mu$ =5%: each $\mu$ are calculated based on sub-array size                                                      |                              |   |                        |
| Resistance area product                  | σ/μ=5%                                                                                                                   |                              |   |                        |
| Sense Amplifier (S/A)                    | I-applying and V-sensing method (AP direction read) +<br>Voltage S/A<br>: I <sub>REF</sub> σ/μ=2.5%, S/A pair mismatches |                              |   |                        |
| Reference cell                           | Reference cell averaging scheme with MTJ replica cells                                                                   |                              |   |                        |
| Write threshold current                  | σ/μ=5%                                                                                                                   |                              |   |                        |
|                                          |                                                                                                                          |                              |   |                        |

\* Mismatches are based on inverse square root relationship of devices' areas.

\* Based on historic data, we assume  $\sigma_{vt}/F$  is constant with tech. scaling

\*  $\mu(C_{BI})$  is assumed to be scaled proportional to scaling factor.

- Left: Sub-array architecture of STT-MRAM cache (~3 times smaller than 6T-SRAM cache)
- Right: simulation setup includes process variation in the memory cells and SA circuit as well as variation of wire resistances, capacitances, reference biases, and supply levels

# **6σ BL Sensing and Write Delay Trends**



- Sensing delay decreases with scaling, and with a higher TMR ratio
- Write delay becomes worse due to the lower drive current (in planar CMOS).
- The trends follow basic circuit intuition (i.e. read and write always have a conflict)

# Outline

- Introduction: DTCO and STT-MRAM
- MTJ SPICE Models
- STT-MRAM Array Level Evaluation
- State-of-the-art STT-MRAM Circuits
- Conclusions

## Write Verify Scheme



H. Noguchi, et al., ISSCC 2016 (Toshiba 65nm)

- MRAM cell is repetitively written until correct value is verified → significant write error rate reduction at the expense of high write energy and long write time
- More suitable for NVM applications than cache memory
- Write driver has programmable write strength

L. Wei, et al., ISSCC 2019 (Intel 22nm)

**Array with Shared SL Voltage** 



- Common source line for compared area and reduced parasitic resistance
  - Improves read and write margins with a modest layout area overhead.
  - Inhibit voltage is applied on the unselected BL
  - Negative voltage applied to unselected WLs to suppress the BL leakage current

Y.-D. Chih, et al., ISSCC 2020 (TSMC 22nm), C. Kim, et al., ISSCC 2015 (Samsung)



- Clamp transistor trimming circuit
  - Clamps BL voltage to prevent read disturbance
  - Trimming circuit to remove offset
- 1T4MTJ reference cell provides a stable  $(R_P+R_{AP})/2$  reference value without causing read disturbance
- Half-V<sub>DD</sub> detection circuit improves the sensing margin by extending the signal development time

**Readout Circuit** 

# Conclusions

- This invited paper covers various aspects of MRAM DTCO including device, circuit, and architecture considerations.
  - SPICE MTJ device model
  - Array level STT-MRAM PPA evaluation
  - Scalability and variability studies
  - Novel MRAM read/write circuit, array, and bit-cell layout design techniques can improve the yield of the large arrays and must be accounted for in DTCO flow

This work was supported in part by C-SPIN, one of the six SRC STARnet Centers, through MARCO and DARPA and in part by the NSF/SRC E2CDA Program.