Variable retention time

Variable retention time (also known as VRT) is a reliability issue in dynamic random-access memory (DRAM) characterized by unpredictable fluctuations in the retention time of memory cells, that is, the duration for which a cell can reliably store data without being refreshed.[1] If a cell's retention time becomes shorter than the refresh interval, it may lead to memory errors, potentially resulting in system crashes or Silent data corruption. VRT-affected bits that go undetected during product testing may pose a significant risk to device reliability. To mitigate the impact of VRT[2] and soft errors, DRAM manufacturers have implemented error-correcting code (ECC) mechanisms directly within the memory chips. This approach has become a standard feature in DDR5 SDRAM.[3]
Possible sources of VRT bits include high-voltage gate stress,[4] exposure to high-energy particles radiation[5] and high temperature stress.[6]
Background
[edit]In dynamic random-access memory (DRAM), each bit of data is stored in a memory cell composed of a capacitor and a transistor. The amount of electrical charge stored in the capacitor determines whether the cell represents a binary "1" or "0". These cells are densely packed into integrated circuits, accompanied by control logic that manages data access. Due to the inherent leakage of charge from capacitors over time, DRAM cells must be periodically refreshed to maintain data integrity this involves rewriting the contents of each cell at regular intervals to prevent data loss.[7]
Overview
[edit]
The amount of time a cell can reliably store data without being refreshed is called cell's retention time (). In the case of a constant leakage current (), can be approximated as
where is the storage node capacitance and is the amount of charge loss required in order to have a failure.[8] In modern devices, at operating temperatures, is dominated by generation current due to defects in the cell's access transistor. Variability in defect configuration is responsible for a wide spread of the value of leakage current, and therefore of , across different memory cells.[9]
Only a few cells actually have approaching the refresh interval.[10] To improve yield and reliability, DRAM chips include redundant rows or columns that can be used to replace faulty ones or single cells including those with retention times shorter than the refresh interval.[11] However, this technique is less effective against VRT cells, which may begin to fail only after faulty cell replacement has been performed, typically at the die level.[11]
Physics
[edit]At the microscopic level, defects located in the bulk or at the Si/SiO2 interface of the access transistor are believed to be the primary source of leakage responsible for the discharge of the storage capacitor.[9] According to Shockley–Read–Hall (SRH) theory, the generation rate depends on trap energy level (), free carrier concentration, and temperature.[12] In the case of defects located in the depleted region, where free carrier concentrations are typically negligible and the generation rate is maximized, the current can be approximated as:
where is the intrinsic Fermi energy, is the intrinsic carrier concentration in silicon, is the capture cross section which determines the probability of carrier capture and emission (assumed to be equal for electrons and holes for simplicity), is the thermal velocity of carriers.[10] Large electric fields() are known to enhance , resulting in increased generation current. Incorporating this effect, the total leakage current can be expressed as
where is the field enhancement factor, a positive quantity that becomes significant under strong electric fields.[13]
Generation current may fluctuate over time displaying a random telegraph noise (RTN) pattern,[14] with transition rates having an Arrhenius dependence on temperature. To explain the origin of these instabilities, two main theoretical models have been proposed. One model attributes VRT to structural modifications of the defect, which cause changes in the trap energy level.[15] The other model suggests that VRT arises from modulation of the local electric field, attributed to changes in the charge state of nearby defects, often located in the gate oxide.[16][17] Both models have been supported by experimental evidence,[18][15] suggesting that the VRT may originate from different physical phenomena.[19]
Mitigation
[edit]Considerable effort has been spent to mitigate the effects of VRT, including modifications to the fabrication process and the introduction of error correction mechanisms.[20][21]
Screening and in-DRAM ECC
[edit]There are no efficient mechanisms to screen VRT bits during production testing. Most manufacturers have been able to deal with it by increasing average retention time and by enforcing larger test screen margins, involving the replacement of possibly faulty cells with spare rows and columns.[8] However, starting from sub-20nm node it became increasingly costly to screen and manage the growing number of defective cells, due to the sharply increasing area overhead required to fit adequate redundant resources.[20]
In-DRAM ECC, coupled with traditional redundant sparing, was identified as the most effective solution,[20] and became a JEDEC standard for DDR5 SDRAM.[3] This technique involve dividing memory data into codewords and encode information adding extra parity bits, to enable the detection and correction of errors. This provides the ability to address faulty bits that were not identified as such during testing, such as VRT ones.[20]
The key difference with the more traditional ECC DRAM lies in where the extra bits are stored. In in-DRAM ECC, parity bits are stored in the same chip, and error correction occurs internally to the chip, making it transparent to the memory controller. In ECC DRAM an extra chip is added to the DIMM to store the extra bits information, providing detection and correction of data transfer errors.[22]
Physical treatments
[edit]Researchers have investigated passivation strategies to reduce the number of active defects in the silicon. Researchers have shown that hydrogen anneal at high temperature strongly reduces VRT,[6] as confirmed by later experiments that highlighted a correlation with the reduction of charge pumping current, a metric typically used to assess the defect density at the Si/SiO2 interface in MOSFETs.[23]
Fluorine implantation was reported to reduce VRT in older technologies.[15] This method was later observed to reduce the number of cells with gate-induced drain leakage, current that is a leakage mechanism induced by the presence of large electric fields at the gate drain overlap region of MOSFETs.[24] Samsung researchers found that the number of VRT errors can be reduced by changes in the process steps for the formation of the metal gate in a 1znm process.[21]
See also
[edit]- Soft error – A type of error involving changes to signals or data but no changes to the underlying device or circuit
- ECC – Scheme for controlling errors in data over noisy communication channels
- Row hammer – A computer exploit that takes advantage of unintended and undesirable side-effect in DRAM
References
[edit]- ^ Restle; Park; Lloyd (1992). "DRAM variable retention time". International Technical Digest on Electron Devices Meeting. pp. 807–810. doi:10.1109/IEDM.1992.307481. ISBN 0-7803-0817-4.
{{cite book}}
:|journal=
ignored (help) - ^ Yixin Luo and Hasan Hassan and Minesh Patel and Onur Mutlu (2023). "Retrospective: An Experimental Study of Data Retention Behavior in Modern DRAM Devices: Implications for Retention Time Profiling Mechanisms". arXiv:2306.16037 [cs.AR].
- ^ a b "JESD79-5: DDR5 SDRAM Standard". JEDEC Solid State Technology Association. July 2020. Retrieved 2025-07-07.
See page 243 for details on in-DRAM ECC
- ^ Kim, Heesang; Oh, Byoungchan; Kim, Kyungdo; Cha, Seon-Yong; Jeong, Jae-Goan; Hong, Sung-Joo; Lee, Jong-Ho; Park, Byung-Gook; Shin, Hyungcheol (2010-09-01). "Random Telegraph Signal-Like Fluctuation Created by Fowler–Nordheim Stress in Gate Induced Drain Leakage Current of the Saddle Type Dynamic Random Access Memory Cell Transistor". Japanese Journal of Applied Physics. 49 (9R): 094102. doi:10.1143/JJAP.49.094102. ISSN 0021-4922.
- ^ Goiffon, Vincent; Jay, Antoine; Paillet, Philippe; Bilba, Teddy; Deladerriere, Theo; Beaugendre, Guillaume; Le Roch, Alexandre; Dion, Arnaud; Virmontois, Cedric; Belloir, Jean-Marc; Gaillardin, Marc (2020). "Radiation-Induced Variable Retention Time in Dynamic Random Access Memories". IEEE Transactions on Nuclear Science. 67 (1): 234–244. doi:10.1109/TNS.2019.2956293. ISSN 0018-9499.
- ^ a b Kim, Y.I.; Yang, K.H.; Lee, W.S. (2004). "Thermal degradation of DRAM retention time: Characterization and improving techniques". 2004 IEEE International Reliability Physics Symposium. Proceedings. pp. 667–668. doi:10.1109/RELPHY.2004.1315442. ISBN 978-0-7803-8315-9.
- ^ "Understanding DRAM Operation" (PDF). IBM. December 1996. Archived from the original (PDF) on August 29, 2017. Retrieved March 10, 2015.
- ^ a b Kang, Uksong (2014). "Co-Architecting Controllers and DRAM to Enhance DRAM Process Scaling". The Memory Forum 2014.
- ^ a b Liu, Yong; Wang, Da; Ren, Pengpeng; Li, Jie; Qiao, Zheng; Wu, Maokun; Wen, Yichen; Zhou, Longda; Sun, Zixuan; Wang, Zirui; Han, Qinghua; Wu, Blacksmith; Cao, Kanyu; Wang, Runsheng; Ji, Zhigang (2024). "Understanding Retention Time Distribution in Buried-Channel-Array-Transistors (BCAT) Under Sub-20-nm DRAM Node—Part I: Defect-Based Statistical Compact Model". IEEE Transactions on Electron Devices. 71 (8): 4462–4468. doi:10.1109/TED.2024.3409510. ISSN 0018-9383.
- ^ a b Hiraiwa, A.; Ogasawara, M.; Natsuaki, N.; Itoh, Y.; Iwai, H. (1996-09-01). "Statistical modeling of dynamic random access memory data retention characteristics". Journal of Applied Physics. 80 (5): 3091–3099. doi:10.1063/1.363119. ISSN 0021-8979.
- ^ a b Cenker, R.P.; Clemons, D.G.; Huber, W.R.; Petrizzi, J.B.; Procyk, F.J.; Trout, G.M. (1979). "A fault-tolerant 64K dynamic random-access memory". IEEE Transactions on Electron Devices. 26 (6): 853–860. doi:10.1109/T-ED.1979.19509. ISSN 0018-9383.
- ^ Shockley, W.; Read, W. T. (1952-09-01). "Statistics of the Recombinations of Holes and Electrons". Physical Review. 87 (5): 835–842. doi:10.1103/PhysRev.87.835.
- ^ Hurkx, G.A.M.; Klaassen, D.B.M.; Knuvers, M.P.G. (1992). "A new recombination model for device simulation including tunneling". TED. 39 (2): 331–338. doi:10.1109/16.121690. ISSN 1557-9646.
- ^ Mori, Y.; Ohyu, K.; Okonogi, K.; Yamada, R.-i. (2005). "The origin of variable retention time in DRAM". IEEE International Electron Devices Meeting, 2005. IEDM Technical Digest. pp. 1034–1037. doi:10.1109/IEDM.2005.1609541. ISBN 978-0-7803-9268-7.
{{cite book}}
:|journal=
ignored (help) - ^ a b c Ohyu, K.; Umeda, T.; Okonogi, K.; Tsukada, S.; Hidaka, M.; Fujieda, S.; Mochizuki, Y. (2006). "Quantitative identification for the physical origin of variable retention time: A vacancy-oxygen complex defect model". 2006 International Electron Devices Meeting. pp. 1–4. doi:10.1109/IEDM.2006.346792. ISBN 1-4244-0438-X.
- ^ Oh, Byoungchan; Cho, Heung-Jae; Kim, Heesang; Son, Younghwan; Kang, Taewook; Park, Sunyoung; Jang, Seunghyun; Lee, Jong-Ho; Shin, Hyungcheol (2011). "Characterization of an Oxide Trap Leading to Random Telegraph Noise in Gate-Induced Drain Leakage Current of DRAM Cell Transistors". IEEE Transactions on Electron Devices. 58 (6): 1741–1747. doi:10.1109/TED.2011.2126046. ISSN 1557-9646.
- ^ Mori, Yuki; Yoshimoto, Hiroyuki; Takeda, Kenichi; Yamada, Ren-ichi (2012-05-29). "Mechanism of random telegraph noise in junction leakage current of metal-oxide-semiconductor field-effect transistor". Journal of Applied Physics. 111 (10): 104513. doi:10.1063/1.4721658. ISSN 0021-8979.
- ^ Mori, Yuki; Yoshimoto, Hiroyuki; Takeda, Kenichi; Yamada, Ren-ichi (2012-05-29). "Mechanism of random telegraph noise in junction leakage current of metal-oxide-semiconductor field-effect transistor". Journal of Applied Physics. 111 (10): 104513. doi:10.1063/1.4721658. ISSN 0021-8979.
- ^ Kim, Heesang; Oh, Byoungchan; Son, Younghwan; Kim, Kyungdo; Cha, Seon-Yong; Jeong, Jae-Goan; Hong, Sung-Joo; Shin, Hyungcheol (2011). "Study of Trap Models Related to the Variable Retention Time Phenomenon in DRAM". IEEE Ted. 58 (6): 1643–1648. doi:10.1109/TED.2011.2125964. ISSN 1557-9646.
- ^ a b c d Cha, Sanguhn; Seongil, O.; Shin, Hyunsung; Hwang, Sangjoon; Park, Kwangil; Jang, Seong Jin; Choi, Joo Sun; Jin, Gyo Young; Son, Young Hoon; Cho, Hyunyoon; Ahn, Jung Ho; Kim, Nam Sung (2017). "Defect Analysis and Cost-Effective Resilience Architecture for Future DRAM Devices". 2017 IEEE International Symposium on High Performance Computer Architecture (HPCA). IEEE. pp. 61–72. doi:10.1109/HPCA.2017.30. ISBN 978-1-5090-4985-1.
{{cite book}}
:|journal=
ignored (help) - ^ a b Jang, Dongkyu; Kim, Daekyum; Lee, Jieun; Lee, Inkyum; Ahn, Sang Bin; Hong, Yoonki; Kim, Shindeuk; Park, Taehoon; Ban, Hyodong (2024-06-01). "Improvement of VRT immunity using low 10-boron word line in recent DRAM". Microelectronics Reliability. 157 115396. doi:10.1016/j.microrel.2024.115396. ISSN 0026-2714.
- ^ "ECC Technical Details". memtest86. 10 July 2025. Retrieved 10 July 2025.
- ^ Ryu, Seong-Wan; Min, Kyungkyu; Shin, Jungho; Kwon, Heimi; Nam, Donghoon; Oh, Taekyung; Jang, Tae-Su; Yoo, Minsoo; Kim, Yongtaik; Hong, Sungjoo (2017). "Overcoming the reliability limitation in the ultimately scaled DRAM using silicon migration technique by hydrogen annealing". 2017 IEEE International Electron Devices Meeting (IEDM). IEEE. pp. 21.6.1–21.6.4. doi:10.1109/IEDM.2017.8268437. ISBN 978-1-5386-3559-9.
{{cite book}}
:|journal=
ignored (help) - ^ Weber, A.; Birner, A.; Krautschneider, W. (2007-11-01). "DRAM retention tail improvement by trap passivation". Solid-State Electronics. Special Issue: Papers Selected from the 36th European Solid-State Device Research Conference - ESSDERC'06. 51 (11): 1534–1539. doi:10.1016/j.sse.2007.09.023. ISSN 0038-1101.
External links
[edit]- Error correction Code (ECC) in DDR Memories, Oct 19, 2020, by Vadhiraj Sankaranarayanan
- ECC Technical Details
- DDR5: What is On-Die ECC?, Aug 7, 2023
- Gate Induced Drain Leakage – An Overview Jan 18, 2022, By siliconvlsi