Electromigration-Aware Architecture for Modern Microprocessors †
Abstract
:1. Introduction
- We offer solutions for modern microprocessor register files that exploit architectural characteristics to reduce the impact of RMS-EM.
- The proposed methods exploit register file characteristics such as toggle rate, hotspots, and resource allocation policies.
- The proposed architectural method for register files can be implemented in conjunction with physical-design-based solutions to complement and enhance current methods.
- The proposed solution incurs minimal cost in terms of power, performance, and silicon-area overhead.
- Our new approach for register files does not compromise reliability or IC lifetime.
- Our extensive experimental analysis combines architectural and EM physical simulations for register files, which both validate the proposed architectural solution on the physical level.
2. IC Reliability
2.1. Electromigration
- A write operation performed by a processor or control logic to a storage element (e.g., register) may manifest through the logical circuit to other nets. Read operation may also involve switching of wire states, but this usually happens on read ports of register files and memory elements and therefore is a smaller contributor to RMS-EM hotspots.
- Usage of logical resources for processing tasks may stimulate switching activity in its digital components (e.g., ALU being used for various computations).
2.2. Prior Work on Electromigration
2.2.1. Prior Work Based on Physical Design
2.2.2. Prior Work Based on Architecture
3. Distribution of EMS-EM Hotspots in Modern Microprocessors
3.1. Experimental Environment
3.2. Experimental Environment
3.2.1. ALUs
3.2.2. Register File
4. Microarchitectures for RMS-EM Avoidance
- We offer RMS-EM-aware architectural solutions dedicated to fundamental microprocessor building elements: register files and execution units, whereas prior studies made limited use of such information.
- The proposed solutions can be implemented in conjunction with physical-design-based flows and provide a complementary enhancement to such flows.
- We avoid the need to duplicate logic, reduce performance, or employ dedicated mechanisms to detect EM degradation through normal IC operation that were suggested by [16].
- The proposed solution eliminates the dynamic power overhead and the design complexity suggested by past studies such as [19].
- Our study is limited to digital circuits. Analog circuits are outside the scope of this study.
- Our solutions are highly effective when the switching probability is a dominant factor in inducing RMS-EM. The proposed techniques may offer a limited benefit for a system with a low activity rate.
- Our solutions rely on a nonuniform distribution of the switching probability that can be exploited to smooth RMS-EM hotspots. When the switching probability is evenly distributed, the effectiveness of our techniques is limited.
4.1. EM-Aware ALU Allocation
Algorithm 1 |
Input: k < N number of execution units to be allocated. Output: Vector E = (e0, e1, …, en−1), for every 0 ≤ i ≤ n − 1, only if ei = 1 execution unit i to be allocated; otherwise, not allocated. Initialization: Ex_counter[i] = 0 for every 0 ≤ i ≤ n − 1, Global_counter = 0 1. M = {0 ≤ i ≤ n − 1 | Ex_counter[i] = Global_counter} 2. if k < |M| then 3. let Q ⊂ M such that |Q| = k 4. ei = 1 for every i∈ Q, otherwise ei = 0 5. Ex_counter[i]++ for every i ∈ Q 6. end if 7. else // k ≥ |M| 8. let Q ⊆ U\M such that |Q| = k–|M| 9. ei = 1 for every i ∈ Q ∪ M, otherwise ei = 0 10. Ex_counter[i]++ for every i ∈ Q ∪ M 11. Global_counter++ 12. end else 13. return E |
4.2. EM-Aware Register Allocation
5. Experimental Analysis of RMS-EM-Aware Architecture
5.1. Toggle Rate-Based Experimental Analysis for RMS-EM MTF Improvement
5.2. Physical RMS-EM Simulations Based on Joule Heating Effect
6. Conclusions
Author Contributions
Funding
Data Availability Statement
Conflicts of Interest
References
- Tan, S.X.-D.; Tahoori, M.; Kim, T.; Wang, S.; Sun, Z.; Kiamehr, S. Long-Term Reliability of Nanometer VLSI Systems—Modeling, Simulation and Optimization; Springer Publisher: Berlin/Heidelberg, Germany, 2019; ISBN 978-3-030-26171-9. [Google Scholar] [CrossRef]
- Lienig, J.; Jerke, G. Embedded Tutorial: Electromigration-Aware Physical Design of Integrated Circuits. In Proceedings of the 18th International Conference on VLSI Design held jointly with 4th International Conference on Embedded Systems Design, Kolkata, India, 3–7 January 2005; IEEE Press: Piscataway, NJ, USA, 2005; pp. 77–82. [Google Scholar]
- Lienig, J. Introduction to electromigration-aware physical design. In Proceedings of the International Symposium on Physical Design (ISPD’06), San Jose, CA, USA, 9–12 April 2006; ACM: New York, NY, USA, 2006; pp. 39–46. [Google Scholar]
- Huang, X.; Kteyan, A.; Tan, S.X.-D.; Sukharev, V. Physics-Based Electromigration Models and Full-Chip Assessment for Power Grid Networks. IEEE Trans. Comput.-Aided Des. Integr. Circuits Syst. 2016, 35, 1848–1861. [Google Scholar] [CrossRef]
- Sun, Z.; Yu, S.; Zhou, H.; Liu, Y.; Tan, S.X.-D. EMSpice: Physics-Based Electromigration Check Using Coupled Electronic and Stress Simulation. IEEE Trans. Device Mater. Reliab. 2020, 20, 376–389. [Google Scholar] [CrossRef]
- Najm, F.N.; Sukharev, V. Efficient Simulation of Electromigration Damage in Large Chip Power Grids Using Accurate Physical Models (Invited Paper). In Proceedings of the 2019 IEEE International Reliability Physics Symposium (IRPS), Monterey, CA, USA, 31 March–4 April 2019; pp. 1–10. [Google Scholar] [CrossRef]
- Chatterjee, S.; Sukharev, V.; Najm, F.N. Power Grid Electromigration Checking Using Physics-Based Models. IEEE Trans. Comput.-Aided Des. Integr. Circuits Syst. 2018, 37, 1317–1330. [Google Scholar] [CrossRef]
- Maiz, J.A. Characterization of electromigration under bidirectional (BC) and pulsed unidirectional (PDC) currents. In Proceedings of the International Reliability Physics Symposium (IRPS), Phoenix, AZ, USA, 11–13 April 1989; pp. 220–228. [Google Scholar]
- Jonggook, K.; Tyree, V.C.; Crowell, C.R. Temperature gradient effects in electromigration using an extended transition probability model and temperature gradient free tests. In I. Transition probability model. In Proceedings of the IEEE International Integrated Reliability Workshop Final Report, Lake Tahoe, CA, USA, 18–21 October 1999; pp. 24–40. [Google Scholar] [CrossRef]
- Yu, X.; Weide, K. A study of the thermal-electrical- and mechanical influence on degradation in an aluminum-pad structure. Microelectron. Reliab. 1997, 37, 1545–1548. [Google Scholar] [CrossRef]
- Lienig, J. Electromigration and Its Impact on Physical Design in Future Technologies. In Proceedings of the 2013 ACM International Symposium on Physical Design, Stateline, NV, USA, 24–27 March 2013. [Google Scholar]
- Srinivasan, J.; Adve, S.V.; Bose, P.; Rivers, J.A. Lifetime Reliability: Toward an Architectural Solution. IEEE Micro 2005, 25, 70–80. [Google Scholar] [CrossRef]
- Srinivasan, J.; Adve, S.V.; Bose, P.; Rivers, J.A. Exploiting Structural Duplication for Lifetime Reliability Enhancement. In Proceedings of the 32nd International Symposium on Computer Architecture, Madison, WI, USA, 4–8 June 2005. [Google Scholar]
- Dasgupta, A.; Karri, R. Electromigration Reliability Enhancement Via Bus Activity Distribution. In Proceedings of the 33rd Annual Conference Design Automation (DAC 96), Las Vegas, NV, USA, 3–7 June 1996; ACM Press: New York, NY, USA, 1996; pp. 353–356. [Google Scholar]
- Abella, J.; Vera, X.; Ergin, S.U.O.; González, A.; Tschanz, J.W. Refueling: Preventing Wire Degradation due to Electromigration. IEEE Micro 2008, 28, 37–46. [Google Scholar] [CrossRef] [Green Version]
- Tao, J.; Chen, J.; Cheung, N.; Hu, C. Modeling and characterization of electromigration failures under bidirectional current stress. IEEE Trans. Electron Devices 1996, 43, 800–808. [Google Scholar]
- Abella, J.; Vera, X. Electromigration for Microarchitects. ACM Comput. Surv. 2010, 42, 9. [Google Scholar] [CrossRef]
- Gabbay, F.; Mendelson, A. Electromigration-Aware Instruction Execution for Modern Microprocessors. In Proceedings of the 2022 the 4th International Conference on Microelectronic Device and Technologies (MicDAT ‘2022), Corfu, Greece, 21–23 September 2022; pp. 60–66, ISBN 978-84-09-43856-3. [Google Scholar]
- Operating Temperature. Wikipedia. Available online: https://en.wikipedia.org/wiki/Operating_temperature (accessed on 8 January 2023).
- AEC-Q100-REV-G Standard; Failure Mechanism based Stress test Qualification for Integrated Circuit. Automotive Electronics Council, Component Technical Committee: Devon UK, 2014.
- Black, J.R. Electromigration—A brief survey and some recent results. IEEE Trans. Electron. Devices 1969, 16, 338–347. [Google Scholar] [CrossRef] [Green Version]
- Kahng, A.B.; Nath, S.; Rosing, T.S. On Potential Design Impacts of Electromigration Awareness. In Proceedings of the 2013 18th Asia and South Pacific Design Automation Conference (ASP-DAC), Yokohama, Japan, 22–25 January 2013. [Google Scholar]
- LEF DEF Reference. Available online: https://www.ispd.cc/contests/18/lefdefref.pdf (accessed on 8 January 2023).
- Scorzoni, A.; Neri, B.; Caprile, C.; Fantini, F. Electromigration in thin- film inter-connection lines: Models, methods and results. Mater. Sci. Rep. 1991, 7, 143–219. [Google Scholar] [CrossRef]
- Valero, A.; Miralaei, N.; Petit, S.; Sahuquillo, J.; Jones, T.M. On Microarchitectural Mechanisms for Cache Wearout Reduction. IEEE Trans. Very Large-Scale Integr. (VLSI) Syst. 2017, 25, 857–871. [Google Scholar] [CrossRef] [Green Version]
- Hau-Riege, C.S. An introduction to Cu electromigration. Microel. Reliab. 2004, 44, 195–205. [Google Scholar] [CrossRef]
- Wang, S.; Kim, T.; Sun, Z.; Tan, S.X.-D.; Tahoori, M. Recovery-aware proactive TSV repair for electromigration lifetime enhancement in 3D ICs. IEEE Trans. Very Large Scale Integr. Syst. 2018, 26, 531–543. [Google Scholar] [CrossRef]
- Srinivasan, J.; Adve, S.V.; Bose, P.; Rivers, J.A. The Case for Lifetime Reliability-Aware Microprocessors. In Proceedings of the 31st International Symposium on Computer Architecture (ISCA ’04), München, Germany, 19–23 June 2004. [Google Scholar]
- Swaminathan, K.; Chandramoorthy, N.; Cher, C.; Bertran, R.; Buyuktosunoglu, A.; Bose, P. BRAVO: Balanced Reliability-Aware Voltage Optimization. In Proceedings of the 2017 IEEE International Symposium on High Performance Computer Architecture (HPCA), Austin, TX, USA, 4–8 February 2017; pp. 97–108. [Google Scholar] [CrossRef]
- Carlson, T.E.; Heirman, W.; Eeckhout, L. Sniper: Exploring the level of abstraction for scalable and accurate parallel multi-core simulations. In Proceedings of the International Conference for High Performance Computing, Net- working, Storage and Analysis (SC), Seattle, WA, USA, 12–18 November 2011. [Google Scholar]
- Thomadakis, M.E. The Architecture of the Nehalem Processor and Nehalem-EP SMP Platforms. Technical Report, December 2010. Available online: https://d1wqtxts1xzle7.cloudfront.net/30311675/nehalem-libre.pdf?1390883858=&response-content-disposition=inline%3B+filename%3DThe_architecture_of_the_Nehalem_processo.pdf&Expires=1673405927&Signature=ZzDFsTSVLljHcDVxkE-D8ky1A7gj51jo2qCHxmcK-Wcje6wwP6RNXd8jRVXTMb4zXytw3S-cO8XecXMUWrqeuuHgUrTtuP0hX3C8x47NNq6cXwsxNb6M~lvk7O1BFLxeDszoMkAKWi45u~RQ7U69XkAzZHck-009CbGkkACtUTDL-zAJrnM6IvN4I7pot4UdL~yRPmi6pmP2hlMTIulNXQgI3ICUDSH9zv4qwTnA~GXrMND3LMKlTEIH4n4LFjfR9gT-U7ShkuKwX-cy8GRyVhX0hyjdWUwwnZ52A-o4UqSmnA5emOHrHZAEXO8D~fk0SxW24DivfwlOL-NWSMCQpQ__&Key-Pair-Id=APKAJLOHF5GGSLRBV4ZA (accessed on 8 January 2023).
- Limaye, A.; Adegbija, T. A workload characterization of the spec cpu2017 benchmark suite. In Proceedings of the 2018 IEEE International Symposium on Performance Analysis of Systems and Software (ISPASS), Belfast, UK, 2–4 April 2018; pp. 149–158. [Google Scholar]
- Wu, Q.; Flolid, S.; Song, S.; Deng, J.; John, L.K. Hot Regions in SPEC CPU2017. In Proceedings of the 2018 IEEE International Symposium on Workload Characterization (IISWC), Raleigh, NC, USA, 30 September–2 October 2018. [Google Scholar]
- Zhang, R.; Liu, T.; Yang, K.; Milor, L. CacheEM: For Reliability Analysis on Cache Memory Aging Due to Electromigration. IEEE Trans. Comput.-Aided Des. Integr. Circuits Syst. 2022, 41, 3078–3091. [Google Scholar] [CrossRef]
- Zhang, R.; Yang, K.; Liu, T.; Milor, L. Modeling of FinFET SRAM array reliability degradation due to electromigration. Microelectron. Reliab. 2019, 100–101, 113485. [Google Scholar] [CrossRef]
- VoltusTM User Guide. Available online: http://www.cadence.com (accessed on 8 January 2023).
Core Model | |
---|---|
Frequency | 2.66 GHz |
Execution units [time] | 3 ALUs [1 cycle] 1 FP add/sub [3 cycles] 1 FP mul/div [5/6 cycles] 1 Branch [1 cycle] 1 Load unit [1 cycle] 1 Store unit [1 cycle] |
Pipeline | Dispatch width: 4 |
Instruction window | 128 |
Memory system model | |
Block size | 64 bytes |
L1-D cache | 32 KB, 8-way |
L1-I cache | 32 KB, 4-way |
L2 cache | 256 KB, 8-way |
L3 cache | 8 MB, 16-way |
DTLB | 64 entries, 4-way |
ITLB | 128 entries, 4-way |
STLB | 512 entries, 4-way (secondary TLB) |
Clock Cycle | Issued Instructions | Ex_counter [2:0] | Global Counter | Selected ALU(s) |
---|---|---|---|---|
0 | 0 | 0, 0, 0 | 0 | None |
1 | 2 | 0, 1, 1 | 0 | 0, 1 |
2 | 2 | 1, 1, 0 | 1 | 2, 0 |
3 | 3 | 0, 0, 1 | 0 | 1, 2, 0 |
Option | Original Area [um2] | Area Overhead [um2]/[%] | Original Power [uW] | Power Overhead [uW]/[%] | Timing Impact |
---|---|---|---|---|---|
1 | 200,613 | 316/0.15 | 641.79 | 0.031/0.004% | None (reg-to-reg delay < clock cycle time) |
2 | 200,613 | 85.9/0.04 | 641.79 | 0.026/0.004% | None (reg-to-reg delay < clock cycle time) |
Original Area [um2] | Area Overhead [um2]/[%] | Original Power [uW] | Power [uW] /[%] | Timing Impact |
---|---|---|---|---|
77,234 | 1973 / 2.5 | 20,162 | 0.282/0.001 | 50 ps delay added to access time |
Physical Simulation Environment Parameters | |
---|---|
Synthesis tool | Cadence® GenusTM version 19.11-s087_1 |
Place-and-route tool | Cadence® InnovusTM version 19.11-s128_1 |
EM tool | Cadence® VoltusTM version 19.11-s129_1 |
Process | 28 nm |
Clock frequency | 2.66 GHz |
Core voltage | 0.9 V |
Tj | 105 °C (self-heating is modeled by the VoltusTM simulation environment) |
Metal layers | Metal 1 to metal 9 |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2023 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
Share and Cite
Gabbay, F.; Mendelson, A. Electromigration-Aware Architecture for Modern Microprocessors. J. Low Power Electron. Appl. 2023, 13, 7. https://doi.org/10.3390/jlpea13010007
Gabbay F, Mendelson A. Electromigration-Aware Architecture for Modern Microprocessors. Journal of Low Power Electronics and Applications. 2023; 13(1):7. https://doi.org/10.3390/jlpea13010007
Chicago/Turabian StyleGabbay, Freddy, and Avi Mendelson. 2023. "Electromigration-Aware Architecture for Modern Microprocessors" Journal of Low Power Electronics and Applications 13, no. 1: 7. https://doi.org/10.3390/jlpea13010007
APA StyleGabbay, F., & Mendelson, A. (2023). Electromigration-Aware Architecture for Modern Microprocessors. Journal of Low Power Electronics and Applications, 13(1), 7. https://doi.org/10.3390/jlpea13010007