Enhancing Security for Resource-Constrained Smart Cities IoT Applications: Optimizing Cryptographic Techniques with Effective Field Multipliers
Abstract
:1. Introduction
1.1. Literature Review
1.2. Paper Contribution
- It introduces an innovative computational systolic architecture specifically designed for a Dickson basis multiplier, aimed at achieving a more compact physical footprint and reduced energy consumption compared to existing architectural solutions. In contrast to prior scholarly contributions, which often employed ad hoc design methodologies and neglected the optimization of critical performance metrics such as latency, throughput, and energy dissipation, the proposed approach emphasizes a systematic framework for the intentional distribution of temporal and spatial data. This strategic allocation enables the development of an architecture tailored to meet specific application requirements effectively. Furthermore, a formalized analytical framework underpins a comprehensive evaluation of the multiplier’s configuration, facilitating the identification of operational enhancements. Through this rigorous analysis, the proposed design not only addresses the limitations of previous approaches but also provides a pathway for optimizing performance in diverse computational environments. This advancement signifies a meaningful contribution to the field, offering a refined methodology that aligns with contemporary demands for efficiency and effectiveness in multiplier architecture.
- The structural design of the multiplier, as delineated in this study, reveals a substantial reduction in hardware complexity compared to conventional two-dimensional architectures, exhibiting linear rather than quadratic growth in resource requirements. This streamlined architecture yields significant improvements in both physical footprint and energy consumption while maintaining competitive processing speeds comparable to traditional designs. The systolic implementation provides inherent timing-attack resistance through deterministic operation and enables future integration of error-detection mechanisms, combining efficiency with security advantages. Its modular configuration with direct PE-to-PE communication minimizes signal delays and optimizes data exchange, making it particularly suitable for VLSI implementations in resource-constrained IoT edge nodes. These combined attributes—reduced hardware complexity, maintained throughput, inherent security features, and VLSI-friendly design—position the multiplier as an optimal solution for secure encryption processors in smart city applications, where operational efficiency and resource optimization are paramount.
- The substantial hardware and energy efficiencies achieved through the proposed multiplier architecture position it as a highly effective solution for compact IoT edge devices within smart cities, which often function under significant resource limitations. These devices are typically required to operate within stringent power constraints while maintaining high performance levels for applications such as intelligent traffic management, environmental monitoring, and surveillance systems. By adeptly addressing the challenges associated with resource-constrained environments, this architecture facilitates the implementation of sophisticated features while ensuring optimal performance and energy efficiency.
- Furthermore, the efficacy of the proposed multiplier is crucial for enabling the implementation of cryptographic algorithms on resource-constrained IoT devices, thereby supporting secure identification and access control mechanisms. As these systems increasingly depend on cryptographic algorithms to protect sensitive data, the reduction in energy consumption and hardware complexity facilitates the deployment of robust security measures within IoT devices, ensuring the confidentiality and integrity of user information. Given the rising demand for inclusive and accessible technology, the development of such secure solutions is essential for creating environments that foster independence and accessibility while addressing the diverse needs of urban populations. By facilitating the seamless integration of efficient cryptographic solutions, the proposed design significantly enhances the resilience and security of smart city infrastructures, ultimately contributing to a safer and more reliable urban ecosystem.
- Within the context of smart city deployments, where a multitude of networked devices must function efficiently within stringent spatial and power constraints, the aforementioned design presents substantial benefits. By reducing the requisite silicon area, the proposed multiplier facilitates the creation of more compact integrated circuits, enabling seamless integration into diverse sensor and device deployments across urban landscapes. Furthermore, its diminished power consumption ensures prolonged operational lifespans for these devices, mitigating the need for frequent recharging or maintenance, a critical factor for the sustainability of smart city initiatives. Consequently, the proposed architecture not only enhances the computational performance of cryptographic functions but also contributes to the overall operational efficiency and extended lifespan of IoT systems, thereby establishing it as a vital element for the development of future smart urban infrastructure.
1.3. Paper Organization
2. Multiplication Using the Dickson Basis in GF()
3. Systolic Design Methodology
- Scheduling: Assigning each node in the DG to a specific clock cycle, ensuring that dependent operations execute in the correct order. This is controlled by a scheduling vector, which dictates the computation sequence.
- Projection: Mapping multiple nodes onto a single PE to optimize hardware reuse. A projection matrix determines how nodes are merged, balancing resource efficiency with performance.
4. Formulating Dependency Graphs
5. Exploring Dickson-Based Systolic Multiplier Framework
Retrieving the Systolic Multiplier Design
- 1.
- Input Signals Allocation:
- Input signals in the upper systolic array coincide with the input signals . Consequently, there is no necessity for an additional input port. Therefore, the input port is omitted from the PEs of the upper systolic array (). This simplification reduces the complexity of the PEs in the upper array compared to the regular PEs () in the middle and lower arrays, as illustrated in Figure 6 and Figure 7.
- 2.
- Sequential Input Signals:
- Input signals , where , are fed sequentially into the first processing element () of the upper and middle systolic arrays. These signals then propagate through all the PEs within their respective arrays.
- Similarly, the input signals , where , are sequentially input into the first processing element () of the lower systolic array and subsequently pass through all its regular PEs.
- 3.
- Intermediate Signal Handling:
- The control signal g is introduced at the second processing element () of the middle and lower systolic arrays. This signal is then pipelined through the regular PEs () of these arrays to activate the lower tri-state buffer, as depicted in Figure 7. This mechanism ensures the accurate and timely assignment of signals to the port , maintaining precise timing and control within the systolic array.
- 4.
- Parallel Output:
- The resulting coefficient bits , , and from the upper, middle, and lower systolic arrays, respectively, are accessible concurrently at the outputs of their respective PEs after q clock cycles.
- 5.
- Final Product Calculation:
- The final product bits , where , are obtained at clock cycle q by performing a bitwise XOR operation (using two-input XOR gates) on the corresponding bits of , , and , as illustrated in Figure 5. This final step completes the multiplication process, providing the desired product output.
- 1.
- Initialization Phase:
- Simultaneously, the control signal g is deactivated, setting it to a logical low (). This deactivation enables the input signals assigned to the port (, 0, and , where ) to pass unimpeded through the upper tri-state buffer, as illustrated in Figure 7. These signals are then correctly allocated to their respective PEs within the systolic array layout.
- Concurrently, the first bits of the input signals and () are sequentially fed into the first processing element () of each corresponding systolic array through the input port m, as shown in Figure 5. These signals, once received at , are directly broadcast to all subsequent PEs within each systolic array, ensuring uniform data distribution.
- 2.
- Computation Phase:
- Starting from the second clock cycle and extending through the clock cycle, the control signal g is activated, setting it to a logical high (). This activation enables the intermediate signals to be allocated to the regular processing elements () for the computation of the intermediate values of the signals designated for port n, as depicted in Figure 5. This process is central to the iterative computation of the multiplication result.
- During these clock cycles, the subsequent components of the input signals and () are sequentially introduced into the first processing element () of each corresponding systolic array through the input port m, as shown in Figure 5. As in the initialization phase, these signals are directly broadcast to all subsequent PEs within each systolic array, maintaining data flow and parallelism.
- 3.
- Parallel Output Phase:
- At the clock cycle, the final result n yields its ultimate parallel output coefficient bits, indicated as (where ). These bits are produced simultaneously from the last row of XOR gates, as illustrated in Figure 5. This simultaneous generation of output bits signifies the completion of the multiplication operation, delivering the final product in a parallel format.
6. Results Summary and Insights
6.1. Complexity Analysis
6.2. Implementation Findings
7. Security Analysis and Countermeasures
8. Key Findings and Conclusions
Author Contributions
Funding
Data Availability Statement
Acknowledgments
Conflicts of Interest
Abbreviations
IoT | Internet of things |
ADP | Area-delay product |
PDP | Power-delay product |
ASIC | Application-specific integrated circuit |
ECC | Elliptic curve cryptography |
DG | Dependency graph |
CPD | Critical path delay |
References
- Rao, P.M.; Pedada, S.; Jangirala, S.; Das, A.K.; Rodrigues, J.J. Role of IoT in the ages of digital to smart cities: Security challenges and countermeasures. IEEE Internet Things Mag. 2024, 7, 56–64. [Google Scholar] [CrossRef]
- Nassereddine, M.; Alex, K. Applications of Internet of Things (IoT) in smart cities. In Advanced IoT Technologies and Applications in the Industry 4.0 Digital Economy; CRC Press: Boca Raton, FL, USA, 2024; pp. 109–136. [Google Scholar]
- Vempati, S.; Nalini, N. Securing Smart Cities: A Cybersecurity Perspective on Integrating IoT, AI, and Machine Learning for Digital Twin Creation. J. Electr. Syst. 2024, 20, 1420–1429. [Google Scholar] [CrossRef]
- Chen, C.C.; Lee, C.Y.; Lu, E.H. Scalable and Systolic Montgomery Multipliers Over GF(2m). IEICE Trans. Fundam. 2008, E91-A, 1763–1771. [Google Scholar] [CrossRef]
- Chiou, C.W.; Lee, C.Y.; Deng, A.W.; Lin, J.M. Concurrent error detection in Montgomery multiplication over GF(2m). IEICE Trans. Fundam. Electron. Commun. Comput. Sci. 2006, E89-A, 566–574. [Google Scholar] [CrossRef]
- Huang, W.T.; Chang, C.; Chiou, C.; Chou, F. Concurrent error detection and correction in a polynomial basis multiplier over GF(2m). IET Inf. Secur. 2010, 4, 111–124. [Google Scholar] [CrossRef]
- Kim, K.W.; Jeon, J.C. Polynomial Basis Multiplier Using Cellular Systolic Architecture. IETE J. Res. 2014, 60, 194–199. [Google Scholar] [CrossRef]
- Choi, S.; Lee, K. Efficient systolic modular multiplier/squarer for fast exponentiation over GF(2m). IEICE Electron. Express 2015, 12, 1–6. [Google Scholar] [CrossRef]
- Reyhani-Masoleh, A. A new bit-serial architecture for field multiplication using polynomial bases. In Proceedings of the 7th International Workshop Cryptographic Hardware Embedded Systems (CHES 2008), Washington, DC, USA, 10–13 August 2008; pp. 300–314. [Google Scholar]
- Abdulrahman, E.A.H.; Reyhani-Masoleh, A. High-Speed Hybrid-Double Multiplication Architectures Using New Serial-Out Bit-Level Mastrovito Multipliers. IEEE Trans. Comput. 2016, 65, 1734–1747. [Google Scholar] [CrossRef]
- Kim, K.W.; Jeon, J.C. A semi-systolic Montgomery multiplier over GF(2m). IEICE Electron. Express 2015, 12, 1–6. [Google Scholar] [CrossRef]
- Ibrahim, A. Novel Bit-Serial Semi-Systolic Array Structure for Simultaneously Computing Field Multiplication and Squaring. IEICE Electron. Express 2019, 16, 20190600. [Google Scholar] [CrossRef]
- Kim, K.W.; Lee, J.D. Efficient unified semi-systolic arrays for multiplication and squaring over GF(2m). Electron. Express 2017, 14, 1–10. [Google Scholar]
- Kim, K.W.; Kim, S.H. Efficient bit-parallel systolic architecture for multiplication and squaring over GF(2m). IEICE Electron. Express 2018, 15, 1–6. [Google Scholar] [CrossRef]
- Ibrahim, A. Efficient Parallel and Serial Systolic Structures for Multiplication and Squaring Over GF (2m). Can. J. Electr. Comput. Eng. 2019, 42, 114–120. [Google Scholar] [CrossRef]
- Ibrahim, A.; Gebali, F. Enhancing Security and Efficiency in IoT Assistive Technologies: A Novel Hybrid Systolic Array Multiplier for Cryptographic Algorithms. Appl. Sci. 2025, 15, 2660. [Google Scholar] [CrossRef]
- Ibrahim, A.; Gebali, F. Optimizing Security of Radio Frequency Identification Systems in Assistive Devices: A Novel Unidirectional Systolic Design for Dickson-Based Field Multiplier. Systems 2025, 13, 154. [Google Scholar] [CrossRef]
- Hasan, A.; Negre, C. Low space complexity multiplication over binary fields with Dickson polynomial representation. IEEE Trans. Comput. 2010, 60, 602–607. [Google Scholar] [CrossRef]
- Chiou, C.W.; Lee, C.M.; Sun, Y.S.; Lee, C.Y.; Lin, J.M. High-throughput Dickson basis multiplier with a trinomial for lightweight cryptosystems. IET Comput. Digit. Tech. 2018, 12, 187–191. [Google Scholar] [CrossRef]
- Chiou, C.; Sun, Y.S.; Lee, C.M.; Liou, J.Y. Low-complexity unidirectional systolic Dickson basis multiplier for lightweight cryptosystems. Electron. Lett. 2019, 55, 28–30. [Google Scholar] [CrossRef]
- Pillutla, S.R.; Boppana, L. Area-efficient low-latency polynomial basis finite field GF(2m) systolic multiplier for a class of trinomials. Microelectron. J. 2020, 97, 104709. [Google Scholar] [CrossRef]
- Imana, J.L. LFSR-Based Bit-Serial GF(2m) Multipliers Using Irreducible Trinomials. IEEE Trans. Comput. 2020, 70, 156–162. [Google Scholar]
- Pillutla, S.R.; Boppana, L. Low-latency area-efficient systolic bit-parallel GF(2m) multiplier for a narrow class of trinomials. Microelectron. J. 2021, 117, 105275. [Google Scholar] [CrossRef]
- Li, Y.; Cui, X.; Zhang, Y. An Efficient CRT-based Bit-parallel Multiplier for Special Pentanomials. IEEE Trans. Comput. 2021, 71, 736–742. [Google Scholar] [CrossRef]
- Li, Y.; Zhang, Y.; He, W. Fast hybrid Karatsuba multiplier for type II pentanomials. IEEE Trans. Very Large Scale Integr. (VLSI) Syst. 2020, 28, 2459–2463. [Google Scholar] [CrossRef]
- Meher, P.K.; Lou, X. Low-Latency, Low-Area, and Scalable Systolic-Like Modular Multipliers for GF(2m) Based on Irreducible All-One Polynomials. IEEE Trans. Circuits Syst. I Regul. Pap. 2016, 64, 399–408. [Google Scholar] [CrossRef]
- Mohaghegh, S.; Yemiscoglu, G.; Muhtaroglu, A. Low-Power and Area-Efficient Finite Field Multiplier Architecture Based on Irreducible All-One Polynomials. In Proceedings of the 2020 IEEE International Symposium on Circuits and Systems (ISCAS), Seville, Spain, 12–14 October 2020; IEEE: Piscataway, NJ, USA, 2020; pp. 1–5. [Google Scholar]
- Zhang, Y.; Li, Y. Efficient Hybrid GF(2m) Multiplier for All-One Polynomial Using Varied Karatsuba Algorithm. IEICE Trans. Fundam. Electron. Comput. Sci. 2021, 104, 636–639. [Google Scholar] [CrossRef]
- Zhou, B.B. A New Bit Serial Systolic Multiplier over GF(2m). IEEE Trans. Comput. 1988, 37, 749–751. [Google Scholar] [CrossRef]
- Fenn, S.T.J.; Taylor, D.; Benaissa, M. A Dual Basis Bit Serial Systolic Multiplier for GF(2m). Integr. VLSI J. 1995, 18, 139–149. [Google Scholar] [CrossRef]
- Lee, C.Y.; Lu, E.H.; Lee, J.Y. Bit-Parallel Systolic Multipliers for GF(2m) Fields Defined by All-One and Equally-Spaced Polynomials. IEEE Trans. Comput. 2001, 50, 358–393. [Google Scholar]
- Lee, C.Y.; Lu, E.H.; Sun, L.F. Low-Complexity Bit-Parallel Systolic Architecture for Computing AB2+C in a Class of Finite Field GF(2m). IEEE Trans. Circuits Syst. II 2001, 50, 519–523. [Google Scholar]
- Lee, C.Y.; Chiou, C.W. Efficient Design of Low-Complexity Bit-Parallel Systolic Hankel Multipliers to Implement Multiplication in Normal and Dual Bases of GF(2m). IEICE Trans. Fundam. Electron. Commun. Comput. Sci. 2005, E88-A, 3169–3179. [Google Scholar] [CrossRef]
- Lee, C.Y. Low-latency bit-pararallel systolic multiplier for irreducible xm + xn + 1 with GCD(m,n) = 1. IEICE Trans. Fundam. Electron. Commun. Comput. Sci. 2008, 55, 828–837. [Google Scholar]
- Bayat-Sarmadi, S.; Farmani, M. High-Throughput Low-Complexity Systolic Montgomery Multiplication Over GF(2m) Based on Trinomials. IEEE Trans. Circuits Syst. II 2015, 62, 377–381. [Google Scholar]
- Mathe, S.E.; Boppana, L. Bit-parallel systolic multiplier over GF(2m) for irreducible trinomials with ASIC and FPGA implementations. IET Circuits Desvices Syst. 2018, 12, 315–325. [Google Scholar] [CrossRef]
- Lee, C.Y.; Chiou, C.W.; Lin, J.M. Concurrent error detection in a polynomial basis multiplier over GF (2m). J. Electron. Test. 2006, 22, 143–150. [Google Scholar] [CrossRef]
- Lee, K. Resource and Delay Efficient Polynomial Multiplier over Finite Fields GF(2m). J. Korea Soc. Digit. Ind. Inf. Manag. 2020, 16, 1–9. [Google Scholar]
- Lee, K. Low Complexity Systolic Montgomery Multiplication over Finite Fields GF(2m). J. Korea Soc. Digit. Ind. Inf. Manag. 2022, 18, 1–9. [Google Scholar]
- Mathe, S.E.; Boppana, L. Design and Implementation of a Sequential Polynomial Basis Multiplier over GF(2m). KSII Trans. Internet Inf. Syst. 2017, 11, 2680–2700. [Google Scholar]
- Gebali, F. Algorithms and Parallel Computers; John Wiley: New York, NY, USA, 2011. [Google Scholar]
- Chiou, C.W.; Lin, J.M.; Lee, C.Y.; Ma, C.T. Novel Mastrovito Multiplier over GF(2m) Using Trinomial. In Proceedings of the 2011 5th International Conference on Genetic and Evolutionary Computing (ICGEC), Kitakyushu, Japan, 29 August–1 September 2011; pp. 237–242. [Google Scholar]
- Ibrahim, A.; Gebali, F.; Bouteraa, Y.; Tariq, U.; Ahanger, T.; Alnowaiser, K. Compact Bit-Parallel Systolic Multiplier Over GF(2m). IEEE Can. J. Electr. Comput. Eng. 2021, 44, 199–205. [Google Scholar] [CrossRef]
Design | AND | XOR | MUX | Latch | Latency | CPD | Area Complexity | Time Complexity |
---|---|---|---|---|---|---|---|---|
Chiou-a [19] | 0 | |||||||
Chiou-b [20] | 0 | |||||||
Lee-a [38] | 0 | |||||||
Lee-b [39] | 0 | |||||||
Chiou-c [42] | q | |||||||
Proposed | 0 | q |
Multiplier | q | Latency | A (Kgates) | D (ns) | P (mW) | ADP | PDP(E) | E/bit | A Saving (%) | P Saving (%) | ADP Saving (%) | PDP Saving (%) |
---|---|---|---|---|---|---|---|---|---|---|---|---|
Chiou-a [19] | 163 | 164 | 3503.3 | 9.0 | 116.5 | 31,554.1 | 1049.2 | 6.44 | 99.8 | 96.9 | 99.9 | 98.8 |
Chiou-b [20] | 163 | 165 | 2463.0 | 5.6 | 97.7 | 13,840.5 | 549.1 | 3.37 | 99.7 | 96.2 | 99.8 | 97.7 |
Lee-a [38] | 163 | 85 | 1515.5 | 2.7 | 63.0 | 4,167.4 | 173.4 | 1.06 | 99.6 | 94.2 | 99.5 | 92.8 |
Lee-b [39] | 163 | 85 | 2172.1 | 2.7 | 80.9 | 5,973.1 | 222.4 | 1.36 | 99.7 | 95.5 | 99.6 | 94.3 |
Chiou-c [42] | 163 | 164 | 1885.5 | 7.4 | 75.2 | 13,901.5 | 554.5 | 3.40 | 99.7 | 95.1 | 99.8 | 97.7 |
Proposed | 163 | 163 | 6.2 | 3.4 | 3.7 | 21.2 | 12.6 | 0.08 | - | - | - |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
Share and Cite
Ibrahim, A.; Gebali, F. Enhancing Security for Resource-Constrained Smart Cities IoT Applications: Optimizing Cryptographic Techniques with Effective Field Multipliers. Cryptography 2025, 9, 37. https://doi.org/10.3390/cryptography9020037
Ibrahim A, Gebali F. Enhancing Security for Resource-Constrained Smart Cities IoT Applications: Optimizing Cryptographic Techniques with Effective Field Multipliers. Cryptography. 2025; 9(2):37. https://doi.org/10.3390/cryptography9020037
Chicago/Turabian StyleIbrahim, Atef, and Fayez Gebali. 2025. "Enhancing Security for Resource-Constrained Smart Cities IoT Applications: Optimizing Cryptographic Techniques with Effective Field Multipliers" Cryptography 9, no. 2: 37. https://doi.org/10.3390/cryptography9020037
APA StyleIbrahim, A., & Gebali, F. (2025). Enhancing Security for Resource-Constrained Smart Cities IoT Applications: Optimizing Cryptographic Techniques with Effective Field Multipliers. Cryptography, 9(2), 37. https://doi.org/10.3390/cryptography9020037