Comprehensive RTL-to-GDSII Workflow for Custom Embedded FPGA Architectures Using Open-Source Tools

Baungarten-Leon, Emilio Isaac; Ortega-Cisneros, Susana; Leyva, Gerardo; Muñoz Zapata, Héctor Emmanuel; Guzmán-Quezada, Erick; Alvarado-Rodríguez, Francisco J.; Raygoza-Panduro, Juan Jose

doi:10.3390/electronics14193866

Open AccessArticle

Comprehensive RTL-to-GDSII Workflow for Custom Embedded FPGA Architectures Using Open-Source Tools

by

Emilio Isaac Baungarten-Leon

^1,2

,

Susana Ortega-Cisneros

^2,*

,

Gerardo Leyva

³,

Héctor Emmanuel Muñoz Zapata

²

,

Erick Guzmán-Quezada

^1,*

,

Francisco J. Alvarado-Rodríguez

¹

and

Juan Jose Raygoza-Panduro

⁴

¹

Departamento de Electromecánica, Universidad Autónoma de Guadalajara, Zapopan 45129, Mexico

²

Centro de Investigación y de Estudios Avanzados del Instituto Politécnico Nacional, Zapopan 45019, Mexico

³

Texas Instrument, Dallas, TX 75243, USA

⁴

Division of Technologies for Cyber Human Integration, Department of Electrophotonics, Universidad de Guadalajara, Guadalajara 44430, Mexico

^*

Authors to whom correspondence should be addressed.

Electronics 2025, 14(19), 3866; https://doi.org/10.3390/electronics14193866

Submission received: 27 August 2025 / Revised: 26 September 2025 / Accepted: 26 September 2025 / Published: 29 September 2025

(This article belongs to the Special Issue FPGAs and Reconfigurable Systems: Theory, Methods and Applications)

Download

Browse Figures

Versions Notes

Abstract

The main objective of this work is to provide a comprehensive explanation of the Register Transfer Level (RTL) to Graphic Data System II (GDSII) flow for designing custom Field-Programmable Gate Array (FPGA) architectures at the 130 nm technology node using the SKY130 Process Design Kit (PDK). By leveraging open-source tools—specifically OpenLane and OpenFPGA—this study details the methodology and implementation steps required to generate a GDSII layout of a custom FPGA. OpenLane offers an integrated RTL-to-GDSII flow by combining multiple Electronic Design Automation (EDA) tools, while OpenFPGA enables the construction of flexible and customizable FPGA architectures. The article covers key aspects of the RTL-to-GDSII workflow, including RTL file configuration, the utilization of configuration variables for physical design, hierarchical chip design, macro and core implementation, chip-level integration, and gate-level simulation. Experimental results validate the proposed workflow, showcasing the successful transformation from RTL to GDSII. The findings of this research provide valuable insights for researchers and engineers in the FPGA design field, advancing the state of the art in FPGA architecture development.

Keywords:

FPGA architectures; Graphic Data System; OpenFPGA; OpenLane; Register Transfer Level; RTL-to-GDSII flow; SKY130 Process Design Kit

1. Introduction

Field-Programmable Gate Arrays (FPGAs) have emerged as a critical enabler for achieving application performance in a post-Moore’s Law era [1]. FPGAs provide inherent flexibility, enabling rapid prototyping, iterative design, and the ability to reconfigure the hardware after manufacturing. This flexibility is especially valuable in fields such as telecommunications, automotive systems, aerospace, signal processing, and embedded systems, where design requirements often may require changes.

The reconfigurable nature of FPGAs also plays a crucial role in the domain of network infrastructure, where they are used in software-defined networking, network function virtualization, and edge computing. FPGAs provide the necessary programmability and adaptability to handle evolving network protocols and traffic patterns, enhancing performance, scalability, security in modern network architectures, and evolving deep learning and machine learning algorithms as well [2].

The parallel processing capabilities of FPGAs with their ability to implement complex algorithms and customized hardware accelerators have made them indispensable in high-performance computing applications. From machine learning and artificial intelligence to data analytics and cryptography, FPGAs have demonstrated remarkable performance, energy efficiency, and real-time processing capabilities, enabling the acceleration of computationally intensive tasks [2,3,4].

Moreover, FPGAs have found extensive use in the development of digital system prototypes, allowing designers to make pre-silicon validation in hardware before manufacturing expensive Application-Specific Integrated Circuits (ASICs). FPGAs enable faster time to market by facilitating rapid prototyping, system integration, and software-defined hardware platforms [5]. This flexibility also makes FPGAs an ideal choice for research and development projects, where frequent design iterations, experimentation, and customization are required.

The RTL-to-GDSII workflow, which converts a Register Transfer Level (RTL) description of a digital circuit into a finalized Graphic Data System II (GDSII) layout, is fundamental in FPGA design. It encompasses several key stages—synthesis, floorplanning, placement, routing, and verification—that together lead to the physical implementation of the circuit [6]. Despite the growing relevance of FPGAs and the availability of open-source toolchains such as OpenLane and OpenFPGA, there is no detailed methodology in the literature that demonstrates how to create, validate, and fabricate a custom embedded FPGA at the layout level.

Prior work on FPGA fabrics has made notable strides but still leaves gaps in the reproducibility and transparency of the physical design flow. Dao et al. [7] couple FABulous to an RISC-V CPU in a 180 nm commercial process using a proprietary PDK and Synopsys/Cadence toolchains; OpenFPGA [8] reports architectural resources and area metrics; and Moser et al. [9] tightly integrate FABulous with OpenLane 2 to automate the assembly of homogeneous fabrics via automatic macro placement. However, these studies do not comprehensively document layout configuration (e.g., floorplanning strategy, routing/clocking options, corner selection, runtime and peak-memory settings), which hinders faithful reproduction—especially for newcomers to open-source flows.

Notably, among these works, only [9] employs an open-source PDK; the others rely on proprietary PDKs typically restricted by non-disclosure agreements (NDAs), which further limits transparency and reusability. Moreover, these studies omit detailed documentation of the physical design phase, a gap that significantly undermines reproducibility—especially for newcomers who depend on open-source flows.

The motivation for this research arises from the growing importance of FPGAs; their flexibility makes them indispensable in a post-Moore’s Law era, where rapid design iterations and hardware adaptability are essential. Despite the availability of powerful open-source tools like OpenFPGA and OpenLane, the literature still lacks a standardized and validated methodology that connects architectural description with physical implementation, progressing seamlessly from RTL design to a fabricable GDSII layout. This absence of a comprehensive workflow hinders researchers and practitioners who aim to develop custom embedded FPGA architectures beyond functional simulation.

The existing problem is further complicated by the inherent complexity of the RTL-to-GDSII flow, which requires coordinating multiple interdependent stages such as synthesis, floorplanning, placement, routing, and verification. Current resources are fragmented and tool-specific, offering only partial guidance that fails to guarantee reproducibility or scalability. Designers are often left to navigate compatibility issues, steep learning curves, and unpredictable outcomes, making the process inefficient and error-prone. This work addresses these challenges by proposing an integrated, reproducible open-source methodology that leverages OpenFPGA for architectural flexibility and OpenLane for automated physical design, thereby bridging the gap between theory and fabrication.

The decision to utilize OpenFPGA alongside other tools is driven by its comprehensive coverage of several critical metrics. OpenFPGA offers features such as being open-source, supporting an architecture description language, netlist generation, bitstream generation, testbench generation, and Synopsys Design Constraint (SDC) file generation. Table 1 presents a comparison of these features with other tools [10].

The research problem addressed in this work is the absence of a documented and validated methodology for designing custom embedded FPGA architectures that progress beyond functional RTL simulation to a fully verified GDSII layout ready for fabrication. Without such a workflow, researchers and engineers face fragmented, tool-specific documentation that does not demonstrate scalability or reproducibility.

Currently, there are only three open-source, fabricable Process Design Kits (PDK) with multiple layers: SKY130 [14], GF180MCU [15], and IHP-130 [16]. All of these have been successfully manufactured using the OpenROAD EDA tool. However, OpenROAD requires a step-by-step development approach. For this reason, the decision was made to use OpenLane, which automates the RTL-to-GDSII process by integrating OpenROAD with a series of custom scripts. By using OpenLane, the development time required for physical design is significantly reduced, streamlining the entire process.

The objective of this research is to close this methodological gap by proposing and validating a reproducible open-source RTL-to-GDSII workflow for embedded FPGA architectures. Unlike existing documentation, this study frames the process as a research problem—evaluating its feasibility, limitations, and opportunities for scalability.

Furthermore, this research addresses the absence of a standardized methodology for the physical design of custom embedded FPGA architectures. By leveraging open-source toolchains, the proposed workflow demonstrates how architectural description, RTL design, and physical implementation can be seamlessly integrated into a reproducible RTL-to-GDSII process. In doing so, this study highlights not only the practical feasibility of open-source approaches for FPGA layout design but also their broader role in enabling collaboration, innovation, and accessibility in the semiconductor research community.

The main contributions of this work are as follows:

Identification and formalization of the lack of a comprehensive methodology for embedded FPGA layout design using open-source tools.
Presentation and validation of a complete RTL-to-GDSII workflow that integrates OpenFPGA and OpenLane, demonstrated on a non-trivial embedded FPGA fabric.
Provision of performance, area, and design insights that highlight both the potential and the limitations of open-source toolchains for advancing custom FPGA architectures.

The remainder of this paper is organized as follows. Section 2 introduces the analysis of development tools, focusing on OpenLane as the open-source RTL-to-GDSII flow and OpenFPGA as the framework for architectural customization and bitstream generation. Section 3 details the proposed workflow, beginning with the creation of custom FPGA architectures in OpenFPGA, followed by pin planning, full testbench verification, and the transfer of RTL descriptions into OpenLane for physical implementation. Section 4 presents a case study in which a semi-customized FPGA fabric is designed, verified, and implemented, illustrating the practicality of the methodology from RTL to GDSII. Section 5 reports experimental results, including area, frequency, power consumption of hardened macros, core layout integration, and gate-level simulation outcomes validating the functional correctness of the design. Section 6 provides a discussion of the results, highlighting the strengths and limitations of the proposed methodology, comparing it to prior work, and outlining potential improvements and future research directions. Finally, Section 7 concludes the paper, emphasizing the feasibility of open-source tools for reproducible FPGA layout design, their educational and research impact, and their potential to democratize hardware innovation.

2. Analysis of Development Tools

2.1. Analysis of OpenLane in the RTL-to-GDSII Flow

OpenLane is an open-source tool that executes the RTL-to-GDSII flow for digital Integrated Circuit (IC) design. It is a comprehensive tool that automates the process of converting a digital design described in RTL to the final layout in GDSII format, which can be used for manufacturing.

OpenLane follows a modular and hierarchical design methodology, integrating various open-source EDA tools and scripts to streamline the RTL-to-GDSII design flow, illustrated in Figure 1. It provides a unified interface and automation infrastructure to handle different design stages, including synthesis, placement, routing, and physical verification.

The OpenLane workflow starts with specifying the design in RTL, which describes the functionality of the digital circuit using Verilog. OpenLane then performs synthesis, which translates the RTL into a gate-level netlist. After synthesis, OpenLane performs floorplanning and placement, where it determines the physical location of the gates on the chip, optimizing the placement to minimize wirelength and other metrics for improved performance and area utilization. Once the placement is complete, routing is performed to establish the interconnections at gate level while considering design constraints.

It is important to note that OpenLane does not currently support automatic Design-for-Testability (DFT) insertion. However, manual approaches are possible, as demonstrated in [18], where scan chain insertion techniques are applied to enable testability in open-source flows.

OpenLane also includes scripts and tools for performing various physical design optimizations, such as clock tree synthesis (CTS), power optimization, and timing analysis. These optimizations ensure that the final layout meets the design specifications and performance requirements. Throughout the entire process OpenLane provides detailed reports and metrics, allowing designers to analyze and optimize their designs; also, it offers customization options and configuration variables to adapt the flow to specific design requirements and constraints.

The open-source nature of OpenLane promotes collaboration, transparency, and innovation in the circuit design community [19,20,21]. It allows designers to access and modify the source code, contributing improvements and sharing their knowledge. This accessibility and community-driven development make OpenLane a powerful tool for accelerating the development of customizable FPGA architectures and advancing the state of the art in digital IC design.

2.2. Analysis of OpenFPGA for Architecture Customization

OpenFPGA is an innovative project that revolutionizes the development of customizable FPGA architectures, placing a strong emphasis on making the process easy and fast. At the heart of OpenFPGA lies the powerful bitstream generator called FPGA-Bitstream, which is specifically designed to support any architecture described by Versatile Place and Route (VPR) [8].

In today’s computing landscape, FPGAs have gained immense importance due to their reconfigurable nature and distributed computing capabilities, which are highly sought after for evolving data processing algorithms [2,3,4]. Understanding the significance of FPGA development, OpenFPGA provides a comprehensive suite of tools and resources that streamline the entire process.

OpenFPGA is intended to facilitate the first steps of FPGA development, offering detailed tutorials and comprehensive documentation to empower users. The project encompasses various stages, including architecture modeling, FPGA-Verilog, and FPGA-Bitstream generation, among others. Users can effortlessly interact with these tools and perform various operations using the user-friendly command-line interface provided by the OpenFPGA shell [22].

The OpenFPGA commitment to simplicity and efficiency is evident throughout the project. Users can easily finalize the architecture description file in XML format, as Figure 2 illustrates, and the FPGA-Bitstream becomes readily usable as the native EDA tool for FPGA customization, as Figure 2 shows. The project also supports multiple design flows and provides in-depth information on file formats, versioning, backward compatibility, regression tests, and using Tcl C interface [8,22].

Beyond usability and design automation, OpenFPGA also incorporates mechanisms that address FPGA-specific security concerns. A key feature in this regard is the use of the fabric key, which establishes a secure link between the architecture and its bitstream. The fabric key ensures that only authorized bitstreams can correctly configure a given FPGA fabric, while any mismatch results in invalid or unusable configurations. This provides an effective safeguard against cloning and unauthorized replication of designs, reinforcing trust in reconfigurable devices. Moreover, because OpenFPGA leaves the ownership of the key entirely in the hands of the user, it prevents leakage of sensitive data and mitigates supply-chain risks such as hardware Trojans or malicious modifications. By integrating this security-centric mechanism into the design flow, OpenFPGA not only streamlines FPGA customization but also enhances the resilience and authenticity of FPGA deployments in sensitive computing environments.

OpenFPGA stands out as a versatile framework that enables rapid and flexible development of FPGA architectures, supported by a rich set of tools, tutorials, and documentation that facilitate architectural customization. Nevertheless, its official documentation does not address the complete RTL-to-GDSII flow, since this stage traditionally depends on commercial EDA solutions. This omission highlights a critical gap in the open-source FPGA design ecosystem. To address this gap, the present work provides a detailed methodology and step-by-step explanation of the RTL-to-GDSII flow for creating custom embedded FPGAs using fully open-source tools and publicly available Process Design Kits (PDKs).

3. Analysis of Workflow

The RTL-to-GDSII workflow is a fundamental process in FPGA development by transforming a high-level RTL description into a final GDSII file ready for manufacturing. This workflow involves several crucial steps, starting with the creation of the FPGA architecture using OpenFPGA.

To ensure a successful GDSII generation, an additional step is necessary during the FPGA architecture creation, which involves the creation of the FPGA pin planner. The pin planner plays a crucial role in defining the placement and routing of input and output pins within the FPGA, ensuring proper connectivity and functionality. It allows designers to strategically position the pins to optimize signal integrity and minimize potential timing and routing issues.

After defining the architecture and pin planner, the FPGA undergoes a full simulation testbench that verifies behavior under multiple input scenarios, allowing early detection and correction of design errors.

The Verilog files are then exported to OpenLane, where minor edits improve compatibility: defaulting signals to wire and adding vcc/vss ports for a stable Power Delivery Network (PDN). Each main module proceeds through the RTL-to-GDSII flow, beginning with configuration files that set timing, power, and placement constraints to meet design goals.

Following the placement stage, the core configuration file, containing various design parameters, includes configuring timing constraints, power targets, and other performance criteria. A final gate-level simulation validates timing and functionality, exposing issues with power, delay, or signal integrity before fabrication.

In summary, the RTL-to-GDSII workflow for FPGAs is a multi-step process that involves various tools and methodologies. Starting with the creation of the FPGA architecture using OpenFPGA and the pin planner, followed by the full simulation testbench and Verilog file modifications, then in OpenLane, each main module undergoes the RTL-to-GDSII workflow. Creating the placement configuration file and performing gate-level simulation are vital stages in ensuring a successful FPGA design that meets all required specifications and design constraints. Figure 3 illustrates the comprehensive workflow followed to obtain a successful GDSII file for an FPGA design.

3.1. Analysis of Custom FPGA Architecture Generation with OpenFPGA

To create an FPGA architecture with OpenFPGA, you can follow the steps outlined in the OpenFPGA documentation [23]. The creation of an FPGA architecture using OpenFPGA can be approached in three different categories, each offering varying levels of customization.

In the fully customized category, users have complete control over the FPGA design by defining every module using the XML file format. This approach allows for detailed customization of the structure, functionality, and physical characteristics of the FPGA. The XML file serves as a powerful tool for specifying the hierarchical composition of the architecture, enabling the creation of complex designs from simpler building blocks. Users can define custom circuit models, interconnect networks, and timing characteristics, tailoring the FPGA architecture to their specific requirements. An example of this customization is shown in Figure 2, which displays the XML description of the configurable logic blocks (CLB) and Look-Up Table (LUT) used in the architecture.
The semi-customized approach involves parameterization features, where users can modify design parameters to adapt the FPGA architecture to specific requirements. Parameters such as the number of CLBs, memory blocks, Digital Signal Processor (DSPs) blocks, inputs and outputs per I/O block, and other general parameters can be easily adjusted. By utilizing the architecture modeling language and the XML file format, designers have the flexibility to precisely tailor their FPGA architectures according to application needs, performance objectives, and design constraints. Figure 4 provides a visual representation of the semi-customized process. The figure uses color coding to distinguish different components and their code modification: gray indicates the number of blocks along the height (H) and width (W), yellow highlights the IO ports, black represents empty blocks reserved for routing, blue denotes the CLBs, green illustrates the DSP blocks, and purple identifies the memory blocks.
In the automatic category, a general FPGA architecture is chosen with default characteristics equal to the semi-customized approach. These characteristics include the type of LUT, number of inputs per LUT, number of LUTs per CLB, usage of adder blocks, DSP blocks, memory blocks, and configuration protocol. OpenFPGA then automatically generates the minimum necessary blocks to execute these designs. This approach provides a convenient and efficient way to create FPGA architectures. The level of automation involved in this approach can be visualized in Figure 5.

By offering these three categories of customization, OpenFPGA empowers designers to create FPGA architectures that range from fully customized to automatic, catering to different levels of expertise, design requirements, and time constraints.

3.2. Analysis of Pin Planning for FPGA Connectivity in OpenFPGA

In FPGA design, a crucial parameter for precise input and output assignment is the pin planner. It enables the designer to assign specific pins on the FPGA for their design’s inputs and outputs, preventing random assignment. OpenFPGA requires three essential files to create an effective pin planner.

The first file is the openfpga_io_map_file, which is an XML file containing the coordinates of each I/O pin throughout the FPGA. It specifies the location of each pin using a basic structure like io_pad="gfpga_pad_GPIO_PAD[168]"x="0"y="1"z="0". Here, gfpga_pad_GPIO_PAD[168] represents the input name and bit, while the coordinates (x and y) indicate the tile’s position. The “z” parameter indicates the number of inputs/outputs in IO blocks.

The second file is the fpga_pin_table, which is a CSV file. It contains essential parameters such as orientation, port name, mapped pin name, and type. The structure of this file includes fields like orientation, row, col, pin_num_in_cell, port_name, mapped_pin, GPIO_type, Associated Clock, and Clock Edge. For example, a line in the file may look like TOP,,,,gfpga_pad_GPIO_PAD[0],pad_fpga_io[0],out,,.

The third file is the Pin Constraints File (PCF) called fpga_pcf. This file establishes the pin binding between the implementation and the FPGA fabric, specifying the connections between specific pins and components in the design.

By utilizing these files, the pin planner in OpenFPGA allows designers to precisely assign and control the input and output pins of their FPGA designs, ensuring proper connectivity and functionality.

3.3. Analysis of Full Testbench for Functional Verification

OpenFPGA employs two types of testbenches: the configuration phase and the operating phase. In the configuration phase, the bitstream is loaded into the FPGA. This phase ensures the correct configuration and functionality of the FPGA. In the operating phase, random input vectors are automatically generated to drive the Devices Under Test (DUTs) within the FPGA. This phase allows users to validate the overall functionality and performance of the FPGA, including its customized circuits and the programmed fabric.

When both phases are utilized together, it is referred to as a full testbench. By using the full testbench approach, users can thoroughly test and validate both the configuration circuits and the programming fabric of the FPGA, ensuring the correctness and reliability of their designs.

3.4. Analysis of RTL Transfer from OpenFPGA to OpenLane

As mentioned, when working with OpenFPGA, certain modifications need to be made to the FPGA Verilog file. By default, OpenFPGA generates a hierarchy file that contains information about the hierarchy of the FPGA. The main modules that need to be modified are those following the FPGA_top module. By default, the main modules include IO blocks, CLBs, switching blocks, and connection blocks. The specific type and quantity of these modules will depend on the chosen FPGA architecture and its characteristics.

The modifications in the Verilog files’ design are required at both the macro level (instances) and core level (top module). To incorporate the necessary power grid functionality, the ifdef USE_POWER_PINS parameter is added, usually called vcc and vss. Within this ifdef statement, the <power_pin> and <ground_pin> are included. This ensures that the power and ground connections are properly defined within the RTL code, enabling effective power distribution within the macro and core designs.

Another configuration to avoid syntax or routing errors with Yosys is to use the wire data type as the default type. By setting wire as the default data type, it ensures that all signals are explicitly defined as wires unless specified otherwise. This helps to prevent any unintended misinterpretation of signal types during synthesis and reduces the chances of potential errors. By following this configuration, it promotes clarity and accuracy in the FPGA design, contributing to a smoother synthesis and routing process. Finally, after making these changes to the Verilog files, they need to be exported to the OpenLane project.

3.5. Analysis of Macro Hardening in the RTL-to-GDSII Process

In order to obtain the GDSII file for each main module of the FPGA, a systematic process is followed within OpenLane. First, an OpenLane project is created for each module, with the number of modules depending on the specific FPGA architecture. Each module is then subjected to the RTL-to-GDSII workflow using OpenLane, treating them as individual macros. It is essential to ensure that each module adheres to the required basic configuration parameters, which are detailed in [19]. These basic parameters are DESIGN_NAME, VERILOG_FILES, CLOCK_PORT, CLOCK_PERIOD, FP_SIZING, DIE_AREA, FP_CORE_UTIL, FP_PDN_MULTILAYER, RT_MAX_LAYER, VDD_NETS, and GND_NETS.

To ensure the proper hardening of each module within the FPGA, it is essential to configure additional key parameters accurately. The specific parameters required depend on the macro module and the overall FPGA architecture. These parameters play a significant role in optimizing the performance and functionality of the FPGA design. Here are the key parameters and their significance:

GPL_CELL_PADDING: This parameter determines the spacing between cells in the placement process. It helps in avoiding any potential violations or timing issues caused by cell proximity.
FP_PDN_VPITCH, FP_PDN_HPITCH, and FP_PDN_VOFFSET: These parameters are associated with the PDN of the FPGA. They define the vertical and horizontal pitch, as well as the vertical offset, for the PDN grid.
DIODE_INSERTION_STRATEGY: This parameter controls the insertion strategy for diodes in the FPGA design. Diodes are commonly used to improve the reliability and robustness of the circuitry. Selecting an appropriate insertion strategy helps in achieving optimal diode placement for enhanced performance.
SYNTH_READ_BLACKBOX_LIB: When set to true, it indicates that the design utilizes standard cells, and the synthesis tool should consider them during the synthesis process.
PL_TARGET_DENSITY, PL_RESIZER_DESIGN_OPTIMIZATIONS, and PL_RESIZER_TIMING_OPTIMIZATIONS: These parameters are related to the placement process of the FPGA. They control the target density of the design and enable design optimizations at the placement stage to enhance performance and meet timing requirements.
RUN_CTS: This parameter determines whether to perform CTS during the FPGA synthesis process. CTS is essential for proper clock distribution and minimizing clock skew in the design.
GLB_RESIZER_TIMING_OPTIMIZATIONS, GRT_ADJUSTMENT: These parameters are associated with global routing in the FPGA design. They enable timing optimizations and adjustments during the global routing stage to meet critical timing constraints.

By understanding and appropriately configuring these parameters based on the OpenLane documentation, designers can effectively harden each module of the FPGA, ensuring optimal performance, timing, and reliability.

3.6. Analysis of Placement Configuration for Macro Integration

The macro placement configuration file is a crucial document that defines the placement of each module within the FPGA architecture. It specifies the coordinates for various components such as switching blocks, x and y connection blocks, CLBs, I/O blocks, and any other relevant blocks. Each module is assigned its own unique coordinate, which determines its physical location within the FPGA fabric. It is essential to ensure that modules are properly spaced to avoid routing congestion issues. Additionally, modules should not overlap or encroach upon the space allocated for other modules to ensure proper functionality and prevent any potential conflicts during the placement process. By carefully defining the coordinates and allocating sufficient space for each module, designers can achieve optimized placement and mitigate routing congestion challenges in the FPGA design.

To streamline the process of macro placement in the FPGA design, a Python 3.8 script has been developed. This script simplifies the placement task by taking input parameters specifying the maximum sizes of x and y connection blocks, CLBs, and switching blocks. Based on these sizes, a perimeter is generated, as depicted the tile in Figure 6, which is the FPGA architecture and serves as the boundary for the placement of tiles.

The dimensions of the basic tile perimeter are calculated, and the coordinates of each tile are adjusted accordingly, considering the position of the tile within the FPGA fabric. Additionally, the script includes parameters for specifying the spacing between cells in the x and y directions. However, it is important to note that while the script automates much of the floorplanning process, it still requires manual verification and adjustment to ensure optimal placement and address any specific floorplanning considerations or constraints.

3.7. Analysis of Core-Level Configuration in FPGA Design

To initiate the design process for the core module (FPGA_top), the first step involves creating a new OpenLane project. It is essential to adhere to the OpenLane workflow and treat the FPGA_Top module as a core entity. The project configuration requires specifying various basic parameters, which can be found in [19]. These configuration parameters include DESIGN_NAME, VERILOG_FILES, CLOCK_PORT, CLOCK_PERIOD, FP_PDN_MULTILAYER, EXTRA_LEFS, EXTRA_GDS_FILES, VERILOG_FILES_BLACKBOX, FP_SIZING, DIE_AREA, RT_MAX_LAYER, VDD_NETS, GND_NETS, and FP_PDN_MACRO_HOOKS.

After selecting the basic parameters for a successful RTL-to-GDSII conversion, it is recommended to configure additional parameters for optimal results. These configurations can help fine-tune the OpenLane flow and achieve desired outcomes. The following parameter settings are suggested:

FP_PDN_CHECK_NODES = 0: Disables checking nodes during floor planning PDN generation.
SYNTH_ELABORATE_ONLY = 1: Enables only synthesis elaboration without running further steps in the flow.
PL_RANDOM_GLB_PLACEMENT = 1: Enables random placement of Global Logic Blocks (GLBs) during placement.
PL_RESIZER_DESIGN_OPTIMIZATIONS = 0: Disables design optimizations during placement.
PL_RESIZER_TIMING_OPTIMIZATIONS = 0: Disables timing optimizations during placement.
PL_RESIZER_BUFFER_INPUT_PORTS = 0: Disables adding buffer input ports during placement.
FP_PDN_ENABLE_RAILS = 0: Disables enabling rails for floorplan power distribution network.
DIODE_INSERTION_STRATEGY = 0: Uses the default strategy for diode insertion.
RUN_FILL_INSERTION = 0: Disables fill insertion step.
RUN_TAP_DECAP_INSERTION = 0: Disables tap decap insertion step.
CLOCK_TREE_SYNTH = 0: Disables clock tree synthesis step.
MAGIC_ZEROIZE_ORIGIN = 0: Disables setting the origin for the Magic database.

These parameter configurations can be adjusted based on the specific requirements of the design. For macro hardening, OpenLane provides features to customize the hardening process. It involves optimizing and hardening individual macros or modules to improve their performance and reliability. The OpenLane documentation provides detailed explanations of each parameter and its impact on the macro hardening process. By carefully configuring these parameters, designers can achieve efficient macro hardening and fine-tune the FPGA design to meet their specific goals.

The remaining configurations are related to the PDN. To address PDN issues, it is recommended to examine the FP_PDN variables. These variables allow for fine-tuning and customization of the PDN generation process.

3.8. Analysis of Gate-Level Simulation for Functional Validation

After completing the RTL-to-GDSII flow, a variety of files will be generated. If the workflow is successful, you can locate the Verilog files in the result/final/Verilog directory of OpenLane. These Verilog files are specifically designed to be compatible with SKY130 logic cells. With these Verilog files, you can now perform the same full testbench, but at the gate level. This allows for comprehensive testing and verification of the synthesized design using the specific SKY130 PDK.

4. Case Study

This case study explores the development of a custom FPGA architecture capable of implementing basic logic gates and arithmetic operations. The primary goal is to design a replicable FPGA architecture that is accessible to a wide audience. To achieve this, the case study will be a small FPGA that can be replicated with low-performance equipment and open-source tools, allowing anyone to follow the design flow and create their own custom FPGA design. While the approach is tailored for simplicity and accessibility, it is also scalable, allowing for the creation of larger and more complex FPGA architectures. However, scaling up may require more advanced resources and commercial simulation tools, as open-source simulators are currently limited to single-core execution, which can take long processing time for more intricate designs.

The case utilizes OpenFPGA to customize the FPGA architecture using the semi-customized approach, tailoring it to meet the specific requirements to execute the basic arithmetic operation. Through the use of XML files and the structure, functionality, and physical characteristics of the FPGA design are defined. After architecture design, a full testbench validates performance by configuring the FPGA to run logic gates as well as addition and subtraction.

The physical design process leverages OpenLane’s RTL-to-GDSII flow, which facilitates the synthesis, placement, and routing of the FPGA design at layout. Following the recommended configurations and parameters, as outlined in the previous sections, ensures an optimized and reliable implementation.

This case study exemplifies the practical application of OpenFPGA and OpenLane, open-source tools, in the development of custom FPGA architectures at layout. The ultimate result of the RTL-to-GDSII flow is the creation of a finalized GDSII file, representing a design that is ready for manufacturing. This successful translation of the custom FPGA architecture into a tangible hardware implementation highlights the potential of open-source tools in democratizing FPGA development and enabling innovation in the field of hardware acceleration.

4.1. Case Study: Analysis of FPGA Architecture Customization with OpenFPGA

For the development of this FPGA, a semi-customized approach was utilized, building upon the k4_N4 architecture provided by OpenFPGA. This configuration incorporates an FPGA structure with I/O blocks positioned along the perimeter, encircling the CLBs. Each CLB is composed of four LUTs, each supporting four inputs. Modifications to this architecture include the adoption of a 3 × 3 CLB arrangement, the use of a single I/O per I/O block, and the incorporation of a tileable design that maximizes the reuse of submodules. Figure 7 illustrates this architecture.

4.2. Case Study: Analysis of Functional Verification Using Full Testbench

One key feature of OpenFPGA is its ability to generate a complete testbench that combines both the configuration and operating phases, as discussed in previous sections. It is important to note that the generated testbench sets up the environment to send the bitstream and configure the FPGA. Once the FPGA is configured, a series of random values are sent to all of its inputs, which is suitable for simple arithmetic and logic operations. However, this approach may not be ideal for more complex systems or protocol communication. In such cases, it is recommended to modify the testbench provided by OpenFPGA to better suit specific requirements.

Figure 8a shows the configuration phase testbench of the FPGA when it is configured as an AND gate. The figure highlights five control signals: prog_clk, the clock used during FPGA configuration; ccff_head (also known as the Configuration Chain Flip-flop or ccff), which serves as the input to send the bitstream (one bit for this architecture); set and reset signals, used to initialize the FPGA state; and finally, ccff_tail, which acts as an output signal indicating when the FPGA has been successfully configured.

Figure 8b illustrates the operating phase, where GPIOs [0] and [1] are used as the inputs for the AND gate, and GPIO [6] serves as the output of the AND gate.

4.3. Case Study: Analysis of Macro-Level RTL-to-GDSII Implementation

Before starting the RTL-to-GDSII flow with OpenLane, it is important to organize the Verilog files into their respective main modules. To achieve this, refer to the fabric_hierarchy.txt file provided by OpenFPGA. For this case, the primary modules that compose the FPGA_top are

grid_io_top
grid_io_right
grid_io_bottom
grid_io_left
grid_clb
sb_0__0_
sb_0__1_

sb_0__3_
sb_1__0_
sb_1__1_
sb_1__3_
sb_3__0_
sb_3__1_
sb_3__3_

cbx_1__0_
cbx_1__1_
cbx_1__3_
cby_0__1_
cby_1__1_
cby_3__1_

Note that modules containing the word “io” in their name refer to the I/O blocks, “clb” refers to the CLBs, “sb” denotes the switching block modules, and “cb” represents the connection blocks for the “x” and “y” directions.

The next step is the verilog file modification of the previous modules adding the the <power_pin> and <ground_pin>; e.g., the grid_io_top.v will change from Listing 1 to Listing 2.

Listing 1. Verilog code prior to power port modification, showing instantiation via positional association.

Listing 2. Verilog code after power port modification, showing instantiation via named association.

Now, we can import the FPGA Verilog files into OpenLane, where we can obtain the layout of the main modules. Section 3.5 provides the essential configuration required to achieve a successful macro layout. For example, the following config.json file was used to generate the grid_io_top tile layout as Listing 3 illustrates; remember that OpenLane uses the config.json to execute the RTL-to-GDSII flow.

Listing 3. JSON configuration used in OpenLane to harden the grid_io_top macro.

Figure 9 illustrates the layout obtained after the OpenLane execution.

This process must be repeated for the primary modules that compose the FPGA_top.

4.4. Case Study: Analysis of Core-Level RTL-to-GDSII Integration

The initial step involves grouping the files generated by each macro, as these files are essential for producing the final layout. After file organization, we configure the config.json file with the parameters detailed in Section 3.6 and Section 3.7. Particular attention must be paid to the FP_PDN_MACRO_HOOKS and MACRO_PLACEMENT_CFG configurations.

The FP_PDN_MACRO_HOOKS parameter (Listing 4) establishes explicit connections for the voltage and ground pins that were incorporated in the Verilog files using USE_POWER_PINS. The asterisk (*) denotes that all modules with the specified name will be connected, as shown in the following configuration example:

Listing 4. JSON configuration of FP_PDN_MACRO_HOOKS in OpenLane, where * is used as a wildcard to apply power and ground connections to matching modules.

Conversely, the MACRO_PLACEMENT_CFG parameter (Listing 5) references a macro.cfg file containing coordinate strings that determine the placement of each macro instantiated in the top module. The configuration fragment below illustrates this:

Listing 5. Fragment of the macro.cfg file in OpenLane, specifying placement coordinates and orientations for top-level macros.

In this example, the macro sb_0__0_ is positioned at coordinates (x = 300, y = 241) within the area defined by the DIE_AREA parameter. Figure 10 displays a portion of the FPGA_top layout after the correct execution of the RTL-to-GDSII flow, demonstrating the placement of modules sb_0__0_, sb_1__0_, cbx_1__0_, grid_io_bottom_1__0_, and sb_0__1_ according to the macro.cfg specifications.

The configuration files mentioned in Section 4.3 and Section 4.4 can be found in the [24] GitHub repository commit bb57e04.

5. Results

5.1. Analysis of Macro-Level Results: Area, Frequency, and Power

Following the physical design implementation of the principal modules comprising the FPGA_top architecture, 20 macros were generated in layout.

The characterization of the macros generated from the FPGA_top architecture provides both functional and physical insights into the design quality. Table 2 summarizes key structural properties, including area, operating frequency, dynamic power consumption, number of logic gates, flip-flops, and I/O ports. These values highlight the differences in resource utilization between the simpler I/O macros and the more complex switch boxes and configurable logic blocks, which naturally exhibit larger areas, higher gate counts, and greater power demands.

To complement these structural metrics, Table 3 presents post-layout timing and power integrity results, reporting the worst setup and hold slacks, total negative slack (TNS), and both average and worst-case IR-drop values. Collectively, these results confirm that the macros not only meet their functional requirements but also remain within acceptable timing margins and supply integrity levels, ensuring robustness of the FPGA fabric. The corresponding layout views of each macro are provided in Appendix A.

5.2. Analysis of Core-Level Layout and Integration Results

At the core level, OpenLane performs automatic routing of all macros according to the structural Verilog netlist specified in FPGA_top.v and the configurations made in the config.json file. Figure 11 illustrates the resulting physical layout of the FPGA architecture, which corresponds to the design presented in Figure 7 with the addition of the connection and switching blocks as Figure 6 shows.

Note that the large macros correspond to grid_clb and sb_0__0_, while the smaller macros distributed across the periphery represent the io macros.

5.3. Analysis of Gate-Level Simulation Results

The correct execution of the gate-level simulation requires the use of the SKY130 PDK, as the Verilog netlist generated by OpenFPGA at this stage has already been translated into thousands of standard cells and interconnections. A logic error at this level would present a significant challenge, as identifying its root cause would involve meticulously analyzing the intricate network of gates and wires.

Figure 12 presents the gate-level simulation of the implemented FPGA performing logical operations AND (Figure 12a), OR (Figure 12b), and XOR (Figure 12c), while Figure 13 presents the gate-level simulation of the as arithmetic operations (addition and subtraction), In both figures, the orange signals denote the FPGA input pins, while the purple signals indicate the FPGA output pins. These results confirm the accurate translation and correct functionality from RTL to gate-level implementation using the SKY130 nm PDK.

6. Discussion

This work demonstrates that a fully open-source toolchain (specifically OpenFPGA [8] and OpenLane [19]) can implement a semi-custom embedded FPGA from RTL to a manufacturable GDSII, with the resulting layout verified at the gate level in SKY130. An alternative option for implementing custom FPGA RTL is the FABulous framework [11], which has been successfully employed in prior works. However, in the present study, FABulous was not adopted, as it does not provide SDC file generation, a feature essential for seamless integration with back-end physical design flows. A comparative evaluation of FABulous and OpenFPGA would nevertheless be valuable in future work, as it could highlight differences in usability, reproducibility, and flow automation. On the other hand, frameworks such as PRGA [12] and Archipelago [25] were also not considered in this study, as they require significantly more manual effort—particularly in generating testbenches—thus complicating reproducibility and increasing the barrier to entry for new users.

In the presented workflow, OpenLane automatically routes the hardened macros within the 3 × 3 CLB fabric (Figure 11) by leveraging the defined floorplanning constraints, while gate-level simulations confirm correct logical (AND/OR/XOR) and arithmetic (add/sub) functionality after technology mapping to standard cells (Figure 12 and Figure 13). These system-level results show that the architectural intent specified in OpenFPGA is faithfully preserved through physical synthesis and sign-off in OpenLane.

A notable practical finding is the emphasis on reproducible macro hardening and the correct interconnection among the hardened blocks, as illustrated in Figure 6 and Figure 7. At the macro level, the principal components of the fabric—including I/Os, CLBs, and switch and connection blocks—were individually hardened; the complete set is enumerated in Appendix A. As reported in Section 5, a total of 20 macros were successfully realized at the layout stage, enabling floorplanning and routing of the top-level integration.

The size contrast between the larger tiles (e.g., grid_clb, sb_0__0_) and the smaller I/O tiles along the periphery helps explain the placement and routing pattern observed in Figure 11. In particular, the dimensions of the connection and switching blocks could not be reduced further without changing the RTL architecture, as their size is mainly determined by the number of I/Os required per tile and by the minimum spacing restrictions that must be respected in the layout between adjacent I/Os; a flat implementation (without macros) of connection and switching blocks often worsens routing, causing routing congestion. In relation to the size of CLB, it arises because OpenLane synthesizes memories as large groups of registers; since LUTs are inherently memory elements and each CLB contains many LUTs, a substantial portion of the CLB area is consumed by registers. While this could be alleviated through the use of small custom memory macros, such an approach would require advanced handling of mixed signals as well as specialized expertise in memory architecture and analog layout design.

Related to the OpenLane constraints and configuration, two configurations were essential for clean integration in OpenLane: explicit macro power hooking and scripted macro placement. The use of FP_PDN_MACRO_HOOKS ensured consistent power/ground stitching across all major tile families, while a macro.cfg with Python-generated coordinates enforced spacing and orientation, preventing overlaps and reducing congestion. These measures enabled automated placement without manual adjustments. Parameter choices also influenced routability and verification effort: high placement density (PL_TARGET_DENSITY = 0.90) was feasible for small macros but problematic at larger scales. Finally, running the full gate-level testbench with SKY130 models was critical to detect RTL integration mismatches in the flow.

Relative to prior research [2,7,8,9,11], our contribution is to document a comprehensive RTL-to-GDSII workflow, from RTL design to layout configuration—floorplanning decisions, PDN hooks, placement files, routing, and GL simulation. Another layout embedded FPGA implementations [7,9] and advanced automatic floorplan but did not comprehensively expose routability/clocking options or runtime/memory considerations, which complicates replication by newcomers; additionally, much of the literature still relies on proprietary PDKs under NDA, curbing transparency. Our workflow is demonstrated entirely with an open PDK and includes concrete config fragments to lower the barrier to reproduction, enabling new researchers to incorporate and experiment with embedded FPGAs in their designs.

The present validation was conducted using SKY130, one of three currently fabricable open PDKs (alongside GF180MCU [15] and IHP-130 [16]). While portability to other nodes is plausible, it is not guaranteed, as differences in metal stacks, antenna rules, and routing resources may require re-tuning of floorplan dimensions, PDN pitches, and diode insertion strategies. Although OpenROAD-based [21] flows support multiple PDKs, OpenLane’s [19] higher level of automation and the use of our predefined templates were preferred to reduce development time. A further consideration is scalability: open-source simulators and verification tools can become runtime bottlenecks for larger fabrics (e.g., single-core execution in certain steps). This highlights the need for modular testbenches and partitioned gate-level checks to maintain efficiency as design complexity increases.

In addition to the 3 × 3 CLB case study, we have extended the validation with a larger implementation to illustrate scalability of the flow. Specifically, a layout of an FPGA fabric comprising 540 six-input LUTs has been generated (Figure A1), demonstrating that the OpenFPGA/OpenLane toolchain can handle designs of significantly greater size and complexity beyond the small-scale prototype. This larger case highlights runtime and resource trends consistent with expectations: placement density and routing congestion increase with fabric size, but the flow remains functional when aided by scripted macro placement and explicit PDN hooks. As discussed above, scalability is ultimately limited by simulator runtime and verification overhead; however, the successful integration of a fabric with hundreds of LUTs confirms the applicability of the methodology to more general and industrially relevant scenarios.

While the workflow demonstrates functional correctness and scalability, reliability remains a critical dimension for mission-critical applications such as aerospace, automotive, and secure communications. Open-source tools like OpenLane and OpenFPGA do not yet natively support advanced reliability features, including automatic DFT, built-in self-test, or fault-tolerance mechanisms against single event upsets. These aspects are particularly relevant when targeting safety-critical or radiation-prone environments, where robustness must be validated through fault injection campaigns, redundancy strategies, and rigorous verification using edge cases. Although some manual methods (e.g., scan chain insertion) have been reported, integrating them seamlessly into the open-source RTL-to-GDSII flow remains an open challenge. Addressing these gaps would strengthen the applicability of the proposed methodology to industrial sectors that demand not only performance but also resilience and long-term reliability.

Practical guidance distilled from this study includes the following: (i) add USE_POWER_PINS to all macro RTL and verify uniform net names before synthesis; (ii) keep a single source of truth (macro.cfg) for tile coordinates and orientations and regenerate it from parameters rather than editing by hand; (iii) begin with conservative density and progressively enable optimization passes as macros converge; and (iv) always replicate the “full testbench” at the gate level with the target PDK models before moving to tapeout collateral. Our artifact repository referenced in [24] contains the exact configuration files used for macro/core hardening and can serve as a starting point for replication or extension.

7. Conclusions

Open-source tools such as OpenLane and OpenFPGA have made it feasible for anyone to design and implement custom FPGA Integrated Circuits. This workflow provides a unique opportunity to gain hands-on experience with both the digital layout process and the architectural aspects of FPGA design. As a result, these tools are especially valuable for education and research, bridging the gap between high-level digital design and low-level physical implementation.

Beyond their educational value, these tools also offer practical advantages for industry. By enabling the development of application-specific, reconfigurable logic devices, OpenLane and OpenFPGA support innovation in fields that demand flexibility and customization. The ability to design custom FPGAs at low cost using open-source resources can accelerate prototyping, reduce dependency on commercial IP, and foster new design methodologies in both academia and industry.

Author Contributions

Conceptualization, E.I.B.-L. and E.G.-Q.; methodology, E.I.B.-L. and S.O.-C.; software, E.I.B.-L. and G.L.; validation, E.I.B.-L., S.O.-C., G.L. and H.E.M.Z.; formal analysis, E.I.B.-L. and E.G.-Q.; investigation, E.I.B.-L., H.E.M.Z. and J.J.R.-P.; resources, F.J.A.-R., J.J.R.-P. and S.O.-C.; data curation, E.I.B.-L., G.L. and H.E.M.Z.; writing—original draft preparation, E.I.B.-L. and S.O.-C.; writing—review and editing, S.O.-C., E.G.-Q., G.L. and F.J.A.-R.; visualization, E.I.B.-L., H.E.M.Z. and F.J.A.-R.; supervision, S.O.-C. and E.G.-Q.; project administration, S.O.-C. and E.G.-Q.; funding acquisition, E.G.-Q., S.O.-C. and G.L. All authors have read and agreed to the published version of the manuscript.

Funding

This research was funded by Secretaría de Ciencia, Humanidades, Tecnología e Innovación (SECIHTI) grant number CBF-2025-G-1587, and the APC was funded by Universidad Autónoma de Guadalajara.

Data Availability Statement

The data and resources supporting the findings of this study are openly available. Custom FPGA design files and scripts used in this research are provided at https://github.com/Baungarten-CINVESTAV/CustomFPGA (accessed on 20 September 2025). The open-source tools employed include OpenLane (https://github.com/The-OpenROAD-Project/OpenLane (accessed on 20 September 2025) and OpenFPGA (https://github.com/lnis-uofu/OpenFPGA (accessed on 20 September 2025). The fabrication technology and process information were based on the open-access SKY130 PDK documentation (https://skywater-pdk.readthedocs.io/en/main/ (accessed on 20 September 2025). Background and methodological details are further described in the book Tape-out Process with Open-Source Tools (https://link.springer.com/book/10.1007/978-3-031-92108-7 (accessed on 20 September 2025). All datasets and materials referenced are publicly available without restriction.

Conflicts of Interest

The Author Gerardo Leyva was employed by the company Texas Instrument. The authors declare no conflicts of interest. The funders had no role in the design of the study; in the collection, analyses, or interpretation of data; in the writing of the manuscript; or in the decision to publish the results.

Appendix A

Appendix B

Figure A1 illustrates the k6_N10 FPGA architecture, in which each CLB integrates ten six-input LUTs to support flexible logic mapping. The overall fabric is arranged as a 6 × 9 array of CLBs, providing a structured balance between logic density and routing complexity. In addition, the device offers a total of 378 I/O pins, ensuring sufficient external connectivity for complex designs and test scenarios. This architecture was selected as a representative example because it combines moderate logic capacity with extensive I/O resources, making it well-suited for evaluating the flow of synthesis, placement, routing, and bitstream generation within OpenFPGA.

Figure A1. An FPGA layout with 540 six-input LUTs created with the flow shown in Figure 3.

References

Moore, G.E. Cramming more components onto integrated circuits. Proc. IEEE 1998, 86, 82–85. [Google Scholar] [CrossRef]
Vaishnav, A.; Pham, K.D.; Koch, D. A survey on FPGA virtualization. In Proceedings of the 2018 28th International Conference on Field Programmable Logic and Applications (FPL), Dublin, Ireland, 27–31 August 2018; pp. 131–1317. [Google Scholar]
Zhang, C.; Li, P.; Sun, G.; Guan, Y.; Xiao, B.; Cong, J. Optimizing FPGA-based accelerator design for deep convolutional neural networks. In Proceedings of the 2015 ACM/SIGDA International Symposium on Field-Programmable Gate Arrays, Monterey, CA, USA, 22–24 February 2015; pp. 161–170. [Google Scholar]
Cong, J.; Fang, Z.; Huang, M.; Wang, L.; Wu, D. CPU-FPGA coscheduling for big data applications. IEEE Des. Test 2017, 35, 16–22. [Google Scholar] [CrossRef]
Taraate, V. ASIC Design and Synthesis; Springer Nature: Berlin/Heidelberg, Germany, 2021. [Google Scholar]
Bhatnagar, H. Advanced ASIC chip synthesis: Using synopsys. In Design Compiler Physical Compiler and PrimeTime; Springer: Berlin/Heidelberg, Germany, 2007. [Google Scholar]
Dao, N.; Attwood, A.; Healy, B.; Koch, D. Flexbex: A risc-v with a reconfigurable instruction extension. In Proceedings of the 2020 International Conference on Field-Programmable Technology (ICFPT), Maui, HI, USA, 9–11 December 2020; pp. 190–195. [Google Scholar]
Tang, X.; Giacomin, E.; Alacchi, A.; Chauviere, B.; Gaillardon, P.E. OpenFPGA: An opensource framework enabling rapid prototyping of customizable FPGAs. In Proceedings of the 2019 29th International Conference on Field Programmable Logic and Applications (FPL), Barcelona, Spain, 8–12 September 2019; pp. 367–374. [Google Scholar]
Moser, L.; Kissich, M.; Scheipel, T.; Baunach, M. Stitching FPGA Fabrics with FABulous and OpenLane 2. In Proceedings of the 21st ACM International Conference on Computing Frontiers: Workshops and Special Sessions, Ischia, Italy, 7–9 May 2024; pp. 71–74. [Google Scholar]
Tu, K.; Tang, X.; Yu, C.; Josipović, L.; Chu, Z. Semi-custom EDA. In FPGA EDA: Design Principles and Implementation; Springer: Berlin/Heidelberg, Germany, 2024; pp. 85–109. [Google Scholar]
Koch, D.; Dao, N.; Healy, B.; Yu, J.; Attwood, A. FABulous: An embedded FPGA framework. In Proceedings of the The 2021 ACM/SIGDA International Symposium on Field-Programmable Gate Arrays, Virtual Event, 28 February–2 March 2021; pp. 45–56. [Google Scholar]
Li, A.; Wentzlaff, D. PRGA: An open-source FPGA research and prototyping framework. In Proceedings of the 2021 ACM/SIGDA International Symposium on Field-Programmable Gate Arrays, Virtual Event, 28 February–2 March 2021; pp. 127–137. [Google Scholar]
Grady, B.; Anderson, J.H. Synthesizable heterogeneous FPGA fabrics. In Proceedings of the 2018 International Conference on Field-Programmable Technology (FPT), Okinawa, Japan, 10–14 December 2018; pp. 222–229. [Google Scholar]
Google; SkyWater Technology Foundry. SkyWater Open Source PDK Documentation. 2024. Available online: https://skywater-pdk.readthedocs.io/en/main/ (accessed on 11 June 2025).
Google; GlobalFoundries. GF180MCU Open Source PDK Documentation. 2024. Available online: https://gf180mcu-pdk.readthedocs.io/en/latest/ (accessed on 11 June 2025).
IHP GmbH. IHP Open PDK. 2023. Available online: https://github.com/IHP-GmbH/IHP-Open-PDK (accessed on 11 June 2025).
Cisneros, S.O.; Leon, E.I.B.; Alvarez, P.M. Integrated Circuit Design: Tape-out Process with Open-Source Tools; Springer Nature: Berlin/Heidelberg, Germany, 2025. [Google Scholar]
Chew, C.; Ooi, C.Y.; Chye, C.N. Integrating Design for Testability Technique into OpenLane with Skywater 130-Nanometer Process Design Kit. Semarak Eng. J. 2023, 3, 14–21. [Google Scholar]
Shalan, M.; Edwards, T. Building OpenLANE: A 130nm openroad-based tapeout-proven flow. In Proceedings of the 39th International Conference on Computer-Aided Design, San Diego, CA, USA, 2–5 November 2020; pp. 1–6. [Google Scholar]
Ghazy, A.; Shalan, M. Openlane: The open-source digital asic implementation flow. In Proceedings of the Workshop on Open-Source EDA Technology (WOSET), Virtual Event, 3 November 2020. [Google Scholar]
Ajayi, T.; Blaauw, D. OpenROAD: Toward a self-driving, open-source digital layout implementation tool chain. In Proceedings of the Government Microcircuit Applications and Critical Technology Conference, Albuquerque, NM, USA, 25–28 March 2019. [Google Scholar]
Tang, X.; Giacomin, E.; Chauviere, B.; Alacchi, A.; Gaillardon, P.E. LNIS-Uofu/OpenFPGA: An Open-Source FPGA IP Generator. 2023. Available online: https://github.com/lnis-uofu/OpenFPGA (accessed on 30 May 2025).
Tang, X. Welcome to OpenFPGA’s Documentation. 2023. Available online: https://openfpga.readthedocs.io/en/master/ (accessed on 9 June 2025).
Baungarten-CINVESTAV. CustomFPGA. 2025. Available online: https://github.com/Baungarten-CINVESTAV/CustomFPGA (accessed on 20 June 2025).
Liu, H.J. Archipelago—An Open Source FPGA with Toolflow Support; Report UCB/EECS-2014-43; Department of Electronic Engineering and Computer Science, UC Berkeley: Berkeley, CA, USA, 2014. [Google Scholar]

Figure 1. Overview of the RTL-to-GDSII flow, illustrating the transformation from design code to fabricated silicon (adapted from [17]).

Figure 2. Example of XML-based architecture description in OpenFPGA, illustrating how users can customize circuit-level details and architectural parameters within a flexible and structured framework [8,22].

Figure 3. Workflow for generating a successful GDSII file of an FPGA design with OpenFPGA and OpenLane.

Figure 4. Semi-customized of FPGA architecture through parametrization, allowing selection of block size, type, and number of blocks.

Figure 5. Automatic generation of FPGA architecture blocks for specific circuits: in this category, the FPGA architecture is automatically generated based on specific circuit requirements.

Figure 6. Basic connection architecture of FPGAs generated with OpenFPGA.

Figure 7. Diagram of the FPGA architecture, with yellow I/O blocks along the perimeter and blue CLBs arranged in a 3 × 3 configuration.

Figure 8. (a) Configuration phase of the FPGA, showing the control signals used during configuration. (b) Operating phase of the FPGA.

Figure 9. GDSII view of the grid_io_top tile.

Figure 10. Partial view of FPGA_top layout showing macro placement.

Figure 11. Physical layout of the FPGA 3 × 3 CLB architecture.

Figure 12. Gate-level simulation of the implemented FPGA executing logical operations: (a) AND, (b) OR, and (c) XOR.

Figure 13. Gate-level simulation of the implemented FPGA executing arithmetic operations: (a) addition and (b) subtraction.

Table 1. Comparison of open-source EDA tools for FPGA design and architecture.

Metric	Open-Source	Architecture Language	Bitstream Generation	Testbench Generation	SDC	Netlist Generation
OpenFPGA [8]	✓	✓	✓	✓	✓	Automatic
FABulous [11]	✓	✓	✓	✓	✗	Automatic
PRGA [12]	✓	✓	✓	✗	✗	Automatic
Archipelago [13]	✓	✗	✓	✗	✗	Automatic

✓ Supported ✗ Not supported.

Table 2. Macro characteristics: area, frequency, power consumption, number of cells, flip-flops, and I/O ports.

Macro	Area ( ${mm}^{2}$ )	Frequency (MHz)	Power (µW)	Gates	DFF	IOs
`grid_io_top`	0.0018	40	3.7	103	1	6
`grid_io_right`	0.0018	40	3.7	102	1	6
`grid_io_bottom`	0.0018	40	3.7	102	1	6
`grid_io_left`	0.0018	40	3.7	102	1	6
`grid_clb`	0.0289	35	132.0	3780	136	21
`sb_0__0_`	0.0049	40	49.0	401	8	43
`sb_0__1_`	0.0100	40	140.0	478	31	63
`sb_0__3_`	0.0049	40	48.9	398	8	43
`sb_1__0_`	0.0100	40	152.0	562	30	63
`sb_1__1_`	0.0272	40	220.0	1989	48	83
`sb_1__3_`	0.0100	40	151.0	558	36	63
`sb_3__0_`	0.0049	40	49.0	400	8	43
`sb_3__1_`	0.0100	40	153.0	553	36	63
`sb_3__3_`	0.0049	40	49.0	401	8	43
`cbx_1__0_`	0.0036	40	54.0	314	8	42
`cbx_1__1_`	0.0036	40	64.9	344	12	44
`cbx_1__3_`	0.0036	40	59.3	331	10	43
`cby_0__1_`	0.0036	40	54.0	314	8	42
`cby_1__1_`	0.0036	40	64.9	344	12	44
`cby_3__1_`	0.0036	40	59.3	331	10	43

Table 3. Macro characteristics: worst slack setup, worst slack hold, TNS, average IR-drop, worst-case IR-drop.

Macro	Worst Slack	Worst Slack	TNS	IR-Drop	IR-Drop
Macro	Setup (ns)	Hold (ns)	(ns)	(Average)	(Worstcase)
`grid_io_top`	14.15	4.84	0	1.16 × 10⁻⁷ V	1.31 × 10⁻⁶ V
`grid_io_right`	14.15	4.84	0	1.10 × 10⁻⁷ V	1.38 × 10⁻⁶ V
`grid_io_bottom`	14.15	4.84	0	1.10 × 10⁻⁷ V	1.38 × 10⁻⁶ V
`grid_io_left`	14.15	4.84	0	1.10 × 10⁻⁷ V	1.38 × 10⁻⁶ V
`grid_clb`	9.8	0.99	0	1.99 × 10⁻⁶ V	2.10 × 10⁻⁵ V
`sb_0__0_`	13.94	0.60	0	1.57 × 10⁻⁶ V	1.32 × 10⁻⁵ V
`sb_0__1_`	13.52	0.59	0	3.50 × 10⁻⁶ V	2.44 × 10⁻⁵ V
`sb_0__3_`	13.94	0.60	0	1.52 × 10⁻⁶ V	1.30 × 10⁻⁵ V
`sb_1__0_`	13.72	0.59	0	3.26 × 10⁻⁶ V	2.38 × 10⁻⁵ V
`sb_1__1_`	13.52	0.59	0	3.34 × 10⁻⁶ V	3.55 × 10⁻⁵ V
`sb_1__3_`	13.72	0.60	0	3.55 × 10⁻⁶ V	2.09 × 10⁻⁵ V
`sb_3__0_`	13.94	0.60	0	1.57 × 10⁻⁶ V	1.32 × 10⁻⁵ V
`sb_3__1_`	13.73	0.60	0	3.61 × 10⁻⁶ V	2.18 × 10⁻⁵ V
`sb_3__3_`	13.94	0.60	0	1.57 × 10⁻⁶ V	1.32 × 10⁻⁵ V
`cbx_1__0_`	13.80	0.60	0	2.05 × 10⁻⁶ V	1.39 × 10⁻⁵ V
`cbx_1__1_`	13.80	0.59	0	2.52 × 10⁻⁶ V	1.26 × 10⁻⁵ V
`cbx_1__3_`	13.80	0.59	0	2.23 × 10⁻⁶ V	1.53 × 10⁻⁵ V
`cby_0__1_`	13.80	0.60	0	2.06 × 10⁻⁶ V	1.39 × 10⁻⁵ V
`cby_1__1_`	13.80	0.59	0	2.52 × 10⁻⁶ V	1.26 × 10⁻⁵ V
`cby_3__1_`	13.80	0.59	0	2.23 × 10⁻⁶ V	1.53 × 10⁻⁵ V

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Baungarten-Leon, E.I.; Ortega-Cisneros, S.; Leyva, G.; Muñoz Zapata, H.E.; Guzmán-Quezada, E.; Alvarado-Rodríguez, F.J.; Raygoza-Panduro, J.J. Comprehensive RTL-to-GDSII Workflow for Custom Embedded FPGA Architectures Using Open-Source Tools. Electronics 2025, 14, 3866. https://doi.org/10.3390/electronics14193866

AMA Style

Baungarten-Leon EI, Ortega-Cisneros S, Leyva G, Muñoz Zapata HE, Guzmán-Quezada E, Alvarado-Rodríguez FJ, Raygoza-Panduro JJ. Comprehensive RTL-to-GDSII Workflow for Custom Embedded FPGA Architectures Using Open-Source Tools. Electronics. 2025; 14(19):3866. https://doi.org/10.3390/electronics14193866

Chicago/Turabian Style

Baungarten-Leon, Emilio Isaac, Susana Ortega-Cisneros, Gerardo Leyva, Héctor Emmanuel Muñoz Zapata, Erick Guzmán-Quezada, Francisco J. Alvarado-Rodríguez, and Juan Jose Raygoza-Panduro. 2025. "Comprehensive RTL-to-GDSII Workflow for Custom Embedded FPGA Architectures Using Open-Source Tools" Electronics 14, no. 19: 3866. https://doi.org/10.3390/electronics14193866

APA Style

Baungarten-Leon, E. I., Ortega-Cisneros, S., Leyva, G., Muñoz Zapata, H. E., Guzmán-Quezada, E., Alvarado-Rodríguez, F. J., & Raygoza-Panduro, J. J. (2025). Comprehensive RTL-to-GDSII Workflow for Custom Embedded FPGA Architectures Using Open-Source Tools. Electronics, 14(19), 3866. https://doi.org/10.3390/electronics14193866

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Comprehensive RTL-to-GDSII Workflow for Custom Embedded FPGA Architectures Using Open-Source Tools

Abstract

1. Introduction

2. Analysis of Development Tools

2.1. Analysis of OpenLane in the RTL-to-GDSII Flow

2.2. Analysis of OpenFPGA for Architecture Customization

3. Analysis of Workflow

3.1. Analysis of Custom FPGA Architecture Generation with OpenFPGA

3.2. Analysis of Pin Planning for FPGA Connectivity in OpenFPGA

3.3. Analysis of Full Testbench for Functional Verification

3.4. Analysis of RTL Transfer from OpenFPGA to OpenLane

3.5. Analysis of Macro Hardening in the RTL-to-GDSII Process

3.6. Analysis of Placement Configuration for Macro Integration

3.7. Analysis of Core-Level Configuration in FPGA Design

3.8. Analysis of Gate-Level Simulation for Functional Validation

4. Case Study

4.1. Case Study: Analysis of FPGA Architecture Customization with OpenFPGA

4.2. Case Study: Analysis of Functional Verification Using Full Testbench

4.3. Case Study: Analysis of Macro-Level RTL-to-GDSII Implementation

4.4. Case Study: Analysis of Core-Level RTL-to-GDSII Integration

5. Results

5.1. Analysis of Macro-Level Results: Area, Frequency, and Power

5.2. Analysis of Core-Level Layout and Integration Results

5.3. Analysis of Gate-Level Simulation Results

6. Discussion

7. Conclusions

Author Contributions

Funding

Data Availability Statement

Conflicts of Interest

Appendix A

Appendix B

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI