First Study of Bubble Error Artifacts in Field-Programmable Gate Array (FPGA)-Based Tapped Delay-Line Time-to-Digital Converters with Sum-of-Ones Decoder on Xilinx 28 nm 7-Series FPGA

Lusardi, Nicola; Garzetti, Fabio; Fiumicelli, Gabriele; Morabito, Mattia; Bonanno, Gabriele; Ronconi, Enrico; Costa, Andrea; Geraci, Angelo

doi:10.3390/electronics14061156

Open AccessArticle

First Study of Bubble Error Artifacts in Field-Programmable Gate Array (FPGA)-Based Tapped Delay-Line Time-to-Digital Converters with Sum-of-Ones Decoder on Xilinx 28 nm 7-Series FPGA

by

Nicola Lusardi

^*,†

,

Fabio Garzetti

^†

,

Gabriele Fiumicelli

,

Mattia Morabito

,

Gabriele Bonanno

,

Enrico Ronconi

,

Andrea Costa

and

Angelo Geraci

DEIB (Dipartimento di Elettronica, Informazione e Bioingegneria), Politecnico di Milano, Via Golgi 40, 20133 Milano, Italy

^*

Author to whom correspondence should be addressed.

^†

These authors contributed equally to this work.

Electronics 2025, 14(6), 1156; https://doi.org/10.3390/electronics14061156

Submission received: 11 October 2024 / Revised: 27 February 2025 / Accepted: 13 March 2025 / Published: 15 March 2025

(This article belongs to the Special Issue FPGA and ASIC: State-of-the-Art Approaches to Time Measurement and Generation in Cutting-Edge Applications)

Download

Browse Figures

Versions Notes

Abstract

Time-to-Digital Converters (TDCs) are increasingly vital in modern measurement systems, with Field-Programmable Gate Arrays (FPGAs) offering a cost-effective platform despite challenges in asynchronous circuit design. Among various solutions, Tapped Delay-Line (TDL)-TDCs stand out for balancing precision, speed, and resource efficiency. However, a primary concern in FPGA-based TDL-TDCs are the Bubble Errors (BEs), i.e., spurious zeros introduced in the information code in the TDL that put the measurement precision at severe risk. The main goal of this contribution is to investigate the distribution of BEs, utilizing the Clock Region Crossing (CRC) within the FPGA as a case study, in order to demonstrate theoretically and experimentally that if BEs are manipulated properly, they create an interpolation effect that reduces the quantization error of the TDL-TDC. The analysis is carried out on a 256-tap fully integrated TDL-TDC implemented in a 28 nm Xilinx Artix 100T FPGA. The outcome confirms the potential to use CRC-BEs instead of suppressing them with precision increasing up to 0.17 ps r.m.s., or by almost 2% while also supporting the correctness of the model.

Keywords:

bubble errors; decoding; tapped delay-line (TDL); time-to-digital converter (TDC); field-programmable gate array (FPGA)

1. Introduction

In modern scientific and industrial advancements, Time-to-Digital Converters (TDC) are a fundamental components in time-resolved systems, where the information is encoded on the basis of the occurrence time of events [1,2]. Owing to recent progress in performance (i.e., precision, rate, and resource consumption), TDCs gained increasing significance in a wide array of fields. Notable applications include time-resolved spectroscopy [3,4] and Time-Correlated Single Photon Counting (TCSPC) [5], where timestamps are used to measure particle decay times for the extraction of key material properties. Time-of-Flight (ToF) imaging [6,7], including LIDAR [8], and ToF Positron Emission Tomography (ToF-PET) [9,10] demonstrate further the wide spectrum of fields in which TDC use is now consolidated.

In this context, Field-Programmable Gate Arrays (FPGAs) prove to be very suited hosts for TDC implementations. In fact, they are perfect for rapid prototyping in research and development applications because of their special reconfigurability characteristic, which enables speedy adaptability to the changing demands of diverse research and industrial disciplines. FPGAs significantly reduce time-to-market, enabling rapid product development and responsiveness to market shifts and consumer preferences. Additionally, they help in cutting down nonrecurring engineering costs, providing a cost-effective alternative to custom integrated circuit solutions.

As a result, FPGAs have become consolidated tools for implementing TDC architectures [11]. In particular, the most promising one, which offers the best compromise in terms of trade-off between figures of merit precision, rate, and resource consumption, is the Tapped Delay-Line TDC (TDL-TDC) consisting of series of N buffers (a.k.a., taps) connected in series where each buffer output is sampled by a D-type flip-flop (FF), allowing, ideally, for a simple counting mechanism on the thermometric code at the output of the FFs, to extract the delay between the signal injected in the TDL (i.e., START) and the clock of the FFs (i.e., STOP). This architecture is shown in Figure 1.

Ideally, assuming that each buffer has the same propagation delay

t_{p}

and that the output of the FFs is correct (i.e., no errors introduced by skews, jitters, meta-stability, and setup and hold violations), a conversion circuit (a.k.a., decoder or encoder) can extract the number of ones n in the N-bit thermometric code. Under these assumptions, the time difference

(Δ T)

between the start (

T_{S T A R T}

) and stop (

T_{S T O P}

) events can then be computed as

Δ T = T_{S T O P} - T_{S T A R T} = n \cdot t_{p}

(1)

Referring to Equation (1), it is evident that the resolution (LSB) of the TDL-TDC is

t_{p}

, while the Full-Scale Range (FSR) is

N \cdot t_{p}

. The digitization process introduces a quantization error (i.e.,

ε_{Q}

) uniformly distributed between

- t_{p} / 2

and

+ t_{p} / 2

, with a variance of

t_{p}^{2} / 12

(i.e.,

ε_{Q}^{2} = L S B^{2} / 12

) [12,13]. Consequently, in the presence of jitter (i.e.,

σ_{j}

) between the START and STOP signals caused by electronic system noise, the precision (i.e.,

σ_{Δ T}

) of the measured time interval (i.e.,

Δ T)

is impacted [14].

σ_{Δ T}^{2} = ε_{Q}^{2} + σ_{j}^{2} = \frac{t_{p}^{2}}{12} + σ_{j}^{2}

(2)

However, the situation in the actual world is somewhat different, particularly when FPGAs are involved. First of all,

t_{p}

is not constant across the TDL due to process, voltage, and temperature (PVT) fluctuations and tolerances among devices, which make mandatory a real-time calibration process [15,16,17] for estimating the propagation delay distribution through a Code Density Test (CDT) [11]. In this way a Calibration Table (CT) is built where each tap

n \in [0; N - 1]

is characterized by its own

t_{p} [n]

. The CT integration generates a Characteristic Curve (CC), which links each output code

n \in [0; N - 1]

(i.e., the number of ones in the thermometric code) to a specific timestamp

Δ T

(i.e.,

C C [n] = \begin{matrix} \sum_{i = 0}^{i = n} t_{p} [i] \end{matrix}

with

i \in [0; N - 1]

). While it surely adds overhead, this approach correctly accounts and compensates all non-linearities affecting delay values; so, the time measurement will no longer be performed as described in Equation (1) but rather by indexing the CC,

Δ T = T_{S T O P} - T_{S T A R T} = C C [n]

(3)

Under these conditions, the resolution (i.e., LSB) is better represented by the distribution of the CT and roughly by the average propagation delay

t_{p}

(i.e.,

t_{p} = \sum_{k} t_{p} [k] / N

) and the precision by the so-called Equivalent LSB (

L S B_{E Q}

) [13], for which the mathematical expression is expressed in Equation (4). Thus, due to the propagation delay distortion, the the quantization error is proportional to the

L S B_{E Q}

(i.e.,

ε_{Q}^{2} = L S B_{E Q}^{2} / 12

), not to the resolution (i.e.,

ε_{Q}^{2} \neq t_{p}^{2} / 12

)

Under these conditions, the resolution (i.e., LSB) is better characterized by the distribution of the CT and approximately by the average propagation delay (i.e.,

t_{p} = \sum_{k} t_{p} [k] / N

), while the precision is defined by the so-called Equivalent LSB (

L S B_{E Q}

) [13], whose mathematical expression is provided in Equation (4). Consequently, due to distortions in propagation delays, the quantization error is proportional to

L S B_{E Q}

(i.e.,

ε_{Q}^{2} = L S B_{E Q}^{2} / 12

) rather than the resolution (i.e.,

ε_{Q}^{2} \neq t_{p}^{2} / 12

).

L S B_{E Q}^{2} = \frac{\sum_{k} t_{p}^{3} [k]}{\sum_{k} t_{p} [k]}

(4)

The second concern is the correctness of the thermometric code, which will be influenced by Bubble Errors (BEs) [18], which are defined as spurious zeros in the TDL output and will be the focus of this study. An example of BE-affected code is represented in Figure 2.

BEs arise from deterministic non-linearities in TDL propagation (e.g., skews) [19] or from stochastic processes (e.g., sampling errors caused by setup and hold time violations) [19]. The presence of BEs significantly affects the performance of the thermometer-to-binary converter, reducing resolution, precision, and linearity. Moreover, the behavior of the pure binary output is also influenced by the decoder architecture. It is important to mention how the decoder technique employed heavily affects the BE relevance. This is evident comparing a “sum1s” decoder [16,20], which counts the number of ones in the thermometric code and results in a “bubble compression”, and a “transition-based” decoder, also known as “Log2” [11] or “one-hot” [21], which looks for the last transition from 1 to 0 and stretches the code up to the BEs [11,22]. In this regard, if we consider the two codes without (a) and with (b) BEs in Figure 2, and assuming we have an 8-tap TDL-TDC, we will obtain 7 for “11111110” and 6 for “11111010” using the sum1s decoder but the two cases will result in 7 using a “transition-based” decoder.

Discriminating between deterministic and stochastic causes of BEs is essential [18]. Deterministic factors primarily stem from routing mismatches (Figure 3) in particular skews (i.e.,

t_{s} [i]

and

t_{s}^{C R C}

) on the clock line (i.e., STOP signal) and routing delays (i.e.,

t_{d} [i]

) between buffer outputs and corresponding FF inputs [23]. Clock skew, defined as the variation in clock arrival times at different components, also significantly contributes to BEs [15]. Although FPGA clock networks are designed to minimize skew through hierarchical distribution in the clock regions (i.e.,

t_{s} [i]

), non-negligible skew, up to tens/hundreds of picoseconds, may still arise when multiple clock regions are involved, through Clock Region Crossing (CRC), (i.e.,

t_{s}^{C R C}

). This can affect the generation of the thermometric code. If the skew and routing delays are minor compared to the buffer propagation delay

t_{p} [i]

, their impact is negligible, and BEs do not occur. However, when

t_{p} [i]

is shorter than these delays, the switching order of outputs may change, introducing BEs in the thermometric code and causing decoder failures. In contrast to deterministic causes, stochastic BEs arise from metastability in FFs due to violations of timing parameters (setup and hold times). This occurs when the asynchronous TDL signal is sampled improperly, causing FFs to enter a metastable state [24]. In such cases, outputs resolve randomly as 0 or 1, unpredictably introducing BEs into the thermometric sequence.

Let us consider the TDL-TDC in Figure 3. What needs to happen in order to prevent BEs at the position k is as follows:

t_{s}^{C R C} + \sum_{k = n - i}^{n} t_{s} [k] < \sum_{k = n - i}^{n + 1} t_{p} [k] + t_{d} [k]

(5)

Said another way, (5) entails that as long as the clock skew is sufficiently small or the propagation delay is suitably large, the occurrence of BEs can be completely mitigated.

TDL-TDCs offer good precision and resolution; they face a trade-off between FSR and area utilization (i.e., to double the FSR a double in area is mandatory) [25]. This limitation is particularly pronounced in modern technological nodes (e.g., 28 nm and below), where tap propagation delays are on the order of a few picoseconds. A common approach to overcoming this trade-off is to use Nutt Interpolation [26]. As illustrated in Figure 4, this technique divides the measurement for all channels (e.g., START and STOP) into two components: a coarse measurement performed by an

N_{C C}

-bit coarse counter clocked at

T_{C L K}

, and a fine measurement carried out by a TDL-TDC (i.e.,

T_{f i n e 1}

for the START and

T_{f i n e 2}

for the STOP). Since the START signal may occur asynchronously with the system clock, the measured time in this configuration is given by

T_{m e a s} = T_{f i n e 1} + (N_{c c 2} - N_{c c 1}) \times T_{C L K} - T_{f i n e 2}

(6)

In this way, if the dynamic range of the TDL exceeds

T_{C L K}

, the FSR of the integral system is extended up to

2^{N_{C C}} \cdot T_{C L K}

.

To enhance the resolution of TDL-TDCs beyond what is offered by the average propagation delay of the FPGA technology node (

\bar{t_{p}}

), various sub-interpolation methods [27,28,29] have been explored in the literature. These methods either involve conducting measurements across multiple parallel TDLs (

N_{T D L}

) [30] (known as spatial sub-interpolation) or repeating measurements several times

(N_{T D L}

) on the same TDL using feedback (referred to as temporal sub-interpolation) [31], thereby creating a Virtual-TDL (VTDL) with quicker propagation delays that improve resolution. A VTDL with an

N_{T D L}

sub-interpolation order can improve, ideally, both the

\bar{t_{p}}

and the

L S B_{E Q}

by a factor

1 / N_{T D L}

, though it also increases jitter (

σ_{j}

) compared to TDLs without sub-interpolation [28]. One prominent example of sub-interpolation is the Wave Union A (WUA) technique. In WUA, generally two [32] edges are sent through the same TDL for each event; a special decoder identifies the position of the two edges on the TDL, thus obtaining two real taps that are then summed to form the so-called virtual tap. The calibration algorithm will then work on the virtual tap to calculate the timestamp. In this way, the

\bar{t_{p}}

and the

L S B_{E Q}

computed using the virtual tap of the VTDL are halved with respect to the real one coming from the TDL.

The aim of this paper is to detect, classify, and characterize BE artifacts using the clock signal skew on the CRC (i.e.,

t_{s}^{C R C}

) in the 28 nm Xilinx 7-Series FPGA as a case study. This is to demonstrate mathematically and verify experimentally that the presence of BEs, if properly managed, allows for a reduction in quantization error provided by the TDL-TDC. In Section 2, we present our TDL-TDC, while in Section 3 we address the issue and provide a model for skew-generated BEs. In Section 4, a straightforward approach to suppress CRC skew-generated BEs is presented and analyzed. Experimental validation is reported in Section 5. Finally, Section 6 reports an overview about the current state of the art in terms of BE correction.

2. FPGA-Based TDL-TDC Architecture

Within 28 nm 7-Series Xilinx devices, as represented in Figure 5, logical resources are systematically arranged within Configurable Logic Blocks (CLBs) split in two slices (i.e., SLICEM and SLICEL). As represented in Figure 6, SLICEs contain diverse logic circuits, such as four Look-Up Tables (LUTs), eight FFs, and one carry logic primitive (CARRY4) [33]. Each CARRY4 primitive has four outputs (numbered 0 to 3), for each two distinct signals that are available, the CO and the O, which for our purposes are each the negation of the other. The CO and O outputs can be sampled mutually exclusively by the corresponding lettered FF (i.e., CO[0] and O[0] to AFF, CO[1] and O[1] to BFF, CO[2] and O[2] to CFF, CO[3] and O[3] to DFF). The CARRY4 thus functions as a four-tap TDL that propagates the input signal from the bottom to the top of the SLICE (i.e., from CO[0] and O[0] to CO[3] and O[3]). Notably, for the facilitation of clock routing, the FPGA is discretely partitioned into

N \times 50

CLBs clock regions [34] (Figure 7), wherein the clock skew into CLBs (i.e.,

t_{s}

) is negligible while the CRC one is not (i.e.,

t_{s}^{C R C}

).

The fundamental structure of the TDL-TDC taken as reference is instantiated in a Xilinx Artix 100T device putting in series

N_{C}

CARRY4 primitives, used as taps, sampled by FFs (Figure 6). This results in a

4 \times N_{C}

-tap long TDL, where the total number of taps N is 4 times the number of carry primitives

N_{C}

(i.e.,

N = 4 \cdot N_{C}

), complemented by real-time decoder and calibration mechanisms [11]. For practical use in real-world applications, the TDL-TDC, following the Nutt Interpolation architecture exposed in the Introduction, must be combined with a coarse counter; in our case an 8-bit counter (i.e.,

N_{C C} = 8

). Consequently, the total delay introduced by the TDL (i.e.,

\sum_{i = 0}^{i = N - 1} t_{p} [i]

) must be greater than or equal to the clock period of the coarse counter (i.e.,

\sum_{i = 0}^{i = N - 1} t_{p} [i] \geq T_{C L K}

). Considering that the 28 nm technology node of the Xilinx 7-Series is characterized by an average propagation delay (

\bar{t_{p}}

) of approximately 15 ps (i.e.,

\bar{t_{p}} \approx 15 p s

) for the elements composing the CARRY4 logic (i.e.,

\sum_{i = 0}^{i = N - 1} t_{p} [i] ≃ N \cdot \bar{t_{p}}

), and that the maximum clock frequencies for a complex firmware are around 400–500 MHz (i.e.,

T_{C L K} \in

[2 ns; 2.5 ns]) [35], it becomes necessary to use TDLs composed, approximately, of more than 168 taps (i.e.,

N \geq T_{C L K} / \bar{t_{p}}

), corresponding to

N_{C} > 42

. However, the TDL must be sufficiently lengthy to ensure ample margin to accommodate variations in the average propagation delay due to process, voltage, and temperature (PVT).

Although faster clocks (i.e., shorter TDLs) may seem plausible, the system’s fully integrated design confines us to a clock period of 2.4 ns and a 256-tap long TDL (i.e.,

N_{C} = 64

); in this way, thanks to the Nutt Interpolation the FSR is extended up to 614.4 ns (i.e.,

2^{N_{C C}} \cdot T_{C L K} = 2^{8} \times 2.4

ns). Figure 8 shows an example of the CT of the implemented TDL-TDC and the related probability density function of delays

t_{p} [n]

. From Figure 8, it can be observed that only 190 out of the 256 available taps are effectively utilized, defining an average propagation delay of 15.19 ps. Additionally, a significant dispersion in propagation delays is evident, ranging from a minimum of 1 ps to a maximum of 63 ps. It is also noticeable that the probability density function of delays

t_{p} [n]

does not follow a specific normal shape, which is consistent with observations reported in other studies [36,37].

For experimental purposes, the above-described Nutt-Interpolated TDL-TDC was implemented in a specific module called FELIX [38] provided by TEDIEL S.r.l. [39], which hosts a Xilinx 28 nm 7-Series Artix-7 100T FPGA. The 256-bit long thermometer codes generated by the TDL and the contents of the 8-bit long coarse counter are sent to a Personal Computer (PC) via USB 2.0, which performs over the sum1s decoding algorithm, the calibration process, the Nutt Interpolation, and also the BE correction.

3. Bubble Error Analysis

As stated in the introduction, the only BEs that can be corrected are the deterministic ones due to the skew (i.e.,

t_{s} [i]

and

t_{s}^{C R C}

) on the clock line (i.e., STOP signal) and routing delays (i.e.,

t_{d} [i]

) between the buffer outputs and the corresponding FF inputs. This was performed by experimental derivation and is expressed in terms of the so-called Bubble Length (BL), also known as

L_{B}

, which is the number of taps involved in the artifact. In fact, the larger the BL, the greater the value of the skew and routing delays compared to the tap propagation time, thus the greater the negative influence of the BEs on the time measurement.

First, we studied the contribution of skew on the clock line (i.e.,

t_{s} [i]

and

t_{s}^{C R C}

) by placing a 256-tap long TDL (i.e.,

N_{C} = 64

) across two contiguous CRCs, so that the first 18 CARRY4 blocks (72 taps, from tap 0 to tap 71) were in one CR and the next 46 (184 taps, from tap 72 to 255) in the following one. We obtained the clock skew of each tap in the TDL using the Vivado post-implementation timing analysis tool in order to better examine how the clock propagates in that area. Figure 9 shows the overall skew present at the clock input of the 256 FFs that sample the 256 taps, taking the clock input of tap 0 as zero. Within the CR hosting the second portion of the TDL (i.e., from tap 72 to 255), the skew ranges between 178 and 159 ps (a delta of 19 ps), with the minimum at tap 171, which is at the middle of the CR. In contrast, the skew present at the CRC (i.e., between tap 71 and tap 72) is an order of magnitude higher, amounting to 164 ps. This agrees with the information in [34], which states that the clock is propagated in a tree-like manner between CRs and injected in the middle of each. In this sense, it is possible to easily identify the magnitude of

t_{s}^{C R C}

, which is 164 ps. Instead, regarding

t_{s} [k]

, we observe that in 100 taps from tap 72 to 171 (i.e., half a CR, 25 CLBs with one CARRY4 each), only a skew of 19 ps is accumulated. This means that, if we consider

t_{s} [k]

constant over k and equal to

\bar{t_{s}}

(i.e.,

t_{s} [k] = \bar{t_{s}}

\forall k \in [72; 171]

),

t_{s}

is approximately 0.19 ps (i.e.,

\bar{t_{s}} = (\sum_{k = 72}^{k = 171} t_{s} [k])

/100 = (19 ps)/100 and negligible with respect to

t_{s}^{C R C}

and the average propagation delay of the buffer (i.e.,

\bar{t_{p}}

= 15 ps). Unlike

t_{s}

(i.e., 0.19 ps), we clearly notice that

t_{s}^{C R C}

(i.e., 164 ps) is much greater, and thus not negligible compared to

\bar{t_{p}}

(i.e., 15 ps).

Using Vivado’s post-implementation timing analysis, regarding the routing delays (i.e.,

t_{d} [i]

), an ideal delay of 0 ps is provided. Although the actual value will be greater than the declared 0 ps, this information is sufficient to tell us that such a delay can be approximately neglected.

In this sense, we easily deduce that the BEs with larger BL are mainly due to

t_{s}^{C R C}

because it is the biggest non-ideality. This causes deterministic BEs with a larger BL proportional to

t_{s}^{C R C}

(i.e.,

t_{s}^{C R C} \propto L_{B} \cdot \bar{t_{p}}

). This is due to the signal propagating asynchronously over the TDL after CRC, being sampled with a delay of

t_{s}^{C R C}

with respect to the taps before the CRC. We call it BEs-CRC. While not being classified as a form of BE, the results of this effect were already seen in [19]. In this regard, the taps that provide a total propagation delay equal to

t_{s}^{C R C}

before the CRC (orange in Figure 10) and the subsequent ones (blue in Figure 10) offer the same information. So, we define the Crossing Point Tap (CPT) as the first tap after the CRC (e.g., tap

n + 1

in Figure 3, tap 72 in Figure 9 and Figure 11). Thus, before and after the CPT, we have two sections of the TDL with a temporal duration of approximately

t_{s}^{C R C}

that, due to the skew, will carry the same information; these are referred to as Causal and Anti-Causal. Consequently, in terms of bins, we call these lengths Number of Taps Before the CRC (NTB) and Number of Taps After CRC (NTA) for the Causal and Anti-Causal sections, respectively.

All this was also demonstrated experimentally by acquiring several thermometric codes (i.e.,

10^{6}

) from the analyzed TDL, verifying that the BEs with larger BL occur at the CRC. For demonstration purposes, Figure 11 shows the 11 thermometric codes with the largest BL (i.e.,

L_{B} = 5

dark green,

L_{B} = 6

green,

L_{B} = 7

lime, and

L_{B} = 8

yellow) from the TDL studied in the post-implementation stage, and it is observed that these BEs are concentrated in the CRC. It can be observed that these BEs extend, before the CRC, from tap 66 to tap 71 and after the CRC from tap 72 to tap 7, that is, for 6 taps before (i.e.,

N T B = 6

) and 6 taps after (i.e.,

N T A = 6

) the CRC, thus allowing for the easy identification of the redundant informational content portions around the CRC (i.e., Causal and Anti-Causal sections of the TDL), highlighted in orange and blue in Figure 10. Furthermore, comparing Figure 10 with Figure 11, we can experimentally derive that

L_{B} \leq N T B + N T A

(7)

Moreover, knowing that each tap provides an average delay of 15 ps (i.e.,

\bar{t_{p}} = 15

ps), we can estimate

t_{s}^{C R C}

as the product between NTB or NTA and

\bar{t_{p}}

, obtaining 90 (i.e.,

6 \times 15

ps), consistent with the 164 ps from the post-implementation analysis.

Also, Equation (5) provides a theoretical explanation to larger BEs-CRC shown in Figure 11. In accordance with the scientific literature [40], in our specific case, we have a negligible

t_{d} [k]

and

t_{s} [k]

with respect to a

t_{s}^{C R C}

of a few hundred ps (i.e.,

t_{s}^{C R C}

= 164 ps) (Figure 9) and a propagation delay (Figure 8) between a few ps and tens of ps with an average value of 15 ps (i.e.,

\bar{t_{p}} \approx 15

ps). Considering that

t_{s}^{C R C} ≫ \sum_{k = n - i}^{n} t_{s} [k]

and that

t_{d} [k] ≃ 0

ps, we can rewrite (5) as follows:

t_{s}^{C R C} ≲ \sum_{k = n - i}^{n + 1} t_{p} [k]

(8)

Moreover, from Equation (8), considering that

\sum_{k = n - i}^{n + 1} t_{p} [k] ≃ i \cdot \bar{t_{p}}

we derive

t_{s}^{C R C} ≲ i \cdot \bar{t_{p}}

(9)

By observing Equation (9), we deduce that we can see CRC-BEs if

t_{s}^{C R C}

is bigger than

\bar{t_{p}}

; moreover, we can theoretically estimate the NTA and NTB lengths as

t_{s}^{C R C} / \bar{t_{p}}

obtaining a value of

\approx 10

taps in accordance with experimental measurements reported in Figure 11. This estimation becomes increasingly accurate the larger the

t_{s}^{C R C}

is compared to the average propagation time (i.e.,

\bar{t_{p}}

), and the less dispersed the propagation time of the taps in the Causal and Anti-Causal sections of the TDL tends to be.

4. Clock Region Crossing Bubble Error Correction

This section describes an offline technique that makes it possible to correct the CRC-BEs. This technique is executed over the PC on the acquired thermometric codes before performing the sum1s decoding and the calibration. By examining just the thermometer code sampled by the TDL-TDC, we may abstract the idea presented in Figure 10 into Figure 12, highlighting the TDL taps belonging to the Causal portion (i.e., from tap B to tap

C P T - 1

with

B = C P T - N T B - 1

) and the Anti-Causal portion (i.e., from tap

C P T

to tap A with

A = C P T + N T A

) of the TDL. Referring to the physical implementation of the TDL analyzed in Section 3, we recall that

C P T = 72

,

N T B = N T A = 6

, thus,

B =

66 and

A = 77

.

Algorithm 1 encodes the information about the Causal (i.e., the number of zeros inside the NTB) and Anti-Casual (i.e., the number of ones inside the NTA) using variables Zero Before Crossing (ZBC) and One After Crossing (OAC), respectively, in order to reject the BEs-CRC; thus, if the BEs-CRC is asserted (i.e.,

\bar{Z B C} = 1

and

O A C = 1

) the section of the TDL after the CPT (so, the Anti-Casual) is forced to zero. In this way, only the timing information over the Casual part is used.

Algorithm 1 Detection of the BEs-CRC.

ZBC := ‘1’;
for i in CPT-NTB-1 to CPT-1 loop
ZBC := ZBC and T[i];
end loop;
OAC := ‘0’;
for j in CPT to CPT+NTA loop
OAC := OAC or T[j];
end loop;
BE := not ZBC and OAC;

A functional example is provided in Figure 13 where only the first thermometric code and the last one are bubble-free. With second and third codes, the algorithm detects the bubble and disables the last part of the TDL (L in figure). This leads to a certain amount of missing taps depending on NTA/NTB, in this case 8 to 11, which reduces the number of output codes from 16 to 12.

The effectiveness of the proposed strategy implemented in Algorithm 1 has been checked offline, via software, by obtaining and then comparing CTs with and without the correction of the BEs-CRC reading out the TDL. The CT obtained without correction of the BE-CRCs using a sum1s decoder is shown in Figure 14a. Meanwhile, the corrected CT downstream is represented in Figure 14c. Instead, Figure 14b,d show a zoom of the Causal section of the TDL.

Putting the correction into practice as the algorithm specifies the Anti-Causal section of the TDL (i.e., from tap 72, CPT, to 77, A, highlighted in red in Figure 14c) has propagation delays equal to zero. This is because they are no longer valid outputs for the TDL-TDC and are not considered by the calibration algorithm. On the other hand, if we observe the uncorrected CT (Figure 14a), we notice that it is identical to the corrected one except for the Causal part, which provides propagation delays that are on average a factor of 2 faster (i.e., ≈7 ps) (Figure 14b) than elsewhere (

\bar{t_{p}} \approx 15

ps).

An explanation for the findings points to an order-2 sub-interpolation effect [13,28] between Casaul and Anti-Causal sections of the TDL. Essentially, if we consider the Anti-Causal section as a delayed replica of the Causal one, the impact becomes more evident (Figure 15). Indeed, if an uncorrected BEs-CRC occurs, it means that we are simultaneously decoding two pieces of information from the same measurement interval, one from the NTA and one from the NTB, which will be sub-interpolated (i.e., added) among each other in the sum1s decoding like a WUA [28,32]. This gives rise to an unexpected order-2 intra-TDL sub-interpolation effect that helps to increase the resolution only if the sub-interpolated bins are properly mapped in the Anti-Causal section.

The results would have been quite different if, instead of a bubble compression-based decoder like sum1s, we had used a transition-based decoder like Log2. In fact, in that case, the Causal portion of the TDL would be completely masked by the Anti-Causal portion, effectively eliminating the order-2 intra-TDL sub-interpolation effect.

5. Experimental Results

In this section, the precision and the quantization error offered with and without the correction of the CDC-BEs, considering different values of NTA = NTB (i.e., 1, 2, 6, and 8), are measured. The precision was evaluated as the standard deviation of the timestamps, in our case a pool of

10^{6}

measurements, provided by the Nutt-Interpolated TDL-TDC in measuring a constant time delay, defined between a START signal and a STOP signal. Meanwhile, the quantization error was calculated using the

L S B_{E Q}

provided by the TDL independently to the time delay. It was decided to vary the values of NTA and NTB to better observe the behavior of Algorithm 1 for the BEs-CRC correction. To minimize potential dependencies between precision and the measured time interval, the precision of the TDL-TDC was evaluated as a function of the time interval between START and STOP.

The START and STOP signals (i.e., two coaxial cables in Figure 16) were generated using the ACTIVEAWG401X [41] (black box with monitor in Figure 16) in delay sweep mode from −0.5 ns to 16.8 ns with step of 50 ps. The 256-tap long TDL-TDC and the 8-bit long coarse counter presented in Section have been implemented in a Xilinx 28 nm 7-Series Artix 100T FPGA (i.e., FELIX [38] provided by TEDIEL S.r.l. [39], black printed circuit board in Figure 16), while the Algorithm 1 together with the sum1s decoding, calibration, and Nutt Interpolation are performed on a PC. Both the thermometer codes from the TDL and the coarse counter values needed for the Nutt Interpolation are transmitted to the PC via USB 2.0.

Referring to the Casual and Anti-Casual portions, we know, from Section 4, that

C T P = 72

and

N T B = N T A = 6

. Consequently, as Figure 17 shows, we have evaluated the precision as function of the time delay between the START and the STOP, using different values of NTA = NTB. In orange, the uncorrected solution, and in blue, the corrections with

N T B = N T A = 6

, as computed in Section 2, and with bigger (i.e.,

N T B = N T A = 8

) and lower (i.e.,

N T B = N T A = 1

and

N T B = N T A = 2

) values.

Furthermore, upon examining Figure 17, one can quickly observe the absolute decrease in the precision of the uncorrected scenario (orange), which underscores the precision gain advantage of sub-interpolation between the Causal (i.e., from tap

B = 66

to tap

C T P - 1 = 71

) and Anti-Causal (i.e., from tap

C T P = 72

to tap

B = 77

) portions of the TDL as mentioned in Section 4. Table 1 offers an overview of all the results and emphasizes the differences in TDL-TDC maximum (i.e., Max

Δ S T D

), minimum (i.e., Min

Δ S T D

), and mean precision (i.e., Mean

Δ S T D

) with and without BEs-CRC correction.

Table 2 illustrates a comparison of the theoretical precision (i.e., the quantization error,

ε_{q} = L S B_{E Q} / \sqrt{12}

) [13], computed as the quantization errors offered by the TDL-TDC with (i.e.,

ε_{q}^{F I X}

), and without (i.e.,

ε_{q}^{N O F I X}

), the BEs-CRC and, their spread

Δ ε_{q}

(i.e.,

Δ ε_{q} = ε_{q}^{F I X} - ε_{q}^{N O F I X}

). We can observe that

Δ ε_{q}

is compatible with the experimental results exposed in Table 1 (i.e., Mean

Δ S T D

).

In this way, we have demonstrated how the order-2 intra-TDL sub-interpolation effect, investigated in Section 4, positively impacts the measurement precision by reducing the quantization error. By observing Table 1 and Table 2, we can note that if we select arbitrary values for NTA and NTB, different from the Causal and Anti-Causal portions of the TDL, we increase the quantization error, thereby reducing the precision.

6. State of the Art

Multiple solutions have been proposed in the literature to fix low-order (1–4) BEs in Flash Analog-To-Digital Converters (ADCs) [42,43] which exhibit similar BE artifacts. Ref. [44] provides ready to use HDL codes for the most common BEs in ADC. One study [15] analyzes the BE-CRC discussed in this paper and provides a new architecture, the Dual-Phase TDC (2-Phase), to implement the TDL while keeping the clock frequency low. Other approaches, such as Sub-TDL [45] aim instead at completely eliminating BEs by subdividing the TDL into decimated bubble-free Sub-TDLs and performing over the sub-interpolation, which has been investigated in [17] for high-performance FPGAs. Similar methods are probed by [22] (i.e., Virtual TDL, V-TDL) and [46] (i.e., Decimated Delay-Line, DDL).

Some interesting work has also been developed regarding the exploitation of such artifacts [19] by overlapping the CRC area to double the active area resolution in a configuration called Chopped TDL (C-TDL). With respect to this contribution, this research aims to provide a general model for skew-generated Bubble Errors, including those observed in the BE-CRC configuration, which served as a fitting case study. Table 3 offers a concise summary of the state of the art at this time.

7. Conclusions

In this paper, a mathematical model has been proposed to explain the presence of BEs as the cause of the existing mismatch between the propagation times of the TDL and clock skew. It has been demonstrated both theoretically and experimentally that the presence of BEs, if the TDL is decoded using the sum1s technique, contributes to increasing the precision of the TDL-TDC, as it reduces quantization error through a sub-interpolation mechanism occurring among the TDL taps affected by bubbles. In order to experimentally demonstrate this, CRC-BEs were identified as a case study, verifying that their correction significantly increases the quantization error of the TDL-TDC. All of this was experimentally verified on a 28 nm Xilinx Artix-7 100T FPGA from the 7-Series, where CRC-BEs have a length of 7–8 taps. It was found that correcting a CRC-BE worsens the instrument’s precision by approximately 1.7% (equivalent to 0.17 ps).

Author Contributions

Conceptualization, N.L.; Methodology, F.G.; Software, G.F. and G.B.; Investigation, M.M.; Data curation, E.R.; Writing—original draft, A.C.; Writing—review & editing, A.G. All authors have read and agreed to the published version of the manuscript.

Funding

This research received no external funding.

Data Availability Statement

Data are contained within the article.

Acknowledgments

A special thanks to TEDIEL S.r.l. a spin-off of Politecnico di Milano.

Conflicts of Interest

The authors declare no conflicts of interest.

References

Selvaraj, M.; Subramani, A.; Ramanathan, K.; Cautero, M.; Richter, R.; Pal, N.; Bolognesi, P.; Avaldi, L.; Vinitha, M.; Jureddy, C.S.; et al. Comprehensive survey of VUV induced dissociative photoionization of aniline: Role of H migration assisted isomerization. Chem. Phys. Lett. 2023, 829, 140716. [Google Scholar] [CrossRef]
Arun, S.; Ramanathan, K.; Selvaraj, M.; Cautero, M.; Richter, R.; Pal, N.; Chiarinelli, J.; Bolognesi, P.; Avaldi, L.; Vinitha, M.V.; et al. In search of universalities in the dissociative photoionization of PANHs via isomerizations. J. Chem. Phys. 2023, 159, 104308. [Google Scholar]
Stebel, L.; Malvestuto, M.; Capogrosso, V.; Sigalotti, P.; Ressel, B.; Bondino, F.; Magnano, E.; Cautero, G.; Parmigiani, F. Time-resolved soft x-ray absorption setup using multi-bunch operation modes at synchrotrons. Rev. Sci. Instrum. 2011, 82, 123109. [Google Scholar] [CrossRef]
O’Keeffe, P.; Bolognesi, P.; Coreno, M.; Avaldi, L.; Moise, A.; Richter, R.; Cautero, G.; Stebel, L.; Sergo, R.; Pravica, L.; et al. A photoelectron velocity map imaging spectrometer for experiments combining synchrotron and laser radiations. Rev. Sci. Instrum. 2011, 82, 033109. [Google Scholar] [CrossRef]
Becker, W. Advanced Time-Correlated Single Photon Counting Techniques; Springer Series in Chemical Physics; Springer: Berlin/Heidelberg, Germany, 2005; Volume 81, pp. 351–387. [Google Scholar] [CrossRef]
Charbon, E. Introduction to time-of-flight imaging. In Proceedings of the 2014 IEEE SENSORS, Valencia, Spain, 2–5 November 2014; pp. 610–613. [Google Scholar] [CrossRef]
Lusardi, N.; Garzetti, F.; Costa, A.; Cautero, M.; Corna, N.; Ronconi, E.; Brajnik, G.; Stebel, L.; Sergo, R.; Cautero, G.; et al. High-Resolution Imager Based on Time-to-Space Conversion. IEEE Trans. Instrum. Meas. 2022, 71, 1–11. [Google Scholar] [CrossRef]
Lussana, R.; Villa, F.; Mora, A.D.; Contini, D.; Tosi, A.; Zappa, F. Enhanced single-photon time-of-flight 3D ranging. Opt. Express 2015, 23, 24962–24973. [Google Scholar] [CrossRef] [PubMed]
Schaart, D. Physics and technology of time-of-flight PET detectors. Phys. Med. Biol. 2021, 66, 09TR01. [Google Scholar] [CrossRef]
Garzetti, F.; Salgaro, S.; Venialgo, E.; Lusardi, N.; Corna, N.; Geraci, A.; Charbon, E. Plug-and-play TOF-PET Module Readout Based on TDC-on-FPGA and Gigabit Optical Fiber Network. In Proceedings of the 2019 IEEE Nuclear Science Symposium and Medical Imaging Conference (NSS/MIC), Manchester, UK, 26 October–2 November 2019; pp. 1–4. [Google Scholar] [CrossRef]
Garzetti, F.; Corna, N.; Lusardi, N.; Geraci, A. Time-to-Digital Converter IP-Core for FPGA at State of the Art. IEEE Access 2021, 9, 85515–85528. [Google Scholar] [CrossRef]
Randall, R.; Tordon, M. DATA ACQUISITION. In Encyclopedia of Vibration; Braun, S., Ed.; Elsevier: Oxford, UK, 2001; pp. 364–376. [Google Scholar] [CrossRef]
Szplet, R.; Jachna, Z.; Kwiatkowski, P.; Rozyc, K. A 2.9 ps equivalent resolution interpolating time counter based on multiple independent coding lines. Meas. Sci. Technol. 2013, 24, 035904. [Google Scholar] [CrossRef]
Won, J.Y.; Lee, J.S. Time-to-Digital Converter Using a Tuned-Delay Line Evaluated in 28-, 40-, and 45-nm FPGAs. IEEE Trans. Instrum. Meas. 2016, 65, 1678–1689. [Google Scholar] [CrossRef]
Won, J.Y.; Kwon, S.I.; Yoon, H.S.; Ko, G.B.; Son, J.W.; Lee, J.S. Dual-Phase Tapped-Delay-Line Time-to-Digital Converter With On-the-Fly Calibration Implemented in 40 nm FPGA. IEEE Trans. Biomed. Circuits Syst. 2016, 10, 231–242. [Google Scholar] [CrossRef]
Wang, Y.; Kuang, J.; Liu, C.; Cao, Q. A 3.9-ps RMS Precision Time-to-Digital Converter Using Ones-Counter Encoding Scheme in a Kintex-7 FPGA. IEEE Trans. Nucl. Sci. 2017, 64, 2713–2718. [Google Scholar] [CrossRef]
Xie, W.; Chen, H.; Li, D.D.U. Efficient Time-to-Digital Converters in 20 nm FPGAs With Wave Union Methods. IEEE Trans. Ind. Electron. 2022, 69, 1021–1031. [Google Scholar] [CrossRef]
Garzetti, F.; Lusardi, N.; Ronconi, E.; Costa, A.; Geraci, A. Novel machine learning-driven optimizing decoding solutions for FPGA-based time-to-digital converters. Meas. J. Int. Meas. Confed. 2024, 238, 115313. [Google Scholar] [CrossRef]
Kwiatkowski, P.; Szplet, R. Efficient Implementation of Multiple Time Coding Lines-Based TDC in an FPGA Device. IEEE Trans. Instrum. Meas. 2020, 69, 7353–7364. [Google Scholar] [CrossRef]
Carra, P.; Bertazzoni, M.; Bisogni, M.G.; Cela Ruiz, J.M.; Del Guerra, A.; Gascon, D.; Gomez, S.; Morrocchi, M.; Pazzi, G.; Sanchez, D.; et al. Auto-Calibrating TDC for an SoC-FPGA Data Acquisition System. IEEE Trans. Radiat. Plasma Med. Sci. 2019, 3, 549–556. [Google Scholar] [CrossRef]
Wang, Y.; Liu, C. A Nonlinearity Minimization-Oriented Resource-Saving Time-to-Digital Converter Implemented in a 28 nm Xilinx FPGA. IEEE Trans. Nucl. Sci. 2015, 62, 2003–2009. [Google Scholar] [CrossRef]
Lusardi, N.; Garzetti, F.; Corna, N.; Marco, R.D.; Geraci, A. Very High-Performance 24-Channels Time-to-Digital Converter in Xilinx 20-nm Kintex UltraScale FPGA. In Proceedings of the 2019 IEEE Nuclear Science Symposium and Medical Imaging Conference (NSS/MIC), Manchester, UK, 26 October–2 November 2019; pp. 1–4. [Google Scholar] [CrossRef]
Berrima, S.; Blaquière, Y.; Savaria, Y. Fine resolution delay tuning method to improve the linearity of an unbalanced time-to-digital converter on a Xilinx FPGA. IET Circuits Devices Syst. 2020, 14, 1243–1252. [Google Scholar] [CrossRef]
Foley, C. Characterizing metastability. In Proceedings of the Proceedings Second International Symposium on Advanced Research in Asynchronous Circuits and Systems, Fukushima, Japan, 18–21 March 1996; pp. 175–184. [Google Scholar] [CrossRef]
Kuang, J.; Wang, Y.; Cao, Q.; Liu, C. Implementation of a high precision multi-measurement time-to-digital convertor on a Kintex-7 FPGA. Nucl. Instruments Methods Phys. Res. Sect. A Accel. Spectrometers Detect. Assoc. Equip. 2018, 891, 37–41. [Google Scholar] [CrossRef]
Nutt, R. Digital Time Intervalometer. Rev. Sci. Instrum. 1968, 39, 1342–1345. [Google Scholar] [CrossRef]
Chaberski, D.; Frankowski, R.; Gurski, M.; Zielinski, M. Comparison of Interpolators Used for Time-Interval Measurement Systems Based on Multiple-Tapped Delay Line. Metrol. Meas. Syst. 2017, 24, 401–412. [Google Scholar] [CrossRef]
Lusardi, N.; Garzetti, F.; Geraci, A. The role of sub-interpolation for Delay-Line Time-to-Digital Converters in FPGA devices. Nucl. Instruments Methods Phys. Res. Sect. A Accel. Spectrom. Detect. Assoc. Equip. 2019, 916, 204–214. [Google Scholar] [CrossRef]
Chaberski, D.; Frankowski, R.; Zieliński, M.; Zaworski, Ł. Multiple-tapped-delay-line hardware-linearisation technique based on wire load regulation. Measurement 2016, 92, 103–113. [Google Scholar] [CrossRef]
Kwiatkowski, P.; Szplet, R. Time-to-Digital Converter with Pseudo-Segmented Delay Line. In Proceedings of the 2019 IEEE International Instrumentation and Measurement Technology Conference (I2MTC), Auckland, New Zealand, 20–23 May 2019; pp. 1–6. [Google Scholar] [CrossRef]
Wang, Y.; Zhou, X.; Song, Z.; Kuang, J.; Cao, Q. A 3.0-ps rms Precision 277-MSamples/s Throughput Time-to-Digital Converter Using Multi-Edge Encoding Scheme in a Kintex-7 FPGA. IEEE Trans. Nucl. Sci. 2019, 66, 2275–2281. [Google Scholar] [CrossRef]
Wu, J.; Shi, Z. The 10-ps wave union TDC: Improving FPGA TDC resolution beyond its cell delay. In Proceedings of the 2008 IEEE Nuclear Science Symposium Conference Record, Dresden, Germany, 19–25 October 2008; pp. 3440–3446. [Google Scholar] [CrossRef]
UG474, 7 Series FPGAs Configurable Logic Block. Available online: https://docs.xilinx.com/v/u/en-US/ug474_7Series_CLB (accessed on 10 December 2024).
UG472, 7 Series FPGAs Clocking Resources. Available online: https://docs.xilinx.com/v/u/en-US/ug472_7Series_Clocking (accessed on 10 December 2024).
DS181, Artix 7 FPGAs Data Sheet: DC and AC Switching Characteristics. Available online: https://docs.amd.com/v/u/en-US/ds181_Artix_7_Data_Sheet (accessed on 10 December 2024).
Song, Z.; Wang, Y.; Kuang, J. Implementation of 5.3 ps RMS precision and 350 M samples/second throughput time-to-digital converters with event sampling architecture in a Kintex-7 FPGA. Nucl. Instrum. Methods Phys. Res. Sect. A Accel. Spectrom. Detect. Assoc. Equip. 2019, 944, 162584. [Google Scholar] [CrossRef]
Zhou, Y.; Wang, Y.; Song, Z.; Kong, X. A High-Precision Folding Time-to-Digital Converter Implemented in Kintex-7 FPGA. IEEE Trans. Instrum. Meas. 2023, 72, 1–8. [Google Scholar] [CrossRef]
TEDIEL S.r.l. Felix Board Product Page. Available online: https://tediel.com/products/felix/ (accessed on 10 December 2024).
TEDIEL S.r.l. TEDIEL SRL Website. Available online: https://tediel.com/ (accessed on 10 December 2024).
Parsakordasiabi, M.; Vornicu, I.; Rodríguez-Vázquez, A.; Carmona-Galán, R. A Low-Resources TDC for Multi-Channel Direct ToF Readout Based on a 28-nm FPGA. Sensors 2021, 21, 308. [Google Scholar] [CrossRef]
AWG4000 Series Aribitrary Waveform Generator. Available online: https://www.tek.com/en/products/arbitrary-waveform-generators/awg4000 (accessed on 10 December 2024).
Zhang, S.; Wang, S.; Lin, X.; Ren, G. A 6-bit low power flash ADC with a novel bubble error correction used in UWB communication systems. In Proceedings of the 2014 IEEE International Conference on Electron Devices and Solid-State Circuits, Chengdu, China, 18–20 June 2014; pp. 1–2. [Google Scholar] [CrossRef]
Ghoshal, P.; Sen, S.K. A bit swap logic (BSL) based bubble error correction (BEC) method for flash ADCs. In Proceedings of the 2016 2nd International Conference on Control, Instrumentation, Energy & Communication (CIEC), Kolkata, India, 28–30 January 2016; pp. 111–115. [Google Scholar] [CrossRef]
Jaworski, Z. Verilog HDL model based thermometer-to-binary encoder with bubble error correction. In Proceedings of the 2016 MIXDES—23rd International Conference Mixed Design of Integrated Circuits and Systems, Lodz, Poland, 23–25 June 2016; pp. 249–254. [Google Scholar] [CrossRef]
Chen, H.; Li, D.D.U. Multichannel, Low Nonlinearity Time-to-Digital Converters Based on 20 and 28 nm FPGAs. IEEE Trans. Ind. Electron. 2019, 66, 3265–3274. [Google Scholar] [CrossRef]
Kim, J.; Jung, J.H.; Choi, Y.; Jung, J.; Lee, S. Linearity improvement of UltraScale+ FPGA-based time-to-digital converter. Nucl. Eng. Technol. 2023, 55, 484–492. [Google Scholar] [CrossRef]

Figure 1. Block diagram of the TDL-TDC architecture.

Figure 2. Graphical representation of BE; (a) TDL without BEs, (b) TDL with BEs highlighted in red.

Figure 3. TDL-TDC with clock skews (

t_{s} [i]

and

t_{s}^{C R C}

) and routing delays (

t_{d} [i]

).

Figure 3. TDL-TDC with clock skews (

t_{s} [i]

and

t_{s}^{C R C}

) and routing delays (

t_{d} [i]

).

Figure 4. Timing diagram of TDL-TDC featuring the Nutt Interpolation technique.

Figure 5. The 7-Series CLB; SLICEM on the left in blue and SLICEL on the right in yellow.

Figure 6. The 7-Series SLICE; the CARRY4 primitive is red colored in the center of the slice, with 4 LUTs on the left in green, the 4 FFs connected to the CO/O outputs of the CARRY4 in yellow, and the other 4 FFs in pink on the right.

Figure 7. Four 7-Series clock regions highlighted with different colors.

Figure 8. CT (left) and propagation delay

t_{p} [n]

probability density function (right) of 256-tap long TDL-TDC instantiated in Xilinx Artix 100T device (only the active region of the TDL is shown for compactness).

Figure 8. CT (left) and propagation delay

t_{p} [n]

probability density function (right) of 256-tap long TDL-TDC instantiated in Xilinx Artix 100T device (only the active region of the TDL is shown for compactness).

Figure 9. Clock skew with respect to the first tap of a 256-tap long TDL with a CRC in bin 71.

Figure 10. TDL where the CRC is highlighted, along with the Causal (with length NTB) and Anti-Causal (with length NTA) portions of TDL with duration

t_{s}^{C R C}

, with the same informational content.

Figure 10. TDL where the CRC is highlighted, along with the Causal (with length NTB) and Anti-Causal (with length NTA) portions of TDL with duration

t_{s}^{C R C}

, with the same informational content.

Figure 11. Span, with different colors, for the BL across the CRC highlighted in red of the 256-tap long TDL-TDC in the Xilinx Artix 100T device.

Figure 12. Sections of TDL; Causal part (orange), Anti-Causal part (blue), and Unaffected taps (white).

Figure 13. Functional example of Algorithm 1.

Figure 14. CTs without BEs-CRC correction (a) and with correction (c), which offer an average propagation delay (

\bar{t_{p}}

, dashed lines) of 15.2 ps and 15.8 ps, respectively. In (b,d) the redistributed taps are shown alongside the local average values.

Figure 14. CTs without BEs-CRC correction (a) and with correction (c), which offer an average propagation delay (

\bar{t_{p}}

, dashed lines) of 15.2 ps and 15.8 ps, respectively. In (b,d) the redistributed taps are shown alongside the local average values.

Figure 15. Schematic view of the intra-TDL order-2 sub-interpolation mechanism between Casaul and Anti-Causal highlighted in orange and blue, respectively.

Figure 16. Experimental setup with the ACTIVEAWG401X on the left and FELIX module on the right.

Figure 17. Comparison of the precision as a function of the delay with (blue) and without (orange) the correction of BEs-CRC for different values of NTA/NTB (i.e., 1, 2, 3, and 4).

Table 1. Spread between maximum, minimum, and mean precision achieved with and without correction of BEs-CRC (a positive value means that the precision offered without the BEs-CRC correction is better).

NTA = NTB	Max $Δ STD$	Min $Δ STD$	Mean $Δ STD$
1	$0.12$ ps	0.17 ps	$0.15$ ps
2	$0.17$ ps	0.14 ps	$0.11$ ps
6	$0.13$ ps	0.09 ps	$0.09$ ps
8	$0.14$ ps	0.04 ps	$0.12$ ps

Table 2. Comparison between theoretical precision (i.e., the quantization error) in ps r.m.s. with (i.e., FIX) and without (i.e., NOFIX) the correction of BEs-CRC and the relative spread.

NTA = NTB	$ε_{q}^{NOFIX}$	$ε_{q}^{FIX}$	$Δ ε_{q}$
1	$9.01$ ps	$9.18$ ps	$0.17$ ps
2	$9.80$ ps	$9.94$ ps	$0.14$ ps
6	$9.56$ ps	$9.69$ ps	$0.13$ ps
8	$9.31$ ps	$9.44$ ps	$0.13$ ps

Table 3. State of the art of BE compensation with precision (STD) in ps r.m.s.

Ref. (Year)	Node [nm]	FPGA Model	Mode	$L_{B}$	STD [ps]
[15] (13)	40	Virtex-6	2-Phase	NA	11.03
[22] (14)	20	UltraScale	V-TDL	8	8.7
[45] (19)	20	UltraScale	Sub-TDL	NA	5
[45] (19)	28	Virtex-7	Sub-TDL	NA	10.5
[19] (20)	28	Kintex-7	C-TDL	NA	7.6
[46] (22)	16	UltraScale+	DDL	8	5.9
[17] (22)	20	UltraScale	Sub-TDL	NA	3.64
This	28	Artix-7	BE-CRC	10	9.6

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Lusardi, N.; Garzetti, F.; Fiumicelli, G.; Morabito, M.; Bonanno, G.; Ronconi, E.; Costa, A.; Geraci, A. First Study of Bubble Error Artifacts in Field-Programmable Gate Array (FPGA)-Based Tapped Delay-Line Time-to-Digital Converters with Sum-of-Ones Decoder on Xilinx 28 nm 7-Series FPGA. Electronics 2025, 14, 1156. https://doi.org/10.3390/electronics14061156

AMA Style

Lusardi N, Garzetti F, Fiumicelli G, Morabito M, Bonanno G, Ronconi E, Costa A, Geraci A. First Study of Bubble Error Artifacts in Field-Programmable Gate Array (FPGA)-Based Tapped Delay-Line Time-to-Digital Converters with Sum-of-Ones Decoder on Xilinx 28 nm 7-Series FPGA. Electronics. 2025; 14(6):1156. https://doi.org/10.3390/electronics14061156

Chicago/Turabian Style

Lusardi, Nicola, Fabio Garzetti, Gabriele Fiumicelli, Mattia Morabito, Gabriele Bonanno, Enrico Ronconi, Andrea Costa, and Angelo Geraci. 2025. "First Study of Bubble Error Artifacts in Field-Programmable Gate Array (FPGA)-Based Tapped Delay-Line Time-to-Digital Converters with Sum-of-Ones Decoder on Xilinx 28 nm 7-Series FPGA" Electronics 14, no. 6: 1156. https://doi.org/10.3390/electronics14061156

APA Style

Lusardi, N., Garzetti, F., Fiumicelli, G., Morabito, M., Bonanno, G., Ronconi, E., Costa, A., & Geraci, A. (2025). First Study of Bubble Error Artifacts in Field-Programmable Gate Array (FPGA)-Based Tapped Delay-Line Time-to-Digital Converters with Sum-of-Ones Decoder on Xilinx 28 nm 7-Series FPGA. Electronics, 14(6), 1156. https://doi.org/10.3390/electronics14061156

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

First Study of Bubble Error Artifacts in Field-Programmable Gate Array (FPGA)-Based Tapped Delay-Line Time-to-Digital Converters with Sum-of-Ones Decoder on Xilinx 28 nm 7-Series FPGA

Abstract

1. Introduction

2. FPGA-Based TDL-TDC Architecture

3. Bubble Error Analysis

4. Clock Region Crossing Bubble Error Correction

5. Experimental Results

6. State of the Art

7. Conclusions

Author Contributions

Funding

Data Availability Statement

Acknowledgments

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI