Design of a Parallel Sampling Encoder for Analog to Information (A2I) Converters: Theory, Architecture and CMOS Implementation

We discuss the architecture and design of parallel sampling front ends for analog to information (A2I) converters. As a way of example, we detail the design of a custom 0.5 µm CMOS implementation of a mixed signal parallel sampling encoder architecture. The system consists of configurable parallel analog processing channels, whose output is sampled by traditional analog-to-digital converters (ADCs). The analog front-end modulates the signal of interest with a high-speed digital chipping sequence and integrates the result prior to sampling at a low rate. An FPGA is employed to generate the chipping sequences and process the digitized samples.


Introduction
Today, the vast majority of information processing takes place in the digital domain where computation on large scales has become increasingly cheap and convenient.Unfortunately, there are still many applications that struggle to take advantage of digital computation because the signals involved exceed the capabilities of current physical hardware approaches.Analog-to-digital converters (ADCs) are the traditional method for capturing real world signals and converting the information into digital samples.The two most critical measures of an ADC are its sampling speed, which limits the bandwidth of the signals it can accommodate, and the resolution, which limits the accuracy of the resulting samples.Conventional approaches process signals sequentially and different architectures trade sampling speed for resolution or vice versa.This relationship is illustrated in [1], where Walden analyzed the state-of-the-art in ADC technology in 1999, showing empirically that each additional bit in resolution decreases the sampling rate by a factor of two.Since then, these architectures have slowly increased in speed as fabrication processes have continued to improve according to Moore's law, but the basic trend endures.
More recently, the field of compressed sensing (CS) [2] has made a number of theoretical developments that leverage the structure of sparse signals into reduced sampling rates.One application where compressive sampling promises advances in the state-of-the-art is in the design of Analog to Information converters (A2I).
A typical A2I architecture consists of a front-end encoder and a back-end decoder with an analog to digital converter in between (see Figure 1).The purpose of the coder is to transform the input signal into a representation that efficiently preserves information relevant to the function of the system.To achieve this, coders may be designed to exploit prior knowledge of the signals to be processed.Exploiting such prior knowledge has been employed in the design of hardware architectures for non-uniform amplitude [3,4] and spectral quantization [5].Modern communication systems also do this when high frequency signals are down-converted to base-band prior to low-rate sampling [6].

Encoder
Quantizer Decoder Analog Digital

Analog to Information Converter
Although related sampling theories, such as non-uniform sampling [7], have also existed in the literature for many years, the CS framework has lead to the appearance of several novel sampling schemes.The modulated wideband converter (MWC) [8,9,10] and the random demodulator (RD) [11,12,13] are two of the earliest architectures.The compressive multiplexer (CMUX) [14] is another more recent concept for hardware implementation.Each of these architectures achieves compressive sampling through some combination of parallelism, random modulation or sampling, and prior knowledge about the signals of interest.However, the assumptions on signal models and the details of the sampling process are different.
The RD architecture, described in [13,15], modulates the input signal with a random chipping sequence, and integrates the modulator output before digitizing with a conventional ADC.The practical details of implementing a RD architecture are described in [12].The RD operates on analog signals composed of sparse linear combinations of known basis functions.The random demodulator concept was demonstrated using discrete components in [16].Since the Nyquist rate requirement is satisfied by the high frequency contents of the chipping sequence, the ADC is allowed to operate at sub-Nyquist rates, and only the generation of the chipping sequences and the modulator require high speed hardware components.From a hardware perspective, compressive sampling is desirable because the modulation of the input signal with a binary sequence is simpler and easier to implement at high sampling rates as compared with the sample and hold circuits necessary for interleaved architectures.A single chip sub-Nyquist sampling receiver architecture operating between 100 MHz and 2 GHz was recently reported in the literature [17,18].
The MWC architecture modulates the input signal with multiple chipping sequences in parallel.The modulator output is low-pass filtered before being digitized by conventional ADCs.The MWC operates on analog signals that comply with the multi-band signal model described in [8].Note that the system modeling is primarily done in the frequency domain (whereas the RD is primarily modeled in the time domain).To the best of our knowledge it is also the only full scale system that has been realized in hardware using discrete components.
In order to process information in applications that require data conversion rates above the practical limits of traditional hardware, we need to consider alternative architectures.Parallel analog to digital converter architectures enable the design of high performance hardware systems in state-of-the-art CMOS technologies that can go beyond the bandwidth and resolution possible in single channel systems.Breaking the data conversion problem into multiple parallel sub-tasks is not a new concept, as exemplified by interleaved ADCs [19] and other random sampling architectures [20,21], which have existed in the literature and in practice for many years.Interleaved ADCs operate multiple ADCs in parallel with offset sampling times such that the resulting low-rate samples can be multiplexed back together into a higher effective sampling rate [22].Although these architectures resolve the difficulty of designing fast ADCs, the high speed sample-and-hold circuits remain a challenge for high resolution Gsps ADCs.
No single ADC architecture can cover the whole design space.For instance, flash ADC architectures can produce a digitized sample with a non-iterative algorithm at high speeds.Unfortunately, they achieve this through massive parallelism that incurs an exponential cost in silicon real-estate, quickly limiting the achievable bits of resolution.On the other end of the spectrum, sigma-delta ADC architectures can produce high resolution samples using very little silicon area.However, it takes a relatively long time for the result to converge.Algorithmic ADC architectures often exhibit a more balanced tradeoff between bandwidth and resolution.Parallel data converter architectures look to harness the capabilities of multiple individual data converters to boost the overall system performance, generally by increasing the effective sampling rate while maintaining a resolution superior to that of the fastest flash architectures.
In this paper, we present a sampling architecture for compressed and Nyquist sampling [23] and provide experimental results from a fabricated system in 0.5 µm CMOS technology.Even though we aim for reasonably high speed and resolution, our goal is to explore hardware CS sampling architecture ideas and hence our system is not fabricated in a state-of-the-art CMOS process.The coder (front end) architecture presented in this paper is inspired by both the RD and MWC schemes.We employ the concept of chipping and integrating from the RD, and the concept of multiple parallel channels from the MWC.The time domain modeling is also derived from the RD.The hardware combines multiple resources in parallel and is flexible enough to leverage additional signal structure, allowing information to be captured at a low data rate.The system is highly configurable and illustrates situations that mirror the behavior of interleaved ADC architectures as well as compressed sensing architectures.

Sampling Architecture
In the analog to information converter framework, signals are first processed in the analog domain before being quantized by analog to digital converters (ADC) [20].Additional signal processing in the digital domain may be required to recover the original input signal in a usable form.Such a situation occurs when analog signal processing is performed to optimize the performance of the analog to digital conversion.
Note that conventional data acquisition systems already follow this general framework in that they almost invariably perform simple transformations of the input signal, such as gain and offset correction or non-linear amplitude compression.These transformations are reversible in the sense that they can be undone with only knowledge of the transformation that was applied.However, irreversible transformations are also performed, the most common being low-pass filtering to satisfy the Nyquist criterion for the ADC.
In the specific implementation of our sampling architecture described herein, the analog coding consists in modulating the input signal with a high-speed chipping sequence and filtering the modulator output.The filter output is then digitized by the Nyquist ADC at a relatively low rate.When multiple parallel channels are used, each channel will normally use a different chipping sequence, so that the ADC outputs are not duplicated, unless some explicit redundancy is desired.

The Analog Processing Channel
Our implementation of the analog processing channel is shown schematically in Figure 2. The input signal x(t) is multiplied with a digital chipping sequence c m (t) to produce a modulated signal p m (t), which in turn is integrated over a finite time period to produce the channel output signal s m (t).The channel output is sampled and digitized by the ADC, producing a digital sample y m [n].
For an analog front-end using M channels, the subscript m refers to the channel index and takes values from the set {1, . . ., M }.The integration period T s is also the sampling period of the ADC, so we define the discrete time index n such that t = nT s , and hence Since each chipping sequence c m (t) is a finite digital sequence, it is piecewise constant and can be described by a finite set of N bits.The integration period T s does not need to be the same as the chipping sequence length, but we do require that it be an integer multiple of T b , the duration of a chipping sequence bit.Additionally, since each integration period operates in essentially the same way, we can restrict our analysis to the first integration period (0 ≤ t < T s ) and let T s = N T b without loss of generality.Therefore, let takes values from the set {−1, +1}, and i = 1, . . ., N indexes those values.The modulator output is just the product of the input signal and the chipping sequence, so Therefore, the output of a single integration period is (As we are only considering the first integration period, we have dropped the discrete time index for simplicity.) Since c m (t) is piecewise constant, Equation 1 shows that the digitized samples can be broken down into a linear combination of the channel chipping sequence and short-term time integrations of the input signal x(t) over the bit period T b .If the duration of the bit is too long, any high-frequency components of the input signal will be lost or aliased.Therefore, the rate at which the chipping sequences alternate, 1 T b , determines the effective input signal bandwidth that can be resolved by the system.
To simplify the expression for y m , we define the auxiliary discrete time signal z[i], such that We also define the vector notation of the bit sequence and the set of samples y = (y 1 , . . ., y M ) T Now we can express the set of samples obtained from the front-end after the first integration period as: where C = (a 1 , . . ., a M ) T .If the duration of the bit is sufficiently short, then the input signal x(t) can be reasonably approximated by the discrete time estimate x[i], where Recovery of the digitized input signal is therefore a matter of solving Equation 3 for z.

System Configurations
The relationship between the output of a channel and the channel's chipping sequence expressed in Equation 1 leads immediately to several interesting and instructive system configurations.
Case 1: N = M.If C is the identity matrix, then only one channel would be sampling the input during each bit period.If the chipping sequences were composed of 1's and 0's, then it would be easy to observe that the output of the mth channel is simply the integral of the ith bit period.Moreover, because the number of channels (M) is the same as the number of bit periods (N), when the parallel samples from the channels are interleaved in the right order, the original signal can be estimated.This is essentially how an interleaved ADC arrangement operates.
To get around the fact that we have arbitrarily restricted the composition of the chipping sequences to contain only ±1, we can achieve the same effect by setting Applying these chipping sequences to Equation 2 yields where we have added and subtracted an additional z[m] term to complete the summation over N .As N = M in this case, the term z[m] corresponds to the +1 chipping coefficient, while all other z[i], for i = m, are modified with a −1 coefficient.Solving for z[m] yields, Hence, although each ADC is only sampled at a rate of f s = 1 Ts , the effective sampling rate is N f s .For other cases where N = M, it is sufficient that C be invertible, so that z can be recovered according to Under these conditions, the chipping sequences form a basis set.Although C −1 can be precomputed, so that the signal recovery requires only a matrix-vector multiplication, there is very little apparent advantage to this approach over the previous interleaving method.
Case 2: M > N. If the number of channels M is greater than the number of bits in the chipping sequence N, then Equation 3 is over-determined.A pseudo-inverse of C is then needed to recover z from y, but the pseudo-inverse can also be precomputed.The advantage of this approach is that the added redundancy may improve the robustness of the system.
Case 3: M < N. In general, this corresponds to an under-determined system and we would not be able to recover z uniquely.In this setting, z can still be recovered if additional information about the input signal is known.For example, if x(t) meets appropriate sparse structure assumptions, then we can use pseudo-random chipping sequences, and the situation reduces to the well studied L 1 optimization problem addressed in [2,12].

System Extensions
In practice, the chipping sequences c m (t) are periodic so that they can be stored in a finite amount of space in a digital controller.However, the chipping sequence period T c can be different from the integration period T s , so that C changes from one integration interval to the next.All that is required for reconstruction of the signal in a given integration interval is for the C of that integration interval to be known to the digital controller.
Note that both T s and T c must be integer multiples of the bit period T b , and that the chipping sequence is also periodic in 2T c , 3T c , etc. Therefore, without loss of generality, we can redefine the chipping sequence period such that T c = kT s for some integer k, where k is the number of distinct versions of C.
Thus far we have assumed that the signal in a given time interval is reconstructed solely from the ADC outputs of a single integration interval.However, for some applications, it may be advantageous to reconstruct the signal using the ADC outputs from several (e.g., k) integration intervals.For instance, suppose that the input signal is known to be periodic in T s .Then, the M chipping sequences can be applied by a single channel sequentially over M integration intervals, and the same set of M ADC outputs needed for reconstruction will be obtained but with a substantially reduced amount of hardware.
In general, we can reduce the number of channels by concatenating chipping sequences over several integration intervals and using the ADC outputs from these integration intervals in the reconstruction.However, the properties of the signal must not change substantially over these integration intervals, and the reconstruction algorithm must take into account the time shift between ADC outputs obtained from different integration intervals.This allows the reduction of the physical complexity of the sampling architecture at the cost of constraining how quickly the input signal characteristics are permitted to evolve in time.

Analog Hardware Implementation
A complete sampling system can be built with discrete components, using discrete ADCs and a field programmable gate array (FPGA) to implement the digital control and chipping sequence generation, as shown in Figure 3.The problem with this approach is that the analog front-end channels require many components to implement them with discrete parts.
To address this problem, we have designed and fabricated an analog front-end custom integrated circuit in a 0.5 µm CMOS process.The chip contains all the analog front-end components except for the ADCs (which we plan to integrate in future versions).The front-end was designed to operate from 20 kHz up to 200 MHz.The current design has 8 channels, but multiple chips can be operated in parallel for cases where M > 8.A micrograph of the fabricated chip is shown in Figure 4.The active chip area is 0.66 mm 2 (the remaining area is consumed by the I/O pads, wiring and fill).

Channel Pipeline
The proper operation of the front-end requires that the integration periods follow one another without interruption.In practice the integration time interval needs to be followed by a hold interval to allow the ADC's sampling circuit to acquire the integrator's output, and then a reset interval to reset the integrator to its starting value.The hold interval in particular can be quite long, because commercial ADCs normally include track-and-hold circuits whose performance is on par with the conversion circuits.
We therefore designed our front-end channel using two integrators operating in tandem.While one integrator is integrating the input signal, the other is holding its value for the external ADC.Each integrator is reset just prior to switching back to integration mode, so that the sum of the reset interval and the hold interval is equal to the integration interval.This allows the output of one of the integrators to be available to the external ADC for almost a full integration period.A two-input voltage follower is used to select which integrator output is sent to the ADC.This arrangement is shown schematically in Figure 5.

Mixer Integrator Two-Input Follower
Output Buffer s m (t)

Modulator Design
Since the input signal is analog, but the chipping sequence is binary, the modulator does not need to be implemented with a full analog multiplier (e.g., Gilbert multiplier).Rather, it is sufficient to invert x(t) and select +x(t) or −x(t) according to whether c m (t) is 1 or 0 (0 being the binary representation for a chipping sequence value of −1).
The schematic of the modulator is shown in Figure 6.The input signal x(t) is first converted from a single-ended voltage v x to a differential current i x − īx by the PMOS source-coupled pair (M 6 -M 7 ).
Because the process does not provide high value resistances having low parasitic capacitance, source resistors were not used in this version to extend the linear range.i x and īx are each fed to NMOS mirrors with switched outputs (M 8 -M 13 ).The switched currents are then combined into i p and īp such that i p − īp = i x − īx when the chipping sequence is 1, and i p − īp = īx − i x when the chipping sequence is 0. The currents i p and īp are then each fed to the input of a diode-connected PMOS transistor (M 4 -M 5 ), nodes v p and vp respectively, for distribution to the two integrators.
The digital control logic that manages the chipping sequence employs a set-reset latch configured so that the internal chipping signal C and its complement C are never overlapping in the high state.This ensures that the current in the diode-connected PMOS transistors (M 4 -M 5 ) never go to zero.
Two tail transistors were used for the PMOS source-coupled pair (M 6 -M 7 ) so that the average current density in each transistor in the modulator is the same.This simplifies circuit analysis by giving each transistor consistent current dependent characteristics, such as frequency response.
For high speed inputs, the bias current I 1 can be adjusted to ensure that the modulator is operating fast enough to keep up with the input signal.For lower speed inputs, it can be reduced to limit the power dissipation.

Integrator Design
The next stage in the analog front-end is the dual integrators, whose circuits are shown in Figure 7.Because we are operating two integrators in tandem, the integrator circuit is used twice and referred to as the left and right integrators.The modulator output is first scaled by applying v p and vp to a PMOS source-coupled pair (M 19 -M 20 , or M 21 -M 22 ).The resulting current i s is fed through a switch to the appropriate integration capacitor (C L or C R , respectively).
This results in two output signals, v s L and v s R , one from each integration capacitor.The signals controlling the switches are arranged to achieve a ping-pong integration scheme such that only one of the two integrators is integrating at any given moment.The other integrator is in the hold mode and can be reset by asserting the chip's reset input without affecting the operation of the first integrator.
Due to the inverting nature of the PMOS control switches (M 23 -M 26 ), the integrator digital control logic employs a set-reset latch configured so that the internal select signals S L and S R are never overlapping in the low state.This ensures that charge is being deposited on at least one capacitor at all times so that there are no interruptions in the sampling process.
Similar to the modulator circuit, two tail transistors were used for the integrator circuits so that the average current density in each transistor in the integrator is the same.The bias current I 2 is used to control the integration range of the front-end.For long integration times or large amplitude input signals, I 2 can be reduced to prevent overflowing the charge capacity of the integrators.For short integration times or small amplitude input signals, I 2 can be increased to maximize the integrators' output amplitude.The combined die area consumed by the mixer and integrator for each channel is 14,000 µm 2 .

Two-Input Follower and Output Buffer Design
The next stage in the signal chain is the two-input follower shown in Figure 8.This circuit is derived from the five transistor operational transconductance amplifier (OTA).When the output select input is high, v s R is copied to the output v sout , otherwise v s L is copied out. Figure 8. Two-input follower and output buffer circuit schematics.
The digital control logic for the follower is also configured so that the internal select signals O L and O R are never overlapping in the low state.These signals interface with a set of NMOS switches (M 36 -M 39 ), ensuring that only one capacitor node at a time is selected.The die area consumed by the follower for each channel is 7600 µm 2 .
In order to achieve a stable output and drive the integrated signal off-chip to be sampled by discrete ADCs, an operational amplifier (OPAMP) was used to buffer the signal.The second section of Figure 8 shows the circuit schematic for the OPAMP.The output of the analog channel is v sm and was designed to drive a load of approximately 50 pF at 50 MHz.This load is the combined capacitance of the output pad, the chip package and the input of an ADC (e.g., AD7276).In order to achieve this drive capability, each transistor in the first (input) stage of the OPAMP (M 42 -M 48 ) is composed of 12 individual transistors in parallel, and each transistor in the second (output) stage (M 49 -M 50 ) is composed of 600 individual transistors in parallel.The resulting capacitive load of the output stage is large enough that the OPAMP does not require any additional compensation.The OPAMP consumed 61,000 µm 2 of die area for each channel.As the capacitors begin their integration after being reset to ground, the OTA and OPAMP circuits were selected for their ability to accurately drive the channel output as close to ground as possible at the required operating frequency.

Testing
A circuit board with all of the components necessary to support and test the fabricated front-end chip at low speed was developed and is shown in Figure 9.This test board supports a maximum of two front-end chips for a total of sixteen analog channels.The board also contains 16 individual ADCs (located under the front-end chips), a single DAC, and an FPGA.Each ADC is a 12-bit 3 Msps converter (AD7276) and is dedicated to digitizing the output of a single analog channel.The DAC is a 14-bit 200 Msps (AD9744 with AD8041 OPAMP) converter that supplies a programmable analog input signal to the front-end chips.The FPGA (XC3S50AN) coordinates the operation of the front-end chips and the data converters.It is responsible for supplying chipping sequences to the front-end chips as well as the input signal in digital form to the DAC.It is also responsible for collecting the ADC samples and relaying them to the host computer via a USB interface.

Channel Calibration
Before applying actual analog input signals, we first calibrated both integrators (left and right) in each of the sixteen channels.These calibration steps were all performed using constant analog inputs and chipping sequences.

Bias Current Adjustment
The front-end chip uses four bias currents, I 1 through I 4 , to tune the operation of the mixer, integrators, follower and output buffer respectively.I 1 , I 3 and I 4 were left at their nominal value of 10 µA, while I 2 was tuned to the integration period T s of 1 µs.
To tune I 2 , analog inputs and chipping sequences must be applied in such a way as to obtain maximum integrator output.This was accomplished using a constant maximal DAC output (DAC set to +8191) with a constant chipping sequence of +1, as well as a constant minimal DAC output (DAC set to −8192) with a constant chipping sequence of −1.The bias current was then progressively reduced until the integrators were no longer saturating.The final value of I 2 was 0.1 µA.

. Input Dynamic Range
The front-end chip uses a source-coupled pair to convert the analog input to a current.Since the circuit does not make use of source resistors, the linear region is fairly narrow.The linear region was characterized by sequentially applying constant analog input signals with constant chipping sequences (both +1 and −1) for all 2 14 possible DAC codes (−8192 to +8191).
The result, reported in Figure 10, shows that the input saturates at the edges of the range −1024 to +1024.Therefore we chose the input range −512 to +512, which represents an input dynamic range of approximately ± 30 mV, and avoids the saturation region.

Mixer Symmetry
Ideally, the operation of the front-end should be symmetric, in that if an analog input of v is applied with a +1 chipping sequence, the resulting integrator output should be the same as for an analog input of −v and a chipping sequence of −1.However, we noted during the input dynamic range test that the operation of the front-end was not symmetric due to transistor mismatch in the mixer.Two causes for the asymmetry were identified.
The first cause is the mismatch between the two transistors forming the source-coupled pair.This introduces an input offset φ in , which can be compensated by adjusting the V ref for each channel.Since this was not an option for the current front-end implementation, we instead corrected for the input offset by adding a constant to the DAC values.This prevented us from operating multiple channels in parallel, and was not by itself sufficient to eliminate all the asymmetry in many channels.
The second cause is the mismatch between the transistors composing the current mirrors.These cause the current gain G +1 for a +1 chipping sequence to be different from the current gain G −1 for a −1 chipping sequence.Unfortunately, these mismatches cannot be trivially corrected.
The input offset and current mirror gains were estimated for each integrator by applying a constant maximal or minimal analog input and constant chipping sequence of +1 or −1.The four corresponding integrator outputs are given by where φ out is the output offset.From these relationships, we can extract the offsets and current mirror gains using In practice however, due to the non-linearity of the source-coupled pair, we first obtained the input offset φ in through an iterative process, in which we computed an input offset from the above equation, and then reapplied the test patterns with the new input offset.This process converged after three to four iterations.
The values shown in Table 1 are for channel 4L (the left capacitor of the fourth channel) after convergence of the input offset.For this integrator, G +1 = 1.36,G −1 = 1.37 and φ in = 36.Several other integrators exhibited near perfect symmetry, but many of the remaining integrators exhibited pronounced differences in the current mirror gains.The transfer characteristics for channel 4L (after input offset correction) is shown in Figure 10.The curve with the positive slope was measured with a constant chipping sequence of +1, while the curve with the negative slope was measured with a chipping sequence of −1.Close observation shows that the transfer characteristics are slightly non-linear so that the two curves do not cross at a DAC input of zero.

Chipping Sequence Sensitivity
With a constant analog input signal, the integrator output should be insensitive to the order of the chipping sequence bits.For instance, any chipping sequence with zero mean should produce the same integrator output.However, when we switched from using the two possible constant chipping sequences to other chipping sequences, we found that the front-end did not behave as expected.Figure 11 shows the output of channel 4L for two chipping sequences and a zero constant analog input signal (with input offset correction applied).The first chipping sequence is a square wave with period equal to the integration period (T s = 1 µs), and the second is a square wave with a period of one eighth of the integration period ( Ts 8 ).Each chipping sequence was circularly shifted by one sixteenth of the integration period ( Ts 16 = T b or one bit period of the fast square wave).

Circular Shift Sensitivity
For both chipping sequences, the integrator output is sensitive to the circular shift, even though the circular shift does not affect the proportion of +1 to −1 bits.This is probably caused by leakage from the integration capacitor.

Frequency Sensitivity
Although both chipping sequences have zero mean, the integrator output is always lower for the chipping sequence with fewer transitions.This is probably due to charge injection when chipping input toggles.

Output Offset Correction
The chipping sequence dependent behavior of the front-end chip is particularly problematic with respect to the output offset φ out .Normally, φ out can be measured directly by applying a zero constant analog input, and recording the integrator output for each chipping sequence of interest.The recorded output offset can then be subtracted from data subsequently collected with the analog input signals of interest.
Alternatively, the chipping sequence and its complement can be applied with the analog input signal of interest, and the two integrator outputs subtracted.Since the offset being corrected is a property of a given channel, the chipping sequence and the its complement sequence must be applied sequentially to the same channel (this is the k = 2 case described at the end of Section 2.3).Although this requires that the input signal be applied twice, in practice, this gave better results than using the recorded output offsets.

Signal Sampling and Reconstruction
Due to the channel dependent offsets, the system was emulated by sequentially applying the chipping sequences to channel 4L.To match the implemented system limitation of sixteen channels, the chipping sequence sets were constrained to at most 16 distinct chipping sequences of 16 bits each.Since the external ADCs only operated reliably at a maximum of 1 MHz, this resulted in a maximum chipping sequence frequency of 16 MHz even though the chips were successfully tested at frequencies up to 256 MHz.

Reconstruction with Complete Chipping Sequences
Figure 12 illustrates the reconstruction of a 1 MHz sine wave using three different sets of chipping sequences.Each of the three sets includes a full sixteen chipping sequences (and their complements) such that the matrix can be directly inverted to recover the input signal.The first set of chipping sequences C is the identity matrix in which the zero entries have been replaced with −1.The second C consists of square waves with periods of 0, T s , Ts 2 , Ts 4 and Ts 8 .All possible phase shifts are represented to form an orthogonal basis.The third C consists of sixteen random chipping sequences.Since each chipping sequence set in this case is invertible, the reconstructed input is fairly close to the original as shown in Figure 12, and these sequences all perform about the same.
The total reconstruction signal-to-noise ratio (SNR) is reported in the second column of Table 2 and was estimated by comparing the power of input signal to the power of the reconstructed signal in the time domain.However, the error is not entirely attributable to the front-end chip.For instance, a portion of the error is due to the approximation Equation 4. Another portion of the error is due to the piece-wise constant nature of the DAC output, which was operating at 16MHz.
To eliminate some of these extraneous sources of reconstruction error, we computed the ideal value of the ADC samples y given a quantized piece-wise constant input signal representing the DAC output.The simulation was parametrized by gain, offset and phase, and the maximum SNR obtained is reported in the third column of Table 2.The gain and offset parameters account for the frequency response effect shown in Figure 11, and the phase parameter accounts for the small time difference between when the DAC updates its output and when the chipping sequences change state.

Reconstruction with Reduced Chipping Sequences
To explore the ability of the sampling system to reconstruct sparse analog signals, we considered a basis set Ψ = {ψ i (t)} composed of both sine and cosine elements: The following results were generated using an input signal created from the sum of two randomly chosen basis elements, each given a random amplitude (N = 16).The resulting signal was programmed into the DAC (at 16 Msps) and the output of the sampling system was used to reconstruct the signal.Figure 13 shows the input signal and the resulting reconstruction achieved with the random chipping sequence set described previously.The reconstruction shown in Figure 14 was obtained from a reduced set of M = 10 chipping sequences.It is important to realize that the elements of the basis set are continuous functions of time, and therefore, so is x(t).However, because the structure of x(t) is constrained to a finite set of basis elements ψ i (t), x(t) exhibits sparsity with respect to the basis Ψ, which is key to recovery when M < N. A basis pursuit optimization algorithm, described in [24,25], was used to recover the original signal once the samples were transmitted to the host computer.While a conventional Nyquist rate ADC would require 16 samples to achieve similar results, our architecture demonstrates reconstruction of the signal with similar quality using as few as eight samples.Figure 15 reports the reconstruction SNR averaged over a hundred trials where the signal is reconstructed from 1 to 16 chipping sequences randomly selected from a full set of 16 sequences.

Discussion
Today, the vast majority of information processing takes place in the digital domain where computation on large scales has become increasingly cheap and convenient.Unfortunately, there are still many applications that struggle to take advantage of digital computation because the signals involved exceed the capabilities of current physical hardware approaches.
Our performance analysis of the analog front-end provides valuable insights for improvements to be made in the next generation of hardware.Integrating additional discrete components like the channel ADCs into the chip will solve a number of practical issues.Analog signals typically require far more power to drive off chip than digital signals.Replacing the large analog drivers in the existing chip with dedicated ADCs will greatly reduce the complexity of the overall system and provide a more controlled connection between a channel's integrating node and the ADC.This in turn will allow for smoother operation of the ping-pong sampling scheme.
Technology related practical issues that affect the performance of analog to information converters were discussed in previous papers on the implementation of integrated systems [26].Channel matching is clearly a major obstacle for the highly parallel architecture.Furthermore, our current system model assumes that the channel components are linear, limiting its ability to account for the nonlinear behavior exhibited by many of the fabricated components.Nonetheless, our work in this paper is the first experimental realization of an integrated sampler for an analog to information converter where fabrication and implementation related issues are evident in the experimental data.Further work and development of implementation models [27] for the non-idealities in the system needs to be done before these fabricated systems can achieve their theoretical limits.In future designs, it may be necessary to provide expanded calibration capabilities in addition to redesigning the integrators to provide a more uniform result.It may also be necessary to augment the model to account for some of these nonlinear effects in order to fully explain the reconstruction errors.Although the prototype system is limited in speed by the fabrication process used and its reliance on external ADCs and digital processing components, it provides an important first step towards a fully integrated system.All the hardware components were designed to allow for the possibility of building data converters in commercial processes with higher effective bandwidths.
Note that the relative simplicity of the circuit elements in the mixer and integrator was intentional and is an important practical consideration for this architecture.Indeed, in order for the architecture to retain its broad applicability, it must necessarily operate near the maximum speed of the fabrication process, independent of which fabrication process is used.This implies that complex circuit elements, such as OPAMPs, cannot be used in the mixer and integrator, because they would limit the speed to such an extent that Nyquist ADCs could be used instead to directly digitize the input signal.
Our D.C. coupled system is complementary to the radio frequency architectures recently reported in [17,18].We expect that further optimization of the circuit components combined with fabrication in a state-of-the-art process will allow our system to span both ranges of operating frequencies.In the future we envision such systems as solid platforms on which to develop a new generation of efficient data converters that improve upon the bandwidth and resolution constraints of conventional ADCs.

Conclusions
We have presented an integrated sampling sub-system fabricated in 0.5 µm CMOS that uses flexible D.C. coupled analog components and programmable digital components to create a highly versatile architecture capable of implementing a wide array of sampling schemes that can sample signals from a few kHz to many hundreds of megahertz.This architecture can be employed in a conventional ADC architecture or a compressive sampling ADC.
There are many applications involving signals that should be digitized but are not digitized, either because they are too fast to sample or because the application requires a higher resolution than existing hardware solutions can provide.The recent developments in the field of compressed sensing suggest that more efficient sampling schemes exist for many classes of signals, but the transition of these ideas to physical systems is still in a nascent state.

Figure 1 .
Figure 1.Analog to information converter architecture.

Figure 2 .
Figure 2. Block diagram of the channel.

Figure 9 .
Figure 9. Picture of the sampling system prototype.

Figure 12 . 1
Figure 12. 1 MHz sine wave input with output offset corrected using the full chipping sequence complement.

Figure 13 .
Figure 13.Reconstruction of input signal composed of two randomly chosen basis elements with random amplitudes.

Figure 14 .Figure 15 .
Figure 14.Compressive reconstruction of input signal composed of two randomly chosen basis elements with random amplitudes.
sequences in set (M) Average reconstruction SNR (dB)