hugo-site/content/publications/2017/a-0-5-v-time-domain-instrumentation-circuit-with-clocked-and-unclocked-rds-operation.md

22 KiB
Raw Permalink Blame History

title date draft toc math type tags
A 0.5 V time-domain instrumentation circuit with clocked and unclocked ΔΣ operation 2017-05-28T15:26:46+01:00 false true true posts
publication
instrumentation
CMOS
time-domain
circuit

Lieuwe B. Leene, Timothy G. Constandinou

Department of Electrical and Electronic Engineering, Imperial College London, SW7 2BT, UK

Centre for Bio-Inspired Technology, Institute of Biomedical Engineering, Imperial College London, SW7 2AZ, UK

1 Abstract

This paper presents a time-domain instrumentation circuit with exceptional noise efficiency directed at using nano metre CMOS for next generation neural interfaces. Current efforts to realize closed loop neuromodulation and high fidelity BMI prosthetics rely extensively on digital processing which is not well integrated with conventional analogue instrumentation. The proposed time-domain topology employs a differential ring oscillator that is put into feedback using a chopper stabilized low noise transconductor and capacitive feedback. This realization promises better digital integration by extensively using time encoded digital signals and seamlessly allows both clocked & unclocked \(\Delta\Sigma\) behavior which is useful on-chip characterization and interfacing with synchronous systems. A 0.5 V instrumentation system is implemented using a 65 nm TSMC technology to realize a highly compact footprint that is 0.006 mm² in size. Simulation results demonstrate an excess of 55 dB dynamic range with 3.5 \mmu Vrms input referred noise for the given 810 nW total system power budget corresponding to an NEF of 1.64.

2 Introduction

Recent efforts to realize brain machine interfaces (BMI) target fully integrated neural recording systems that use advanced CMOS technologies to enable real time diagnostics for hundreds of channels simultaneously. This emerging trend is predominately motivated by the extensive use of digital techniques applied to robust therapeutic feedback for closed-loop neuromodulation and signal compression/feature extraction in high channel count BMIs for prosthetic motor control1. However there are a growing number of challenges associated with integrating analogue instrumentation for these digital systems due to the loss in analogue transistor characteristics which has motivated the use of time domain (TD) analogue 2. TD systems encode information with respect to the timing intervals of asynchronous digital signals to perform mixed signal processing while extensively using standard logic and oscillators that do not suffer from analogue complications. Many recent publications will indicate the potential for exceptional dynamic range like in the recording system of 3 or highly compact instrumentation like the potentiostat in \cite{VCO_SENSE2}. We argue that a key advantage for these systems is that supply voltage is utilized more effectively from a fundamental aspect without being impeded by linearity. This allows an aggressive reduction in power dissipation. Furthermore realizing band limiting filters outside the instrumentation loop prevents KT/C relations from limiting the compactness of the system.

{{< figure src="/images/iscas2017/TDI.svg" title="Figure 1: TD system architecture for BMI systems where both the instrumentation and processing sub-blocks manipulate time-encoded signals to perform feature extraction from bio-signal recordings. This paper will present the instrumentation sub-system." width="500" >}}

This work presents a TD instrumentation topology for neural recording applications that targets ultra-low power operation and a highly compact implementation for miniaturized multichannel systems. Based on previous work in 4, which uses a third order feedback loop to digitize the signal asynchronously, this realization exhibits only one dominant pole and thus reduces the resources spent on achieving stability. While 4 can achieve a smaller footprint than the topology presented here the feed-forward structure cannot achieve high dynamic range as the high frequency quantization noise and out of band distortion from the counter cannot be suppressed without additional filtering. Our current efforts aim to realize the system architecture illustrated in Fig. 1. This approach aims to use the efficiency of TD processing techniques to extract features from neural recordings which promise an order of magnitude improvement over conventional implementations 5. In fact such a TD approach has already been applied to acquire wireless radio signals with a versatile activity dependent power dissipation 6. However there will be numerous scenarios where these asynchronous structures must interface with clocked systems to effectively perform calibration or other discrete time analysis. For this reason we specifically consider how the class of circuits presented here and in 5 can seamlessly realize a more traditional clocked \(\Delta\Sigma\) oversampling loops. This is relevant because time encoded signals have a very broad bandwidth of several GHz making it unreasonable to characterize or communicate a large number of them off chip. Finding effective means to convert such time-encoded signals to a clocked and quantized equivalent is crucial for fully integrated systems. This paper is organized as follows; First the basic structure of the proposed circuit is introduced in Sec. \ref{Sec:architecture} which motivates the general topology chosen here. Then in Sec. \ref{Sec:circuit} we shall detail the circuit level implementation and identify important design considerations. Finally Sec. \ref{Sec:simulation} presents the preliminary simulation results which leads to the conclusions in Sec. \ref{Sec:conclusion}.

3 Time-Domain Instrumentation Architecture

The proposed configuration is shown in Fig. 2 as a single ended equivalent of the fully differential system implemented here. The neural signals picked up on VIN include local field potentials of several millivolt and extracellular action potential from nearby neural tissue with 100 microvolt amplitudes where the total bandwidth of interest here is from near DC to 6 kHz1. Using a reference electrode these components are first chopped to a higher frequency fchp at 100 kHz. This is then passed into a capacitive network that allows the digital output signal Q to be directly feedback onto the input after using a XOR gate realize chopper modulation. The intermediate node VG is tapped off to a transconductive cell which advances/recedes the phase of a multi stage ring oscillator structure after the signal is chopped back to the base band. By referencing the phase of this oscillator to another oscillator we can realize the time-encoded digital signal that encodes the phase difference \pphi\dDelta as pulse width modulated information. Then we may either quantize this signal with a register in the time domain or simply buffer it onto Q to provide asynchronous feedback without time quantization. Now the near-DC aggressors such as off-set and flicker noise from the transconductor are neutralized using the feedback loop through Rp which presents a high pass response for signals below the chopper frequency on node VG. If we simply take the transconductor, oscillator, and digital logic to represent some kind of integrator then the high level instrumentation topology reduces to a relatively simple chopper stabilized circuit. Finally in the synchronous case where time is quantized, the output Q is passed into a second order cascaded integratorcomb (CIC) filter that decimates oversampled signals into discrete samples. Otherwise \pphi\dDelta is simply passed onto other TD processing structures which is outside the scope of this paper.

{{< figure src="/images/iscas2017/TDA.svg" title="Figure 2: The proposed chopper stabilized instrumentation system that uses a ring oscillator to realize TD based first order integration with respect to the output phase difference which is then quantized in time and decimated using a CIC filter. " width="500" >}}

We highlight the fact that the high pass feedback uses the output referred signal. This has a profound effect on how the noise profile of Rp appears at the chopper frequency when referred to the input. In fact if Rp was tied to a biasing voltage to set VG then resistor's noise will have a rms power of \(√{kT/C_{IN}}\) at the input that is up modulated and will corrupt the linearity of the integrator. This can be very substantial because CIN is reduced to 100 fF in order to boost the input impedance that depends on the chopping frequency as RIN=1/(fchpCIN). This feedback configuration reduces this noise component as a function of closed loop gain Acl which is maximized as we realize all the necessary signal gain in a single analogue processing stage. Moreover the chopper stabilization allows an aggressive reduction on input transistor size which is important for reducing parasitics on the node VG. Any parasitic loading here can prevent further reduction in CIN because it would degrade the noise performance.

The PWM signals \pphi\dDelta, Q, and X in Fig. 2 are illustrated in bold because they represent multiphase time-encoded signals by simultaneously taping off multiple phases inside the oscillator and using them in parallel. The capacitive feedback network sums these signals to acquire an analogue equivalent where the number of amplitude quantization levels is 1+N for N phases used in parallel. This is an important point because the error signal on VG is a function of the supply voltage as VDD/(N Acl) which must lie well within the linear range of the transconductor. Adding to the fact that supply noise is also inversely proportional to Acl reveals that although we can reduce power by reducing VDD, this parameter is tightly coupled to other performance requirements. In fact using the definition of noise efficiency factor NEF from 7 one can derive the following expression to predict the system power Psys for noise limited systems:

P_{sys} = V_{DD} \omega_{3dB} \frac{kT U_T}{e^2_{in}} NEF^2

Eq. 1 uses \(\omega\)3dB, kT, UT, and e²in as the signal bandwidth in radians, Boltzmann energy, thermal voltage, and input referred noise power. This expression ignores any constraints due to sampling noise or capacitor sizing that could suffer as the supply voltage decreases. However it clearly illustrates our motivation for reducing power through VDD which is allowed in this fashion only if no band limiting behavior is required from the instrumentation loop. In fact an important contribution here is that a near ideal NEF is achieved for the TD structure using the proposed implementation.

4 Circuit Implementation

{{< figure src="/images/iscas2017/TDS.svg" title="Figure 3: Transistor level implementation of a) the low noise amplifier and b) high pass filter circuit that removes flicker noise and off-set. A legend is presented in gray showing how a) & b) relate to Fig. 2 and the chopper in a) is omitted for clarity." width="500" >}}

The fully differential analogue part of this system is shown in Fig. 3. Here two analogue structures are shown, one for signal amplification that integrates on the differential phase \pphi\dDelta which we shall consider the amplifier. The second structure presents the pseudo resistor that rejects the low frequency aggressors that are being up modulated onto the output Q which we will refer to as the high pass filter. The amplifier is composed of a complementary transcondutor to boost the noise efficiency and is loaded by two ring oscillators that are biased with a current IX that is 16x smaller than IB such that the input referred noise from the oscillator to VG is greatly reduced. A XOR gate is used to compute the phase difference of the two oscillators because it does not have a discontinuity in its phase to PWM characteristic. The floating ground of both oscillators is tied together to reject common mode noise and minimize any coupling to the analogue power supplies.

Actually, the high pass filter not only performs feedback but also determines the common mode input voltage on VG. This mechanism is used to set the common mode on VX to VCM using the transistors M5-M8. This is important because the oscillator needs to run in the middle of the rail to allow more efficient conversion from the small internal voltage oscillation of 300 mV to a digital signal with full swing. The reason a current DAC is used to drive the pseudo resistors is because the linearity of these devices is important and Rp can only handle \(\pm\)100 mV if simple diode connected devices are used. This is why CL is introduced to provide some filtering and minimize distortion from the high frequency PWM feedback. Note that because all transistors will use subthreshold operation the threshold voltage VTH difference between the input PMOS pair M3-M4 and the high VTH devices M9-M10 will represent voltage headroom of the PMOS current bias which should at least be 100 mV.

Now there is quite a significant impact from using an oscillator as load for the amplifier structure in this particular fashion. It may be obvious that there are no high impedance analogue nodes in this configuration that could introduce undesirable poles. But more importantly we do not need to provide extra voltage headroom or a second gain stage to let our output signal vary with maximum amplitude. This proposed implementation allows both simplicity and high power efficiency. In this case the oscillator mostly reuses the VTH voltage headroom needed by the NMOS input transistors. This raises an interesting question; what limits the required voltage headroom for this circuit? Typically the complementary structure necessitates that the source drain voltage of the current bias transistors and differential pairs is sufficient to provide good channel resistance. However there is another component with regard to the noise generated by the oscillator that should be considered in terms of the oscillator voltage overhead VRO. Consider the noisy charge induced as sampling noise from one of the oscillators on each inverter gate capacitance Cgate before the up/down transition as residue from the previous cycle. This can be represented by an equivalent noisy current source i²smp using the oscillator frequency fosc as formulated in Eq 2.

i^2_{smp} = 2N f^2_{osc} kT C_{gate}

Referring this component to VG as input referred voltage noise equivalent v²smp requires fosc to be represented in terms of total charge dissipated each cycle fosc=IX/(2N VRO Cgate). Taking the transconductance of the amplifier approximately as Gm\(\approx\)2IB/(\(\eta\) UT) will resolve the expression as in Eq 3 using \(\eta\) and UT as slope factor and thermal voltage.

v^2_{smp} = \frac{i^2_{smp}}{Gm^2} = \frac{2 kT}{P_{osc}} \left( \eta U_T \frac{I_X}{2 I_B} \right)^2

This result may be surprising in some sense because indicates the oscillator should dissipate a strict amount of energy to avoid this noise component from being significant. The only way to do this is by increasing the VRO because increasing its bias current IX will in fact degrade the input referred noise. Also notice that the actual capacitive load of the oscillator does not impact the thermal noise floor although it is directly related to the bandwidth of operation. In this particular case we configured the oscillation frequency to be around 300 kHz after optimization which implies that the effective frequency is around 1.5 MHz given the 5 phases. For this reason when the CIC filter is enabled we use a 3 MHz system clock for time quantization and an over sampling ratio of 128. This implies that the instrumentation bandwidth will be limited to 11 kHz sampled at exactly 22 kS/s.

5 Simulation Results

The presented implementation was fabricated using the commercially available TSMC 65 nm CMOS LP MS RF technology (1P9M 6X1Z1U RDL). This system can operate at 0.5 V by extensively using the different VTH process options for the standard transistor. Fig. 4 shows the floor plan and final fabricated device and the MIM capacitors covering the active area. A compact configuration was achieved with a 0.006mm² footprint where future work could potentially share the biasing and digital filter resources. Our preliminary results consist of post layout noise simulations that should have a good degree of precision for predicting the expected bench top measurements. Fig 5 shows the ring oscillator output with 200 mV amplitude in the middle of the rail that is fed to the XOR gate to produce the full swing digital signal \pphi\dDelta. Fig 6 shows the output of the circuit in the frequency domain before & after decimation by the CIC filter. We can observe that linearity is quite easily achieved due to the exceptional loop gain from the TD integrator and because all analogue nodes exhibit a small signal swing. The performance summary in Table 1 compares this work to recently published instrumentation systems using the 65nm or 90nm technology node. Notice that a comparable noise performance is achieved for a similar if not better input dynamic range while the NEF & area characteristics are respectable given the extra capability that signal quantization is also performed. The input resistance due to chopping for this configuration is estimated to be 57 M\(\Omega\) which will be sufficient for most integrated neural recording electrodes.

{{< figure src="/images/iscas2017/LAY.svg" title="Figure 4: The floor plan and micro photograph of the fabricated TD instrumentation prototype. " width="500" >}}

{{< figure src="/images/iscas2017/DIG2.svg" title="Figure 5: Simulation results showing a) the two ring oscillator outputs & b) the respective phase difference signal due to a 5 mVpp sinusoidal signal at 2 kHz at the input of the instrumentation system." width="500" >}}

{{< figure src="/images/iscas2017/SPC2.svg" width="500" >}} {{< figure src="/images/iscas2017/SPC.svg" title="Figure 6: Spectral characteristics of a) the \(\Delta\Sigma\) modulator and b) the CIC filter output due to a 2 kHz 5 mVpp sinusoidal input showing a precision of 54.8 dB SINAD and no visible harmonics due to distortion." width="500" >}}

Table 1: Performance summary and comparison with state of the art.

Specification This Work 8 9 10
Modality Time Volt. Volt. Volt.
Technology 65nm 90nm 65nm 65nm
Supply [V] 0.5 1 1 1
Supply [A] 2.65\mmu (\dagger) 2.85\mmu 1n 3.28\mmu
Gain [dB] 46 59 32 52
Bandwidth [Hz] 11 k 10.5 k 370 8.2 k
SINAD [dB] 53 (>)40 57 (>)40
IRN [Vrms] 3.8\mmu 3.04\mmu 26\mmu 4.13\mmu
NEF 2.2 1.93 2.1 3.19
Area [mm²] 0.006(\star) 0.137 0.168(\star) 0.042
\(^\star\) Includes ADC area. \(^\dagger\) Includes axillary and biasing circuits.

6 Conclusion

This work realizes a chopper stabilized \(\Sigma\Delta\) modulator right at the sensor interface using a time-domain topology for ultra low voltage operation. The proposed oscillator based instrumentation circuit addresses a number of the challenges associated with instrumentation using nano metre CMOS technologies. This system has a power budget of 810 nW and a compact silicon foot print of 0.006 mm². Moreover this system can achieve a NEF of 1.64 while including the power dissipation due to quantizing the signal with 8.8 effective number of bits at 22 kS/s.

7 Acknowledgment

This work was supported by EPSRC grants EP/K015060/1 and EP/M020975/1.

Refernces:


  1. H.Kassiri etal., ''Battery-less tri-band-radio neuro-monitor and responsive neurostimulator for diagnostics and treatment of neurological disorders,'' IEEE J. Solid-State Circuits, vol.51, no.5, pp. 1274--1289, May 2016. [Online]: http://dx.doi.org/10.1109/JSSC.2016.2528999 ↩︎

  2. B.Vigraham, J.Kuppambatti, and P.R. Kinget, ''Switched-mode operational amplifiers and their application to continuous-time filters in nanoscale cmos,'' IEEE J. Solid-State Circuits, vol.49, no.12, pp. 2758--2772, December 2014. [Online]: http://dx.doi.org/10.1109/JSSC.2014.2354641 ↩︎

  3. W.Jiang etal., ''A ±50mv linear-input-range vco-based neural-recording front-end with digital nonlinearity correction,'' in IEEE Proc. ISSCC, January 2016, pp. 484--485. [Online]: http://dx.doi.org/10.1109/ISSCC.2016.7418118 ↩︎

  4. M.Elia, L.B. Leene, and T.G. Constandinou, ''Continuous-time micropower interface for neural recording applications,'' in IEEE Proc. ISCAS, May 2016. ↩︎

  5. L.B. Leene and T.G. Constandinou, ''A 0.45v continuous time-domain filter using asynchronous oscillator structures,'' in IEEE Proc. ICECS, December 2016, pp. 49--52. [Online]: http://dx.doi.org/10.1109/ICECS.2016.7841129 ↩︎

  6. M.Kurchuk etal., ''Event-driven ghz-range continuous-time digital signal processor with activity-dependent power dissipation,'' IEEE J. Solid-State Circuits, vol.47, no.9, pp. 2164--2173, September 2012. [Online]: http://dx.doi.org/10.1109/JSSC.2012.2203459 ↩︎

  7. M.S.J. Steyaert and W.M.C. Sansen, ''A micropower low-noise monolithic instrumentation amplifier for medical purposes,'' IEEE J. Solid-State Circuits, vol.22, no.6, pp. 1163--1168, December 1987. [Online]: http://dx.doi.org/10.1109/JSSC.1987.1052869 ↩︎

  8. T.Yang and J.Holleman, ''An ultralow-power low-noise cmos biopotential amplifier for neural recording,'' IEEE Trans. Circuits Syst. II, vol.62, no.10, pp. 927--931, Oct 2015. [Online]: http://dx.doi.org/10.1109/TCSII.2015.2457811 ↩︎

  9. P.Harpe etal., ''A 0.20$ $mm$^2$ 3$ $nw signal acquisition ic for miniature sensor nodes in 65 nm cmos,'' IEEE J. Solid-State Circuits, vol.51, no.1, pp. 240--248, Jan 2016. [Online]: http://dx.doi.org/10.1109/JSSC.2015.2487270 ↩︎

  10. K.A. Ng and Y.P. Xu, ''A low-power, high cmrr neural amplifier system employing cmos inverter-based otas with cmfb through supply rails,'' IEEE J. Solid-State Circuits, vol.51, no.3, pp. 724--737, March 2016. [Online]: http://dx.doi.org/10.1109/JSSC.2015.2512935 ↩︎