IBM Research, ZRL Analog RF CMOS and Optical … µm X 30 µm Speed 8 GHz Configuration: 16 phases...

30
IBM Research, ZRL I/O Link Technology | October 2004 | © 2004 IBM Corporation Analog RF CMOS and Optical Design Techniques for 10+ Gbps Datacom Trends, Challenges & Solutions Martin L. Schmatz Oct 24 th , 2004

Transcript of IBM Research, ZRL Analog RF CMOS and Optical … µm X 30 µm Speed 8 GHz Configuration: 16 phases...

Page 1: IBM Research, ZRL Analog RF CMOS and Optical … µm X 30 µm Speed 8 GHz Configuration: 16 phases to 1 phase output 6-to-1 configuration area: 30 ... Propagation length [cm] λ =

IBM Research, ZRL

I/O Link Technology | October 2004 | © 2004 IBM Corporation

Analog RF CMOS and Optical Design Techniques for 10+ Gbps DatacomTrends, Challenges & Solutions

Martin L. SchmatzOct 24th, 2004

Page 2: IBM Research, ZRL Analog RF CMOS and Optical … µm X 30 µm Speed 8 GHz Configuration: 16 phases to 1 phase output 6-to-1 configuration area: 30 ... Propagation length [cm] λ =

2

IBM Research

I/O Link Technology | © 2004 IBM Corporation

Agenda

• Introduction

• Electrical I/O

• Optical I/O

• Q&A

Page 3: IBM Research, ZRL Analog RF CMOS and Optical … µm X 30 µm Speed 8 GHz Configuration: 16 phases to 1 phase output 6-to-1 configuration area: 30 ... Propagation length [cm] λ =

3

IBM Research

I/O Link Technology | © 2004 IBM Corporation

Page 4: IBM Research, ZRL Analog RF CMOS and Optical … µm X 30 µm Speed 8 GHz Configuration: 16 phases to 1 phase output 6-to-1 configuration area: 30 ... Propagation length [cm] λ =

4

IBM Research

I/O Link Technology | © 2004 IBM Corporation

Page 5: IBM Research, ZRL Analog RF CMOS and Optical … µm X 30 µm Speed 8 GHz Configuration: 16 phases to 1 phase output 6-to-1 configuration area: 30 ... Propagation length [cm] λ =

5

IBM Research

I/O Link Technology | © 2004 IBM Corporation

System level considerationsFundamental 'game changes':

Bandwidth: I/O bandwidth is becoming the limiting factor to leverage CPU performance !

Size/Distance: The critical I/O link distance is between 2mm for on-MCM links and 30m between racks.

Power/area/C4's: Processor I/O power consumption, chip area and available numbers of C4 terminals are most limiting factors

For the future, several required I/O classes can be identifiedOn - MCMOn - Board (zero - one connector)Backplane (1m, two connectors)Shelf-to-shelf, rack-to-rack

Source: The Earth Simulator

# of package pins

Pentium 1 Vitals Summary TableIntroduction date: March 22, 1993Process: 0.8 micronTransistor Count: 3.1 millionClock speed at introduction: 60 and 66 MHzCache sizes: L1: 8K instruction, 8K data

Page 6: IBM Research, ZRL Analog RF CMOS and Optical … µm X 30 µm Speed 8 GHz Configuration: 16 phases to 1 phase output 6-to-1 configuration area: 30 ... Propagation length [cm] λ =

6

IBM Research

I/O Link Technology | © 2004 IBM Corporation

Electrical I/O

Page 7: IBM Research, ZRL Analog RF CMOS and Optical … µm X 30 µm Speed 8 GHz Configuration: 16 phases to 1 phase output 6-to-1 configuration area: 30 ... Propagation length [cm] λ =

7

IBM Research

I/O Link Technology | © 2004 IBM Corporation

12.5 GByte/s RX I/O Macro: 10 lanes @ 10 Gbps with scrambling and FEC12.5 GByte/s RX I/O Macro: 10 lanes @ 10 Gbps with scrambling and FEC

• 10 GByte/s on 5 mm2 area with < 1.5 W @ 1V power supply

• Layout with stackable footprint

RF PLL

Clock buffers

Data logic (ECC, de-skew, scrambling, training….)

Data output @ ¼ rate = 2.5 - 3.2 GHz bus clock

32+1 (40 raw)

BIST interface

CDR logic PR Control

RX frontend

Phase rotator

Quarter rate data from individual channels

CDR logicPR Control

RX frontend

Phase rotator

Quarter rate data from individual channels

Clock buffers

RF PLL

Clock buffers

Data logic (ECC, de-skew, scrambling, training….)

Data output @ ¼ rate = 2.5 - 3.2 GHz bus clock

32+1 (40 raw)

BIST interface

CDR logic PR Control

RX frontend

Phase rotator

Quarter rate data from individual channels

CDR logic PR Control

RX frontend

Phase rotator

Quarter rate data from individual channels

CDR logicPR Control

RX frontend

Phase rotator

Quarter rate data from individual channels

Clock buffers

10 Gbps inputs10 Gbps inputs

Page 8: IBM Research, ZRL Analog RF CMOS and Optical … µm X 30 µm Speed 8 GHz Configuration: 16 phases to 1 phase output 6-to-1 configuration area: 30 ... Propagation length [cm] λ =

8

IBM Research

I/O Link Technology | © 2004 IBM Corporation

12.5 GByte/s RX I/O Macro C4 “4 on 8” Footprint12.5 GByte/s RX I/O Macro C4 “4 on 8” Footprint

RF PLL

Clock buffers

Data logic (ECC, de-skew, scrambling, training….)

Data output @ ¼ rate = 2.5 - 3.2 GHz bus clock

32+1 (40 raw)

BIST interface

CDR logic PR Control

RX frontend

Phase rotator

Quarter rate data from individual channels

CDR logicPR Control

RX frontend

Phase rotator

Quarter rate data from individual channels

Clock buffers

SignalAnalog GNDAnalog VDDDigital GNDDigital VDDOther (Ref..)

2.8448mm

0.91

44m

m

One channel analog front-endShared RF PLLPhase rotators

CDR logic loopMacro level logic

10 GByte/s

Chip photo for 90nm CMOS SOI

Page 9: IBM Research, ZRL Analog RF CMOS and Optical … µm X 30 µm Speed 8 GHz Configuration: 16 phases to 1 phase output 6-to-1 configuration area: 30 ... Propagation length [cm] λ =

9

IBM Research

I/O Link Technology | © 2004 IBM Corporation

Game changers: I/O terminal to I/O device/macro wiringGame changers: I/O terminal to I/O device/macro wiringC4 to I/O wiring:• Up to 3 Gbps: Short on-chip data wiring is allowed but total I/O BW is limited

due to slow speed of the lanes• More BW is obtained by optimizing the speed (= data-rate) for each C4. • BUT: The on-chip wiring distance between C4 terminal and I/O circuits has to

be optimized for a jitter budget closure

C4

High-speed I/O macro owns C4 square(one channel; the higher the data-rate, the larger the area)

Non I/O function area

> 3~4 Gbps – 20 Gbps C4

I/O macro(many channels)

Non I/O function area

<= 1 Gbps

Long on-chip wire

C4

I/O macro(many channels)

2 – 3~4 Gbps

Short on-chip wire

Minimum on-chip wire

Consequences:• The I/O macro and the C4 I/O terminals have to be placed in close proximity• If the I/O has to own the area around the C4 terminals, the optimum data-rate

is such that the area is just filled

Page 10: IBM Research, ZRL Analog RF CMOS and Optical … µm X 30 µm Speed 8 GHz Configuration: 16 phases to 1 phase output 6-to-1 configuration area: 30 ... Propagation length [cm] λ =

10

IBM Research

I/O Link Technology | © 2004 IBM Corporation

Game changers: I/O area & speed vs C4 pitch”What if” question: Transition from “4 on 8” C4 pitch (200um) to “1 on 2” (50um)

Game changers: I/O area & speed vs C4 pitch”What if” question: Transition from “4 on 8” C4 pitch (200um) to “1 on 2” (50um)

I/O area:• Up to 3 Gbps: Minimum area ‘somewhere’ on chip (see previous chart)• Above 3 Gbps: I/O macro owns area beneath a C4 square

“1 on 2” C4

High-speed I/O macro(one channel, ~10-12.5 Gbps)

Non I/O function area

“1 on 2” C4: 4-16x more pins/area

Power/area optimized medium-speed I/O macro(one channel, ~5-6 Gbps)

C4

High-speed I/O macro(one channel)

Non I/O function area

Conventional C4:200 um pitch

>2x throughputat ½ data rate

Consequences:• C4 pitch and CMOS ft number plus target link distance determine the

optimum I/O data-rate.• Narrow C4 pitch plus medium-speed link design enhance aggregate

chip throughput

Page 11: IBM Research, ZRL Analog RF CMOS and Optical … µm X 30 µm Speed 8 GHz Configuration: 16 phases to 1 phase output 6-to-1 configuration area: 30 ... Propagation length [cm] λ =

11

IBM Research

I/O Link Technology | © 2004 IBM Corporation

Area/performance example: CMOS SOI 90nm Phase Rotator Layout Comparison

CML TypeActive area:76 µm X 61 µm

Speed 13 GHz

Configuration:6 phases input to 1 phase output

Mux3:1

Mux3:1

Interpolator

Weight 1..8

Weight 1..8

Clock OutputBuffer

Mux3:1

Mux3:1

Interpolator

Weight 1..8

Weight 1..8

Clock OutputBuffer

Preselector1 Preselector2 Preselector3 Preselector4

Second stage mux1

Second stage mux2

Interpolator Buff

er

CMOS TypeActive area:80 µm X 30 µm

Speed 8 GHz

Configuration:16 phases to 1 phase output

6-to-1 configuration area: 30 µm x 30 µm

0.2 x area@ 0.6 x speed⇒ 3x bandwidth per area improvement

Area for equivalent function

Page 12: IBM Research, ZRL Analog RF CMOS and Optical … µm X 30 µm Speed 8 GHz Configuration: 16 phases to 1 phase output 6-to-1 configuration area: 30 ... Propagation length [cm] λ =

12

IBM Research

I/O Link Technology | © 2004 IBM Corporation

Area shrinkage demonstrator: Measured 10 Gbps operationMeasurement setup:

delay line

mux (3x)1

1

1

data even @ 5Gbps

data odd @ 5Gbpsclock out

VDD

Control bits

10101010

11001100

1101100011011000

10101010

11001100

1101100011011000

data even @ 5 Gbps

data odd @ 5 Gbps

3

10 Gbps bit pattern

1101100011011000

bit patterngenerator

1010101010101010half rate clock

@ 5 GHz

data @ 10 Gbps

2

2

Measured outputs:

data even:

data odd:

clock out:

200ps

Page 13: IBM Research, ZRL Analog RF CMOS and Optical … µm X 30 µm Speed 8 GHz Configuration: 16 phases to 1 phase output 6-to-1 configuration area: 30 ... Propagation length [cm] λ =

13

IBM Research

I/O Link Technology | © 2004 IBM Corporation

Optical I/O

Page 14: IBM Research, ZRL Analog RF CMOS and Optical … µm X 30 µm Speed 8 GHz Configuration: 16 phases to 1 phase output 6-to-1 configuration area: 30 ... Propagation length [cm] λ =

14

IBM Research

I/O Link Technology | © 2004 IBM Corporation

Equalization

0 0.5 1 1.5 2 2.5 3 3.5 4-1

-0.8

-0.6

-0.4

-0.2

0

0.2

0.4

0.6

0.8

1

Time in UI(=80ps)

Nor

mal

ized

Vol

tage

Unequalized eye diagram

0 0.5 1 1.5 2 2.5 3 3.5 4-1

-0.8

-0.6

-0.4

-0.2

0

0.2

0.4

0.6

0.8

11 postcursor taps

UI

0.79127 -0.20873

No Equalization

H(z)=1

No Equalization

H(z)=1

1tap Equalization

H(z)=0.79 – 0.21*z-1

Remark: Only minor improvement in jitter for higher order FIR filters (for this channel)

1tap Equalization

H(z)=0.79 – 0.21*z-1

Remark: Only minor improvement in jitter for higher order FIR filters (for this channel)

8-inch FR4 Board (measured S-parameters)

HyperBGA Package (TX & RX)

700fF for ESD/C4(TX & RX)

~-10dB attenuation @ 5GHz

~-10dB attenuation @ 5GHz

Channel S21 response:

Page 15: IBM Research, ZRL Analog RF CMOS and Optical … µm X 30 µm Speed 8 GHz Configuration: 16 phases to 1 phase output 6-to-1 configuration area: 30 ... Propagation length [cm] λ =

15

IBM Research

I/O Link Technology | © 2004 IBM Corporation

I/O link distance enhancementShort electrical CMOS link design for optimum power & areaOptical extension for optimum distance

Link TX/RX pair

Page 16: IBM Research, ZRL Analog RF CMOS and Optical … µm X 30 µm Speed 8 GHz Configuration: 16 phases to 1 phase output 6-to-1 configuration area: 30 ... Propagation length [cm] λ =

16

IBM Research

I/O Link Technology | © 2004 IBM Corporation

Our Technology-Approach

Electronics: Cable Printed Circuit Optics: Fiber Integrated Waveguides

Page 17: IBM Research, ZRL Analog RF CMOS and Optical … µm X 30 µm Speed 8 GHz Configuration: 16 phases to 1 phase output 6-to-1 configuration area: 30 ... Propagation length [cm] λ =

17

IBM Research

I/O Link Technology | © 2004 IBM Corporation

FR4

Lowercladding

Corelayer

Uppercladding

Waveguide Manufacturing

Woven glassfiber bundles

Cu layer

Epoxyresin

FR4 substrateFR4 substrate

WG patterning

Page 18: IBM Research, ZRL Analog RF CMOS and Optical … µm X 30 µm Speed 8 GHz Configuration: 16 phases to 1 phase output 6-to-1 configuration area: 30 ... Propagation length [cm] λ =

18

IBM Research

I/O Link Technology | © 2004 IBM Corporation

Propagation losses

0 1 2 3 4 5 6 7 8 9 10 11 12 13

0123456789

10111213

λ = 850 nm

λ = 980 nm

λ = 1.55 µm

0.32 dB/cm

1.04 dB/cm

0.12 dB/cm

0.04 dB/cm

Prop

agat

ion

Loss

[dB

]

Propagation length [cm]

λ = 1.31 µm

70 µm 50 µm 30 µm

Experimental resultsConsistent losses for WG width of 30, 50, and 70 µm Clearly increased losses in the 2nd and 3rd telecom window

Increased losses come from 2nd and 3rd

overtones of hydrocarbon (C-H) bond vibrations (absorption peak @ 3.39 µm)

Possible solution: Fluorination, i.e.replacing C-H groups by C-F groups

0.04 dB/cm loss @ 850nm

Page 19: IBM Research, ZRL Analog RF CMOS and Optical … µm X 30 µm Speed 8 GHz Configuration: 16 phases to 1 phase output 6-to-1 configuration area: 30 ... Propagation length [cm] λ =

19

IBM Research

I/O Link Technology | © 2004 IBM Corporation

Bending LossesMeasurement results

R = 11 mm R = 11 mm

Mask layout

Photograph @ λ = 640 nm

Topview

5 10 15 20 250.30

0.35

0.40

0.45

0.50

0.55

0.60

0.65

0.70

0.75

0.80

0.85

0.90

0.95

1.00

Wav

egui

de S

pira

l

18mm

0.2 dB0.1 dB

0.5 dB

1.0 dB

2.0 dB

3.0 dB

Nor

m. T

rans

mis

sion

per

180

o Ben

ding

Bending Radius [mm]

TopView:

Micrograph of 50-µm-WG bends

0.1 dB loss per 180o-bending of radius R = 20 mm

Page 20: IBM Research, ZRL Analog RF CMOS and Optical … µm X 30 µm Speed 8 GHz Configuration: 16 phases to 1 phase output 6-to-1 configuration area: 30 ... Propagation length [cm] λ =

20

IBM Research

I/O Link Technology | © 2004 IBM Corporation

Crossing Losses Mask layout Purpose

Waveguide channel routing

0 1 2 3 4 5 6 7 80.000.010.020.030.040.050.060.070.080.090.100.110.120.130.140.150.16

Cro

ssin

g Lo

ss [

dB]

# of CrossingsTop view

Micrograph of 30-µm-crossing

Measurement results

WG width = 50 µm

Loss per 90o-crossing: 0.02 dB (@ 850 nm)(Example: 100 crossings add up to only 2 dB)

Page 21: IBM Research, ZRL Analog RF CMOS and Optical … µm X 30 µm Speed 8 GHz Configuration: 16 phases to 1 phase output 6-to-1 configuration area: 30 ... Propagation length [cm] λ =

21

IBM Research

I/O Link Technology | © 2004 IBM Corporation

Y-SplittersMicrograph of 50-µm-splitters

Required for non point-to-point links

PurposeTop view

50 µm

Light to beincoupled

Scre

enMeasurement scheme Experimental results

λ=640nm

Branch 1

Branch 2

0.10 dB excess loss for 50%:50% splitting (@ λ = 850 nm)

0.17 dB excess loss for use as combiner (@ λ = 850 nm)

Page 22: IBM Research, ZRL Analog RF CMOS and Optical … µm X 30 µm Speed 8 GHz Configuration: 16 phases to 1 phase output 6-to-1 configuration area: 30 ... Propagation length [cm] λ =

22

IBM Research

I/O Link Technology | © 2004 IBM Corporation

Waveguide Density

48 parallel WG channels

(“pseudo-standard”)

1 x

2.5 x

4 x

At 10 Gbps channel modulation, this deliversan aggregate data density of 1 TByte/s per inch

Increase in waveguide channel density

Page 23: IBM Research, ZRL Analog RF CMOS and Optical … µm X 30 µm Speed 8 GHz Configuration: 16 phases to 1 phase output 6-to-1 configuration area: 30 ... Propagation length [cm] λ =

23

IBM Research

I/O Link Technology | © 2004 IBM Corporation

Experiments: 12.5 Gbps Signal over WG Spiral

Open eye diagrams at 12.5 Gbps through 100 cm waveguide spiral modal dispersion & loss not critical

Page 24: IBM Research, ZRL Analog RF CMOS and Optical … µm X 30 µm Speed 8 GHz Configuration: 16 phases to 1 phase output 6-to-1 configuration area: 30 ... Propagation length [cm] λ =

24

IBM Research

I/O Link Technology | © 2004 IBM Corporation

Classification of coupling approachesAssumption: Electrical tracks and optical waveguides are both parallel to board surface.

OE-components emit/accept light perpendicular to the chip surface.

Consequence: A 90°-bend is required in this E-O path.

Question: Is this task moved to the optical domain or to the electrical domain?

• more "electronics-friendly" (standard package and orientation), but more complex optical part (especially for 2D)

• separation of active OE-component and passive board (repairability); board (w/o OE) is sealed; interface to optics is on surface (servicability)

• etc.

• more "optics-friendly" (effort in optical domain minimized), but more complex electrical part (flex)

• closer interlock between passive and active parts; board (w/o OE) has open slot; interface to optics is within board

• etc.

For an honest and realistic evaluation, a system-level view is crucial.

At this point in time, both approaches have to be considered in more detail.

Page 25: IBM Research, ZRL Analog RF CMOS and Optical … µm X 30 µm Speed 8 GHz Configuration: 16 phases to 1 phase output 6-to-1 configuration area: 30 ... Propagation length [cm] λ =

25

IBM Research

I/O Link Technology | © 2004 IBM Corporation

Waveguide based approach for I/O link extension

• Approach• Plug-in self-aligned coupler modules• Collimated beam coupling concept

• Features• Simple waveguide structures• Potentially cheap and mass-producible

2 m

m

without lens at 65 mm

with lens at 970 mmVCSEL

Page 26: IBM Research, ZRL Analog RF CMOS and Optical … µm X 30 µm Speed 8 GHz Configuration: 16 phases to 1 phase output 6-to-1 configuration area: 30 ... Propagation length [cm] λ =

26

IBM Research

I/O Link Technology | © 2004 IBM Corporation

I/O Link Technology: Recent progressOptical wave-guides on electrical FR4 test board 4 x 12 channels = 600 Gb/s aggregate data-rate @ 12.5 Gb/s channels

Loss <0.05 dB/cm

30 - 70 um square wave-guide cross-section

85um buried below surface for protection

Organic cavity-down wire-bond ball-grid array packagefor 12.5 Gbps E/O comp.

Wave-guides based on Acrylates(Dupont)

4 polymer stripes with 12 wave-guidesin each stripe

Page 27: IBM Research, ZRL Analog RF CMOS and Optical … µm X 30 µm Speed 8 GHz Configuration: 16 phases to 1 phase output 6-to-1 configuration area: 30 ... Propagation length [cm] λ =

27

IBM Research

I/O Link Technology | © 2004 IBM Corporation

Experiments: On-Board Link at 10 Gbps

Page 28: IBM Research, ZRL Analog RF CMOS and Optical … µm X 30 µm Speed 8 GHz Configuration: 16 phases to 1 phase output 6-to-1 configuration area: 30 ... Propagation length [cm] λ =

28

IBM Research

I/O Link Technology | © 2004 IBM Corporation

Video of Passive PositioningLive 10 Gbps link

Setup runs at 10 Gbps, using OE-module #4

Passive placement of OE-module

OE-module #4 is removed

OE-module #3 is inserted

As soon as #3 has electrical contact, the 10 Gbps eyescan be seen again

Zoomed views of setupScope screen with two 10 Gbps eyes; time axis detail

Pattern generator speed

Zoom to IR-camera view, showing waveguide betweenpackages, illuminated by some scattered light

Page 29: IBM Research, ZRL Analog RF CMOS and Optical … µm X 30 µm Speed 8 GHz Configuration: 16 phases to 1 phase output 6-to-1 configuration area: 30 ... Propagation length [cm] λ =

29

IBM Research

I/O Link Technology | © 2004 IBM Corporation

Summary & ConclusionsCMOS technology at 90 nm node (and follow-on nodes) has the inherent potential for >> 10 Gbps signaling rates.

Technology folks have done their job

First level of packaging (chip I/O terminals) is the most limiting factor for leveraging of the CMOS technology potential.

Packaging folks are on: Denser C4 pitch would help

Equalization concepts help increase signaling rates, but as speed goes up, complex equalization increases area significantly.

Distance enhancements via optical extension is one potential solution

Polymer waveguides have the potential to serve as the next generation wires.

Physicists & chemicists have done their job

Optical packaging and optical backplane connector are not finished yetPackaging folks are on: Cheap optical packages are required

Page 30: IBM Research, ZRL Analog RF CMOS and Optical … µm X 30 µm Speed 8 GHz Configuration: 16 phases to 1 phase output 6-to-1 configuration area: 30 ... Propagation length [cm] λ =

30

IBM Research

I/O Link Technology | © 2004 IBM Corporation

Acknowledgements

This presentation would not have been possible without the excellent inputs from…

various development groups in IBM’s STG division

the IBM Research division, in particular the Zurich…I/O Link Technology team Photonics team “Movie star” Chistoph Berger, ZRL who did most of the videos