

# ECE 122A VLSI Principles Lecture 10 

Prof. Kaustav Banerjee
Electrical and Computer Engineering University of California, Santa Barbara

E-mail: kaustav@ece.ucsb.edu

# Designing Combinational Logic Circuits 

## Static Vs Dynamic Circuits

## Static Circuits

At every point in time (except during the switching transients) each gate output is connected to either $V_{D D}$ or $V_{s s}$ via a low-resistive path.

The outputs of the gates assume at all times the value of the Boolean function, implemented by the circuit (ignoring, once again, the transient effects during switching periods).

This is in contrast to the dynamic circuit class, which relies on temporary storage of signal values on the capacitance of high impedance circuit nodes.

Resulting gate is simpler and faster, but increased sensitivity to noise....

## Static and Dynamic CMOS Circuit Families

- Static:
- Complementary CMOS
- (robustness, low power, large fan-in expensive in terms of area and performance)
- Ratioed Logic: pseudo-NMOS, differential cascode voltage switch logic (DCVSL)
- (simple and fast at the expense of reduced NM and static power)
- Pass-Transistor Logic (Transmission Gate)
- attractive for specific circuits: MUX, XOR-dominated logic such as Adders)
- Dynamic:
- good for fast and complex gates, design process is harder due to parasitic effects, leakage puts an upper limit on the operating frequency of the circuit
- Domino Logic
- np-CMOS

Which style is best?
......depends on ease of design, performance, power, area and robustness.

## Complementary CMOS Logic

- Full rail-to-rail swing; high noise margins
- Logic levels not dependent upon the relative device sizes; ratioless
- Always a path to Vdd or Gnd in steady state; low output impedance
- Extremely high input resistance; nearly zero steady-state input current
$\square$ No direct path steady state between power and ground; no static power dissipation
- Propagation delay function of load capacitance and resistance of transistors


## Static Complementary CMOS



PUN and PDN are dual logic networks

## Dual Networks

- Dual networks: parallel connection in PUN = series connection in PDN, vice-versa
- If CMOS gate implements logic function $F$ :
- PUN implements function $\bar{F}$
- PDN implements function


$$
G=F
$$

## Key Properties of Complementary CMOS Gates: Snapshot

High noise margins
$V_{O H}$ and $V_{O L}$ are at $V_{D D}$ and $G N D$, respectively.
No static power consumption
There never exists a direct path between $V_{D D}$ and $V_{S S}(G N D)$ in steady-state mode.

Comparable rise and fall times:
(under appropriate sizing conditions)

## Analysis of CMOS gates

- Represent "on" transistors as resistors

- Transistors in series $\rightarrow$ resistances in series
- Effective resistance $=2 \mathrm{R}$
- Effective width $=1 / 2 \mathrm{~W}$

Note: 1) As transistor width increases, its ON resistance ( $R$ ) decreases
2) If transistor channel length increases, $R$ increases

## Analysis of CMOS Gates, contd...

- Represent "on" transistors as resistors

- Transistors in parallel $\rightarrow$ resistances in parallel
- Effective resistance $=1 / 2 \mathrm{R}$
- Effective width $=2 \mathrm{~W}$


## Equivalent Inverter

- CMOS gates: many paths to Vcc and Gnd
- Multiple values for $\mathrm{V}_{\mathrm{M}}, \mathrm{V}_{\mathrm{IL}}, \mathrm{V}_{\mathrm{OL}}$, etc
- Different delays for each input combination
-Equivalent inverter
- Represent each gate as an inverter with appropriate device width
- Include only transistors which are ON or switching
- Calculate $\mathrm{V}_{\mathrm{M}}$, delays, etc using inverter equations


## CMOS gates: equivalent inverter

- Represent complex gate as inverter for delay estimation
- Use worst-case delays
- Example: NAND gate
- Worst-case (slowest) pull-up: only 1 PMOS "on"
- Pull-down: both NMOS "on"



## Equivalent Inverter: $V_{M}$

- Example: 2-input NAND gate threshold $\mathrm{V}_{\mathrm{M}}$ Three possibilities:
- A \& B switch together
- A switches alone
- B switches alone

Hint: the VTC curve is data dependent
$\square$ What is equivalent inverter for each case?

## Example: NOR gate

- Find threshold voltage

$V_{M}$ when both inputs switch simultaneously
-Two methods:
- Transistor equations
- Equivalent inverter


## Switch Delay Model



## Input Pattern Effects on Delay



2-input NAND
$\square$ Delay is dependent on the pattern of inputs

- Low to high transition
- both inputs go low
- delay is $0.69 R_{p} / 2 C_{L}$
- one input goes low
- delay is $0.69 R_{p} C_{L}$
- High to low transition
- both inputs go high
- delay is $0.692 R_{n} C_{L}$


## Delay Dependence on Input Patterns



- High to Low Delay depends on the initial state of the internal nodes
- Example: when both inputs transition from 0 to 1 , it is important to know the state of the internal node (between M1 and M2)
- Worst case: when internal node is initially charged up to $\mathrm{V}_{\mathrm{DD}}-\mathrm{V}_{\mathrm{tn}}$. This can be ensured by pulsing the A input from 1-(0-1), while B only makes the 0-1 transition.


## 2-input NAND

Bottom Line: Delay estimation can be a fairly complex affair and requires careful consideration of the internal node caps. and data patterns

## Delay Dependence on Input Patterns

Simulated Low to High delay for a 2-input NAND

time [ps]
Note: worst case L-H delay depends on which input (A or $B$ ) is driven low (due to internal node cap of PDN stack: source of M2)

| Input Data <br> Pattern | Delay <br> (psec) |
| :---: | :---: |
| $A=B=0 \rightarrow 1$ | 69 |
| $A=1, B=0 \rightarrow 1$ | 62 |
| $A=0 \rightarrow 1, B=1$ | 50 |
| $A=B=1 \rightarrow 0$ | 35 |
| $A=1, B=1 \rightarrow 0$ | 76 |
| $A=1 \rightarrow 0, B=1$ | 57 |

NMOS $=0.5 \mu \mathrm{~m} / 0.25 \mu \mathrm{~m}$
PMOS $=0.75 \mu \mathrm{~m} / 0.25 \mu \mathrm{~m}$
$C_{L}=100 \mathrm{fF}$
Kaustav Banerjee

## Revisit Transistor Sizing....

$\square$ Sizing for switching threshold

- All inputs switch together
$\square$ Sizing for delay
- Find worst-case input combination
$\square$ Find equivalent inverter, use inverter analysis to set device sizes


## Recall Transistor Sizing....

## What sizing will lead to a 2:1 equivalent inverter?

For effective $R_{p}=R_{n}$, we need $W_{p}=2 W_{n}$

Since two NMOS are in series, each should have $R_{n} / 2$, hence each NMOS size, $W_{n}=2$

Each PMOS should be such that $R_{p}=$ effective $R_{n}$ (which corresponds to $W_{n}$ =1)

Hence, $W_{p}=2$ (effective $\left.W_{n}\right)=2$

## 2-input NAND



2-input NOR

## Example: complex gate (1)

## Design CMOS gate for this truth table:

\(\left.\begin{array}{|c|c|c|c|}\hline \mathbf{A} \& \mathbf{B} \& \mathbf{C} \& \mathbf{F} <br>
\hline 0 \& 0 \& 0 \& 1 <br>
\hline 0 \& 0 \& 1 \& 1 <br>
\hline 0 \& 1 \& 0 \& 1 <br>
\hline 0 \& 1 \& 1 \& 1 <br>
\hline 1 \& 0 \& 0 \& 1 <br>
\hline 1 \& 0 \& 1 \& 0 <br>
\hline 1 \& 1 \& 0 \& 0 <br>
\hline 1 \& 1 \& 1 \& 0 <br>

\hline\end{array}\right]\)| $\mathrm{F} \cdot(\mathrm{B}+\mathrm{C})$ |
| :---: |

Lecture 10, ECE 122A, VLSI Principles

## Example: complex CMOS gate (1)

Design CMOS gate for this logic function:

$$
\mathrm{F}=\overline{\mathrm{A} \cdot(\mathrm{~B}+\mathrm{C})}
$$

1. Find NMOS pulldown network diagram:

$$
\mathrm{G}=\overline{\mathrm{F}}=\mathrm{A} \cdot(\mathrm{~B}+\mathrm{C})
$$



## Example: complex CMOS gate (1)

## Completed gate:



- What is worse-case pullup delay?

BC

- What is worse-case pulldown delay?

$$
A B \text { or } A C
$$

- Effective inverter for delay calculation:



## Example: complex CMOS gate (2)



Lecture 10, ECE 122A, VLSI Principles
Kaustav Banerjee

## Example: complex CMOS gate (2)



Find the equivalent inverter....

## Transistor Sizing a Complex CMOS Gate

## What sizing will lead to a 2:1 equivalent inverter?

Note: $A=3$ and $B=C=D=6$ will also give a 2:1 inverter but that will give higher output parasitic capacitance (due to larger D)

1. Start with a transistor in the PDN, that is (preferably) isolated (i.e., it can pull down the output node on its own)----D in this case
2. Assign a minimum size to this transistor (W=1 for D)
3. Now identify other paths in the PDN that can also discharge the output node: $A B$ and $A C$ in this case.
4. Size A, B, and C such that: each path will be electrically equivalent to the path through D (i.e., with same resistance), hence $A, B$ and $C$ should have $W=2$
5. Repeat the same for the PUN, keeping in mind that i) effective resistance of any path must equal the effective resistance of any path in the PDN, and ii) PMOS transistors will have to be sized (twice as large in this case) appropriately.

Note: here we ignored internal caps.

## CMOS gate design: summary

$\square$ Designing a CMOS gate:

- Find pulldown NMOS network from logic function or by inspection
- Find pullup PMOS network
- By inspection
- Using logic function
- Using dual network approach
- Size transistors using equivalent inverter
- Find worst-case pullup and pulldown paths
- Size to meet rise/fall or threshold requirements


## Fan-In Considerations



Distributed RC model (Elmore delay)
Assuming all NMOS are equal size $\mathrm{t}_{\mathrm{pHL}}=0.69 \mathrm{R}_{\text {eqn }}\left(\mathrm{C}_{1}+2 \mathrm{C}_{2}+3 \mathrm{C}_{3}+4 \mathrm{C}_{\mathrm{L}}\right)$

Propagation delay deteriorates rapidly as a function of fan-in quadratically in the worst case.

## $t_{p}$ as a Function of Fan-In


$t_{p L H}$ increases linearly with fan-in.....

## $t_{p}$ as a Function of Fan-Out



# All gates <br> have the same drive current 

Slope is a function of
"driving strength"

Note: i) these lines are not going through the origin, ii) NAND and NOR have same intrinsic delay....

## $t_{p}$ as a Function of Fan-In and Fan-Out

$\square$ Fan-in: quadratic due to increasing resistance and capacitance
-Fan-out: each additional fan-out gate adds two gate (N/P FETs) capacitances to $\mathrm{C}_{\mathrm{L}}$

$$
t_{p}=a_{1} F I+a_{2} F^{2}+a_{3} F O
$$

## Common Static CMOS Gate

## Topologies

$\square$ And-Or-Invert (AOI)

- Sum of products Boolean function
- Parallel branches of series connected NMOS
- Or-And-Invert (OAI)
- Product of sums Boolean function
- Series connection of sets of parallel NMOS


## Fast Complex Gates:

## Design Technique 1

- Transistor sizing
- as long as fan-out capacitance dominates
- Progressive sizing (not so simple to layout!)


Distributed RC line
$\mathrm{M} 1>\mathrm{M} 2>\mathrm{M} 3>\ldots>\mathrm{MN}$
(the transistor closest to the output is the smallest: has biggest $R$ )
Can reduce delay by more than $20 \%$; decreasing gains as technology shrinks

## Fast Complex Gates: <br> Design Technique 2

- Transistor ordering
delay determined by time to discharge $\mathrm{C}_{\mathrm{L}}, \mathrm{C}_{1}$ and $\mathrm{C}_{2}$


Critical path is determined by the slowest arriving signal: $\ln 1$
critical path

delay determined by time to discharge $\mathrm{C}_{\mathrm{L}}$

## Fast Complex Gates:

 Design Technique 3$\square$ Alternative logic structures

$\mathrm{F}=\mathrm{ABCDEFGH}$



Using De Morgan's Law...

## Fast Complex Gates: Design Technique 4

- Isolating fan-in from fan-out using buffer insertion



## Fast Complex Gates:

## Design Technique 5

$\square$ Reducing the voltage swing

$$
\begin{aligned}
\mathrm{t}_{\mathrm{PHL}} & =0.69\left(3 / 4\left(\mathrm{C}_{\mathrm{L}} \mathrm{~V}_{\mathrm{DD}}\right) / \mathrm{I}_{\mathrm{DSATn}}\right) \\
& =0.69\left(3 / 4\left(\mathrm{C}_{\mathrm{L}} \mathrm{~V}_{\text {swing }}\right) / \mathrm{I}_{\mathrm{DSATn}}\right)
\end{aligned}
$$

- linear reduction in delay
- also reduces power consumption
$\square$ But the following gate is much slower!
- Or requires use of "sense amplifiers" on the receiving end to restore the signal level (memory design)


## Revise CMOS gate layout....

- Goal: minimum area
- Method
- Minimize diffusion breaks (reduces capacitance on internal nodes)
- Align transistors with common gates above each other in layout (minimizes poly length)
- Group PMOS and NMOS transistors together
- Approach:
- Use Euler path method to find ordering of transistors in layout

