10-Bayle-Basis of Theory Behind FIDES - Reliability Division · MTBF ∑ = = = n i i MTBF 1 1 1 λ...

24
Basis of the theory behind FIDES Methodology ASTR Workshop, St. Paul 10 th to 12 th September 2014 Franck Bayle – Thales Avionics Chanthachith Souvanthong – ON Semiconductor

Transcript of 10-Bayle-Basis of Theory Behind FIDES - Reliability Division · MTBF ∑ = = = n i i MTBF 1 1 1 λ...

Basis of the theory behind FIDES Methodology

ASTR Workshop, St. Paul 10th to 12th September 2014

Franck Bayle – Thales Avionics Chanthachith Souvanthong – ON Semiconductor

Assumptions of reliability prediction

Reliability prediction is one step of the reliability process. Some assumptions must be checked during the “Design For Reliability” activities performed during the development phase as :

q  Product robustness to reduce induced failures (overstresses) §  Derating analysis §  Worst case analysis §  Highly Accelerated Stress (HALT)

q  Life limited component analysis to reject wear-out failure mechanisms after the product service life

q  Accelerated tests for specific components

q  Optimized screening (HASS)

q  Life profile (stress type, stress level, operational phases)

RELIABILITY

TECHNOLOGY

PROCESS USE

What are the goals of FIDES [REF06]: •  realistic prediction •  Usable for reliability assessment and building (DFR) •  Usable for all types of items, including COTS sub-assemblies

Life cycle Audit 170 Questions

components technologies

System Life Profile

Technologies • Electronic Component family Classification • Detailed technologies =>Recent technologies covered

Profile Description • Phases breakdown • Duration of each phase • Functioning condition (on/off) • Ambient temperature (constant) • Thermal cycling (duration, frequency, amplitude) • Relative Humidity • Vibrations • Chemical (pollution)

Process

Components

Origin cause of failure

Process failure distribution versus

life Cycle

Process impact

FIDES global modelisation

FIDES global modelisation

ΠΠλλ ProcessPMPhysical ⋅⋅=

•  Unexpected overstress

•  Operational use of the product

•  Quality of manufacturer

•  Quality of component

•  Experience of supplier

•  Processes management (Quality & Control)

inducedPhysical Π.λ ⋅⎥⎥

⎢⎢

⎟⎟⎟

⎜⎜⎜

⎛= ∑

∑ii

ii

i phaseTphaseTphase

λ

•  Influence of technology

•  Influence of mission profile

Sedyakin principle

Different type of failures

Defaults from production

Defaults from Design

Wear-out failures

Weak components

Overstress

Mavericks lots

ΠΠλλ ProcessPMPhysical ⋅⋅=

- Overstress - Default from production - Wrong spec, bug - Mavericks lots

- Weak components

- Weak components - Overstress(Pi induced) - Mission profile

Constant failure justification

q  Early failures due to flaws in manufacturing process are only observed on a small part of the population, these failures are becoming very low with a screening activity in place. Furthermore, a great quantity of failure mechanisms at equipment level can be observed: F  It is impossible to model these phenomena by Weibull distribution

q  Catalectic failures are usually modeled by the exponential distribution or Homogeneous

Poisson Process (HPP)

q  Premature wear-out failures are only observed on few components. Following Occam’s razor principle, as Early failures, HPP model is better than Power Low Process (PLP) model when the number of failures is low [REF04].

q  Wear-out mechanisms, during DFR activities, are released after the service life. Even, in

some cases, it is not possible, renewal process (AGAN maintenance) at component level leads to an approximately behavior of the failure rate [REF01].

q  Drenick [REF03] has proven that this assumption is correct for a series system with a large number of components and maintenance actions.

⇒ Constant failure rate model

Non-repairable system

Probability of mission success

∑=

=n

ii

1λλ

Reliability requirement

Reliability metric

Failure rate

Failure mechanism

Catalectic failure

( ) ( ) ⎟⎠⎞

⎜⎝⎛−= ∫

t

duutR0

.exp λ

block architecture

Series system

Reliability background

Constant failure rate

Independent events

Repairable system

MTBF

∑=

== n

ii

MTBF

1

11

λλ

Reliability requirement

Reliability metric

Failure intensity

Failure mechanism

Product maturity

( )( ) ( )( )

( ) ( )∫

=

−−=+∞

t

duutm

duumtmMTBF

0

0

.

.exp.exp

λ

whatever System architecture

Series system

Reliability background

Homogeneous Poisson Process

Poisson Process superposition

Maturity Deterioration Improvement

FIDES physical modelisation

q  Physical failure “rate / intensity” is based on Cox models defined by:

( ) ( ) ( )XAFtoXtPhysical ., λλ =

where λo is a base failure rate in reference conditions AF is an acceleration factor based on Physic Of Failure X is stress vector t is the calendar time

q  Constant failure “rate / Intensity”

( ) ( )XAFoXPhysical .λλ =

Base failure rate estimation

q Component reliability prediction is very difficult as:

§  Reliability improvement with technology change (Moore’s Law is still valid since 1964) .

§  Physic of failure mechanisms change with technology.

( ) ( ) inducedPhysical XAFoX Π= ..λλ

§  λ0 can be determined by using: §  Reliability tests take a long time and will be very expensive:

§  High pressure on development time and cost in combination with high level requirement on reliability for systems with increasing complexity.

§  Has to be done for all electronic components. §  Manufacturers reliability data report:

§  However for the same equivalent component the reliability can be different . §  In FIDES, λ0 was calculated from multiple semiconductor providers:

§  Need more data for statistical reasons. §  Multisource could be used or supplier could change during lifetime of

the equipment. §  Observation time of the λ0 is limited due to fast technology change.

Base failure rate estimation

q  Base failure rates are estimated from manufacturer test. This is the component intrinsic reliability and are based on manufacturer reliability tests as follows:

Sigle Signification C.I. Discrets PassifsHTOL High Temperature Operating Life X XHTS High Temperature Storage X XHTRB High Temperature Reverse Bias XHTGB High Temperature Gate Bias XHAST Humidity Accelerated Stess Test XTHB Temperature Humidity Bias X XH3TRB High Humidity, High Temperature

Reverse BiasX

PCT ou AC Pressure Cooker Test / Autoclave XTC / TS Temperature Cycle / Thermal Shock X X X

FIDES physical modelisation

q  Physic of Failure are based on modified GLL models [REF02] :

( ) ( )⎟⎠⎞⎜

⎝⎛ += ∑

=

s

iiii XgXL

10 .exp αα

where s is the number of stress α is a parameter vector of the corresponding POF law L is given quantile of the underlying distribution

q  Acceleration factors are so given by:

( ) ( )( )

( ) ( )[ ]⎥⎦⎤

⎢⎣⎡ −== ∑

=

s

iiiiii XgXog

XLXoLXAF

1.exp α

q  Acceleration factors are based on POF [REF05]:

Stress POF Law

Steady-state temperature Arrhenius Law

Thermal cycling Norris-Landzberg

Vibration Basquin

Humidity Hallberg - Peck

FIDES physical modelisation

POF in FIDES : example of Arrhenius law theory q Chemical reactions (not only for gas) are active by steady-state temperature. This reaction rate R(T) was proposed by Svante Arrhenius [REF07] as follows:

•  The Boltzmann constant Kb, which links the temperature T to the energy E ∼ Kb.T, gives a mechanical significance to the temperature notion. This fundamental constant hasn’t sense for a particle and is defined only in the limit of statistical physic i.e. the limit of a great number of particles (~ 10 23).

•  The constant C is specific to the reaction and is the fastest reaction that could ever proceed if the temperature is very large.

•  Arrhenius argued that molecules must acquire a minimum amount of energy, called “activation energy Ea” to get transformed into chemical products. He also argued that the proportion of molecules that have kinetic energy greater than Ea can be calculated from Maxwell Boltzmann distribution.

( ) ⎟⎟⎠

⎞⎜⎜⎝

⎛=

TKECTRb

a

.exp.

FIDES physical modelisation

High activation energy

Low activation energy

q  Arrhenius concept and reasoning have been widely accepted and widely applied not only to chemical reactions but also to various physical processes involving solids. The inventors of the transistor (Bardeen, Shockley & Brattain) were physicists and this is perhaps why Arrhenius law have been widely used in reliability physics problem during the last 50 years or so to assess the reliability for semiconductor materials and devices.

q  The reaction rate depends in fact of the proportion of particles having a minimal energy called “activation energy”. The activation energy is characteristic of the failure mechanisms meaning that failure mechanisms are more or less sensitive to temperature

If we take the example of the Peck law. We have 2 stress (RH & T), so s=2 è Set g1(RH)=ln(RH) and g2(T)=1/T . We obtain:

( ) ( ) ( )( ) ⎥⎦

⎤⎢⎣

⎡⎟⎠

⎞⎜⎝

⎛ −⋅+−⋅=TTo

RHRHoRHTAF 11lnlnexp, 21 αα

è ( ) ( ) ( )( )[ ] ⎥⎦

⎤⎢⎣

⎡⎟⎠

⎞⎜⎝

⎛ −⋅⋅−⋅=TTo

RHRHoRHTAF 11explnlnexp, 21 αα

è ( ) ⎥⎦

⎤⎢⎣

⎡⎟⎠

⎞⎜⎝

⎛ −⋅⋅⎟⎠

⎞⎜⎝

⎛=TToRHo

RHRHTAF 11exp, 2

1

αα

with KbEap =−= 21 and αα

FIDES physical modelisation

Acceleration factor estimation

§  Based on: §  physics of failures (JEDEC – “ Failure Mechanisms and Models for Semiconductor Devices “ - JEP 122G, nov 2011)

§  and Accelerated stress tests (Mechanical, Humidity, temperature cycling) were done over 16 boards to validate assumptions taken in FIDES on physics of failures.

( ) ( ) inducedPhysical XAFoX Π= ..λλ

1,00

5,00

10,00

50,00

90,00

99,00

1000,00 10000,00

Use Level Probability Weibull

Time

Cum

ulat

ive d

ensi

ty fu

nctio

n

Beta=2,4494, a=8,9462E+6, n=0,6729

Integrated Circuit Acceleration factor for Thermal Stress

( )⎥⎥⎦

⎢⎢⎣

+−××

− 273T1

29310.711604

componentje

Reciprocal of the Boltzmann’s constant.

Activation energy Ea is adapted to each type of component. 0.7 eV is for active components [3] (semiconductors). This value has been chosen after an analysis that shows that this was -for current technologies- the only acceptable value. An identified alternative was to make the activation energy variable with temperature, Ea = f(T°); this was a bit too complicated to calibrate (to improve in the future).

Arrhenius law

In an operating phase ΠThermal =

In a non-operating phase: ΠThermal = 0

In non operating phases, this factor is zero. The total failure rate will not be zero because of additive part of the model (the other contributors). The Pi_thermal is especially complemented by the Pi_RH (humidity) for non operating phases.

Integrated Circuit Acceleration factor for Thermal cycling

( )⎥⎥⎦

⎢⎢⎣

+−××

−×⎟⎟⎠

⎞⎜⎜⎝

⎛×⎟⎟

⎞⎜⎜⎝

⎛×⎟⎟⎠

⎞⎜⎜⎝

⎛ × 273T1

31310,12116041,9

cycling31

cy

annual

cy-annual cyclingmaxe20

ΔT2,2)min(θ

tN12

( )⎥⎥⎦

⎢⎢⎣

+−××

−×⎟⎟⎠

⎞⎜⎜⎝

⎛×⎟⎟⎠

⎞⎜⎜⎝

⎛ × 273T1

313112,0116044

cycling

annual

cy-annual cyclingmaxe20

ΔTtN12

Norris Landzberg

ΠTcy for soldier joints :

313K is to compare to the maximum temperature in the cycle. As the "reference cycle" chosen for FIDES is of 20°C and the reference ambient temperature of 20°C, this make a reference maximum cycle temperature of 273+20+20=313K.

Effect of temperature amplitude

Effect of maximum

temperature

Effect of constraints relaxation

Conversion cycles to failure rate

ΠTcy for case :

Integrated Circuit Acceleration factor for Vibration

1.5RMS

0.5G

⎟⎠

⎞⎜⎝

The value of 1.5 is at the bottom of the range of values usually encountered in Basquin's law uses. This is because the failure mechanisms accelerated by vibrations are numerous and not restricted to fatigue (for example: move of particles). And even only for fatigue, the materials to cover are also numerous. This leads to this low exponent. The value of 1.5 is relevant for reliability prediction, not to accelerate testing. There is a specific warning on this topic in the FIDES guide 2009 page 43

Inverse Power Law

ΠMech =

Integrated Circuit Acceleration factor for Humidity

Peck Law

Π RH = ( )⎥⎦⎤

⎢⎣

+−××

×⎟⎠

⎞⎜⎝

⎛ 273T1

29310.9116044,4

ambient ambient-boarde70

RH

Indeed the most common values seen for this exponent in Peck's law are between 2.7 and 3, but several trials have shown that a power of 4.4 is more realistic for non-operating conditions. As this law is mainly used for non-operating phases in FIDES, the value of 4.4 has been taken.

.

The reference relative humidity is taken at 70% (average world humidity). It is true that in a building the RH is lower is often lower because of heating or air-conditioning (see also the FIDES Guide page 40-42) In some way, the humidity factor replaces the thermo-electric factor for non-operating phases.

Effect of temperature Effect of

humidity

Acceleration factor determination

( ) ( ) inducedPhysical XAFoX Π= ..λλ

Discrepancies between FIDES and tests results due to the fact that tests were done on supplied boards (ΠRH not null)

Tests results include bonding and solder joints failures.

Tests results are globally matching FIDES parameters and validate its approach.

Induced factor estimation

§  Reliability tests cannot reveal all possible failures:

§  Tests are not done in the final environment and cannot reflect the complexity of the whole system e.g. interaction between equipments.

§  Experience from field returns shows that main failures are catalectic and due to EOS, humans errors,…Those failures cannot always be recreated during the tests.

§  Those types of failures cannot have a Cox representation because there is no physics of failure behind them.

§  Therefore, an audit with qualitative questions is done. The result is a coefficient between 1 to 100.

§  Because of the additive form of the failure rate, these modelisations cannot be put directly in equation because the audit result is not a failure rate.

=> A multiplicative form has been chosen

( ) ( ) inducedPhysical XAFoX Π= ..λλ

TERMINOLOGY

AGAN : As Good As New COTS : Component On The Shelf DFR : Design For Reliability GLL : General Log Linear HASS : Highly Accelerated Step Stress HAT : Highly Accelerated Test HPP : Homogeneous Poisson Process MTBF : Mean Time Between Failure PLP : Power Law Process

Document reference

[REF01] : “Statistical Methods for the reliability of repairable systems” Steven E.RIGDON & Asit P.BASU Wiley 2000

[REF02] : “Modeling & Analysis for Multiple Stress-Type Accelerated Life Data

Adamantios METTAS 2000 PROCEEDINGS Annual RELIABILITY and MAINTAINABILITY Symposium [REF03] : « The failure law of complex equipment »

R.F.DRENICK J.Soc.Indust.Appl.Math.1960 Vol 8 N°4 [REF04] : “ The effect of assuming a HPP when the true process is a PLP”

Steven E.RIGDON & Asit P.BASU Journal of Quality technology 1990 [REF05] : JEDEC Standards

JEP122G [REF06] : FIDES Guide 2009 [REF07] : «Über die Reaktion-geschwindigkeit bei der Inversion vor Rhorzücker durch Sauren “

S.ARRHENIUS Z. Phys.-Ch., 1889.