presented by: mark e. sims reliability s&t engineer aviation and missile research, development...

Presented by:

Mark E. SimsReliability S&T Engineer

Aviation and Missile Research, Development and Engineering Center

UNCLASSIFIED

Intro Reliability GrowthIntro Reliability Growth

"Approved for public release; distribution unlimited. Review completed by the AMRDEC Public Affairs Office 11 Oct 2013; PR0073."

Mil-HDBK-189 DefinitionMil-HDBK-189 Definition

Reliability GrowthThe positive improvement in a reliability parameter

over a period of time

due to changes in product design

or the manufacturing process.

MIL-HDBK-189 is a Department of Army Handbook for Reliability Growth Management

J.T. Duane was an engineer at the Aerospace Electronics Department of the General Electric Company.

He published a paper in 1964 that applied a “learning curve approach” to reliability monitoring.

He observed that the cumulative MTBF versus cumulative operating time followed a straight line when plotted on log-log paper.

The learning (i.e., growing) is accomplished through a “test, analyze, and fix” (TAAF) process.

Beginnings

Design

TestFailure

Analysis

Identified Deficiencies

log-log paper graphing

Normal graphing

1 10 100 1000 100001

1000Reliability Growth Chart

Cumulative Duane

Instantaneous Duane

Test Hours

0 200 400 600 800 1000 1200 1400 16000

120Reliability Growth Chart

Cumulative Duane

Instantaneous Duane

Test Hours

. .. .

Graphs

Duane Postulate:The cumulative MTBF versus cumulative operating time is a straight line on log-log paper.

Continuous GrowthContinuous Growth

0 500 1000 1500 2000 2500 3000 3500 4000 45000.000

Reliability Growth Chart

Test Hours

Continuous means time.

0 500 1000 1500 2000 2500 3000 3500 4000 45000

Reliability Growth Chart

Test Hours

You can plot failure rate or MTBF against the total test hours.

Discrete GrowthDiscrete Growth

0 20 40 60 80 100 120 140 160 180 20060.0%

Reliability Chart

Trials

ity Discrete means trials.

Discrete GrowthDiscrete Growth

0 20 40 60 80 100 120 140 160 180 20060.0%

Reliability Chart

Trials

Reliability Growth follows a Learning Curve approach.

Note: More rapid growth occurs earlier in the process then flattens out!

Why Reliability Growth?

1 2 3 4 5 6 7 8 9 10 11 12 13 14 15

16 17 18 19 20 21 22 23 24 25 26 27 28 29 30

31 32 33 34 35 36 37 38 39 40 41 42 43 44 45

46 47 48 49 50 51 52 53 54 55 56 57 58 59 60

61 62 63 64 65 66 67 68 69 70 71 72 73 74 75

76 77 78 79 80 81 82 83 84 85 86 87 88 89 90

91 92 93 94 95 96 97 98 99 100 101 102 103 104 105

106 107 108 109 110 111 112 113 114 115 116 117 118 119 120

121 122 123 124 125 126 127 128 129 130 131 132 133 134 135

136 137 138 139 140 141 142 143 144 145 146 147 148 149 150

151 152 153 154 155 156 157 158 159 160 161 162 163 164 165

166 167 168 169 170 171 172 173 174 175 176 177

Example

A System has 18 Failures in 177 Trials

Example

A system has 18 failures in 177 trials. The failures are listed the tables below.

Failure Trial

15 108

16 129

17 145

18 148

Failure TrialTrials

Between Failures

3 14 7

4 16 2

5 26 10

6 30 4

7 38 8

8 39 1

9 51 12

There appears to be reliability growth.

Example

Failure TimeTrials

Between Failures

10 55 4

11 64 9

12 71 7

13 79 8

14 98 19

15 108 10

16 129 21

17 145 16

18 148 3

Less trials between failures.More trials between failures.

0 200 400 600 800 1000 12000

Trials

Reliability

0.9254

Example

Applying Reliability Growth Methodology, we get the following curve:

0 200 400 600 800 1000 12000

Trials

Reliability

Note: Reliability without applying growth is1 – (18 / 177) = 0.8983

0.9254

Example

Applying Reliability Growth Methodology, we get the following curve:

Why Reliability Growth?Saves Assets

Reduces Test TimeSaves $$$$$$$

Duane ModelPower Law Formulation

for reliability growth

Duane PostulateDuane Postulate

During Reliability Growth,Graphing the log of time (or tests) against its corresponding log of MTBF

Will be a straight line with slope α.

MTBFCum = Cumulative Mean-Time-Between-Failure

t = Time

K = Constant for Power Law Equation

α = Growth parameter

Slope, α

Time (or Trial), t

(Ln(t1), Ln(M1))

(Ln(t3), Ln(M3))

(Ln(t2), Ln(M2))

Times MTBFCum

Linear relationship:y = αx + b

Has a linear log-log relationship!

Calculating αthe growth rate

Calculating α (the growth rate)

Time (hrs)

Total Failures

First reading 500 5

Last reading 4000 20

We will determine α from these two readings.

Time (hrs)

Total Failures

First reading 500 5

Last reading 4000 20

First calculate the cumulative MTBF for each reading.

Time (hrs)

Total Failures

First reading 500 5 100

Last reading 4000 20 200

Time (hrs)

Total Failures

MTBFLn(Time) Ln(MTBF)

First reading 500 5 100 Ln(500) Ln(100)

Last reading 4000 20 200 Ln(4000) Ln(200)

Take logs of the readings.

Slope, α

x-axis

( Ln(500) , Ln(100) )

( Ln(4000) , Ln(200) )

Time (hrs)

Total Failures

Plot the logs of the readings.

α = 0.33

x-axis

( 6.215 , 4.605 )

( 8.294 , 5.298 )

Time (hrs)

Total Failures

α = 0.33

x-axis

Growth is indicated when 0 < α < 1

Duane ParametersDuane Parameters

α = Growth parameterTI = Initial test timeMI = Initial MTBFMF = Final MTBFTtotal = Total time

• These parameters go into the Duane equation.

• If you know 4 of the parameters, you can calculate the other.

Sensitivity of αSensitivity of α

What is the Total Test time if we are given these 4 parameters?

α .40

TI 100

MF 150

TTotal ?

How does changing the growth parameter α affect the total test time?

α .40

TI 100

MF 150

Ttotal 435

α .40 .27 .46 .64

TI 100 100 100 100

MI 50 50 50 50

MF 150 150 150 150

Ttotal 435

α .40 .27 .46 .64

TI 100 100 100 100

MI 50 50 50 50

MF 150 150 150 150

Ttotal 435 1823 285 113

The α is very sensitive to the Total Time!

Instantaneousvs

Cumulative

Instantaneousvs

Cumulative

Duane MTBF Equation

Finding the true estimate of a system’s MTBF using reliability growth.

FailureNumber

Failure Time

Inst vs. Cum MTBF

What is the true estimate of the MTBF at 250 hours?

FailureNumber

Failure Time

MTBFCum

1 10 10

2 40 20

3 90 30

4 160 40

5 250 50

Inst vs. Cum MTBF

Is the MTBF 50 at time 250?

FailureNumber

Failure Time

MTBFCum

Time Between Failures

1 10 10 10

2 40 20 30

3 90 30 50

4 160 40 70

5 250 50 90

Inst vs. Cum MTBF

Or would you say the MTBF is 90 at 250 hours?

FailureNumber

Failure Time

MTBFCum

MTBFInst

1 10 10 10 31

2 40 20 30 43

3 90 30 50 52

4 160 40 70 59

5 250 50 90 66

Inst vs. Cum MTBF

Applying a Reliability Growth Tracking Model from AMSAA or ReliaSoft’s RGA software tool will give these numbers.

FailureNumber

Failure Time

MTBFCum

MTBFInst

1 10 10 10 31

2 40 20 30 43

3 90 30 50 52

4 160 40 70 59

5 250 50 90 66

Inst vs. Cum MTBF

Applying a Reliability Growth Tracking Model from AMSAA or ReliaSoft’s RGA software tool to get these numbers.

So, 66 is the true MTBF at 250 operating hours, if reliability growth is occurring.

MTBFInst

MTBFCum

Time (or Test), t

On Log-Log Graph Paper

10 100 1000 10,000

Inst vs. Cum MTBF

This is how the graphs lookIn standard Cartesian coordinate

MTBFInst

MTBFCum

Time (or Test), t

500 1000 1500 2000

Inst vs. Cum MTBF

ExerciseExercise

10 system failures occurred after 500 hours of reliability growth testing, with a calculated growth parameter of 0.40.

What is the system’s instantaneous MTBF?

ExerciseExercise

Reliability Growth FormulasReliability Growth Formulas

Failure Rate

Reliability

M(t) = 1 / r(t)

MTBF is the reciprocal of the failure rate.

rI = Initial failure rate

tI = Initial time corresponding to rI

α = Growth rate parameter

Failure Rate FormulaFailure Rate Formula

Initial Conditions

MI = Initial MTBF

tI = Initial time corresponding to MI

MTBF FormulaMTBF Formula

Initial Conditions

RI = Initial Reliability

NI = Initial number of trials corresponding to RI

Reliability (Discrete)Reliability (Discrete)

Initial Conditions

Deriving r(t) FormulaDeriving r(t) Formula

r(t) is sometimes called the Hazard Rate.

K = Constant for Power Law Equation

First, start with the Duane Postulate.

Insert initial conditions MI at TI , and solve for K.

tI is the Initial Test Time.MI is the Initial MTBF at time tI.

Now substitute for K.

The failure rate, r, is the inverse of the MTBF, so r(t) = 1 / M(t).

Now we will simplify and take the derivative.

MI = Initial MTBF

tI = Initial time corresponding to MI

Deriving M(t) FormulaDeriving M(t) Formula

Recall MTBF = 1/r, so take the inverse of r(t).

The Sensitivity of Duane’s Initial Conditions TI and MI on the Total Test Time.

TI 100 150 200 250

α .40 .40 .40 .40

MI 50 50 50 50

MF 150 150 150 150

Ttotal 435 ??? ??? ???

What if we increase the initial time for a planning curve?

Sensitivity of Initial TimeSensitivity of Initial Time

What if we increase the initial time for a planning curve?

A higher initial time significantly increases Ttotal!

TI 100 150 200 250

α .40 .40 .40 .40

MI 50 50 50 50

MF 150 150 150 150

Ttotal 435 652 869 1087

250 500 750 1000

TI 100 250

α .40 .40

MI 50 50

MF 150 150

Ttotal 435 1087

Sensitivity of Initial Time

250 500 750 1000

TI 100 250

α .40 .40

MI 50 50

MF 150 150

Ttotal 435 1087

Growth is more rapid the smaller TI is!

Sensitivity of Initial TimeM

MI 50 25 70 85

α .40 .40 .40 .40

TI 100 100 100 100

MF 150 150 150 150

Ttotal 435 ??? ??? ???

What if we change the initial MTBF for a planning curve?

Sensitivity of Initial MTBF

What if we change the initial MTBF for a planning curve?

A higher initial MTBF significantly decreases Ttotal!

MI 50 25 70 85

α .40 .40 .40 .40

TI 100 100 100 100

MF 150 150 150 150

Ttotal 435 2459 187 115

Sensitivity of Initial MTBF

NI = Initial number of trials corresponding to RI

Deriving Reliability FormulaDeriving Reliability Formula

Rcum = Cumulative Reliability

F = Number of Failures

N = Number of Trials

r is the failure rate.

Recall failure rate formula.

Subtract from 1.

Make substitutions.

Initially, System A has 3 failures after 100 firings.

If you expect a growth rate of 0.25, what would be the expected reliability after 1000 flight tests?

Exercise

Initially, System A has 3 failures after 100 firings.

If you expect a growth rate of 0.25, what would be the expected reliability after 1000 flight tests?

Exercise

Inst / Cum Conversions

AMSAA-Crow ModelProjection Method

for reliability growth planning

- 50 100 150 200 250 300

RG PotentialRGP = 0.9747

RG = 0.9639

RLUT = 0.9568

RDT3 = 0.9455

RDT2 = 0.9260

RDT1 = 0.8987

PM2-Discrete Reliability Growth Planning Curve

Idealized Curve DT1 DT2 DT3 LUT IOT Requirement

Trials

RR = 0.9200

Discrete PM2 Growth Plan Example

041712-Sims-Reliability Growth (TE Class)

Continuous PM2 Growth Plan Example

CAP10CAP9CAP8CAP7CAP6CAP5

PM2 Continuous Reliability Growth Planning Curve

Idealized Curve Series3 Series5 Series7 Series9

Series11 Series12 Series13 Series14 Series15

Series16 Hypothetical Last Step IOT Series22 Requirement

Test Time (hours)

MG,DT = 581

LUTMG,0T = 523

MI = 190

MR = 200

MGP = 782

041712-Sims-Reliability Growth (TE Class)

Continuous Curve EquationContinuous Curve Equation

Continuous curve is plotted using this equation.

MTBF(T) = System Mean-Time-Between-Failures at time T

MTBFI = Initial MTBF

MS = Management Strategy

µ = Average Fix Effectiveness Factor (FEF)

β = Shape parameter

R(N) = System Reliability at trial N.

RA = The portion of the system reliability not impacted by the correction action effort

RB = The portion of the system reliability addressed by the correction action effort

MS = Management Strategy

µ = Average Fix Effectiveness Factor (FEF)

n = Shape parameter of the beta distribution representing pseudo trials

Discrete Curve EquationDiscrete Curve Equation

Discrete curve is plotted using this equation.

Management Strategy Factor

Management Strategy (MS) is the fraction of the overall system failure rate to be address by the corrective action plan.

λ = Failure rate.

For various reasons (prohibitive cost, improbability of reoccurrence), some failure modes will not have a corrective action.

A-Mode: Failures that are not fixed.B-Mode: Failures that will have a fix.

Failure Rates

A-ModeB-Mode

A “fix” means a reliability improvement corrective action, not just a remove and replace of the same component.

λA = Failure rate of A-modesλB = Failure rate of B-modesλA + λB = Overall system failure rate

Failure Rates

A-ModeB-Mode

Failure Rates

A-ModeB-Mode

Example: What is the MS here?

Failure mode

Failure mode rate

Mode Type

1 0.027 B

2 0.015 B

3 0.033 B

4 0.001 A

5 0.013 B

Failure Rates

A-ModeB-Mode

Example: What is the MS here?

Failure mode

Failure mode rate

Mode Type

1 0.027 B

2 0.015 B

3 0.033 B

4 0.001 A

5 0.013 B

Total B-modes 0.088

Total System 0.089

μ, Fix Effectiveness Factor

Mil-HDBK-189 Definition:

Fix Effectiveness Factor, μ = A fraction representing the reduction in an individual initial mode failure rate due to implementation of a corrective action.

Essentially Fix Effectiveness Factors discount failures. A couple examples will follow.

Number of tests = 20

Successful tests = 18

Hardware Failure

Software Failure

What is the reliability?

Software Failure

Hardware Failure

What is the updated reliability?

μ1 = 100%

μ2 = 75%

Software Failure

Hardware Failure

Hardware

Software

X100% Fix

75% Fix

Failure mode

Failure mode rate

Mode Type

1 0.027 B

2 0.015 B

3 0.033 B

4 0.001 A

5 0.013 B

Another Example: Say the average μ is 0.75 (or 75%). What is the updated System Failure Rate?

λA = 0.001λB = 0.088λSystem = 0.089

Failure mode

Failure mode rate

Mode Type

1 0.027 B

2 0.015 B

3 0.033 B

4 0.001 A

5 0.013 B

Another Example: Say the average μ is 0.75 (or 75%). What is the updated System Failure Rate?

OriginalλA = 0.001λB = 0.088λSystem = 0.089

UpdatedλA = 0.001λB = 0.088 * (1- 0.75) = 0.022λSystem = 0.023

Shape Parameter, βShape Parameter, β

β = Shape parameter

TT = Total Test Time

MG = MTBF Goal

MGP = MTBF Growth Potential

MI = Initial MTBF

η = Shape parameter of the beta distribution representing pseudo trials

NT = Total Number of Trials

RG = Reliability Goal

RGP = Reliability Growth Potential

Shape Parameter, βShape Parameter, β

Growth PotentialGrowth Potential

MGP = MTBF Growth PotentialThe theoretical upper limit on MTBF

Growth PotentialGrowth Potential

MGP = MTBF Growth PotentialThe theoretical upper limit on MTBF

For example:MS = 0.95μ = 0.80MI = 190

RA = The portion of the system reliability not impacted by the correction action effort

PM2 Curve EquationPM2 Curve Equation

MS = Management Strategy. Fraction of failures to be addressed by corrective action.

Medium Risk Range 0.90 – 0.96.

RB = The portion of the system reliability addressed by the correction action effort

PM2 Curve EquationPM2 Curve Equation

MS = Management Strategy. Fraction of failures to be addressed by corrective action.

Medium Risk Range 0.90 – 0.96.

Management Strategy FactorManagement Strategy Factor

A-Mode: Failures that are not fixed.B-Mode: Failures that will have a fix.

λ = Failure rate.

Fraction of failures to be addressed by the corrective action plan.

n = Shape parameter of the beta distribution representing pseudo trials

PM2 Growth PlanPM2 Growth Plan

RGP = Reliability Growth PotentialRG = Reliability Goal (to meet requirement)NT = Total trials before going into IOT phase

Reliability Growth PotentialReliability Growth Potential

RGP = Reliability Growth PotentialThe theoretical upper limit on system reliability

Reliability Growth PotentialReliability Growth Potential

RGP = Reliability Growth PotentialThe theoretical upper limit on system reliability

For example:MS = 0.95μ = 0.80MI = 190

Summary

• Reliability Growth applies a “Learning Curve” Approach

• System must undergo Test-Analyze-And-Fix for reliability to grow.

• Initial Conditions are sensitive to a growth plan.

ASMSA-Crow/Duane EquationsASMSA-Crow/Duane Equations

1. Single Shot Systems Expected Failures:

2. Continuously Operating Systems Expected Failures:

presented by: mark e. sims reliability s&t engineer aviation and missile research, development...

reliability growth slide

growth rate slide

growth parameter slide

total time

log of time

reliability growth methodology

rapid growth

definition reliability

Documents

glenair interconnect conduit systems. military applications ...

sims daspresentation

army missile - defense technical information center ·...

the cuban missile crisis...the cuban missile crisis subject:...

paris - cloudinary · molly sims paris. molly sims. molly...

sims introduction

sims intouch · sims intouch uses contacts within sims -...

sims discover sims multiview chris sherwood. sims discover

digitizer software(u) army missile sims unclassified ... ·...

cheats sims

sims econo58

hiden sims - tools · hiden sims workstation • 3d imaging...

agni 2 missile nuclear capable missile

model er-1800 sims case erecting machine manufactufing sims...

sims research

an online photo browsing system carrie burgener carrie@sims...

sims discover and sims multiview

innova*sims la revista de sims artists

missile defence update - raytheon · pdf filepatriot air and...

sims manual