by: swetha kendyala ske009@latech
DESCRIPTION
Software Rejuvenation. By: Swetha Kendyala [email protected]. Introduction. - PowerPoint PPT PresentationTRANSCRIPT
![Page 2: By: Swetha Kendyala ske009@latech](https://reader035.vdocument.in/reader035/viewer/2022062800/56814194550346895dad7b90/html5/thumbnails/2.jpg)
• When software applications execute continuously for long periods of time, the processes corresponding to the software in execution age or slowly degrades with respect to the effective usage of their system resources. Process aging will affect the performance and eventually cause the application to fail.
Introduction
![Page 3: By: Swetha Kendyala ske009@latech](https://reader035.vdocument.in/reader035/viewer/2022062800/56814194550346895dad7b90/html5/thumbnails/3.jpg)
What is Software Rejuvenation?
• The act of gracefully terminating an application and immediately restarting
• Goal: Prevents unexpected error termination by terminating the program before it suffers an error
![Page 4: By: Swetha Kendyala ske009@latech](https://reader035.vdocument.in/reader035/viewer/2022062800/56814194550346895dad7b90/html5/thumbnails/4.jpg)
Intended Use• Software rejuvenation is primarily indicated for
servers where applications are intended to run indefinitely without failure
![Page 5: By: Swetha Kendyala ske009@latech](https://reader035.vdocument.in/reader035/viewer/2022062800/56814194550346895dad7b90/html5/thumbnails/5.jpg)
Why do applications fail?• Process Aging: gradual degradation of application
performance, over time, that may lead to premature program termination
![Page 6: By: Swetha Kendyala ske009@latech](https://reader035.vdocument.in/reader035/viewer/2022062800/56814194550346895dad7b90/html5/thumbnails/6.jpg)
Causes• Memory leaks• Unreleased file locks• File descriptor leaking• Etc.
![Page 7: By: Swetha Kendyala ske009@latech](https://reader035.vdocument.in/reader035/viewer/2022062800/56814194550346895dad7b90/html5/thumbnails/7.jpg)
Software Rejuvenation• Periodic preemptive rollback of continuously
running applications to prevent failures in the future
![Page 8: By: Swetha Kendyala ske009@latech](https://reader035.vdocument.in/reader035/viewer/2022062800/56814194550346895dad7b90/html5/thumbnails/8.jpg)
Transition Model For SW withoutRejuvenation
Transition Model For SW with Rejuvenation
![Page 9: By: Swetha Kendyala ske009@latech](https://reader035.vdocument.in/reader035/viewer/2022062800/56814194550346895dad7b90/html5/thumbnails/9.jpg)
Downtime and cost without rejuvenation
• Pf =
• Downtimew/o r(L) = Pf * L
• Costw/o r(L) = Pf * L * cf
2111
1
rrr
![Page 10: By: Swetha Kendyala ske009@latech](https://reader035.vdocument.in/reader035/viewer/2022062800/56814194550346895dad7b90/html5/thumbnails/10.jpg)
Downtime and cost with rejuvenation
• Pp =
• Pf =
• Pr =
• P0 =
• Downtimew r(L) = (Pf + Pr) * L
• Costw r(L) = (Pf * cf + Pr * cr) * L
24
34
111
rr
rr
r
Pr P*1
Prr P*3
4
Prr P*2
4
![Page 11: By: Swetha Kendyala ske009@latech](https://reader035.vdocument.in/reader035/viewer/2022062800/56814194550346895dad7b90/html5/thumbnails/11.jpg)
Thresholds - Goal
Goal is to stay in S0 for the longest amount of time
![Page 12: By: Swetha Kendyala ske009@latech](https://reader035.vdocument.in/reader035/viewer/2022062800/56814194550346895dad7b90/html5/thumbnails/12.jpg)
Thresholds cont.• To see how r4 affects downtime and cost, lets
differentiate the previous equations with respect to r4
![Page 13: By: Swetha Kendyala ske009@latech](https://reader035.vdocument.in/reader035/viewer/2022062800/56814194550346895dad7b90/html5/thumbnails/13.jpg)
Thresholds cont.• Downtime:
• If r3 is dominant, the derivative becomes negative and downtime decreases when r4 increases thus rejuvenate at state Sp
• If r3 is small, slow recovery from SR, downtime increases as r4 increases
]3)1(1[ 2 rr r
![Page 14: By: Swetha Kendyala ske009@latech](https://reader035.vdocument.in/reader035/viewer/2022062800/56814194550346895dad7b90/html5/thumbnails/14.jpg)
Thresholds cont. • Cost =
• When cr is dominant, cost increases as r4 increases, implies no rejuvenation benefit
• When cr is small, cost decreases as r4 increases
][ 21)21()32(rrrr
rrr fcc
![Page 15: By: Swetha Kendyala ske009@latech](https://reader035.vdocument.in/reader035/viewer/2022062800/56814194550346895dad7b90/html5/thumbnails/15.jpg)
Thresholds cont. • Overall, costs need to be calculated for individual
programs
• For best results: perform rejuvenation at state SP (r4 = ∞) or don’t perform rejuvenation (r4 = 0)
![Page 16: By: Swetha Kendyala ske009@latech](https://reader035.vdocument.in/reader035/viewer/2022062800/56814194550346895dad7b90/html5/thumbnails/16.jpg)
Example 1• MTBF = 12 months; = 1/(12*30*24)• Takes 30 min to recover from unexpected error; r1 = 2• Base Longevity is seven days; r2 =1/(7*24)• If rejuvenation is performed, mean repair time after
rejuvenation is 20 minutes; r3 = 3• Ave. Cost of unscheduled downtime due to failure, cf,
is $1,000/hour• Ave. Cost of scheduled downtime during rejuvenation,
cr, is $40/hour
![Page 17: By: Swetha Kendyala ske009@latech](https://reader035.vdocument.in/reader035/viewer/2022062800/56814194550346895dad7b90/html5/thumbnails/17.jpg)
Software Rejuvenation
No rejuvenation
(r4 = 0)
Once Every three Week
r4 = 1/(2*7*24)
Once Every Two Weeks
r4 =1/(1*7*24)
Hours of Downtime 0.490 5.965 8.727
Cost of Downtime
490 554 586
![Page 18: By: Swetha Kendyala ske009@latech](https://reader035.vdocument.in/reader035/viewer/2022062800/56814194550346895dad7b90/html5/thumbnails/18.jpg)
Software Rejuvenation
No rejuvenation
(r4 = 0)
Once Every month
r4 = 1/(20*24)
Once Every Two Weeks
r4 =1/(4*24)
Hours of Downtime 7.19 6.83 6.36
Cost of Downtime
3.6k 2.48k 1.11k
![Page 19: By: Swetha Kendyala ske009@latech](https://reader035.vdocument.in/reader035/viewer/2022062800/56814194550346895dad7b90/html5/thumbnails/19.jpg)
Example 2• MTBF = 3 months; = 1/(3*30*24)• Takes 30 min to recover from unexpected error; r1 = 2• Base Longevity is three days; r2 =1/(3*24)• If rejuvenation is performed, mean repair time after
rejuvenation is 10 minutes; r3 = 6• Ave. Cost of unscheduled downtime due to failure, cf,
is $5,000/hour• Ave. Cost of scheduled downtime during rejuvenation,
cr, is $5/hour
![Page 20: By: Swetha Kendyala ske009@latech](https://reader035.vdocument.in/reader035/viewer/2022062800/56814194550346895dad7b90/html5/thumbnails/20.jpg)
Software Rejuvenation
No rejuvenation
(r4 = 0)
Once Every three Week
r4 = 1/(11*24)
Once Every Two Weeks
r4 =1/(4*24)
Hours of Downtime 1.94 5.70 9.52
Cost of Downtime
9675.25 7672.43 5643.31
![Page 21: By: Swetha Kendyala ske009@latech](https://reader035.vdocument.in/reader035/viewer/2022062800/56814194550346895dad7b90/html5/thumbnails/21.jpg)
Example 3• MTBF = 3 months; = 1/(3*30*24)• Takes 2 min to recover from unexpected error; r1 =
0.5• Base Longevity is 10 days; r2 =1/(10*24)• If rejuvenation is performed, mean repair time after
rejuvenation is 10 minutes; r3 = 6• Ave. Cost of unscheduled downtime due to failure,
cf, is $5,000/hour• Ave. Cost of scheduled downtime during
rejuvenation, cr, is $5/hour
![Page 22: By: Swetha Kendyala ske009@latech](https://reader035.vdocument.in/reader035/viewer/2022062800/56814194550346895dad7b90/html5/thumbnails/22.jpg)
Implementation• Implementation of Software Rejuvenation is fairly
easy. • Cron Jobs can be set to restart the application at
various intervals• watchd can be used to detect if applications have
failed and restart them
![Page 23: By: Swetha Kendyala ske009@latech](https://reader035.vdocument.in/reader035/viewer/2022062800/56814194550346895dad7b90/html5/thumbnails/23.jpg)
Real World Examples• BILL-DATS II Collector
– Billing collection system used by AT&T long-distance network
– Set to rejuvenate after 1 week– Hasn’t prematurely failed after several year
![Page 24: By: Swetha Kendyala ske009@latech](https://reader035.vdocument.in/reader035/viewer/2022062800/56814194550346895dad7b90/html5/thumbnails/24.jpg)
• “S” Scientific Speech synthesis system• Long running scientific application• Used to process several hundred sentences over the
course of many days• Found to fail after 100 sentences• Rejuvenates after 15
![Page 25: By: Swetha Kendyala ske009@latech](https://reader035.vdocument.in/reader035/viewer/2022062800/56814194550346895dad7b90/html5/thumbnails/25.jpg)
Conclusions:• Decision to use Software Rejuvenation depends on
predetermined failure rates and associated costs.
• r4 = 0 , No rejuvenation
• r4 = ∞ , Rejuvenation
![Page 26: By: Swetha Kendyala ske009@latech](https://reader035.vdocument.in/reader035/viewer/2022062800/56814194550346895dad7b90/html5/thumbnails/26.jpg)
Questions???