Skip to main content

Reliability MTBF/MTTR Availability

Calculate system availability and reliability metrics. Enter values for instant results with step-by-step formulas.

Share this calculator

Formula

Availability = MTBF / (MTBF + MTTR); Nines = -log(1-A)

Worked Examples

Example 1: Web Server Reliability

Problem: Server: MTBF = 2,000 hours, MTTR = 2 hours. Calculate availability and annual downtime.

Solution: Availability calculation:\nAvailability = MTBF / (MTBF + MTTR)\nAvailability = 2,000 / (2,000 + 2)\nAvailability = 2,000 / 2,002\nAvailability = 99.90%\n\nNines: 3 nines (99.9%)\n\nAnnual downtime:\nFailures per year: 8,760 / 2,000 = 4.38 failures\nDowntime per failure: 2 hours\nUnplanned downtime: 4.38 × 2 = 8.76 hours/year\n\nThis matches the \"three nines\" benchmark of ~8.76 hours/year.\n\nTo reach four nines (99.99%):\nNeeded MTBF with 2hr MTTR:\n99.99 = MTBF / (MTBF + 2)\nMTBF ≈ 20,000 hours\n\nOR reduce MTTR:\n99.99 = 2,000 / (2,000 + MTTR)\nMTTR ≈ 0.2 hours (12 minutes)

Result: 99.90% availability | 3 nines | 8.76 hrs/year downtime

Example 2: Manufacturing Equipment

Problem: CNC machine: fails 6 times/year average, each repair takes 8 hours. Plus 40 hours planned maintenance. Calculate availability.

Solution: MTBF calculation:\nHours per year: 8,760\nFailures: 6\nMTBF = 8,760 / 6 = 1,460 hours\n\nMTTR: 8 hours\n\nTheoretical availability:\n1,460 / (1,460 + 8) = 99.45%\n\nActual annual downtime:\nUnplanned: 6 failures × 8 hours = 48 hours\nPlanned: 40 hours\nTotal: 88 hours\n\nActual availability:\n(8,760 - 88) / 8,760 = 99.0%\n\nNines: 2 nines (99%)\n\nFor manufacturing, 99% may be acceptable but:\n88 hours = 11 work days lost per year\nAt $500/hour lost production = $44,000/year\n\nImprovement options:\n- Reduce MTTR to 4 hours: saves 24 hours/year\n- Preventive maintenance to reduce failures to 4/year: saves 16 hours/year

Result: 99.0% actual availability | 88 hrs/year downtime | $44K/year impact at $500/hr

Example 3: Cloud Service SLA

Problem: SaaS product targets 99.95% availability SLA. Current: MTBF 500 hours, MTTR 30 minutes. Will they meet SLA?

Solution: Current availability:\nMTBF: 500 hours\nMTTR: 0.5 hours\n\nAvailability = 500 / (500 + 0.5) = 99.90%\n\nThis is below 99.95% target!\n\nAnnual downtime:\nFailures: 8,760 / 500 = 17.5 per year\nDowntime: 17.5 × 0.5 = 8.75 hours/year\n\n99.95% allows: 8,760 × 0.0005 = 4.38 hours/year\n\nThey're at 2× the allowed downtime.\n\nTo meet 99.95%:\nOption 1: Improve MTBF\n99.95 = MTBF / (MTBF + 0.5)\nMTBF = 1,000 hours (need 2× improvement)\n\nOption 2: Reduce MTTR\n99.95 = 500 / (500 + MTTR)\nMTTR = 0.25 hours = 15 minutes\n\nOption 3: Add redundancy\nTwo systems: 99.90% each\nCombined: 1 - (0.001)² = 99.9999%\n\nRecommendation: Redundancy provides biggest improvement.

Result: Current 99.90% < 99.95% target | Need 2× MTBF or 50% MTTR reduction | Redundancy recommended

Frequently Asked Questions

What is MTBF?

Mean Time Between Failures (MTBF) is the average time a repairable system operates before failure. MTBF = Total Operating Time / Number of Failures. Higher MTBF = more reliable. Example: if system runs 10,000 hours with 10 failures, MTBF = 1,000 hours. Used for systems expected to be repaired and continue operating.

What is MTTR?

Mean Time To Repair (MTTR) is average time to restore a failed system. MTTR = Total Repair Time / Number of Repairs. Lower MTTR = faster recovery. Includes: diagnosis, parts acquisition, repair, testing. MTTR is as important as MTBF for availability—a system that fails rarely but takes days to repair may be less available than one that fails more but recovers quickly.

How do you calculate availability?

Availability = MTBF / (MTBF + MTTR). This represents the probability system is operational at any given time. Also: Uptime / (Uptime + Downtime). Expressed as percentage or 'nines' (99.9% = 'three nines'). Accounts for both failure frequency (MTBF) and recovery speed (MTTR).

What's the difference between MTBF and MTTF?

MTBF applies to repairable systems—time between failures that are repaired. MTTF (Mean Time To Failure) applies to non-repairable items—time until permanent failure (e.g., light bulbs, hard drives). For components that are replaced not repaired, use MTTF. For systems that are repaired and continue, use MTBF.

What is 'five nines' availability?

Five nines (99.999%) means at most 5.26 minutes downtime per year. Scale: 99% = 3.65 days/year downtime, 99.9% = 8.76 hours/year, 99.99% = 52.6 minutes/year, 99.999% = 5.26 minutes/year. Each additional nine is 10x harder to achieve. Most web services target 99.9-99.99%.

How do I improve availability?

Two approaches: 1) Increase MTBF (reduce failure frequency) through: quality components, redundancy, preventive maintenance, environmental controls. 2) Reduce MTTR (faster recovery) through: monitoring, automation, spare parts, trained staff, documented procedures. Often reducing MTTR is easier than increasing MTBF.

References