Free · Open · Practitioner-Built

Free CMRP
Certification
Prep.

CMRPstudy is a complete CMRP exam prep resource — study guide, working calculators, and glossary. Built for practitioners. Free, always.

SMRP Domains

Reliability Pillars

Working Calculators

100+

Glossary Terms

What's Inside

Everything you need
to pass the CMRP exam.

No fluff. Built from the SMRP Body of Knowledge and real field experience, covering what actually appears on the exam.

📚

Complete Study Guide

All 5 SMRP domains plus the 5 Pillars of Reliability. Deep coverage of Domain 3 (45% of exam) — vibration, lubrication, alignment, NDT, Weibull, RCA, and motor failure modes.

Open Guide →

🧮

Reliability Calculators

MTBF/MTTR/Availability, OEE breakdown, Weibull β interpreter, Series/Parallel reliability, COUR estimator, and Storeroom EOQ — all with instant interpretation against world-class benchmarks.

Open Calculators →

📖

Searchable Glossary

100+ reliability and maintenance terms defined clearly — P-F interval, FMEA RPN, Weibull eta, ISO 4406, NLGI grades, and more. Instant search. No clicking through pages.

Search Terms →

🏛️

5 Pillars Framework

Leadership & Culture, Work Management, Proactive Maintenance, Skills & Training, Reliability Engineering — fully explained with Kotter's 8 steps, Palmer's 6 principles, and TPM's 8 pillars.

Explore Pillars →

📳

Domain 3 Deep Dive

45% of the exam. Vibration fault frequencies, bearing damage detection, thermography physics, oil analysis categories, ultrasound steam trap testing, alignment tolerances, balancing G-grades.

Go to Domain 3 →

📊

Weibull & Statistics

β shape parameter interpretation, η characteristic life, the 6 failure patterns, R(MTBF) = 36.8% trap, series/parallel reliability math, and FTA gate logic — all exam-tested content.

Study Weibull →

SMRP Benchmarks

World-class targets
you need to memorize.

OEE

≥85%

Overall Equipment Effectiveness

Wrench Time

55–65%

Hands-on productive time

Schedule Compliance

≥90%

Scheduled hours completed

PM Compliance

≥90%

PMs completed on time

MC / RAV

2–3%

Maintenance cost vs. asset value

Planned Work

>85%

Planned vs. total work hours

Emergency Work

<10%

Emergency vs. total hours

Ready Backlog

2–4 wks

Craft hours ready to schedule

Storeroom Fill Rate

≥95%

Parts filled on first request

Inventory Accuracy

≥98%

Physical count vs. system

Availability

≥95%

Asset uptime fraction

R(MTBF)

36.8%

Survival at MTBF — e⁻¹, NOT 50%

About This Resource

Built by someone
who does this work.

CMRPstudy was built by a practicing Maintenance Planner & Scheduler actively pursuing CMRP certification. The content reflects real-world application in a manufacturing environment — not just what textbooks say.

Everything here is grounded in the SMRP Body of Knowledge, Nowlan & Heap's failure pattern research, Doc Palmer's planning principles, and years of hands-on reliability improvement work. No advertising. No paid placements. No course upsells.

The goal: give every maintenance professional access to the same quality of study material regardless of whether they can afford a prep course.

CMRP Candidate SMRP Member Manufacturing Industry Practitioner-Written

Exam Fast Facts

Questions	110 (≈10 unscored)
Time Limit	2.5 hours
Format	Computer-based, Prometric
Scoring	200–800 scaled score
Passing Score	≈520 (not a raw %)
Recertification	Every 3 years
Largest Domain	D3 Equipment (45%)

Domain Weights

5 Pillars

🏛️ Framework Overview

P1 · Leadership & Culture

P2 · Work Management

P3 · Proactive Maintenance

P4 · Skills & Training

P5 · Reliability Engineering

SMRP Domains

D1 · Business & Mgmt 15%

D2 · Process Reliability 15%

D3a · Vibration Analysis 45%

D3b · Thermography

D3c · Oil Analysis

D3d · Lubrication

D3e · Alignment & Balance

D3f · RCA & Failure Analysis

D4 · Organization 10%

D5 · Work Management 15%

Advanced

📊 Weibull Analysis

🔬 NDT Methods

⚡ Motor Failure Modes

🏭 Shutdown Planning

Foundation Framework · SMRP Body of Knowledge

The 5 Pillars of Reliability

The 5 Pillars describe the organizational capabilities that enable world-class reliability. They underpin all five exam domains. Understanding the framework — and the interdependencies between pillars — is essential for answering situational questions correctly.

LeadershipWork ManagementProactive MaintSkills & TrainingReliability Engineering

🔑

Why this matters for the exam: SMRP exam questions frequently describe an organizational scenario and ask you to identify the best approach. Understanding which pillar is being tested — and knowing how pillars interact — lets you eliminate wrong answers quickly. A question about "why the PM program isn't reducing failures" is often a Pillar 2 (Work Management) or Pillar 4 (Skills) question, not a Pillar 3 question.

🔗 How the 5 Pillars Interconnect The interdependency logic

▾

Pillar 1 (Leadership) enables all others. Without leadership commitment and cultural alignment, no reliability initiative survives the first production crisis. Leaders who revert to measuring only output — not reliability performance — create the pressure that collapses every other pillar.

Pillar 2 (Work Management) is the delivery mechanism. The best maintenance strategies (Pillar 3) and most rigorous failure analysis (Pillar 5) produce zero value if the work management system can't deliver them to the field efficiently. An organization with great RCM output and poor scheduling has done expensive analysis for nothing.

Pillar 3 (Proactive Maintenance) generates the operating data. Condition monitoring findings feed Pillar 5 analysis. PM compliance rates feed Pillar 1 KPI dashboards. Without Pillar 3 execution, the other pillars have no field data to act on.

Pillar 4 (Skills) is the human enabler. Technically sophisticated PdM programs fail when the workforce lacks the skills to execute correctly. Misapplied vibration analysis — or an operator who can't recognize abnormal equipment sounds — is worse than no program at all.

Pillar 5 (Reliability Engineering) drives continuous improvement. Engineering analysis converts field experience into strategic decisions: which assets to prioritize, which failure modes to target, how to optimize maintenance strategy over time.

The Most Common Failure Pattern

Organizations implement technical tools (Pillars 3 and 5) without the organizational foundation (Pillar 1) or work management infrastructure (Pillar 2) to sustain them. The tools sit unused because there's no cultural mandate to use them and no scheduling system to deliver the resulting work. This is the single most tested scenario on the CMRP exam.

📊 Reliability Culture Maturity Model

▾

Stage	Characteristics	Typical MC/RAV
1 — Reactive	Run-to-failure dominant. Fire-fighting culture. No PM program. Emergency work >50%.	8–15%+
2 — Preventive	PM program exists but mostly time-based. Low schedule compliance. Planning inconsistent.	6–9%
3 — Proactive	PdM program active. Planning & scheduling mature. RCA practiced. Leading indicators tracked.	3–6%
4 — Predictive/CBM	Condition-based decisions dominant. FMEA-driven strategy. Failure modes well understood.	2–4%
5 — Optimized	AI-assisted. Real-time condition integration. Continuous improvement loops embedded.	<2%

✎

When a question describes an organization's current state, identify the maturity stage. The best answer typically describes the next logical step up — not a leap from Stage 1 to Stage 5.

Pillar 1 of 5 · Foundation

Leadership & Culture

The most critical and most neglected pillar. Technical tools fail when leadership culture doesn't support them. Covers reliability vision, accountability systems, change management, and organizational maturity.

🔄 Kotter's 8-Step Change Model Most referenced change framework on the exam

▾

Create Urgency — Show why change is necessary NOW. Use data: downtime losses, MC/RAV vs. competitors, COUR. The "burning platform." Without urgency, change initiatives stall immediately.
Build a Guiding Coalition — Form a team of influential leaders combining formal authority AND credibility. A coalition of only managers without craft respect will fail. One with only craft without management authority will be ignored.
Form a Strategic Vision — Clear, compelling, simple. People need to see where they're going. "Reduce unplanned downtime 40% in 18 months" is a vision. "Be excellent" is not.
Communicate the Vision — Relentlessly. Every meeting, every shift change, every KPI review. Leaders must model the behaviors they're asking for — walking the floor, asking about reliability metrics, attending P&S meetings.
Remove Obstacles — Identify and eliminate organizational barriers: systems, structures, and people who actively block the change. A supervisor who constantly pulls planned work for reactive emergencies is an obstacle.
Generate Short-Term Wins — Plan and create visible wins early. Quick wins (first 90 days) build credibility, silence skeptics, and maintain momentum. Pick a bad actor asset with a known solution and fix it visibly.
Sustain Acceleration — Use early wins to drive deeper change. Don't declare victory prematurely. Organizations that declare victory after 6 months almost always regress.
Anchor in Culture — Embed new behaviors in hiring criteria, training programs, performance reviews, and recognition systems. Culture is the LAST thing to change, not the first.

📋 Reliability Accountability Systems

▾

Accountability requires: visible KPIs (posted, reviewed weekly), individual ownership (each metric has a named owner), consequence (performance affects something — recognition, discussion, planning resources), and root cause response (when KPIs miss target, the response is to analyze why, not blame).

Common Leadership Failures in Reliability

Measuring only production output — reliability metrics are absent or ignored
Verbal commitment to proactive maintenance while resource decisions consistently favor production over PM windows
Celebrating "heroes" who fix breakdowns overnight — creating perverse incentive for reactive culture
Launching reliability initiatives without addressing the supervisory behaviors that drive reactive work

Pillar 2 of 5 · Process

Work Management

The operational engine of the maintenance organization. Work management converts reliability strategy into executed, documented field work. Palmer's 6 Principles are the most heavily tested work management content on the exam.

📋 Palmer's 6 Planning Principles All 6, fully explained

▾

Principle 1 — Planners Form a Separate Department

Planners must be organizationally separate from craft supervision. A planner who also supervises technicians will always prioritize the urgent (today's work) over the important (next week's preparation). The planner works FOR the supervisors by preparing work packages — not alongside them in the field.

Principle 2 — Focus on Future Work

The planner's time horizon is work not yet started. Once a job enters execution, it belongs to the supervisor. Planners diverted to manage active jobs degrade planning quality for future work — the cost compounds every week.

Principle 3 — Component-Level Files (Technical Library)

For each significant asset, maintain a file containing: historical job plans, parts lists with vendor info, OEM manuals, special tools required, isolation/clearance procedures, and lessons learned from past repairs. This file transforms individual technician experience into organizational knowledge available to everyone.

Principle 4 — Estimates Based on Craft Expertise

Estimates should reflect what a skilled craftsperson can accomplish — not the average performer, not padded for worst-case. Accurate estimates enable accurate schedule loading and meaningful schedule compliance measurement.

Principle 5 — Recognize the Craft (Feedback Loop)

Craft technicians are the source of field knowledge that improves job plans. A feedback loop must exist: technicians complete job feedback noting actual time, parts deviations, problems, and improvements. The planner incorporates this into the job plan file — individual experience becomes organizational memory.

Principle 6 — Measure Performance with Wrench Time

Wrench time (hands-on productive time as % of available work time) is the fundamental measure of planning and scheduling effectiveness. Improving from the industry average of 25–35% to world-class 55–65% roughly doubles craft productivity without adding headcount.

💡

What destroys wrench time: Traveling to get unstaged parts, waiting for equipment clearance, missing job information, hunting for tools, rework from wrong parts, waiting for permits. Planning and scheduling eliminate every one of these delays.

📅 The Weekly Work Cycle & Schedule Loading

▾

Timing	Activity	Owner
Mon–Thu	Planner prepares next week's work packages; acquires parts; confirms equipment availability with operations	Planner
Thursday	Weekly coordination meeting: planner, scheduler, supervisor, operations representative review following week's plan	All stakeholders
Friday by noon	Schedule published; work packages distributed to supervisors	Scheduler/Planner
Daily	Supervisor assigns work, manages execution, updates system with completions and deviations	Supervisor
End of week	Schedule compliance calculated, posted, and debriefed	All

Why 100% Schedule Loading?

Palmer's principle is to load the schedule to 100% of available craft hours. The intuitive approach — loading 85% to "leave room for emergencies" — is wrong. The scheduler's job is to plan the best use of available time. Emergency work is the supervisor's problem to manage in real time. Pre-compensating at 85% trains the organization to expect 85% execution and guarantees the remaining 15% is wasted.

Pillar 3 of 5 · Technical

Proactive Maintenance

Covers all forms of proactive maintenance — time-based PM, condition-based PdM, TPM, and operator-driven reliability. The goal: address potential failures before they become functional failures.

⚙️ PM Task Types & Selection Logic Which task for which failure mode

▾

Task Type	Use When	Examples
Time-based (scheduled)	Known age-related deterioration; component reliability decreases predictably with time/use (Weibull β > 3)	Oil changes, filter replacements, belt changes, seal replacements
On-condition (CBM/PdM)	Detectable P-F interval exists; condition monitoring can find the failure before it occurs	Vibration routes, thermography surveys, oil sampling, ultrasound routes
Failure-finding	Hidden function — failure is not self-announcing and only matters when the protected function is demanded	Testing relief valve setpoints, exercising emergency shutdown valves, testing standby pump auto-starts
Run-to-failure (RTF)	Failure consequence is acceptable AND no cost-effective proactive task exists	Light bulbs, fuses, low-value redundant non-critical components

The P-F Interval — Governing Concept for All PdM

The P-F interval is the time between when a potential failure (P) can first be detected and when functional failure (F) occurs. It governs how PdM programs are designed:

Monitoring frequency must be less than half the P-F interval
Response time after detection must also fit within the remaining P-F interval
Technologies with longer P-F intervals allow less frequent monitoring

🏭 TPM — All 8 Pillars

▾

Autonomous Maintenance — Operators perform basic care (clean, inspect, lubricate, tighten). 7-step progression from initial clean/inspect to full autonomous management.
Focused Improvement (Kobetsu Kaizen) — Cross-functional teams eliminate losses using structured problem-solving. Targets the 16 major losses.
Planned Maintenance — Maintenance department's shift from reactive to planned, condition-based. Supports Autonomous Maintenance by handling complex tasks.
Quality Maintenance — Zero defects through perfect equipment condition. Identifies equipment conditions that cause quality defects; sets and maintains zero-defect conditions.
Early Equipment Management — Apply reliability knowledge during design and commissioning. Most cost-effective time to address reliability is before equipment is purchased.
Training & Education — Multi-skilling for operators; maintenance skills upgrade; leadership development.
Safety, Health & Environment — Zero accidents, zero health issues, zero environmental incidents. Integrated into every TPM activity.
Office TPM — Applying TPM principles to administrative functions: eliminate waste in ordering, scheduling, and information flow.

OEE & The 6 Big Losses

OEE Component	Loss	Definition
Availability	Breakdowns	Unplanned equipment failures causing production stoppage
Availability	Setup & Adjustment	Time lost during changeovers, startups, adjustments
Performance	Minor Stoppages / Idling	Brief stops not recorded as breakdowns
Performance	Reduced Speed	Equipment running below designed capacity
Quality	Process Defects / Scrap	Products not meeting specification during steady-state
Quality	Startup / Yield Losses	Defects produced during startup before stable conditions

Pillar 4 of 5 · People

Skills & Training

Technical programs fail when the workforce lacks the skills to execute correctly. Covers competency frameworks, PdM certification levels, training delivery, and knowledge management.

🎓 Competency-Based Training & PdM Certification Levels

▾

Competency-based training focuses on demonstrated ability — not attendance. The test is not "did they take the class?" but "can they perform the task correctly in the field?"

PdM Technology Certification Levels

Level	Capabilities
Category I	Data collection, route-based monitoring, basic anomaly identification, escalation to higher level
Category II	Analysis and diagnosis, report generation, maintenance task recommendations
Category III	Advanced analysis, program design, calibration, training of lower levels
Category IV	Expert-level, method development, technical authority, standards development

Knowledge Management

When experienced technicians retire, they take institutional knowledge with them unless it's systematically captured. Strategies: document job plans with lessons learned (Palmer Principle 5), require CMMS failure code notes, formal mentoring programs, accessible technical libraries with OEM manuals and RCA reports.

Pillar 5 of 5 · Engineering

Reliability Engineering

The analytical engine of the reliability program. Converts failure data into optimized maintenance strategies and failure prevention. Covers RCM, FMEA, FRACAS, and failure statistics.

📐 RCM — The 7 Questions (SAE JA1011)

▾

Functions & Performance Standards — What is the asset supposed to do, and how well? A pump circulating 300 GPM when 500 GPM is required is in functional failure even if it's running.
Functional Failures — In what ways can it fail to fulfill its functions? Not failure modes yet — the inability to meet the performance standard.
Failure Modes — What causes each functional failure? The specific physical event. "Impeller erosion reduces flow below 500 GPM" is a failure mode.
Failure Effects — What happens when each failure mode occurs? Factual description only — evidence the failure is occurring, physical damage, safety/environmental/production impact.
Failure Consequences — Why does each failure matter? RCM hierarchy: Safety/Environmental → Hidden → Operational → Non-operational. Consequence classification drives task selection.
Proactive Tasks — What should be done to predict or prevent each failure? Task selection hierarchy: On-condition → Scheduled restoration → Scheduled discard → Failure-finding. Task must be technically feasible AND worth doing.
Default Actions — What if no suitable proactive task exists? Options: RTF (if consequences acceptable), redesign, accept and document the risk.

✎

RCM vs. FMEA distinction: FMEA is a bottom-up analysis identifying failure modes and effects. RCM is a top-down decision process that uses FMEA as input to select maintenance strategies. RCM answers "what should we do?" — FMEA answers "what can fail and what happens?"

🔄 FRACAS — Closed-Loop Failure System

▾

Failure Reporting, Analysis & Corrective Action System. Without FRACAS, failures are repaired but not learned from. The cycle:

Report — Every failure documented in CMMS with failure code, downtime, and technician observations
Analyze — Bad actors escalated for formal RCA; all failures coded for trending
Corrective Action — Root causes addressed with specific tracked actions (not just repairs)
Verify — Corrective actions confirmed effective; recurrence rate tracked
Feed Back — Lessons update job plans, PM program, spare parts strategy, and engineering standards

Domain 1 · 15% of Exam · ~16–17 Questions

Business & Management

Financial justification, KPIs, benchmarking, ISO 55000, and strategic alignment of the maintenance program to organizational objectives.

OEE · MTBF · MTTRMC/RAV · COURNPV · LCC · ROIISO 55000

💰 Financial Justification Tools

▾

OEE

OEE = Availability × Performance × Quality

World-class ≥ 85% · Typical industry average 50–65%

Availability (Asset)

A = MTBF / (MTBF + MTTR)

MC/RAV

MC/RAV (%) = (Annual Maint Cost / Replacement Asset Value) × 100

World-class 2–3% · Reactive 8–15%+

Life Cycle Cost

LCC = Acquisition + Installation + Training + Operating + Maint + Downtime + Disposal

Cost of Unreliability (COUR) — 6 Categories

Category	Components
Production losses	Lost throughput × margin per unit; missed customer orders, late fees
Emergency maint premium	Overtime labor, expedited freight, emergency contractor rates (3–10× planned cost)
Quality defects	Scrap, rework, customer returns — equipment in degraded condition causes quality problems before outright failure
Safety incidents	Medical costs, investigation, regulatory fines, legal liability
Environmental incidents	Remediation, fines, permit violations
Customer/market impact	Lost contracts, relationship damage — often the largest long-term cost

📈 KPIs — Leading vs. Lagging

▾

💡

Leading indicators predict future performance (PM compliance predicts future MTBF). Lagging indicators report past performance (MTBF reports historical reliability). A balanced scorecard uses both — lagging to confirm outcomes, leading to intervene before outcomes deteriorate.

KPI	Formula	Target	Type
PM Compliance	(PMs done on time / PMs scheduled) × 100	≥90%	Leading
Schedule Compliance	(Sched hrs completed / Sched hrs) × 100	≥90%	Leading
Wrench Time	Hands-on time / Total available time	55–65%	Leading
Planned Work %	Planned hrs / Total work hrs	>85%	Leading
Emergency Work %	Emergency hrs / Total hrs	<10%	Lagging
MTBF	Total uptime / # failures	Trending up	Lagging
MTTR	Total repair time / # repairs	Trending down	Lagging
MC/RAV	(Annual maint cost / RAV) × 100	2–3%	Lagging

Domain 2 · 15% of Exam · ~16–17 Questions

Manufacturing Process Reliability

System reliability, failure theory, reliability statistics, FMEA, RCM, fault trees, and the 6 failure patterns from Nowlan & Heap.

📉 The 6 Failure Patterns (Nowlan & Heap 1978)

▾

⚠️

Critical insight: 68% of industrial equipment follows Pattern F — infant mortality dominant. This means most failures are caused by installation errors and quality defects, NOT age-related wear. Time-based PM is therefore ineffective for the majority of equipment.

Pattern	Failure Rate	% of Equipment	Best Strategy
A — Bathtub	High → constant → increasing	4%	CBM for useful life; time-based for wear-out
B — Wear-out	Steadily increasing	2%	Time-based scheduled replacement
C — Gradual degradation	Slowly increasing	5%	CBM or wide-interval time-based
D — Late increase	Low then increasing	7%	Condition-based monitoring
E — Random	Constant throughout life	14%	CBM; RTF if consequence allows
F — Infant mortality	High decreasing → low constant	68%	Improve installation quality; CBM

🔗 Reliability Mathematics

▾

Failure Rate

λ = Number of Failures / Total Operating Time = 1 / MTBF

Reliability Function (Exponential / Constant Failure Rate)

R(t) = e^(−λt) = e^(−t/MTBF)

⚠️ At t=MTBF: R = e^(−1) = 0.368 (36.8% survive — NOT 50%)

Series System

R_s = R₁ × R₂ × R₃ × … × Rₙ

Any component failure = system failure. Always less than lowest individual reliability.

Parallel System (2 components)

R_p = 1 − (1 − R₁)(1 − R₂)

All components must fail for system failure. Always greater than highest individual reliability.

FMEA Risk Priority Number

RPN = Severity × Occurrence × Detectability (each 1–10, max = 1,000)

Always address Severity = 9–10 regardless of RPN. High severity + low probability = low RPN but catastrophic if it occurs.

Domain 3 · 45% of Exam · Part A

Vibration Analysis

The most tested PdM technology on the exam. Know fault frequencies, measurement parameter selection, bearing fault signatures, ISO severity zones, and the distinction between spectrum analysis and high-frequency techniques.

📳 FFT Fault Frequency Reference

▾

Frequency	Primary Cause	Direction
1× RPM	Rotor unbalance (dominant radial); bent shaft; also misalignment	Radial (H & V)
2× RPM	Angular misalignment (especially axial); mechanical looseness; cracked shaft	Radial + Axial
Sub-harmonic (0.5×, 0.33×)	Fluid instability — oil whirl/whip; internal rub	Radial
BPFO	Outer race bearing defect; non-synchronous	Radial (load zone)
BPFI	Inner race defect; ±1× sidebands (inner race rotates)	Radial
BSF	Ball/roller defect; sub-harmonic	Radial
FTF	Cage defect; very low freq (0.35–0.48 × RPM)	Radial
Gear Mesh Freq (GMF)	# teeth × RPM; sidebands = wear/eccentricity	Radial
2× Line Freq (120 Hz)	Electrical fault in motor (USA); stator eccentricity	Radial

Measurement Parameter Selection

Displacement (mils p-p): Low frequency (<10 Hz) — measures how far the shaft moves
Velocity (in/sec or mm/sec RMS): Mid frequency (10–1000 Hz) — ISO 10816 uses velocity for machine severity
Acceleration (g): High frequency (>1000 Hz) — bearing damage detection via Spike Energy, kurtosis

✎

ISO 10816 zones: A (new acceptable) → B (long-term acceptable) → C (plan maintenance) → D (immediate action). Boundaries depend on machine class. The exam tests understanding of the zone system, not specific numbers.

Domain 3 · 45% of Exam · Part B

Infrared Thermography

Thermography measures surface temperature via infrared radiation. Critical concept: emissivity. Applications span electrical, mechanical, process, and refractory systems.

🌡️ Physics, Emissivity & Applications

▾

Emissivity is the ratio of radiation emitted by a surface vs. a perfect blackbody (ε = 1.0). Shiny metals have very low emissivity (0.03–0.15) — they primarily reflect surrounding temperatures rather than emitting their own. This causes false readings unless corrected. Apply matte paint or tape, or use emissivity-corrected settings.

Applications by Category

Category	What Thermography Finds
Electrical	Loose connections (resistance heating: P = I²R), overloaded circuits, phase imbalance, failing switches/breakers, transformer problems
Mechanical	Bearing overheating, motor winding hotspots through vent slots, misaligned couplings, seized conveyor rollers
Process / Refractory	Furnace refractory degradation (hot spots on shell), heat exchanger fouling (cold spots), failed-open steam traps (hotter than cycling traps)
Building / Insulation	Moisture infiltration, insulation gaps, roof anomalies

Domain 3 · 45% of Exam · Part C

Oil Analysis

Three test categories: wear debris analysis, oil condition testing, and contamination analysis. Understand what each detects and when each is used.

🛢️ Three Test Categories — Complete Coverage

▾

1. Wear Debris Analysis

ICP Spectrometry: Measures dissolved metals ≤8–10 µm. Key metals: Fe (steel wear), Cu (bearings/bushings), Pb (babbitt), Cr (ring/liner), Al (piston), Si (dirt contamination)
Particle Count (ISO 4406): Counts particles by size; measures total contamination load
Analytical Ferrography: Most detailed method. Magnetically deposits particles on slide for microscopic examination — identifies particle type, shape, size, and failure mode

2. Oil Condition Testing

Viscosity: Most critical property. ±15% of new oil = warning; ±25% = change required
TAN (Total Acid Number): Rising TAN = oil oxidation/degradation
TBN (Total Base Number): Alkaline reserve in engine oils. When TBN drops to 50% of new value, change required
RPVOT: Measures remaining antioxidant life in turbine/hydraulic oils

3. Contamination Analysis

Water (Karl Fischer): Even 100 ppm can reduce bearing life 50%. Alert: >500 ppm for most hydraulics
Fuel dilution (FTIR): Thins viscosity in engine oils
Glycol/coolant: Causes sludge and severe oxidation; detected by glycol test or elevated potassium
High Si without elevated Al: Soil/dirt ingestion (breather failure, seal failure)

Domain 3 · 45% of Exam · Part D

Lubrication Management

40–50% of bearing failures trace to lubrication problems. Viscosity, NLGI grades, ISO 4406 cleanliness codes, contamination control, and filtration beta ratios.

🔬 Viscosity, NLGI Grades & ISO 4406

▾

Viscosity is the single most important lubricant property. The lubricant must maintain a hydrodynamic film separating metal surfaces under all operating conditions. Too low → film collapse, metal contact. Too high → excessive heat from shear and churning.

ISO VG Grades

The ISO VG number = nominal kinematic viscosity in cSt at 40°C, ±10%. Common grades: 32, 46, 68, 100, 150, 220, 320, 460, 680. Each grade is approximately double the previous in viscosity.

NLGI Grease Grades

NLGI	Consistency	Applications
000–00	Semi-fluid	Enclosed gearboxes, centralized systems
0–1	Very soft	Centralized systems, cold climate
2	Medium (peanut butter)	~80% of bearing applications — most common
3	Firm	High-speed bearings, vertical orientation
4–6	Hard to block	Open gears, wire ropes, extreme pressure

ISO 4406 Cleanliness Code

Format: X/Y/Z — particle count range codes at ≥4µm, ≥6µm, ≥14µm per mL. Each number increment doubles the particle count. Typical targets: servo hydraulics 15/13/10; gearboxes 17/15/12.

Filtration Beta Ratio (βₓ): Upstream count ÷ downstream count at particle size x. β₁₀ = 200 means 99.5% efficiency at 10µm.

Domain 3 · 45% of Exam · Part E

Precision Alignment & Balancing

🎯 Shaft Alignment — Types, Methods, Soft Foot

▾

Misalignment is the leading cause of premature bearing and seal failure in rotating equipment.

Types

Parallel (offset): Shaft centerlines parallel but offset. Dominant 2× RPM radial vibration.
Angular: Centerlines meet at an angle. Strong 1× and 2× RPM axial vibration.
Combined: Both — most real-world cases.

Soft Foot — Correct BEFORE Alignment Begins

Parallel: One foot shorter → solid shims
Angular: Canted foot → tapered shims
Induced: Pipe/conduit strain on frame → fix the piping, not the shimming
Squishy: Corrosion or debris under foot → clean and reshim

Alignment Tolerances

Speed (RPM)	Offset (mils)	Angularity (mils/inch)
<1,800	±3.0	±0.7
1,800–3,600	±2.0	±0.5
>3,600	±1.0	±0.3

ISO 1940 Balance Grades

G Grade	Application
G 0.4	Gyroscopes, precision spindles
G 2.5	Gas/steam turbines, centrifuges
G 6.3	Industrial fans, pumps, motors — most common
G 16	Agricultural machinery, large crankshafts
G 40	Car/truck wheels

Domain 3 · 45% of Exam · Part F

Root Cause Analysis

🔍 Three Root Cause Levels

▾

Level	Definition	Example
Physical	The material or mechanical event that caused the failure	Rolling element bearing failed from subsurface fatigue spalling
Human	The human decision or omission that enabled the physical cause	Bearing over-greased — excess pressure forced grease past seals
Latent/Systemic	The organizational weakness that enabled the human cause	No written greasing procedure exists; no quantity/interval specification; no competency verification for lubrication tasks

RCA Methods — Selection Guide

5-Why: Simple linear failure chains; quick team facilitation. Limitation: can reach different root causes depending on which "why" path is followed.
Fishbone/Ishikawa: Brainstorming with the 6Ms (Man, Machine, Method, Material, Measurement, Mother Nature). Good for ensuring no cause category is overlooked.
Fault Tree Analysis (FTA): Top-down, AND/OR gate logic. Best for complex multi-cause failures, safety events, and when probability quantification is needed.
Apollo RCA: Cause-and-effect charting with evidence requirements at each causal link. Defensible and auditable.
TapRooT®: Structured investigation with specific root cause categories. Most rigorous — used for serious safety incidents.

Domain 4 · 10% of Exam · ~11 Questions

Organization & Leadership

Culture, TPM, team development, change management. Questions are typically situational — choose the approach that builds commitment, not compliance.

👥 TPM, OEE & Organizational Design

▾

See Pillar 3 for full TPM 8-pillar detail. Domain 4 exam focus areas:

TPM requires operator ownership — not just maintenance executing more PMs. The transfer of basic care is essential.
Autonomous Maintenance 7-step progression must be understood sequentially
OEE is the scorecard for TPM effectiveness: ≥85% world-class
The Maintenance-Operations interface is the most critical organizational relationship for reliability
Kotter's 8 steps (see Pillar 1) are the primary change management framework tested

Domain 5 · 15% of Exam · ~16–17 Questions

Work Management

Planning, scheduling, CMMS, MRO/stores, contractor management, shutdown planning. See Pillar 2 for full Palmer's Principles coverage.

📦 MRO Storeroom — Key Formulas & KPIs

▾

Reorder Point

ROP = (Average Daily Usage × Lead Time in Days) + Safety Stock

Economic Order Quantity (EOQ)

EOQ = √(2DS / H)

D = annual demand · S = ordering cost per order · H = annual holding cost per unit

Inventory Turns

Turns = Annual Issues ($ value) / Average Inventory Value ($)

World-Class Storeroom KPIs

Fill rate: ≥95% requests filled from stock on first request
Inventory accuracy: ≥98% physical count matches system at line-item level
Stockout rate: <5% of requests result in emergency procurement
ABC classification: A items (top 10–20% of SKUs, 70–80% of value) get tight controls; C items use simple min/max

Advanced Topic · Domains 2 & 5 · Pillar 5

Weibull Analysis

The primary statistical tool for analyzing failure time data. The exam tests understanding of β (shape parameter) and η (characteristic life) — and what each tells you about maintenance strategy.

📊 β Shape Parameter — Maintenance Strategy Implications

▾

Weibull Reliability Function

R(t) = e^[−(t/η)^β]

β = shape parameter · η = characteristic life (time at which 63.2% have failed) · t = time

β Value	Failure Pattern	Failure Rate	Maintenance Strategy
β < 1	Infant mortality	Decreasing	Improve installation quality & commissioning. Time-based PM makes it WORSE.
β = 1	Random (exponential)	Constant	CBM or RTF. Time-based replacement does NOT reduce failure frequency.
1 < β < 3	Early wear-out onset	Slowly increasing	CBM preferred; time-based at wide interval acceptable
β ≈ 3.44	Normal distribution approx	Symmetric around mean	Time-based PM; set at 70–80% of η
β > 3	Wear-out	Rapidly increasing	Time-based PM IS effective. Set interval at 70–80% of η (characteristic life).

⚠️

η (eta) trap: η is NOT the MTBF. η is the time at which exactly 63.2% of the population has failed — regardless of β. MTBF can be calculated from β and η but they are not the same value.

Advanced Topic · Domain 3 · Equipment Reliability

Non-Destructive Testing (NDT)

Know which NDT method is appropriate for which application. Key decision variables: surface vs. subsurface, ferromagnetic vs. non-ferromagnetic, radiation constraints, speed requirements.

🔬 All NDT Methods — Comparison Table

▾

Method	How It Works	Best For	Key Limitation
VT (Visual)	Direct or aided visual examination	Surface defects; first step in any inspection	Surface only; requires access
PT (Liquid Penetrant)	Dye drawn into surface-breaking cracks by capillary action	Surface-breaking defects in any non-porous material	Surface-breaking only; rough surfaces reduce sensitivity
MT (Magnetic Particle)	Magnetic flux leakage at defects attracts particles	Surface & near-surface defects	Ferromagnetic materials only (carbon/alloy steel)
UT (Ultrasonic)	Sound waves reflect from internal defects	Internal defects, wall thickness measurement	Requires couplant; operator skill critical
PAUT	Multi-element UT with electronic steering; produces cross-section image	Complex geometry welds, code-compliant inspection	Higher cost; specialized operator
RT (Radiography)	X-ray/gamma ray differential absorption	Internal defects; permanent record; complex geometry	Radiation safety required; two-sided access needed
ET (Eddy Current)	Induced currents; defects disturb current flow	Heat exchanger tube inspection (fast, no contact); surface cracks	Conductive materials only; limited depth
AE (Acoustic Emission)	Passive — detects stress waves emitted by active defect growth	Monitoring pressure vessels during pressurization; active cracking	Noise discrimination; multi-sensor required for location

✎

NDT selection logic: (1) Surface or subsurface? → PT/MT for surface; UT/RT for subsurface. (2) Ferromagnetic? → If no (stainless, aluminum), eliminate MT. (3) Thickness measurement? → UT. (4) Radiation constraint? → UT instead of RT. (5) Fast heat exchanger tube inspection? → Eddy Current.

Advanced Topic · Domain 3 · Equipment Reliability

Motor Failure Modes

⚡ AC Motor Failure Modes — Complete Table

▾

Failure Mode	Root Cause	Detection	Prevention
Bearing failure	Over/under lubrication, contamination, VFD fluting, misalignment	Vibration, ultrasound, temperature	Lube program, ultrasound-guided greasing, shaft grounding rings
Stator winding failure	Thermal cycling, voltage spikes, partial discharge, over-temperature	Surge test, partial discharge, Megger/PI trending	Thermal monitoring, VFD voltage spike filters
Rotor bar failure	Thermal cycling fatigue, casting defects, overloading	MCSA (sidebands at 1 ± 2s × line frequency)	Avoid frequent starts; VFD soft-starting
VFD bearing fluting	High-freq common-mode voltage → capacitive coupling → discharge current → raceway craters	Visual (washboard pattern), vibration, MCSA	Shaft grounding rings; insulated bearings on NDE
Overheating	Overloading, blocked ventilation, high ambient, frequent starts, voltage unbalance	Thermal monitoring, current monitoring, thermography	Proper motor sizing, clear ventilation, voltage balancing

Motor Insulation Testing

Test	What It Measures	Pass Criteria
Megger (IR Test)	Winding-to-ground insulation resistance	Varies by voltage class; trending over time is more valuable than a single reading
Polarization Index (PI)	IR(10 min) / IR(1 min)	>2.0 = good; <1.0 = contaminated/damaged
Surge/Impulse Test	Turn-to-turn insulation	Detects inter-turn shorts other tests miss
Hi-Pot	Dielectric withstand (proof test)	Used for acceptance testing; destructive if overused

Advanced Topic · Domain 5 · Work Management

Shutdown & Turnaround Management

🏭 Turnaround Planning, CPM & Scope Control

▾

Planning Timeline

Phase	Timeline	Key Activities
Scope Development	12–18 months prior	Inspection list, regulatory requirements, deferred maintenance, process improvements. Freeze scope 3 months before execution.
Engineering & Procurement	6–12 months	Detailed work packages, long-lead materials, contractor bids
Detailed Planning	3–6 months	Job plans, resource leveling, critical path, permit coordination
Execution	Shutdown period	Daily progress meetings, critical path monitoring, MOC for scope additions
Closeout	Post-shutdown	Cost vs. budget, schedule vs. plan, lessons learned

Critical Path Method (CPM)

Critical path: Longest chain of dependent activities — determines minimum turnaround duration
Float: Time an activity can be delayed without affecting the end date. Critical path activities have zero float.
Total Float = Late Finish − Early Start − Duration

⚠️

Scope creep flag: >10% scope addition after freeze = poor initial scope development or ineffective freeze enforcement. Leading cause of turnaround overruns. All post-freeze additions require formal MOC with documented cost and schedule impact.

Working Calculators

Reliability Engineering
Calculators

Six calculators covering the most commonly used reliability and maintenance formulas. Every result is benchmarked against SMRP world-class targets.

⏱️

MTBF / MTTR / Availability

Calculate Mean Time Between Failures, Mean Time to Repair, and asset availability from raw operating data.

Total Uptime (hrs)

Total Downtime (hrs)

Number of Failures

—

MTBF (hrs)

—

MTTR (hrs)

—

Availability

🏭

OEE Breakdown

Calculate Overall Equipment Effectiveness and identify which of the 6 Big Losses is your primary constraint.

Planned Production Time (hrs)

Downtime Losses (hrs)

Ideal Cycle Time (sec/unit)

Total Units Produced

Reject Units

—

Availability

—

Performance

—

Quality

—

OEE

📈

Reliability R(t) Calculator

Calculate the probability of survival to time t for an asset with a known MTBF (constant failure rate / exponential distribution).

MTBF (hours)

Mission Time t (hours)

—

R(t) — Survival Prob

—

F(t) — Failure Prob

—

λ (failures/1000hr)

🔗

Series & Parallel Reliability

Calculate system reliability for components configured in series or parallel. Enter up to 5 component reliabilities.

Configuration

R₁ (0–1)

R₂ (0–1)

R₃ (0–1 or blank)

R₄ (0–1 or blank)

R₅ (0–1 or blank)

—

System Reliability

—

vs. Lowest Component

📊

Weibull β Interpreter

Enter your Weibull β (shape) and η (characteristic life) parameters — get the failure pattern, recommended maintenance strategy, and PM interval if applicable.

β (Shape Parameter)

η (Characteristic Life, hrs)

—

Failure Pattern

📦

Storeroom: EOQ & Reorder Point

Calculate the Economic Order Quantity and Reorder Point for a spare part. Optimize between ordering frequency and carrying costs.

Annual Demand (units/yr)

Order Cost ($ per order)

Unit Cost ($)

Holding Cost (% of value/yr)

Lead Time (days)

Safety Stock (units)

—

EOQ (units/order)

—

Reorder Point (units)

—

Orders per Year

—

Total Annual Cost

Reference

Reliability Glossary

🔍

CMRPstudy · Practice Tools

Flashcards &
Practice Exam

200 flashcards and 100 exam-style questions covering all 5 SMRP domains and 5 Pillars of Reliability. Track your score, identify weak domains, and study to world-class benchmarks.

200

Flashcards

100

Practice Questions

SMRP Domains

~520

Passing Score Target

📇

Flashcard Deck

200 cards spanning all 5 domains, 5 Pillars, Weibull analysis, NDT, motor failure modes, lubrication, alignment, and work management. Filter by category, mark known/unknown, track your progress.

200 cards 8 categories Spaced repetition

Open Flashcards →

📝

Practice Exam

100 multiple-choice questions with full explanations. Choose all domains or focus on one. Each answer reveals a detailed rationale explaining why it's correct and why the wrong answers are wrong.

100 questions Full explanations Domain breakdown

Start Practice Exam →

Domain Weight Guide — Study Priority

200 Cards · All Domains

Flashcard Deck

Click any card to flip. Mark Known or Unknown to track mastery. Filter by category to focus your study session.

200 cards

Tap to reveal definition

Definition

Click card to flip · Use arrow keys to navigate

1 / 200

Known

Study More

200

Remaining

Mastery

100 Questions · Full Explanations

Practice Exam

Exam-style questions with detailed rationale. Select domains to focus or run all 100 for a full mock exam. Aim for 85%+ before sitting the real exam.

Select domains to include:

100 questions selected

CMRPstudy

Get in Touch

Found an error? Have a recommendation? Want to suggest a practice question, a flashcard, or a missing topic? Every message goes directly to Chad.

⚙

Chad Zahner — CMRPstudy

Reads every message personally

CMRPstudy

Hey — thanks for using the site. Found an error, have a recommendation, want to suggest a question or flashcard, or anything else? Drop it here.

Messages go to cpzahner@gmail.com · Responses within 48 hours

Free CMRP Certification Prep.

Everything you needto pass the CMRP exam.

World-class targetsyou need to memorize.

Built by someonewho does this work.

The Most Common Failure Pattern

Common Leadership Failures in Reliability

Principle 1 — Planners Form a Separate Department

Principle 2 — Focus on Future Work

Principle 3 — Component-Level Files (Technical Library)

Principle 4 — Estimates Based on Craft Expertise

Principle 5 — Recognize the Craft (Feedback Loop)

Principle 6 — Measure Performance with Wrench Time

Why 100% Schedule Loading?

The P-F Interval — Governing Concept for All PdM

OEE & The 6 Big Losses

PdM Technology Certification Levels

Knowledge Management

Cost of Unreliability (COUR) — 6 Categories

Measurement Parameter Selection

Applications by Category

1. Wear Debris Analysis

2. Oil Condition Testing

3. Contamination Analysis

ISO VG Grades

NLGI Grease Grades

ISO 4406 Cleanliness Code

Types

Soft Foot — Correct BEFORE Alignment Begins

Alignment Tolerances

ISO 1940 Balance Grades

RCA Methods — Selection Guide

World-Class Storeroom KPIs

Motor Insulation Testing

Planning Timeline

Critical Path Method (CPM)

Reliability EngineeringCalculators

Reliability Glossary

Flashcards &Practice Exam

Flashcard Deck

Practice Exam

Get in Touch

Free CMRP
Certification
Prep.

Everything you need
to pass the CMRP exam.

World-class targets
you need to memorize.

Built by someone
who does this work.

Reliability Engineering
Calculators

Flashcards &
Practice Exam