Free · Open · Practitioner-Built

Free CMRP
Certification
Prep.

CMRPstudy is a complete CMRP exam prep resource — study guide, working calculators, and glossary. Built for practitioners. Free, always.

5
SMRP Domains
5
Reliability Pillars
6
Working Calculators
100+
Glossary Terms
What's Inside

Everything you need
to pass the CMRP exam.

No fluff. Built from the SMRP Body of Knowledge and real field experience, covering what actually appears on the exam.

📚
Complete Study Guide
All 5 SMRP domains plus the 5 Pillars of Reliability. Deep coverage of Domain 3 (45% of exam) — vibration, lubrication, alignment, NDT, Weibull, RCA, and motor failure modes.
Open Guide →
🧮
Reliability Calculators
MTBF/MTTR/Availability, OEE breakdown, Weibull β interpreter, Series/Parallel reliability, COUR estimator, and Storeroom EOQ — all with instant interpretation against world-class benchmarks.
Open Calculators →
📖
Searchable Glossary
100+ reliability and maintenance terms defined clearly — P-F interval, FMEA RPN, Weibull eta, ISO 4406, NLGI grades, and more. Instant search. No clicking through pages.
Search Terms →
🏛️
5 Pillars Framework
Leadership & Culture, Work Management, Proactive Maintenance, Skills & Training, Reliability Engineering — fully explained with Kotter's 8 steps, Palmer's 6 principles, and TPM's 8 pillars.
Explore Pillars →
📳
Domain 3 Deep Dive
45% of the exam. Vibration fault frequencies, bearing damage detection, thermography physics, oil analysis categories, ultrasound steam trap testing, alignment tolerances, balancing G-grades.
Go to Domain 3 →
📊
Weibull & Statistics
β shape parameter interpretation, η characteristic life, the 6 failure patterns, R(MTBF) = 36.8% trap, series/parallel reliability math, and FTA gate logic — all exam-tested content.
Study Weibull →
SMRP Benchmarks

World-class targets
you need to memorize.

OEE
≥85%
Overall Equipment Effectiveness
Wrench Time
55–65%
Hands-on productive time
Schedule Compliance
≥90%
Scheduled hours completed
PM Compliance
≥90%
PMs completed on time
MC / RAV
2–3%
Maintenance cost vs. asset value
Planned Work
>85%
Planned vs. total work hours
Emergency Work
<10%
Emergency vs. total hours
Ready Backlog
2–4 wks
Craft hours ready to schedule
Storeroom Fill Rate
≥95%
Parts filled on first request
Inventory Accuracy
≥98%
Physical count vs. system
Availability
≥95%
Asset uptime fraction
R(MTBF)
36.8%
Survival at MTBF — e⁻¹, NOT 50%
About This Resource

Built by someone
who does this work.

CMRPstudy was built by a practicing Maintenance Planner & Scheduler actively pursuing CMRP certification. The content reflects real-world application in a manufacturing environment — not just what textbooks say.

Everything here is grounded in the SMRP Body of Knowledge, Nowlan & Heap's failure pattern research, Doc Palmer's planning principles, and years of hands-on reliability improvement work. No advertising. No paid placements. No course upsells.

The goal: give every maintenance professional access to the same quality of study material regardless of whether they can afford a prep course.

CMRP Candidate SMRP Member Manufacturing Industry Practitioner-Written
Exam Fast Facts
Questions110 (≈10 unscored)
Time Limit2.5 hours
FormatComputer-based, Prometric
Scoring200–800 scaled score
Passing Score≈520 (not a raw %)
RecertificationEvery 3 years
Largest DomainD3 Equipment (45%)
Domain Weights
Foundation Framework · SMRP Body of Knowledge
The 5 Pillars of Reliability
The 5 Pillars describe the organizational capabilities that enable world-class reliability. They underpin all five exam domains. Understanding the framework — and the interdependencies between pillars — is essential for answering situational questions correctly.
LeadershipWork ManagementProactive MaintSkills & TrainingReliability Engineering
🔑
Why this matters for the exam: SMRP exam questions frequently describe an organizational scenario and ask you to identify the best approach. Understanding which pillar is being tested — and knowing how pillars interact — lets you eliminate wrong answers quickly. A question about "why the PM program isn't reducing failures" is often a Pillar 2 (Work Management) or Pillar 4 (Skills) question, not a Pillar 3 question.
🔗 How the 5 Pillars Interconnect The interdependency logic

Pillar 1 (Leadership) enables all others. Without leadership commitment and cultural alignment, no reliability initiative survives the first production crisis. Leaders who revert to measuring only output — not reliability performance — create the pressure that collapses every other pillar.

Pillar 2 (Work Management) is the delivery mechanism. The best maintenance strategies (Pillar 3) and most rigorous failure analysis (Pillar 5) produce zero value if the work management system can't deliver them to the field efficiently. An organization with great RCM output and poor scheduling has done expensive analysis for nothing.

Pillar 3 (Proactive Maintenance) generates the operating data. Condition monitoring findings feed Pillar 5 analysis. PM compliance rates feed Pillar 1 KPI dashboards. Without Pillar 3 execution, the other pillars have no field data to act on.

Pillar 4 (Skills) is the human enabler. Technically sophisticated PdM programs fail when the workforce lacks the skills to execute correctly. Misapplied vibration analysis — or an operator who can't recognize abnormal equipment sounds — is worse than no program at all.

Pillar 5 (Reliability Engineering) drives continuous improvement. Engineering analysis converts field experience into strategic decisions: which assets to prioritize, which failure modes to target, how to optimize maintenance strategy over time.

The Most Common Failure Pattern

Organizations implement technical tools (Pillars 3 and 5) without the organizational foundation (Pillar 1) or work management infrastructure (Pillar 2) to sustain them. The tools sit unused because there's no cultural mandate to use them and no scheduling system to deliver the resulting work. This is the single most tested scenario on the CMRP exam.

📊 Reliability Culture Maturity Model
StageCharacteristicsTypical MC/RAV
1 — ReactiveRun-to-failure dominant. Fire-fighting culture. No PM program. Emergency work >50%.8–15%+
2 — PreventivePM program exists but mostly time-based. Low schedule compliance. Planning inconsistent.6–9%
3 — ProactivePdM program active. Planning & scheduling mature. RCA practiced. Leading indicators tracked.3–6%
4 — Predictive/CBMCondition-based decisions dominant. FMEA-driven strategy. Failure modes well understood.2–4%
5 — OptimizedAI-assisted. Real-time condition integration. Continuous improvement loops embedded.<2%
When a question describes an organization's current state, identify the maturity stage. The best answer typically describes the next logical step up — not a leap from Stage 1 to Stage 5.
Pillar 1 of 5 · Foundation
Leadership & Culture
The most critical and most neglected pillar. Technical tools fail when leadership culture doesn't support them. Covers reliability vision, accountability systems, change management, and organizational maturity.
🔄 Kotter's 8-Step Change Model Most referenced change framework on the exam
  1. Create Urgency — Show why change is necessary NOW. Use data: downtime losses, MC/RAV vs. competitors, COUR. The "burning platform." Without urgency, change initiatives stall immediately.
  2. Build a Guiding Coalition — Form a team of influential leaders combining formal authority AND credibility. A coalition of only managers without craft respect will fail. One with only craft without management authority will be ignored.
  3. Form a Strategic Vision — Clear, compelling, simple. People need to see where they're going. "Reduce unplanned downtime 40% in 18 months" is a vision. "Be excellent" is not.
  4. Communicate the Vision — Relentlessly. Every meeting, every shift change, every KPI review. Leaders must model the behaviors they're asking for — walking the floor, asking about reliability metrics, attending P&S meetings.
  5. Remove Obstacles — Identify and eliminate organizational barriers: systems, structures, and people who actively block the change. A supervisor who constantly pulls planned work for reactive emergencies is an obstacle.
  6. Generate Short-Term Wins — Plan and create visible wins early. Quick wins (first 90 days) build credibility, silence skeptics, and maintain momentum. Pick a bad actor asset with a known solution and fix it visibly.
  7. Sustain Acceleration — Use early wins to drive deeper change. Don't declare victory prematurely. Organizations that declare victory after 6 months almost always regress.
  8. Anchor in Culture — Embed new behaviors in hiring criteria, training programs, performance reviews, and recognition systems. Culture is the LAST thing to change, not the first.
📋 Reliability Accountability Systems

Accountability requires: visible KPIs (posted, reviewed weekly), individual ownership (each metric has a named owner), consequence (performance affects something — recognition, discussion, planning resources), and root cause response (when KPIs miss target, the response is to analyze why, not blame).

Common Leadership Failures in Reliability

  • Measuring only production output — reliability metrics are absent or ignored
  • Verbal commitment to proactive maintenance while resource decisions consistently favor production over PM windows
  • Celebrating "heroes" who fix breakdowns overnight — creating perverse incentive for reactive culture
  • Launching reliability initiatives without addressing the supervisory behaviors that drive reactive work
Pillar 2 of 5 · Process
Work Management
The operational engine of the maintenance organization. Work management converts reliability strategy into executed, documented field work. Palmer's 6 Principles are the most heavily tested work management content on the exam.
📋 Palmer's 6 Planning Principles All 6, fully explained

Principle 1 — Planners Form a Separate Department

Planners must be organizationally separate from craft supervision. A planner who also supervises technicians will always prioritize the urgent (today's work) over the important (next week's preparation). The planner works FOR the supervisors by preparing work packages — not alongside them in the field.

Principle 2 — Focus on Future Work

The planner's time horizon is work not yet started. Once a job enters execution, it belongs to the supervisor. Planners diverted to manage active jobs degrade planning quality for future work — the cost compounds every week.

Principle 3 — Component-Level Files (Technical Library)

For each significant asset, maintain a file containing: historical job plans, parts lists with vendor info, OEM manuals, special tools required, isolation/clearance procedures, and lessons learned from past repairs. This file transforms individual technician experience into organizational knowledge available to everyone.

Principle 4 — Estimates Based on Craft Expertise

Estimates should reflect what a skilled craftsperson can accomplish — not the average performer, not padded for worst-case. Accurate estimates enable accurate schedule loading and meaningful schedule compliance measurement.

Principle 5 — Recognize the Craft (Feedback Loop)

Craft technicians are the source of field knowledge that improves job plans. A feedback loop must exist: technicians complete job feedback noting actual time, parts deviations, problems, and improvements. The planner incorporates this into the job plan file — individual experience becomes organizational memory.

Principle 6 — Measure Performance with Wrench Time

Wrench time (hands-on productive time as % of available work time) is the fundamental measure of planning and scheduling effectiveness. Improving from the industry average of 25–35% to world-class 55–65% roughly doubles craft productivity without adding headcount.

💡
What destroys wrench time: Traveling to get unstaged parts, waiting for equipment clearance, missing job information, hunting for tools, rework from wrong parts, waiting for permits. Planning and scheduling eliminate every one of these delays.
📅 The Weekly Work Cycle & Schedule Loading
TimingActivityOwner
Mon–ThuPlanner prepares next week's work packages; acquires parts; confirms equipment availability with operationsPlanner
ThursdayWeekly coordination meeting: planner, scheduler, supervisor, operations representative review following week's planAll stakeholders
Friday by noonSchedule published; work packages distributed to supervisorsScheduler/Planner
DailySupervisor assigns work, manages execution, updates system with completions and deviationsSupervisor
End of weekSchedule compliance calculated, posted, and debriefedAll

Why 100% Schedule Loading?

Palmer's principle is to load the schedule to 100% of available craft hours. The intuitive approach — loading 85% to "leave room for emergencies" — is wrong. The scheduler's job is to plan the best use of available time. Emergency work is the supervisor's problem to manage in real time. Pre-compensating at 85% trains the organization to expect 85% execution and guarantees the remaining 15% is wasted.

Pillar 3 of 5 · Technical
Proactive Maintenance
Covers all forms of proactive maintenance — time-based PM, condition-based PdM, TPM, and operator-driven reliability. The goal: address potential failures before they become functional failures.
⚙️ PM Task Types & Selection Logic Which task for which failure mode
Task TypeUse WhenExamples
Time-based (scheduled)Known age-related deterioration; component reliability decreases predictably with time/use (Weibull β > 3)Oil changes, filter replacements, belt changes, seal replacements
On-condition (CBM/PdM)Detectable P-F interval exists; condition monitoring can find the failure before it occursVibration routes, thermography surveys, oil sampling, ultrasound routes
Failure-findingHidden function — failure is not self-announcing and only matters when the protected function is demandedTesting relief valve setpoints, exercising emergency shutdown valves, testing standby pump auto-starts
Run-to-failure (RTF)Failure consequence is acceptable AND no cost-effective proactive task existsLight bulbs, fuses, low-value redundant non-critical components

The P-F Interval — Governing Concept for All PdM

The P-F interval is the time between when a potential failure (P) can first be detected and when functional failure (F) occurs. It governs how PdM programs are designed:

  • Monitoring frequency must be less than half the P-F interval
  • Response time after detection must also fit within the remaining P-F interval
  • Technologies with longer P-F intervals allow less frequent monitoring
🏭 TPM — All 8 Pillars
  1. Autonomous Maintenance — Operators perform basic care (clean, inspect, lubricate, tighten). 7-step progression from initial clean/inspect to full autonomous management.
  2. Focused Improvement (Kobetsu Kaizen) — Cross-functional teams eliminate losses using structured problem-solving. Targets the 16 major losses.
  3. Planned Maintenance — Maintenance department's shift from reactive to planned, condition-based. Supports Autonomous Maintenance by handling complex tasks.
  4. Quality Maintenance — Zero defects through perfect equipment condition. Identifies equipment conditions that cause quality defects; sets and maintains zero-defect conditions.
  5. Early Equipment Management — Apply reliability knowledge during design and commissioning. Most cost-effective time to address reliability is before equipment is purchased.
  6. Training & Education — Multi-skilling for operators; maintenance skills upgrade; leadership development.
  7. Safety, Health & Environment — Zero accidents, zero health issues, zero environmental incidents. Integrated into every TPM activity.
  8. Office TPM — Applying TPM principles to administrative functions: eliminate waste in ordering, scheduling, and information flow.

OEE & The 6 Big Losses

OEE ComponentLossDefinition
AvailabilityBreakdownsUnplanned equipment failures causing production stoppage
Setup & AdjustmentTime lost during changeovers, startups, adjustments
PerformanceMinor Stoppages / IdlingBrief stops not recorded as breakdowns
Reduced SpeedEquipment running below designed capacity
QualityProcess Defects / ScrapProducts not meeting specification during steady-state
Startup / Yield LossesDefects produced during startup before stable conditions
Pillar 4 of 5 · People
Skills & Training
Technical programs fail when the workforce lacks the skills to execute correctly. Covers competency frameworks, PdM certification levels, training delivery, and knowledge management.
🎓 Competency-Based Training & PdM Certification Levels

Competency-based training focuses on demonstrated ability — not attendance. The test is not "did they take the class?" but "can they perform the task correctly in the field?"

PdM Technology Certification Levels

LevelCapabilities
Category IData collection, route-based monitoring, basic anomaly identification, escalation to higher level
Category IIAnalysis and diagnosis, report generation, maintenance task recommendations
Category IIIAdvanced analysis, program design, calibration, training of lower levels
Category IVExpert-level, method development, technical authority, standards development

Knowledge Management

When experienced technicians retire, they take institutional knowledge with them unless it's systematically captured. Strategies: document job plans with lessons learned (Palmer Principle 5), require CMMS failure code notes, formal mentoring programs, accessible technical libraries with OEM manuals and RCA reports.

Pillar 5 of 5 · Engineering
Reliability Engineering
The analytical engine of the reliability program. Converts failure data into optimized maintenance strategies and failure prevention. Covers RCM, FMEA, FRACAS, and failure statistics.
📐 RCM — The 7 Questions (SAE JA1011)
  1. Functions & Performance Standards — What is the asset supposed to do, and how well? A pump circulating 300 GPM when 500 GPM is required is in functional failure even if it's running.
  2. Functional Failures — In what ways can it fail to fulfill its functions? Not failure modes yet — the inability to meet the performance standard.
  3. Failure Modes — What causes each functional failure? The specific physical event. "Impeller erosion reduces flow below 500 GPM" is a failure mode.
  4. Failure Effects — What happens when each failure mode occurs? Factual description only — evidence the failure is occurring, physical damage, safety/environmental/production impact.
  5. Failure Consequences — Why does each failure matter? RCM hierarchy: Safety/Environmental → Hidden → Operational → Non-operational. Consequence classification drives task selection.
  6. Proactive Tasks — What should be done to predict or prevent each failure? Task selection hierarchy: On-condition → Scheduled restoration → Scheduled discard → Failure-finding. Task must be technically feasible AND worth doing.
  7. Default Actions — What if no suitable proactive task exists? Options: RTF (if consequences acceptable), redesign, accept and document the risk.
RCM vs. FMEA distinction: FMEA is a bottom-up analysis identifying failure modes and effects. RCM is a top-down decision process that uses FMEA as input to select maintenance strategies. RCM answers "what should we do?" — FMEA answers "what can fail and what happens?"
🔄 FRACAS — Closed-Loop Failure System

Failure Reporting, Analysis & Corrective Action System. Without FRACAS, failures are repaired but not learned from. The cycle:

  1. Report — Every failure documented in CMMS with failure code, downtime, and technician observations
  2. Analyze — Bad actors escalated for formal RCA; all failures coded for trending
  3. Corrective Action — Root causes addressed with specific tracked actions (not just repairs)
  4. Verify — Corrective actions confirmed effective; recurrence rate tracked
  5. Feed Back — Lessons update job plans, PM program, spare parts strategy, and engineering standards
Domain 1 · 15% of Exam · ~16–17 Questions
Business & Management
Financial justification, KPIs, benchmarking, ISO 55000, and strategic alignment of the maintenance program to organizational objectives.
OEE · MTBF · MTTRMC/RAV · COURNPV · LCC · ROIISO 55000
💰 Financial Justification Tools
OEE
OEE = Availability × Performance × Quality
World-class ≥ 85% · Typical industry average 50–65%
Availability (Asset)
A = MTBF / (MTBF + MTTR)
MC/RAV
MC/RAV (%) = (Annual Maint Cost / Replacement Asset Value) × 100
World-class 2–3% · Reactive 8–15%+
Life Cycle Cost
LCC = Acquisition + Installation + Training + Operating + Maint + Downtime + Disposal

Cost of Unreliability (COUR) — 6 Categories

CategoryComponents
Production lossesLost throughput × margin per unit; missed customer orders, late fees
Emergency maint premiumOvertime labor, expedited freight, emergency contractor rates (3–10× planned cost)
Quality defectsScrap, rework, customer returns — equipment in degraded condition causes quality problems before outright failure
Safety incidentsMedical costs, investigation, regulatory fines, legal liability
Environmental incidentsRemediation, fines, permit violations
Customer/market impactLost contracts, relationship damage — often the largest long-term cost
📈 KPIs — Leading vs. Lagging
💡
Leading indicators predict future performance (PM compliance predicts future MTBF). Lagging indicators report past performance (MTBF reports historical reliability). A balanced scorecard uses both — lagging to confirm outcomes, leading to intervene before outcomes deteriorate.
KPIFormulaTargetType
PM Compliance(PMs done on time / PMs scheduled) × 100≥90%Leading
Schedule Compliance(Sched hrs completed / Sched hrs) × 100≥90%Leading
Wrench TimeHands-on time / Total available time55–65%Leading
Planned Work %Planned hrs / Total work hrs>85%Leading
Emergency Work %Emergency hrs / Total hrs<10%Lagging
MTBFTotal uptime / # failuresTrending upLagging
MTTRTotal repair time / # repairsTrending downLagging
MC/RAV(Annual maint cost / RAV) × 1002–3%Lagging
Domain 2 · 15% of Exam · ~16–17 Questions
Manufacturing Process Reliability
System reliability, failure theory, reliability statistics, FMEA, RCM, fault trees, and the 6 failure patterns from Nowlan & Heap.
📉 The 6 Failure Patterns (Nowlan & Heap 1978)
⚠️
Critical insight: 68% of industrial equipment follows Pattern F — infant mortality dominant. This means most failures are caused by installation errors and quality defects, NOT age-related wear. Time-based PM is therefore ineffective for the majority of equipment.
PatternFailure Rate% of EquipmentBest Strategy
A — BathtubHigh → constant → increasing4%CBM for useful life; time-based for wear-out
B — Wear-outSteadily increasing2%Time-based scheduled replacement
C — Gradual degradationSlowly increasing5%CBM or wide-interval time-based
D — Late increaseLow then increasing7%Condition-based monitoring
E — RandomConstant throughout life14%CBM; RTF if consequence allows
F — Infant mortalityHigh decreasing → low constant68%Improve installation quality; CBM
🔗 Reliability Mathematics
Failure Rate
λ = Number of Failures / Total Operating Time = 1 / MTBF
Reliability Function (Exponential / Constant Failure Rate)
R(t) = e^(−λt) = e^(−t/MTBF)
⚠️ At t=MTBF: R = e^(−1) = 0.368 (36.8% survive — NOT 50%)
Series System
R_s = R₁ × R₂ × R₃ × … × Rₙ
Any component failure = system failure. Always less than lowest individual reliability.
Parallel System (2 components)
R_p = 1 − (1 − R₁)(1 − R₂)
All components must fail for system failure. Always greater than highest individual reliability.
FMEA Risk Priority Number
RPN = Severity × Occurrence × Detectability (each 1–10, max = 1,000)
Always address Severity = 9–10 regardless of RPN. High severity + low probability = low RPN but catastrophic if it occurs.
Domain 3 · 45% of Exam · Part A
Vibration Analysis
The most tested PdM technology on the exam. Know fault frequencies, measurement parameter selection, bearing fault signatures, ISO severity zones, and the distinction between spectrum analysis and high-frequency techniques.
📳 FFT Fault Frequency Reference
FrequencyPrimary CauseDirection
1× RPMRotor unbalance (dominant radial); bent shaft; also misalignmentRadial (H & V)
2× RPMAngular misalignment (especially axial); mechanical looseness; cracked shaftRadial + Axial
Sub-harmonic (0.5×, 0.33×)Fluid instability — oil whirl/whip; internal rubRadial
BPFOOuter race bearing defect; non-synchronousRadial (load zone)
BPFIInner race defect; ±1× sidebands (inner race rotates)Radial
BSFBall/roller defect; sub-harmonicRadial
FTFCage defect; very low freq (0.35–0.48 × RPM)Radial
Gear Mesh Freq (GMF)# teeth × RPM; sidebands = wear/eccentricityRadial
2× Line Freq (120 Hz)Electrical fault in motor (USA); stator eccentricityRadial

Measurement Parameter Selection

  • Displacement (mils p-p): Low frequency (<10 Hz) — measures how far the shaft moves
  • Velocity (in/sec or mm/sec RMS): Mid frequency (10–1000 Hz) — ISO 10816 uses velocity for machine severity
  • Acceleration (g): High frequency (>1000 Hz) — bearing damage detection via Spike Energy, kurtosis
ISO 10816 zones: A (new acceptable) → B (long-term acceptable) → C (plan maintenance) → D (immediate action). Boundaries depend on machine class. The exam tests understanding of the zone system, not specific numbers.
Domain 3 · 45% of Exam · Part B
Infrared Thermography
Thermography measures surface temperature via infrared radiation. Critical concept: emissivity. Applications span electrical, mechanical, process, and refractory systems.
🌡️ Physics, Emissivity & Applications

Emissivity is the ratio of radiation emitted by a surface vs. a perfect blackbody (ε = 1.0). Shiny metals have very low emissivity (0.03–0.15) — they primarily reflect surrounding temperatures rather than emitting their own. This causes false readings unless corrected. Apply matte paint or tape, or use emissivity-corrected settings.

Applications by Category

CategoryWhat Thermography Finds
ElectricalLoose connections (resistance heating: P = I²R), overloaded circuits, phase imbalance, failing switches/breakers, transformer problems
MechanicalBearing overheating, motor winding hotspots through vent slots, misaligned couplings, seized conveyor rollers
Process / RefractoryFurnace refractory degradation (hot spots on shell), heat exchanger fouling (cold spots), failed-open steam traps (hotter than cycling traps)
Building / InsulationMoisture infiltration, insulation gaps, roof anomalies
Domain 3 · 45% of Exam · Part C
Oil Analysis
Three test categories: wear debris analysis, oil condition testing, and contamination analysis. Understand what each detects and when each is used.
🛢️ Three Test Categories — Complete Coverage

1. Wear Debris Analysis

  • ICP Spectrometry: Measures dissolved metals ≤8–10 µm. Key metals: Fe (steel wear), Cu (bearings/bushings), Pb (babbitt), Cr (ring/liner), Al (piston), Si (dirt contamination)
  • Particle Count (ISO 4406): Counts particles by size; measures total contamination load
  • Analytical Ferrography: Most detailed method. Magnetically deposits particles on slide for microscopic examination — identifies particle type, shape, size, and failure mode

2. Oil Condition Testing

  • Viscosity: Most critical property. ±15% of new oil = warning; ±25% = change required
  • TAN (Total Acid Number): Rising TAN = oil oxidation/degradation
  • TBN (Total Base Number): Alkaline reserve in engine oils. When TBN drops to 50% of new value, change required
  • RPVOT: Measures remaining antioxidant life in turbine/hydraulic oils

3. Contamination Analysis

  • Water (Karl Fischer): Even 100 ppm can reduce bearing life 50%. Alert: >500 ppm for most hydraulics
  • Fuel dilution (FTIR): Thins viscosity in engine oils
  • Glycol/coolant: Causes sludge and severe oxidation; detected by glycol test or elevated potassium
  • High Si without elevated Al: Soil/dirt ingestion (breather failure, seal failure)
Domain 3 · 45% of Exam · Part D
Lubrication Management
40–50% of bearing failures trace to lubrication problems. Viscosity, NLGI grades, ISO 4406 cleanliness codes, contamination control, and filtration beta ratios.
🔬 Viscosity, NLGI Grades & ISO 4406

Viscosity is the single most important lubricant property. The lubricant must maintain a hydrodynamic film separating metal surfaces under all operating conditions. Too low → film collapse, metal contact. Too high → excessive heat from shear and churning.

ISO VG Grades

The ISO VG number = nominal kinematic viscosity in cSt at 40°C, ±10%. Common grades: 32, 46, 68, 100, 150, 220, 320, 460, 680. Each grade is approximately double the previous in viscosity.

NLGI Grease Grades

NLGIConsistencyApplications
000–00Semi-fluidEnclosed gearboxes, centralized systems
0–1Very softCentralized systems, cold climate
2Medium (peanut butter)~80% of bearing applications — most common
3FirmHigh-speed bearings, vertical orientation
4–6Hard to blockOpen gears, wire ropes, extreme pressure

ISO 4406 Cleanliness Code

Format: X/Y/Z — particle count range codes at ≥4µm, ≥6µm, ≥14µm per mL. Each number increment doubles the particle count. Typical targets: servo hydraulics 15/13/10; gearboxes 17/15/12.

Filtration Beta Ratio (βₓ): Upstream count ÷ downstream count at particle size x. β₁₀ = 200 means 99.5% efficiency at 10µm.

Domain 3 · 45% of Exam · Part E
Precision Alignment & Balancing
🎯 Shaft Alignment — Types, Methods, Soft Foot

Misalignment is the leading cause of premature bearing and seal failure in rotating equipment.

Types

  • Parallel (offset): Shaft centerlines parallel but offset. Dominant 2× RPM radial vibration.
  • Angular: Centerlines meet at an angle. Strong 1× and 2× RPM axial vibration.
  • Combined: Both — most real-world cases.

Soft Foot — Correct BEFORE Alignment Begins

  • Parallel: One foot shorter → solid shims
  • Angular: Canted foot → tapered shims
  • Induced: Pipe/conduit strain on frame → fix the piping, not the shimming
  • Squishy: Corrosion or debris under foot → clean and reshim

Alignment Tolerances

Speed (RPM)Offset (mils)Angularity (mils/inch)
<1,800±3.0±0.7
1,800–3,600±2.0±0.5
>3,600±1.0±0.3

ISO 1940 Balance Grades

G GradeApplication
G 0.4Gyroscopes, precision spindles
G 2.5Gas/steam turbines, centrifuges
G 6.3Industrial fans, pumps, motors — most common
G 16Agricultural machinery, large crankshafts
G 40Car/truck wheels
Domain 3 · 45% of Exam · Part F
Root Cause Analysis
🔍 Three Root Cause Levels
LevelDefinitionExample
PhysicalThe material or mechanical event that caused the failureRolling element bearing failed from subsurface fatigue spalling
HumanThe human decision or omission that enabled the physical causeBearing over-greased — excess pressure forced grease past seals
Latent/SystemicThe organizational weakness that enabled the human causeNo written greasing procedure exists; no quantity/interval specification; no competency verification for lubrication tasks

RCA Methods — Selection Guide

  • 5-Why: Simple linear failure chains; quick team facilitation. Limitation: can reach different root causes depending on which "why" path is followed.
  • Fishbone/Ishikawa: Brainstorming with the 6Ms (Man, Machine, Method, Material, Measurement, Mother Nature). Good for ensuring no cause category is overlooked.
  • Fault Tree Analysis (FTA): Top-down, AND/OR gate logic. Best for complex multi-cause failures, safety events, and when probability quantification is needed.
  • Apollo RCA: Cause-and-effect charting with evidence requirements at each causal link. Defensible and auditable.
  • TapRooT®: Structured investigation with specific root cause categories. Most rigorous — used for serious safety incidents.
Domain 4 · 10% of Exam · ~11 Questions
Organization & Leadership
Culture, TPM, team development, change management. Questions are typically situational — choose the approach that builds commitment, not compliance.
👥 TPM, OEE & Organizational Design

See Pillar 3 for full TPM 8-pillar detail. Domain 4 exam focus areas:

  • TPM requires operator ownership — not just maintenance executing more PMs. The transfer of basic care is essential.
  • Autonomous Maintenance 7-step progression must be understood sequentially
  • OEE is the scorecard for TPM effectiveness: ≥85% world-class
  • The Maintenance-Operations interface is the most critical organizational relationship for reliability
  • Kotter's 8 steps (see Pillar 1) are the primary change management framework tested
Domain 5 · 15% of Exam · ~16–17 Questions
Work Management
Planning, scheduling, CMMS, MRO/stores, contractor management, shutdown planning. See Pillar 2 for full Palmer's Principles coverage.
📦 MRO Storeroom — Key Formulas & KPIs
Reorder Point
ROP = (Average Daily Usage × Lead Time in Days) + Safety Stock
Economic Order Quantity (EOQ)
EOQ = √(2DS / H)
D = annual demand · S = ordering cost per order · H = annual holding cost per unit
Inventory Turns
Turns = Annual Issues ($ value) / Average Inventory Value ($)

World-Class Storeroom KPIs

  • Fill rate: ≥95% requests filled from stock on first request
  • Inventory accuracy: ≥98% physical count matches system at line-item level
  • Stockout rate: <5% of requests result in emergency procurement
  • ABC classification: A items (top 10–20% of SKUs, 70–80% of value) get tight controls; C items use simple min/max
Advanced Topic · Domains 2 & 5 · Pillar 5
Weibull Analysis
The primary statistical tool for analyzing failure time data. The exam tests understanding of β (shape parameter) and η (characteristic life) — and what each tells you about maintenance strategy.
📊 β Shape Parameter — Maintenance Strategy Implications
Weibull Reliability Function
R(t) = e^[−(t/η)^β]
β = shape parameter · η = characteristic life (time at which 63.2% have failed) · t = time
β ValueFailure PatternFailure RateMaintenance Strategy
β < 1Infant mortalityDecreasingImprove installation quality & commissioning. Time-based PM makes it WORSE.
β = 1Random (exponential)ConstantCBM or RTF. Time-based replacement does NOT reduce failure frequency.
1 < β < 3Early wear-out onsetSlowly increasingCBM preferred; time-based at wide interval acceptable
β ≈ 3.44Normal distribution approxSymmetric around meanTime-based PM; set at 70–80% of η
β > 3Wear-outRapidly increasingTime-based PM IS effective. Set interval at 70–80% of η (characteristic life).
⚠️
η (eta) trap: η is NOT the MTBF. η is the time at which exactly 63.2% of the population has failed — regardless of β. MTBF can be calculated from β and η but they are not the same value.
Advanced Topic · Domain 3 · Equipment Reliability
Non-Destructive Testing (NDT)
Know which NDT method is appropriate for which application. Key decision variables: surface vs. subsurface, ferromagnetic vs. non-ferromagnetic, radiation constraints, speed requirements.
🔬 All NDT Methods — Comparison Table
MethodHow It WorksBest ForKey Limitation
VT (Visual)Direct or aided visual examinationSurface defects; first step in any inspectionSurface only; requires access
PT (Liquid Penetrant)Dye drawn into surface-breaking cracks by capillary actionSurface-breaking defects in any non-porous materialSurface-breaking only; rough surfaces reduce sensitivity
MT (Magnetic Particle)Magnetic flux leakage at defects attracts particlesSurface & near-surface defectsFerromagnetic materials only (carbon/alloy steel)
UT (Ultrasonic)Sound waves reflect from internal defectsInternal defects, wall thickness measurementRequires couplant; operator skill critical
PAUTMulti-element UT with electronic steering; produces cross-section imageComplex geometry welds, code-compliant inspectionHigher cost; specialized operator
RT (Radiography)X-ray/gamma ray differential absorptionInternal defects; permanent record; complex geometryRadiation safety required; two-sided access needed
ET (Eddy Current)Induced currents; defects disturb current flowHeat exchanger tube inspection (fast, no contact); surface cracksConductive materials only; limited depth
AE (Acoustic Emission)Passive — detects stress waves emitted by active defect growthMonitoring pressure vessels during pressurization; active crackingNoise discrimination; multi-sensor required for location
NDT selection logic: (1) Surface or subsurface? → PT/MT for surface; UT/RT for subsurface. (2) Ferromagnetic? → If no (stainless, aluminum), eliminate MT. (3) Thickness measurement? → UT. (4) Radiation constraint? → UT instead of RT. (5) Fast heat exchanger tube inspection? → Eddy Current.
Advanced Topic · Domain 3 · Equipment Reliability
Motor Failure Modes
⚡ AC Motor Failure Modes — Complete Table
Failure ModeRoot CauseDetectionPrevention
Bearing failureOver/under lubrication, contamination, VFD fluting, misalignmentVibration, ultrasound, temperatureLube program, ultrasound-guided greasing, shaft grounding rings
Stator winding failureThermal cycling, voltage spikes, partial discharge, over-temperatureSurge test, partial discharge, Megger/PI trendingThermal monitoring, VFD voltage spike filters
Rotor bar failureThermal cycling fatigue, casting defects, overloadingMCSA (sidebands at 1 ± 2s × line frequency)Avoid frequent starts; VFD soft-starting
VFD bearing flutingHigh-freq common-mode voltage → capacitive coupling → discharge current → raceway cratersVisual (washboard pattern), vibration, MCSAShaft grounding rings; insulated bearings on NDE
OverheatingOverloading, blocked ventilation, high ambient, frequent starts, voltage unbalanceThermal monitoring, current monitoring, thermographyProper motor sizing, clear ventilation, voltage balancing

Motor Insulation Testing

TestWhat It MeasuresPass Criteria
Megger (IR Test)Winding-to-ground insulation resistanceVaries by voltage class; trending over time is more valuable than a single reading
Polarization Index (PI)IR(10 min) / IR(1 min)>2.0 = good; <1.0 = contaminated/damaged
Surge/Impulse TestTurn-to-turn insulationDetects inter-turn shorts other tests miss
Hi-PotDielectric withstand (proof test)Used for acceptance testing; destructive if overused
Advanced Topic · Domain 5 · Work Management
Shutdown & Turnaround Management
🏭 Turnaround Planning, CPM & Scope Control

Planning Timeline

PhaseTimelineKey Activities
Scope Development12–18 months priorInspection list, regulatory requirements, deferred maintenance, process improvements. Freeze scope 3 months before execution.
Engineering & Procurement6–12 monthsDetailed work packages, long-lead materials, contractor bids
Detailed Planning3–6 monthsJob plans, resource leveling, critical path, permit coordination
ExecutionShutdown periodDaily progress meetings, critical path monitoring, MOC for scope additions
CloseoutPost-shutdownCost vs. budget, schedule vs. plan, lessons learned

Critical Path Method (CPM)

  • Critical path: Longest chain of dependent activities — determines minimum turnaround duration
  • Float: Time an activity can be delayed without affecting the end date. Critical path activities have zero float.
  • Total Float = Late Finish − Early Start − Duration
⚠️
Scope creep flag: >10% scope addition after freeze = poor initial scope development or ineffective freeze enforcement. Leading cause of turnaround overruns. All post-freeze additions require formal MOC with documented cost and schedule impact.
Working Calculators

Reliability Engineering
Calculators

Six calculators covering the most commonly used reliability and maintenance formulas. Every result is benchmarked against SMRP world-class targets.

⏱️
MTBF / MTTR / Availability
Calculate Mean Time Between Failures, Mean Time to Repair, and asset availability from raw operating data.
MTBF (hrs)
MTTR (hrs)
Availability
🏭
OEE Breakdown
Calculate Overall Equipment Effectiveness and identify which of the 6 Big Losses is your primary constraint.
Availability
Performance
Quality
OEE
📈
Reliability R(t) Calculator
Calculate the probability of survival to time t for an asset with a known MTBF (constant failure rate / exponential distribution).
R(t) — Survival Prob
F(t) — Failure Prob
λ (failures/1000hr)
🔗
Series & Parallel Reliability
Calculate system reliability for components configured in series or parallel. Enter up to 5 component reliabilities.
System Reliability
vs. Lowest Component
📊
Weibull β Interpreter
Enter your Weibull β (shape) and η (characteristic life) parameters — get the failure pattern, recommended maintenance strategy, and PM interval if applicable.
Failure Pattern
📦
Storeroom: EOQ & Reorder Point
Calculate the Economic Order Quantity and Reorder Point for a spare part. Optimize between ordering frequency and carrying costs.
EOQ (units/order)
Reorder Point (units)
Orders per Year
Total Annual Cost
Reference

Reliability Glossary

🔍
Loading...
CMRPstudy

Get in Touch

Found an error? Have a recommendation? Want to suggest a practice question? Every message goes directly to Chad.

⚙️
Chad Zahner — CMRPstudy
Reads every message
CMRPstudy
Hey! Thanks for using the site. Found something wrong, have a suggestion, or want to contribute a practice question? Drop a message below.

Messages go directly to [email protected] · Usually responded to within 48 hours

CMRPstudy · Practice Tools

Flashcards &
Practice Exam

200 flashcards and 100 exam-style questions covering all 5 SMRP domains and 5 Pillars of Reliability. Track your score, identify weak domains, and study to world-class benchmarks.

200
Flashcards
100
Practice Questions
5
SMRP Domains
~520
Passing Score Target
📇
Flashcard Deck
200 cards spanning all 5 domains, 5 Pillars, Weibull analysis, NDT, motor failure modes, lubrication, alignment, and work management. Filter by category, mark known/unknown, track your progress.
200 cards 8 categories Spaced repetition
Open Flashcards →
📝
Practice Exam
100 multiple-choice questions with full explanations. Choose all domains or focus on one. Each answer reveals a detailed rationale explaining why it's correct and why the wrong answers are wrong.
100 questions Full explanations Domain breakdown
Start Practice Exam →
Domain Weight Guide — Study Priority
200 Cards · All Domains

Flashcard Deck

Click any card to flip. Mark Known or Unknown to track mastery. Filter by category to focus your study session.

200 cards
Tap to reveal definition
Definition
Click card to flip · Use arrow keys to navigate
1 / 200
0
Known
0
Study More
200
Remaining
0%
Mastery
100 Questions · Full Explanations

Practice Exam

Exam-style questions with detailed rationale. Select domains to focus or run all 100 for a full mock exam. Aim for 85%+ before sitting the real exam.

Select domains to include:
100 questions selected