Why OREDA Won't Fix Your Compressor: SME-Led RCM and What the Field Actually Knows
In the monthly integrity review, someone asks: "Are we better than industry?" A failure rate from OREDA is pulled up. The meeting moves on. Weeks later the export compressor trips again—after a hot restart, outside the approved operating envelope, with a seal cartridge that sat in stores past its preservation limit. No handbook warned you, because the handbook was never the problem.
If you lead RCM or asset integrity, you already know: major equipment failures on a running plant rarely come from not knowing an industry λ. They come from how the machine is operated, started, stopped, preserved, designed, maintained, and governed. This article— informed by field practitioners and OREDA Vol. 1 (2015)—uses one topside compressor as a lens for what actually drives reliability, and where industry data fits (and does not).
Illustrative Field Story (Composite — Not an OREDA Case)
Imagine a gas-turbine-driven export compressor on a mature offshore platform: dry gas seals, antisurge recycle, repeated "nuisance trips" logged as compressor failures. Over 18 months, eleven unplanned events hit production. Root cause reviews cite vibration, control logic, and operator error. An external team runs a compressed "RCM" study without operators, instrument technicians, or shop-floor mechanics in the room. Management approves task duration cuts to show cost benefit on a slide. Six months later, the same failure modes return.
When records are reconciled properly, many events trace not to mysterious rotor failure but to envelope excursions, restart procedure gaps, and seal spares that were never audited—none of which are resolved by quoting an industry average failure rate.
Seven Drivers Experts See More Often Than Missing OREDA Data
Across upstream operations, practitioners consistently rank these above benchmark statistics as causes of major compressor distress:
- Operating outside the equipment envelope — sustained run in surge margin, overspeed, or off-design conditions that accelerate seal and antisurge distress long before "random" hardware failure.
- Incorrect start-up, shutdown, and preservation — hot restarts, short-notice trips, and long idle periods without proper preservation drive seal and control failures that CMMS codes as "trip" or "operator error."
- Design flaws and engineering compromises — retrofit antisurge logic, seal systems, or capacity upgrades that never made it into the RCM baseline; your machine is not the anonymous average in a handbook row.
- Plant personnel excluded from RCM — operators and maintainers who know the asset's history are absent while consultants produce tasks that do not match field reality.
- PM optimisation disguised as RCM — task removal and duration reduction to impress cost KPIs is not Reliability Centred Maintenance; it is PM optimisation. Deferred cost reappears as trips and deferment.
- Rushed studies and poor history — no time for quality failure data, no participation down to technician level, no equipment experts in the workshop—outputs look polished, foundations are hollow.
- No spare parts maintenance or audit discipline — critical seal cartridges, valve internals, and rotors stored without preservation control; the part fails on install, not in OREDA statistics.
OREDA cannot substitute for correcting these. At best, it hints at which failure mode classes exist in pooled topside experience—useful for taxonomy, not for telling your running train what to do next week.
What OREDA Actually Is (Reference Data, Not O&M Gospel)
For topside compressors (Taxonomy 1.1), OREDA Vol. 1 aggregates 47 units across 11 installations and roughly 2.13 million operational hours [1]. Experienced integrity teams use this in the design and pre-commissioning / RAMS phase—as an order-of-magnitude view of mode types—not as a live failure-rate target for a operating asset with its own envelope, procedures, and history.
What the handbook can inform (with caution):
• Equipment boundary: driver (gas turbine / motor) is outside the compressor class [1]
• Pooled critical failure rate (operational time, mean): ≈ 155 per 10⁶ op. hrs [1]
• Frequent mode classes in the dataset: spurious stop (~34), vibration (~32), parameter deviation (~32) per 10⁶ op. hrs [1]
• Maintainable-item focus in tables: dry gas seals, antisurge, control units, instrumentation [1]
What it must not be used for on a running plant:
Site-specific λ for PM intervals · spare strategy without plant data · proof that your RCM is "aligned with industry" · replacement for SME-led failure analysis. Failure rates vary by industry, machine, duty, and governance—pooled λ is probability estimation across strangers' machines, not yours in O&M.
SME-Led RCM (RCM3): A Group Task, Not a Consultant Deliverable
RCM3 is Meta Infa's label for facilitator-led Reliability Centred Maintenance aligned to SAE JA1011 [2]—the same RCM2 / RCM3 methodology used in our training. It is not a separate global standard or a consultant deliverable. It is a facilitated group analysis in which plant SMEs own the failure logic; external support provides structure, taxonomy (ISO 14224 / OREDA), and facilitation—not imported failure rates. The right outcome requires every SME in the room:
- Operations — envelope, start/stop, preservation reality
- Mechanical, electrical, and instrument — maintainable items and failure mechanisms
- F&G and safety — consequence and detection context
- Shop floor and technicians — what actually fails, what gets coded wrong, what spares condition looks like
Plant personnel know their asset and history better than any external team alone. Consultants add structure, ISO 14224 / OREDA taxonomy alignment, and facilitation—but tasks must be justified from plant failure history and consequences, not imported λ. If management intent is only to shorten PM for a cost chart, call it what it is: PM optimisation. Do not label it RCM.
Recommendations: RCM Leads and Asset Integrity Executives
For RCM / maintenance leads:
- Gate any study on failure history quality and CMMS coding aligned to ISO 14224 boundaries before debating benchmarks.
- Run SME-led RCM (RCM3) with mandatory participation from operations through technician level; document who was in the room.
- Map the seven field drivers (envelope, procedures, design, spares audit) into failure modes—not only hardware PM tasks.
- Use OREDA only to sense-check mode taxonomy or early-phase RAMS; build running-plant strategy from site data.
- Implement spare parts preservation and audit procedures for seal systems and critical rotors.
For asset integrity executives:
- Ask: "Was this RCM or PM optimisation?" Challenge task removals that have no failure linkage.
- Fund adequate study time and data cleanup before approving deliverables—rushed RCM is expensive theatre.
- Hold integrity accountable for envelope compliance and preservation standards, not only MTBF slides.
- Measure success in trips avoided and deferment reduced—not maintenance hours deleted from a spreadsheet.
Closing Thought
OREDA remains a valuable shared reference for topside equipment—when respected for what it is. It does not replace the people who operate and maintain compressors daily, and it does not forgive operating outside envelope, skipping preservation, or stripping tasks to please a budget review.
The running plant wins when SME-led RCM is a group outcome, history is honest, spares are audited, and industry data stays in its lane. Your compressor did not fail because you missed a handbook page. It failed because the system around the machine did not act on what the field already knew.
Need facilitator-led RCM3 with your plant SMEs?
Meta Infa supports upstream teams with structured RCM3 workshops—operations, mechanical, electrical, instrument, F&G, safety, and shop floor included—plus ISO 14224 taxonomy and asset data quality, without mistaking OREDA averages for your plant's truth.
Talk to Our RCM Team →References:
1. OREDA Participants. Offshore and Onshore Reliability Data Handbook, Volume 1 – Topside Equipment, 6th ed. (2015). Compressors (Taxonomy 1.1): population, limitations, failure rates, boundary definition. SINTEF / DNV GL.
2. SAE International. SAE JA1011: Evaluation Criteria for Reliability-Centered Maintenance (RCM) Processes.
3. ISO 14224:2016. Petroleum, petrochemical and natural gas industries — Collection and exchange of reliability and maintenance data for equipment.