Subject: ASM Health Check Report – New Failures Detected

Date: [Insert Date] Host/Cluster: [Insert Environment Name] Severity: Warning / Critical (as applicable)

Step 3: Clear the Alert

Step 4: Prevent Future Issues

Conclusion

The "asm health checker found 1 new failures updated" alert requires immediate attention to prevent data loss, performance degradation, or system downtime. By understanding the cause, taking corrective action, and implementing preventive measures, database administrators can ensure the reliability and performance of their Oracle databases. Always refer to Oracle documentation or consult with Oracle Support for specific guidance tailored to your environment.


Introduction

If you are an Oracle Database Administrator (DBA) managing an Oracle Real Application Clusters (RAC) environment, you have likely encountered a cryptic but critical message in your alert logs or monitoring console: "ASM Health Checker found 1 new failures updated."

At first glance, this message can induce panic. Does it mean data loss? Is your disk group about to crash? Will your production database go offline? Fortunately, in most cases, this alert is a proactive warning from Oracle’s Automatic Storage Management (ASM) diagnostics framework. However, ignoring it can lead to severe performance degradation or service interruption.

This comprehensive guide will dissect every aspect of this error message. We will explore what the ASM Health Checker is, why it triggers this alert, how to diagnose the specific failure, and step-by-step remediation strategies.


Mitigation Strategies

Scenario A: Transient Failure If the underlying issue was a temporary glitch (e.g., a loose fiber cable or a brief network blip), the disk might still be repairable. If the OS can see the disk again, you may be able to issue:

ALTER DISKGROUP <diskgroup_name> ONLINE DISK <disk_name>;

This will initiate a rebalance operation to resync the data.

Scenario B: Permanent Hardware Failure If the disk has physically failed, you must replace it at the hardware level.

  1. Identify the physical slot.
  2. Drop the disk from the ASM configuration (if not already done automatically).
  3. Replace the physical drive.
  4. Add the new disk back to the Disk Group.

Interpreting False Positives

Rarely, the "ASM Health Checker found 1 new failures updated" message appears without any actual hardware or storage issue. This can happen due to:

If you cannot find any underlying failure, cross-check with Oracle Support (My Oracle Support) for known bugs in your ASM version. Apply the latest Grid Infrastructure patchset if needed.