Configure a Fault-Tolerant Environment

When you first install or upgrade the data aggregator to
DX NetOps Performance Management
3.5 release or higher, you are prompted to configure a fault-tolerant environment. After the initial installation or upgrade to a fault-tolerant environment,
DX NetOps Performance Management
saves the responses you give to the fault-tolerant environment prompts and the prompts do not appear during future upgrades of the fault-tolerant environment.
A fault-tolerant environment requires a new shared directory (example:
/DASharedRepo
) to help limit data loss. The shared drive stores customized metric families, DTO files, and the ActiveMQ Kaha database. When a hardware failure or network issue occurs, the newly-active data aggregator accesses the shared drive. The data aggregator picks up where the now inactive data aggregator left off. The user ID that the shared drive is created with must be synced to both of the data aggregators. Then both data aggregators have read and write permissions to that directory.
Use the following process to configure a fault-tolerant environment:
  1. Follow the installation or upgrade procedure for the data repository:
  2. Ensure that you have a new shared data directory (for example,
    /DASharedRepo
    ) and that the same user ID is shared between data aggregator hosts. Data from the active data aggregator is stored in this directory.
    If you are using Network File System (NFS),
    DX NetOps Performance Management
    supports only NFSv4 and higher because of the ActiveMQ Kaha locking requirements.
    To allow data to be loaded and to prevent data loss, the shared data directory must be up and accessible at all times.
  3. Install the proxy server.
    For more information, see Install or Uninstall the Proxy Server.
  4. Install or upgrade the active data aggregator. Follow the installation or upgrade procedure:
    The database user credentials for both data aggregators must match.
  5. Install the secondary inactive data aggregator.
    One of the two available data aggregators becomes the active data aggregator. The other data aggregator is available for failover.
  6. Install or upgrade each data collector. Follow the installation or upgrade procedure:
    As you proceed through the data collector install or upgrade, you are prompted for a failover location for fault tolerance. The data collector installer prompts for the inactive data aggregator host.
  7. Install
    NetOps Portal
    . Follow the installation or upgrade procedure:
  8. Allow communications to function properly in the fault-tolerant environment by opening the following ports:
    • TCP 8300
      In a fault-tolerant environment, enables communication between the proxy server and the data aggregators.
    • TCP/UDP 8301
      In a fault-tolerant environment, enables LAN communication between the proxy server and the data aggregators.
    • TCP 8500
      In a fault-tolerant environment, enables communication between the proxy server and the data aggregators to the HTTP API.
  9. Verify the configured fault-tolerant environment by viewing the overall health status of the data aggregators.
    For more information, see View the Health of the System.
  10. Enable the fault-tolerant data aggregators to use HTTPS.
    For more information, see Enable HTTPS for Fault-Tolerant Data Aggregators.