It is a cluster resource used to control an application that must be kept highly available. It includes start and stop scripts. Output from these scripts will be logged in the hacmp.out file, if "set -x" is defined within the script. The exit code from the script will be monitored by PowerHA.
Once the application is started, there is a stabilization period, where appl. monitoring does not attempt to determine if the appl. is alive or not. This is called the stabilization period. Once the stabilization period has expired the appl. monitor is executed.
In long running mode, the monitor periodically checks that the application is running successfully. In startup mode only at startup.
If the appl. detected as failed, the retry counter is examined. If it is not zero, it is decremented and an appl. cleanup and restart is attempted. This process continnues until the retry counter is zero.
If retry counter reached 0 and the appl. failed again the failover action is examined.
As a protection mechanism, prior to invoking the application server start script, the cluster manager uses an application monitor to determine the status of the application
You can define Proces Appl. Monitoring (which will check if a process is running) or Custom Appl. Monitoring (which is a user defined script and its exit code will be checked by HACMP).
Process Application Monitoring Config:
Ext. Conf. -> Ext. Resources ->Conf. HACMP Appl. Servers -> Conf. HACMP Appl. Mon. ...
Process Monitor:(the values are in seconds)
* Monitor Name [X11_APPL]
* Application Server(s) to Monitor
* Monitor Mode [Long-running monitoring]
* Processes to Monitor [xxx]
* Process Owner [root]
Instance Count 
* Stabilization Interval 
* Restart Count 
Restart Interval 
* Action on Application Failure [fallover]
Notify Method 
Cleanup Method [/usr/local/scripts/cluster/X11_stop.ksh]
Restart Method [/usr/local/scripts/cluster/X11_start.ksh]
Processes to Monitor: give th name of the process from the output of ps -el (not ps -ef)
Stabilization Interval: The length of time the monitor will wait before resuming monitoring
Restart Count: The maximum times the application will be (re)started (the Failure Counter is connected with it), if it is not successful, then Action on Application Failure will be.
Restart Interval: the elapsed time the application must run before the Failure Counter is reset.
If the Restart Interval time is reached the Failure Count is reset to 0.
(is the time during which attempts will be made to restart the application)
Monitor Inetrval: (this is only at custom monitor) time between execution of the monitor event to see if the appl. is running.
Suspend/Resume Application monitoring:
smitty hacmp -> System Management (C-SPOC) -> HACMP Resource Group and Application Management -> Suspend/Resume Application Monitoring - > Suspend Application Monitoring
smitty hacmp -> System Management (C-SPOC) -> HACMP Resource Group and Application Management -> Suspend/Resume Application Monitoring - > Resume Application Monitoring
- FS - LVM
- STORAGE - BACKUP
- UPD. - INSTALL