Azure Site Recovery orchestrates and manages disaster recovery for Azure VMs, on-premises VMs and physical servers.
This step-by-step guide explains how to setup and monitor Azure Site Recovery using CloudMonix.
In this article
1. Monitoring setup
2. Collect, understand and use your data
3. Setup verification and troubleshooting
Did you know?
CloudMonix extends native Azure Site Recovery monitoring with advanced metrics and features. Noteworthy:
- CloudMonix receives data from the Azure Management API and Azure Monitor API
- pre-configured metrics: backup items count, backup usage, failed jobs count, jobs in progress count, recommended actions, replication events, replication protected items, replication usage, size GRS, size LRS, resource status, suspended jobs count, unhealthy providers count, unhealthy VMs count, vault health issues
- alerts on failed replication jobs detected, unhealthy providers detected, unhealthy replicated items detected, unhealthy VMs detected
a. Run the Setup Wizard in the portal (preferred way):
This article explains how to add resources to CloudMonix via the Setup Wizard.
b. Tweak settings in the Definition tab (optional):
Definition tab for an existing resource can be accessed by clicking the resource's monitoring settings in the performance dashboard:
Definition tab provides optional settings for the resource name, Azure resource management token, Azure resource group, Azure resource name, configuration template and categories:
Configuration Template setting provides pre-defined configuration templates available in CloudMonix by default as well as previously stored custom templates. See predefined templates for Azure Site Recovery for reference.
c. Manual setup (optional instead of the Setup Wizard route):
Click the Add New button in the top right corner of your dashboard:
Fill in required information in the Definition tab as described in the previous step.
d. Advanced configuration:
Advanced configuration tab provides additional monitoring settings, which are already set as default for most use-cases.
Collect, understand and use your data
Specific Metrics, Templates, Alerts and Automation Actions for Azure Site Recovery:
Azure Site Recovery Settings
Diagnostic data points retrieved from the monitored resource are referred to as metrics . CloudMonix provides default templates for the metrics recommended for common configurations. Metrics can be further added, removed or customized in the Metrics tab of the Azure Site Recovery resource configuration dialog:
CloudMonix features a sophisticated alert engine that allows alerts to be published for very particular conditions pre-defined by a template configuration or custom based on any of the available metrics. Alerts can be further added, removed or customized in the Alerts tab of the Azure Site Recovery resource configuration dialog:
Actions are automation features that can be configured to fire based on specific conditions or schedule. Actions can be added and configured in the Actions tab of the Azure Site Recovery resource configuration dialog:
Automation allows executing actions on specified conditions. While Azure Site Recovery resource itself doesn't support any actions on its own, CloudMonix provides the possibility to execute calls to custom API endpoints as well as execute Azure Functions, Runbooks and WebJobs on other resources.
Setup verification and troubleshooting
a. Setup verification:
Successful resource setup can be verified by clicking Test button in the resource configuration dialog and visiting the Test Results tab:
b. Troubleshooting monitoring issues:
CloudMonix provides deep insights into resource monitoring issues via the Status Dashboard screen. The screen allows to overview resources that have raised alerts and troubleshoot them by diving into the monitoring logs.
Read the full article on how to use Status Dashboard to diagnose resource monitoring issues.