Monitor the health of azure services

Azure Service Health continuously notifies you of issues that may affect the availability of your environment, such as service incidents, planned maintenance periods, or regional outages.

We’ve recently enhanced our Azure integration to include additional support for monitoring Service Health issues, enabling you to keep tabs on the health of your Azure environment and take proactive measures to mitigate downtime. Within minutes of setting up the integration, you’ll see rich, contextual Service Health events appear within your event stream, where you can monitor and correlate them with data from more than 600 infrastructure technologies (including other Azure services), all in one place.

When a new issue is identified, Azure reports an Azure Service Health event indicating the nature of the problem and affected resources and regions. Azure then continuously updates the status of the issue via a series of events until it is finally resolved.

With Datadog, you can clearly monitor every stage of Service Health issues within our event stream under the “Azure Service Health” namespace. Datadog collects these events automatically for all subscriptions being monitored with our Azure integration. You’ll see each issue cohesively grouped by its Tracking ID, enabling you to get full visibility into its current status and progression, from start to finish. This makes it easier for you to keep track of high-priority issues and follow up on their progress.

Monitor Azure Service Health events in the Datadog event stream.

The Azure Events API provides valuable metadata around each Service Health event. Datadog automatically converts this metadata into key:value tags that you can use to easily filter and search through all your events. To point out a few:

  • service: The impacted Azure service(s) (e.g., Azure Virtual Machines)
  • status: The status of the event (i.e., active or resolved)
  • region: The impacted Azure region(s) (e.g., US East, Global)
  • incident_type: The type of Service Health event (ServiceIssue, PlannedMaintenance, SecurityAdvisory, HealthAdvisory)
  • level: The severity level of the event (i.e., informational, warning, or critical)

In addition to these tags, each Azure Service Health event includes a description that captures the essence of the issue from the perspective of the Azure engineers investigating the problem. Some events may also contain mitigation steps for addressing the issue and reducing its impact.

Once you are capturing Azure Service Health events with Datadog, you can set up event monitors to get notified when a specific type of Azure Service Health issue occurs, using string matching, tags, and more to narrow down the scope. For example, you could use the “Azure Service Health” source and a few tags (status:active, incident_type:serviceissue) to quickly create a monitor that will notify you if any of your Azure services has an active issue. This will help you keep consistent tabs on your mission-critical Azure services and regions—and you won’t have to worry about constantly refreshing an events feed.

Set up an event monitor to notify you if any of your Azure services has an active issue detected through Azure Service Health Monitoring with Datadog.

Scheduled maintenance events can be extremely annoying if you failed to prepare in advance to offset the performance and availability decline. Now, you can create event monitors to immediately alert you of an upcoming maintenance session, so you can proactively make adjustments to your sprint plans and engineering commitments without missing a beat in productivity.

Set up an event monitor in Datadog to alert on Azure Service Health events and stay informed about planned maintenance sessions.

Dashboards help you visualize the state of your infrastructure—but it can sometimes be difficult to fully understand the data displayed in your graphs if you don’t have the added context of what is taking place behind the scenes. With Azure Service Health events in Datadog, you can easily overlay them on graphs to get helpful context for interpreting unusual trends in your metrics and troubleshooting issues.

You can overlay Service Health events on top of mission-critical metrics within your favorite dashboards. In the example below, we are using event overlays to correlate Azure Service Health events with the status of your Azure Virtual Machines. This can help you understand how a single Service Health event, such as a network outage, has affected the status of your entire cloud environment.

You can monitor Azure Service Health events in context with metrics by overlaying them on your Azure dashboards in Datadog.

If you’re already using the Azure integration, you should automatically have access to these enhancements—navigate to the event stream and filter for the “Azure Service Health” namespace to see your Azure Service Health events. Otherwise, install Datadog’s Azure integration on the to start monitoring Azure Service Health events.

If you don’t yet have a Datadog account, sign up for a to get complete visibility into the health of your Azure environment.

Which blade should you identify to Monitor the health of Azure services?

Azure Monitor is used to monitor the health of Azure services.

What is Azure service health alerts?

Service Health keeps you informed about the health of your environment. It provides a personalized view of the status of your Azure services and regions, includes information about planned maintenance and current incidents, and offers richer functionality, including alerting and RCAs.

What can I Monitor with Azure Monitor?

Azure users can use this tool to monitor the status of events in their cloud environment and to plan ahead for maintenance. Azure Network Watcher offers network monitoring for network performance. This tool can provide insights and metrics on Azure Virtual Networks (VNet), VMs and application gateways.