AZURE
|
Count databases that are publicly accessible, without replication, without high availability configuration, with high CPU usage, high memory usage, high cache miss rate, and low availability in Azure
Tasks:
Tasks:
- Score Database Availability in resource group `${AZURE_RESOURCE_GROUP}`
- Count Publicly Accessible Databases in resource group `${AZURE_RESOURCE_GROUP}`
- Count Databases Without Replication in resource group `${AZURE_RESOURCE_GROUP}`
- Count Databases Without High Availability in resource group `${AZURE_RESOURCE_GROUP}`
- Count Databases With High CPU Usage in resource group `${AZURE_RESOURCE_GROUP}`
- Count Databases With High Memory Usage in resource group `${AZURE_RESOURCE_GROUP}`
- Count Redis Caches With High Cache Miss Rate in resource group `${AZURE_RESOURCE_GROUP}`
- Count Databases With Health Issues in resource group `${AZURE_RESOURCE_GROUP}`
- Generate Health Score
List databases that are publicly accessible, without replication, without high availability configuration, with high CPU usage, high memory usage, high cache miss rate, and low availability in Azure
Tasks:
Tasks:
- List Database Availability in resource group `AZURE_RESOURCE_GROUP`
- List Publicly Accessible Databases in resource group `AZURE_RESOURCE_GROUP`
- List Databases Without Replication in resource group `AZURE_RESOURCE_GROUP`
- List Databases Without High Availability in resource group `AZURE_RESOURCE_GROUP`
- List Databases With High CPU Usage in resource group `AZURE_RESOURCE_GROUP`
- List All Databases With High Memory Usage in resource group `AZURE_RESOURCE_GROUP`
- List Redis Caches With High Cache Miss Rate in resource group `AZURE_RESOURCE_GROUP`
- List Database Resource Health in resource group `AZURE_RESOURCE_GROUP`
Check Azure storage health by identifying unused disks, snapshots, and storage accounts
Tasks:
Tasks:
- Count Azure Storage Accounts with Health Status of `Available` in resource group `${AZURE_RESOURCE_GROUP}`
- Count Unused Disks in resource group `${AZURE_RESOURCE_GROUP}`
- Count Unused Snapshots in resource group `${AZURE_RESOURCE_GROUP}`
- Count Unused Storage Accounts in resource group `${AZURE_RESOURCE_GROUP}`
- Count Public Accessible Storage Accounts in resource group `${AZURE_RESOURCE_GROUP}`
- Generate Health Score
Check Azure storage health by identifying unused disks, snapshots, and storage accounts
Tasks:
Tasks:
- Check Azure Storage Resource Health in resource group `AZURE_RESOURCE_GROUP`
- List Unused Azure Disks in resource group `AZURE_RESOURCE_GROUP`
- List Unused Azure Snapshots in resource group `AZURE_RESOURCE_GROUP`
- List Unused Azure Storage Accounts in resource group `AZURE_RESOURCE_GROUP`
- List Public Accessible Azure Storage Accounts in resource group `AZURE_RESOURCE_GROUP`
Count Virtual machines that are publicly accessible, have high CPU usage, underutilized memory, stopped state, unused network interfaces, and unused public IPs in Azure
Tasks:
Tasks:
- Check Azure VM Health in resource group `${AZURE_RESOURCE_GROUP}`
- Check for VMs With Public IP in resource group `${AZURE_RESOURCE_GROUP}`
- Check for VMs With High CPU Usage in resource group `${AZURE_RESOURCE_GROUP}`
- Check for Stopped VMs in resource group `${AZURE_RESOURCE_GROUP}`
- Check for Underutilized VMs Based on CPU Usage in resource group `${AZURE_RESOURCE_GROUP}`
- Check for VMs With High Memory Usage in resource group `${AZURE_RESOURCE_GROUP}`
- Check for Underutilized VMs Based on Memory Usage in resource group `${AZURE_RESOURCE_GROUP}`
- Check for Unused Network Interfaces in resource group `${AZURE_RESOURCE_GROUP}`
- Check for Unused Public IPs in resource group `${AZURE_RESOURCE_GROUP}`
- Generate Health Score
List Virtual machines that are publicly accessible, have high CPU usage, underutilized memory, stopped state, unused network interfaces, and unused public IPs in Azure
Tasks:
Tasks:
- Check Azure VM Health in resource group `AZURE_RESOURCE_GROUP`
- List VMs With Public IP in resource group `AZURE_RESOURCE_GROUP`
- List for Stopped VMs in resource group `AZURE_RESOURCE_GROUP`
- List VMs With High CPU Usage in resource group `AZURE_RESOURCE_GROUP`
- List Underutilized VMs Based on CPU Usage in resource group `AZURE_RESOURCE_GROUP`
- List VMs With High Memory Usage in resource group `AZURE_RESOURCE_GROUP`
- List Underutilized VMs Based on Memory Usage in resource group `AZURE_RESOURCE_GROUP`
- List Unused Network Interfaces in resource group `AZURE_RESOURCE_GROUP`
- List Unused Public IPs in resource group `AZURE_RESOURCE_GROUP`
Queries the health of an Azure Application Gateway, returning 1 when it's healthy and 0 when it's unhealthy.
Tasks:
Tasks:
- Check for Resource Health Issues Affecting Application Gateway `${APP_GATEWAY_NAME}` In Resource Group `${AZ_RESOURCE_GROUP}`
- Check Configuration Health of Application Gateway `${APP_GATEWAY_NAME}` In Resource Group `${AZ_RESOURCE_GROUP}`
- Check Backend Pool Health for Application Gateway `${APP_GATEWAY_NAME}` In Resource Group `${AZ_RESOURCE_GROUP}`
- Fetch Metrics for Application Gateway `${APP_GATEWAY_NAME}` In Resource Group `${AZ_RESOURCE_GROUP}`
- Check SSL Certificate Health for Application Gateway `${APP_GATEWAY_NAME}` In Resource Group `${AZ_RESOURCE_GROUP}`
- Check Logs for Errors with Application Gateway `${APP_GATEWAY_NAME}` In Resource Group `${AZ_RESOURCE_GROUP}`
- Generate Application Gateway Health Score
Performs a health check on Azure Application Gateways and the backend pools used by them, generating a report of issues and next steps.
Tasks:
Tasks:
- Check for Resource Health Issues Affecting Application Gateway `APP_GATEWAY_NAME` In Resource Group `AZ_RESOURCE_GROUP`
- Check Configuration Health of Application Gateway `APP_GATEWAY_NAME` In Resource Group `AZ_RESOURCE_GROUP`
- Check Backend Pool Health for Application Gateway `APP_GATEWAY_NAME` In Resource Group `AZ_RESOURCE_GROUP`
- Fetch Log Analytics for Application Gateway `APP_GATEWAY_NAME` In Resource Group `AZ_RESOURCE_GROUP`
- Fetch Metrics for Application Gateway `APP_GATEWAY_NAME` In Resource Group `AZ_RESOURCE_GROUP`
- Check SSL Certificate Health for Application Gateway `APP_GATEWAY_NAME` In Resource Group `AZ_RESOURCE_GROUP`
- Check Logs for Errors with Application Gateway `APP_GATEWAY_NAME` In Resource Group `AZ_RESOURCE_GROUP`
- List Related Azure Resources for Application Gateway `APP_GATEWAY_NAME` In Resource Group `AZ_RESOURCE_GROUP`
This CodeBundle counts the number of container images (from a configured list) outdated. It compares upstream images with those in the registry and counts the number that are outdated.
Tasks:
Tasks:
- Count Outdated Images in Azure Container Registry `${ACR_REGISTRY}`
This CodeBundle syncs images from public repostitories into an Azure Container Registry.
Tasks:
Tasks:
- Sync Container Images into Azure Container Registry `ACR_REGISTRY`
Triages issues related to a Azure Loadbalancers and its activity logs.
Tasks:
Tasks:
- Check Activity Logs for Azure Load Balancer `AZ_LB_NAME`
Azure Data Factories health checks including resource health status, frequent pipeline errors, failed pipeline runs, and large data operations monitoring.
Tasks:
Tasks:
- Identify Health Issues Affecting Data Factories in resource group `${AZURE_RESOURCE_GROUP}`
- Count Frequent Pipeline Errors in Data Factories in resource group `${AZURE_RESOURCE_GROUP}`
- Count Failed Pipelines in Data Factories in resource group `${AZURE_RESOURCE_GROUP}`
- Count Large Data Operations in Data Factories in resource group `${AZURE_RESOURCE_GROUP}`
- Count Long Running Pipeline Runs in Data Factories in resource group `${AZURE_RESOURCE_GROUP}`
- Generate Health Score
Azure Data Factories health checks including resource health status, frequent pipeline errors, failed pipeline runs, and large data operations monitoring.
Tasks:
Tasks:
- Check for Resource Health Issues Affecting Data Factories in resource group `AZURE_RESOURCE_GROUP`
- List Frequent Pipeline Errors in Data Factories in resource group `AZURE_RESOURCE_GROUP`
- List Failed Pipelines in Data Factories in resource group `AZURE_RESOURCE_GROUP`
- Find Large Data Operations in Data Factories in resource group `AZURE_RESOURCE_GROUP`
- Fetch Azure Data Factory Details in resource group `AZURE_RESOURCE_GROUP`
- List Long Running Pipeline Runs in Data Factories in resource group `AZURE_RESOURCE_GROUP`
Checks VM Scale Set key metrics and returns a 1 when healthy, or 0 when not healthy.
Tasks:
Tasks:
- Check Scale Set `${VMSCALESET}` Key Metrics In Resource Group `${AZ_RESOURCE_GROUP}`
Runs diagnostic checks against virtual machine scaled sets and generates reports from key metrics.
Tasks:
Tasks:
- Check Scale Set `VMSCALESET` Key Metrics In Resource Group `AZ_RESOURCE_GROUP`
- Fetch VM Scale Set `VMSCALESET` Config In Resource Group `AZ_RESOURCE_GROUP`
- Fetch Activities for VM Scale Set `VMSCALESET` In Resource Group `AZ_RESOURCE_GROUP`
Counts Azure Key Vault health by checking availability metrics, configuration settings, expiring items (secrets/certificates/keys), log issues, and performance metrics
Tasks:
Tasks:
- Count Key Vault Resource Health in resource group `${AZURE_RESOURCE_GROUP}` in Subscription `${AZURE_SUBSCRIPTION_NAME}`
- Count Key Vault Availability in resource group `${AZURE_RESOURCE_GROUP}` in Subscription `${AZURE_SUBSCRIPTION_NAME}`
- Count Key Vault configuration in resource group `${AZURE_RESOURCE_GROUP}` in Subscription `${AZURE_SUBSCRIPTION_NAME}`
- Count Expiring Key Vault Items in resource group `${AZURE_RESOURCE_GROUP}` in Subscription `${AZURE_SUBSCRIPTION_NAME}`
- Count Key Vault Log Issues in resource group `${AZURE_RESOURCE_GROUP}` in Subscription `${AZURE_SUBSCRIPTION_NAME}`
- Count Key Vault Performance Metrics in resource group `${AZURE_RESOURCE_GROUP}` in Subscription `${AZURE_SUBSCRIPTION_NAME}`
- Generate Comprehensive Key Vault Health Score
Check Azure Key Vault health by checking availability metrics, configuration settings, expiring items (secrets/certificates/keys), log issues, and performance metrics
Tasks:
Tasks:
- Check Key Vault Resource Health in resource group `AZURE_RESOURCE_GROUP` in Subscription `AZURE_SUBSCRIPTION_NAME`
- Check Key Vault Availability in resource group `AZURE_RESOURCE_GROUP` in Subscription `AZURE_SUBSCRIPTION_NAME`
- Check Key Vault Configuration in resource group `AZURE_RESOURCE_GROUP` in Subscription `AZURE_SUBSCRIPTION_NAME`
- Check Expiring Key Vault Items in resource group `AZURE_RESOURCE_GROUP` in Subscription `AZURE_SUBSCRIPTION_NAME`
- Check Key Vault Logs for Issues in resource group `AZURE_RESOURCE_GROUP` in Subscription `AZURE_SUBSCRIPTION_NAME`
- Check Key Vault Performance Metrics in resource group `AZURE_RESOURCE_GROUP` in Subscription `AZURE_SUBSCRIPTION_NAME`
Runs diagnostic checks to check the health of APIM instances
Tasks:
Tasks:
- Check for Resource Health Issues Affecting APIM `${APIM_NAME}` in Resource Group `${AZ_RESOURCE_GROUP}`
- Fetch Key Metrics for APIM `${APIM_NAME}` in Resource Group `${AZ_RESOURCE_GROUP}`
- Check Logs for Errors with APIM `${APIM_NAME}` in Resource Group `${AZ_RESOURCE_GROUP}`
- Verify APIM Policy Configurations for `${APIM_NAME}` in Resource Group `${AZ_RESOURCE_GROUP}`
- Check APIM SSL Certificates for `${APIM_NAME}` in Resource Group `${AZ_RESOURCE_GROUP}`
- Inspect Dependencies and Related Resources for APIM `${APIM_NAME}` in Resource Group `${AZ_RESOURCE_GROUP}`
- Generate APIM Health Score
Runs diagnostic checks to check the health of APIM instances
Tasks:
Tasks:
- Gather APIM Resource Information for APIM `APIM_NAME` in Resource Group `AZ_RESOURCE_GROUP`
- Check for Resource Health Issues Affecting APIM `APIM_NAME` in Resource Group `AZ_RESOURCE_GROUP`
- Fetch Key Metrics for APIM `APIM_NAME` in Resource Group `AZ_RESOURCE_GROUP`
- Check Logs for Errors with APIM `APIM_NAME` in Resource Group `AZ_RESOURCE_GROUP`
- Verify APIM Policy Configurations for `APIM_NAME` in Resource Group `AZ_RESOURCE_GROUP`
- Check APIM SSL Certificates for `APIM_NAME` in Resource Group `AZ_RESOURCE_GROUP`
- Inspect Dependencies and Related Resources for APIM `APIM_NAME` in Resource Group `AZ_RESOURCE_GROUP`
Generates a composite score about the health of an AKS cluster using the AZ CLI. Returns a 1 if all checks pass, 0 if they all fail, and value between 0 and 1 for partial success/fail. Checks the upstream service for reported errors. Looks for Critical or Error activities within a specified time period. Checks the overall configuration for provisioning failures.
Tasks:
Tasks:
- Check for Resource Health Issues Affecting AKS Cluster `${AKS_CLUSTER}` In Resource Group `${AZ_RESOURCE_GROUP}`
- Fetch Activities for AKS Cluster `${AKS_CLUSTER}` In Resource Group `${AZ_RESOURCE_GROUP}`
- Check Configuration Health of AKS Cluster `${AKS_CLUSTER}` In Resource Group `${AZ_RESOURCE_GROUP}`
- Generate AKS Cluster Health Score
Runs diagnostic checks against an AKS cluster.
Tasks:
Tasks:
- Check for Resource Health Issues Affecting AKS Cluster `AKS_CLUSTER` In Resource Group `AZ_RESOURCE_GROUP`
- Check Configuration Health of AKS Cluster `AKS_CLUSTER` In Resource Group `AZ_RESOURCE_GROUP`
- Check Network Configuration of AKS Cluster `AKS_CLUSTER` In Resource Group `AZ_RESOURCE_GROUP`
- Fetch Activities for AKS Cluster `AKS_CLUSTER` In Resource Group `AZ_RESOURCE_GROUP`
Operational tasks for an Azure App Services, such as restarting, scaling or re-deploying.
Tasks:
Tasks:
- Restart App Service `APP_SERVICE_NAME` in Resource Group `AZ_RESOURCE_GROUP`
- Swap Deployment Slots for App Service `APP_SERVICE_NAME` in Resource Group `AZ_RESOURCE_GROUP`
- Scale Up App Service `APP_SERVICE_NAME` in Resource Group `AZ_RESOURCE_GROUP`
- Scale Down App Service `APP_SERVICE_NAME` in Resource Group `AZ_RESOURCE_GROUP`
- Scale Out Instances for App Service `APP_SERVICE_NAME` in Resource Group `AZ_RESOURCE_GROUP` by SCALE_OUT_FACTORx
- Scale In Instances for App Service `APP_SERVICE_NAME` in Resource Group `AZ_RESOURCE_GROUP` to 1/SCALE_IN_FACTOR
- Redeploy App Service `APP_SERVICE_NAME` from Latest Source in Resource Group `AZ_RESOURCE_GROUP`
Queries the health status of an Function App, and returns 0 when it's not healthy, and 1 when it is.
Tasks:
Tasks:
- Check for Resource Health Issues Affecting Function App `${FUNCTION_APP_NAME}` In Resource Group `${AZ_RESOURCE_GROUP}`
- Check Function App `${FUNCTION_APP_NAME}` Health Check Metrics In Resource Group `${AZ_RESOURCE_GROUP}`
- Check Function App `${FUNCTION_APP_NAME}` Configuration Health In Resource Group `${AZ_RESOURCE_GROUP}`
- Check Deployment Health of Function App `${FUNCTION_APP_NAME}` In Resource Group `${AZ_RESOURCE_GROUP}`
- Fetch Function App `${FUNCTION_APP_NAME}` Activities In Resource Group `${AZ_RESOURCE_GROUP}`
- Generate Function App Health Score for `${FUNCTION_APP_NAME}` in resource group `${AZ_RESOURCE_GROUP}`
Triages an Azure Function App and its workloads, checking its status and logs and verifying key metrics.
Tasks:
Tasks:
- Check for Resource Health Issues Affecting Function App `FUNCTION_APP_NAME` In Resource Group `AZ_RESOURCE_GROUP`
- Check Function App `FUNCTION_APP_NAME` Health in Resource Group `AZ_RESOURCE_GROUP`
- Fetch Function App `FUNCTION_APP_NAME` Plan Utilization Metrics In Resource Group `AZ_RESOURCE_GROUP`
- Get Function App `FUNCTION_APP_NAME` Logs In Resource Group `AZ_RESOURCE_GROUP`
- Check Configuration Health of Function App `FUNCTION_APP_NAME` In Resource Group `AZ_RESOURCE_GROUP`
- Check Deployment Health of Function App `FUNCTION_APP_NAME` In Resource Group `AZ_RESOURCE_GROUP`
- Fetch Function App `FUNCTION_APP_NAME` Activities In Resource Group `AZ_RESOURCE_GROUP`
- Check Logs for Errors in Function App `FUNCTION_APP_NAME` In Resource Group `AZ_RESOURCE_GROUP`
Queries the health status of an App Service, and returns 0 when it's not healthy, and 1 when it is.
Tasks:
Tasks:
- Check for Resource Health Issues Affecting App Service `${APP_SERVICE_NAME}` In Resource Group `${AZ_RESOURCE_GROUP}`
- Check App Service `${APP_SERVICE_NAME}` Health Check Metrics In Resource Group `${AZ_RESOURCE_GROUP}`
- Check App Service `${APP_SERVICE_NAME}` Configuration Health In Resource Group `${AZ_RESOURCE_GROUP}`
- Check Deployment Health of App Service `${APP_SERVICE_NAME}` In Resource Group `${AZ_RESOURCE_GROUP}`
- Fetch App Service `${APP_SERVICE_NAME}` Activities In Resource Group `${AZ_RESOURCE_GROUP}`
- Generate App Service Health Score for `${APP_SERVICE_NAME}` in resource group `${AZ_RESOURCE_GROUP}`
Triages an Azure App Service and its workloads, checking its status and logs and verifying key metrics.
Tasks:
Tasks:
- Check for Resource Health Issues Affecting App Service `APP_SERVICE_NAME` In Resource Group `AZ_RESOURCE_GROUP`
- Check App Service `APP_SERVICE_NAME` Health in Resource Group `AZ_RESOURCE_GROUP`
- Fetch App Service `APP_SERVICE_NAME` Utilization Metrics In Resource Group `AZ_RESOURCE_GROUP`
- Get App Service `APP_SERVICE_NAME` Logs In Resource Group `AZ_RESOURCE_GROUP`
- Check Configuration Health of App Service `APP_SERVICE_NAME` In Resource Group `AZ_RESOURCE_GROUP`
- Check Deployment Health of App Service `APP_SERVICE_NAME` In Resource Group `AZ_RESOURCE_GROUP`
- Fetch App Service `APP_SERVICE_NAME` Activities In Resource Group `AZ_RESOURCE_GROUP`
- Check Logs for Errors in App Service `APP_SERVICE_NAME` In Resource Group `AZ_RESOURCE_GROUP`
Determines if any RunWhen CodeCollection or private runner components require image updates.
Tasks:
Tasks:
- Check for CodeCollection Updates against ACR Registry`${REGISTRY_NAME}`
- Check for RunWhen Local Image Updates against ACR Registry`${REGISTRY_NAME}`
- Count Images Needing Update and Push Metric
Synchronizes CodeCollection and Helm Images for the RunWhen Runner into a private ACR Registry
Tasks:
Tasks:
- Sync CodeCollection Images to ACR Registry `REGISTRY_NAME`
- Sync RunWhen Local Image Updates to ACR Registry`REGISTRY_NAME`
Determines if any RunWhen Local images have available updates in the private Azure Container Registry service.
Tasks:
Tasks:
- Check for Available RunWhen Helm Images in ACR Registry`${REGISTRY_NAME}`
Checks (or applies) RunWhen image updates with Helm CLI if any updated images exist in the upstream ACR registry.
Tasks:
Tasks:
- Apply Available RunWhen Helm Images in ACR Registry`REGISTRY_NAME`
Runs an ad-hoc user-provided command, and if the provided command outputs a non-empty string to stdout then a health score of 0 (unhealthy) is pushed, otherwise if there is no output, indicating no issues, then a 1 is pushed.
User commands should filter expected/healthy content (eg: with grep) and only output found errors.
Tasks:
Tasks:
- ${TASK_TITLE}
Runs an ad-hoc user-provided command, and if the provided command outputs a non-empty string to stdout then an issue is generated with a configurable title and content.
User commands should filter expected/healthy content (eg: with grep) and only output found errors.
Tasks:
Tasks:
- TASK_TITLE
This sli runs a user provided azure cli command and pushes the metric. The supplied command must result in distinct single metric. Command line tools like jq are available.
Tasks:
Tasks:
- ${TASK_TITLE}
This taskset runs a user provided azure command and adds the output to the report. Command line tools like jq are available.
Tasks:
Tasks:
- TASK_TITLE