Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Update azure-vm-watch.md #147

Open
wants to merge 2 commits into
base: main
Choose a base branch
from
Open
Changes from all commits
Commits
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
29 changes: 25 additions & 4 deletions articles/virtual-machines/azure-vm-watch.md
Original file line number Diff line number Diff line change
Expand Up @@ -28,10 +28,11 @@ VM watch is delivered via the [Application Health VM extension](/azure/virtual-m
|:---:|:---:|:---:|
| **Outbound connectivity** | Check | Verify the network outbound connectivity from the Azure VM. |
| **DNS Resolution** | Check | Verify if one or more DNS names can be resolved. |
| **TCPSynRetransmits (Linux Only)** | Metric | The number of times the system retransmits a TCP SYN and SYN/ACK packet before giving up on establishing a connection.|
| **SegmentsRetransmitted** | Metric | The number of transmitted TCP segments that contain one or more previously transmitted octets. |
| **NormalizedSegmentsRetransmitted** | Metric | **SegmentsRetransmitted** / (**SegmentsSent** + **SegmentsReceived**) |
| **NormalizedSegmentsRetransmitted** | Metric | **SegmentsRetransmitted** / (**SegmentsSent** + **SegmentsRetransmitted**) |
| **ConnectionResets** | Metric | The number of times that TCP connections made a direct transition to the `CLOSED` state from either the `ESTABLISHED` state or the `CLOSE_WAIT` state. |
| **NormalizedConnectionResets** | Metric | **ConnectionResets** / **CurrentConnections** |
| **NormalizedConnectionResets** | Metric | The percentage of connections that were reset during the last measurement interval. |
| **FailedConnectionAttempts** | Metric |The number of times that TCP connections made a direct transition to the `CLOSED` state from either the `SYN_SENT` state or the `SYN_RCVD` state. |
| **NormalizedFailedConnectionAttempts** | Metric | **FailedConnectionAttempts** / (**ActiveConnectionOpenings** + **PassiveConnectionOpenings**) |
| **ActiveConnectionOpenings** | Metric | The number of times that TCP connections made a direct transition to the `SYN_SENT` state from the `CLOSED` state. |
Expand All @@ -40,6 +41,7 @@ VM watch is delivered via the [Application Health VM extension](/azure/virtual-m
| **SegmentsReceived** | Metric | The number of segments received, including segments received in error. |
| **SegmentsSent** | Metric | The number of segments sent, including segments on current connections but excluding segments that contain only retransmitted octets. |


### Disk

| **Signal name** | **Type** | **Description** |
Expand All @@ -56,10 +58,20 @@ VM watch is delivered via the [Application Health VM extension](/azure/virtual-m

| **Signal name** | **Type** | **Description** |
|:---:|:---:|:---:|
| **ProcessCoreUsage** | Metric | An instantaneous measurement of the percentage of a single CPU core that the target process is using (100 = 100%, a whole core). |
| **ProcessMachineUsage** | Metric | The percentage of the machine's total CPU that this process is using. |
| **ProcessCPUCoreUsage** | Metric | An instantaneous measurement of the percentage of a single CPU core that the target process is using (100 = 100%, a whole core). |
| **ProcessCPUMachineUsage** | Metric | The percentage of the machine's total CPU that this process is using. |
| **MachineTotalCpuUsage** | Metric | The VM's total instantaneous CPU utilization. |

### Memory

| **Signal name** | **Type** | **Description** |
|:---:|:---:|:---:|
| **ProcessRSSPercent** | Metric | **Process RSS** / (**Machine Total Memory** * **100%**) |
| **ProcessPageFaults** | Metric | The number of page faults since the process started. |
| **MachineMemoryTotalInBytes** | Metric | The VM's total Memory in Bytes. |
| **MachineMemoryUsedPercent** | Metric | **Machine Used Memory** / (**Machine Total Memory** * **100%**) |
| **TotalPageFaults** | Metric | The total number of page faults for all running processes since they started. |

### Process

| **Signal name** | **Type** | **Description** |
Expand All @@ -80,6 +92,12 @@ VM watch is delivered via the [Application Health VM extension](/azure/virtual-m
|:---:|:---:|:---:|
| **Clock Skew** | Check | Verify the clock skew between the remote Network Time Protocol (NTP) server and the Azure VM. For a Windows VM, fall back to check if the Windows Time service is synced with `w32tm` if the remote NTP server is inaccessible. |

### OS

| **Signal name** | **Type** | **Description** |
|:---:|:---:|:---:|
| **System Errors** | Metric | Collect the number of errors from the system-level event log (Windows only) where the SystemData <=2 (including LOG_ALWAYS, Critical, Error). The measurementTarget is defined as the Source_EventId of the EventLog using default Windows locale. Each collection is limited to more than 10 different measurement targets. |

### azblob

| **Signal name** | **Type** | **Description** |
Expand All @@ -91,7 +109,10 @@ VM watch is delivered via the [Application Health VM extension](/azure/virtual-m
| **Signal name** | **Type** | **Description** |
|:---:|:---:|:---:|
| **Hardware Health Monitor** | EventLog | Collect hardware health info from the Windows event log. Currently, only disk-related critical events are collected, including events with ID 7, 500, 504, 505, 512, and 549. |
| **Hardware Health Nvidia Smi** | EventLog | Collect GPU stats including memory and GPU usage, temp and others by running nvidia-smi command (Linux Ubuntu only) |

## Related content

- VM watch Collector Suites
- [Install VM watch](install-vm-watch.md)
- Configure VM watch