Skip to content

tidb cloud: add more Prometheus monitoring metrics #21075

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Merged
merged 5 commits into from
Jun 3, 2025

Conversation

huoyao1125
Copy link

@huoyao1125 huoyao1125 commented May 29, 2025

First-time contributors' checklist

What is changed, added or deleted? (Required)

Which TiDB version(s) do your changes apply to? (Required)

Tips for choosing the affected version(s):

By default, CHOOSE MASTER ONLY so your changes will be applied to the next TiDB major or minor releases. If your PR involves a product feature behavior change or a compatibility change, CHOOSE THE AFFECTED RELEASE BRANCH(ES) AND MASTER.

For details, see tips for choosing the affected versions.

  • master (the latest development version)
  • v9.0 (TiDB 9.0 versions)
  • v8.5 (TiDB 8.5 versions)
  • v8.4 (TiDB 8.4 versions)
  • v8.3 (TiDB 8.3 versions)
  • v8.1 (TiDB 8.1 versions)
  • v7.5 (TiDB 7.5 versions)
  • v7.1 (TiDB 7.1 versions)
  • v6.5 (TiDB 6.5 versions)
  • v6.1 (TiDB 6.1 versions)
  • v5.4 (TiDB 5.4 versions)

What is the related PR or file link(s)?

  • This PR is translated from:
  • Other reference link(s):

Do your changes match any of the following descriptions?

  • Delete files
  • Change aliases
  • Need modification after applied to another branch
  • Might cause conflicts after applied to another branch

@CLAassistant
Copy link

CLAassistant commented May 29, 2025

CLA assistant check
All committers have signed the CLA.

@ti-chi-bot ti-chi-bot bot added the contribution This PR is from a community contributor. label May 29, 2025
Copy link

ti-chi-bot bot commented May 29, 2025

Welcome @huoyao1125!

It looks like this is your first PR to pingcap/docs 🎉.

I'm the bot to help you request reviewers, add labels and more, See available commands.

We want to make sure your contribution gets all the attention it needs!



Thank you, and welcome to pingcap/docs. 😃

@ti-chi-bot ti-chi-bot bot added first-time-contributor Indicates that the PR was contributed by an external member and is a first-time contributor. needs-ok-to-test Indicates a PR created by contributors and need ORG member send '/ok-to-test' to start testing. labels May 29, 2025
Copy link

ti-chi-bot bot commented May 29, 2025

Hi @huoyao1125. Thanks for your PR.

I'm waiting for a pingcap member to verify that this patch is reasonable to test. If it is, they should reply with /ok-to-test on its own line. Until that is done, I will not automatically test new commits in this PR, but the usual testing commands by org members will still work. Regular contributors should join the org to skip this step.

Once the patch is verified, the new status will be reflected by the ok-to-test label.

I understand the commands that are listed here.

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes-sigs/prow repository.

@ti-chi-bot ti-chi-bot bot added missing-translation-status This PR does not have translation status info. size/S Denotes a PR that changes 10-29 lines, ignoring generated files. labels May 29, 2025
Copy link

@gemini-code-assist gemini-code-assist bot left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Hello @huoyao1125, I'm Gemini Code Assist1! I'm currently reviewing this pull request and will post my feedback shortly. In the meantime, here's a summary to help you and other reviewers quickly get up to speed!

Summary of Changes

Hello team, Gemini here with a summary of this pull request. This PR focuses on updating the documentation for integrating Prometheus and Grafana with TiDB Cloud. The primary changes involve adding guidance on how to handle existing Grafana dashboards when new metrics become available and listing several new Prometheus metrics that users can now scrape and visualize.

Highlights

  • Grafana Dashboard Guidance: Added a note in the Grafana integration steps recommending users create a new dashboard when incorporating newly available metrics, rather than directly updating an existing one. This helps avoid potential issues with existing configurations.
  • New Prometheus Metrics Added: Included a list of several new Prometheus metrics that are now available for monitoring TiDB Cloud. These new metrics cover areas such as node storage availability, disk read/write latency, KV request duration, component uptime, TiCDC changefeed lag and status, and Resource Manager Request Unit consumption.

Changelog

Click here to see the changelog
  • tidb-cloud/monitor-prometheus-and-grafana-integration.md
    • Added a sentence to step 2 of the Grafana integration section advising on dashboard updates (line 58).
    • Added a table row for the tidbcloud_node_storage_available_bytes metric (line 91).
    • Added a table row for the tidbcloud_disk_read_latency metric (line 92).
    • Added a table row for the tidbcloud_disk_write_latency metric (line 93).
    • Added a table row for the tidbcloud_kv_request_duration metric (line 94).
    • Added a table row for the tidbcloud_component_uptime metric (line 95).
    • Added a table row for the tidbcloud_ticdc_owner_resolved_ts_lag metric (line 96).
    • Added a table row for the tidbcloud_changefeed_status metric (line 97).
    • Added a table row for the tidbcloud_resource_manager_resource_unit_read_request_unit metric (line 98).
    • Added a table row for the tidbcloud_resource_manager_resource_unit_write_request_unit metric (line 99).
Using Gemini Code Assist

The full guide for Gemini Code Assist can be found on our documentation page, here are some quick tips.

Invoking Gemini

You can request assistance from Gemini at any point in your pull request via creating an issue comment (i.e. comment on the pull request page) using either /gemini <command> or @gemini-code-assist <command>. Below is a summary of the supported commands.

Feature Command Description
Code Review /gemini review Performs a code review for the current pull request in its current state.
Pull Request Summary /gemini summary Provides a summary of the current pull request in its current state.
Comment @gemini-code-assist Responds in comments when explicitly tagged, both in issue comments and review comments.
Help /gemini help Displays a list of available commands.

Customization

To customize Gemini Code Assist for GitHub experience, repository maintainers can create a configuration file and/or provide a custom code review style guide (such as PEP-8 for Python) by creating and adding files to a .gemini/ folder in the base of the repository. Detailed instructions can be found here.

Limitations & Feedback

Gemini Code Assist is currently in preview and may make mistakes. Please leave feedback on any instances where its feedback is incorrect or counter productive. You can react with 👍 and 👎 on @gemini-code-assist comments to provide feedback.

You can also get AI-powered code generation, chat, as well as code reviews directly in the IDE at no cost with the Gemini Code Assist IDE Extension.

Footnotes

  1. Review the Privacy Notices, Generative AI Prohibited Use Policy, Terms of Service, and learn how to configure Gemini Code Assist in GitHub here. Gemini can make mistakes, so double check it and use code with caution.

Copy link

@gemini-code-assist gemini-code-assist bot left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Code Review

This pull request updates the Prometheus and Grafana integration guide by adding a helpful recommendation for dashboard updates and expanding the list of available metrics. These are valuable additions that enhance the document's utility for users.

I've included a couple of minor suggestions focusing on conciseness and formatting consistency to further improve readability, in line with our documentation style guide. Specifically, these relate to simplifying phrasing [^1] and maintaining consistent formatting for placeholders, ellipsis, and HTML tags within the metrics table [^2][^3].

Additionally, please remember to complete the 'Which TiDB version(s) do your changes apply to?' section in the PR description, as this is crucial information for our users.

Overall, great work on improving this documentation!

Summary of Findings

  • Documentation Clarity: A minor suggestion was made to improve the conciseness and simplicity of the new recommendation for Grafana dashboard updates.
  • Table Formatting Consistency: Several minor inconsistencies were noted in the newly added metric table rows regarding placeholder style for cluster_name, ellipsis usage, and HTML break tags. A combined suggestion was provided to align them with existing entries for better readability and consistency.
  • PR Metadata: The 'Which TiDB version(s) do your changes apply to?' section in the PR description needs to be completed to provide necessary context for users and maintainers.

Merge Readiness

The changes are a valuable update to the monitoring documentation, adding new metrics and a helpful recommendation for dashboard updates. The suggested minor consistency improvements in the table formatting and a small tweak for conciseness will further enhance readability.

Please also ensure the 'Which TiDB version(s) do your changes apply to?' section in the PR description is completed.

As per the contribution guidelines, I've added comments directly. This PR appears to be in good shape for merging once these minor points are considered. As a reviewer, I am not authorized to approve pull requests; this PR should be reviewed and approved by other maintainers before merging.

@hfxsd hfxsd self-assigned this May 30, 2025
@huoyao1125 huoyao1125 closed this May 30, 2025
@huoyao1125 huoyao1125 reopened this May 30, 2025
@hfxsd hfxsd self-requested a review May 30, 2025 07:25
@hfxsd hfxsd added translation/no-need No need to translate this PR. area/tidb-cloud This PR relates to the area of TiDB Cloud. and removed missing-translation-status This PR does not have translation status info. labels May 30, 2025
@hfxsd
Copy link
Collaborator

hfxsd commented May 30, 2025

/retest

@huoyao1125 huoyao1125 closed this May 30, 2025
@huoyao1125 huoyao1125 reopened this May 30, 2025
@hfxsd hfxsd changed the title Update monitor-prometheus-and-grafana-integration.md tidb cloud: add more Prometheus monitoring metrics Jun 3, 2025
@hfxsd
Copy link
Collaborator

hfxsd commented Jun 3, 2025

/retest

Copy link

ti-chi-bot bot commented Jun 3, 2025

@jinshayumi: adding LGTM is restricted to approvers and reviewers in OWNERS files.

In response to this:

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes-sigs/prow repository.

@ti-chi-bot ti-chi-bot bot added the needs-1-more-lgtm Indicates a PR needs 1 more LGTM. label Jun 3, 2025
@Oreoxmt Oreoxmt self-requested a review June 3, 2025 06:41
@ti-chi-bot ti-chi-bot bot added lgtm and removed needs-1-more-lgtm Indicates a PR needs 1 more LGTM. labels Jun 3, 2025
Copy link

ti-chi-bot bot commented Jun 3, 2025

[LGTM Timeline notifier]

Timeline:

  • 2025-06-03 06:40:31.319754083 +0000 UTC m=+337517.522099348: ☑️ agreed by hfxsd.
  • 2025-06-03 06:57:15.4905558 +0000 UTC m=+338521.692901101: ☑️ agreed by Oreoxmt.

@hfxsd
Copy link
Collaborator

hfxsd commented Jun 3, 2025

/approve

Copy link

ti-chi-bot bot commented Jun 3, 2025

[APPROVALNOTIFIER] This PR is APPROVED

This pull-request has been approved by: hfxsd

The full list of commands accepted by this bot can be found here.

The pull request process is described here

Needs approval from an approver in each of these files:

Approvers can indicate their approval by writing /approve in a comment
Approvers can cancel approval by writing /approve cancel in a comment

@ti-chi-bot ti-chi-bot bot added the approved label Jun 3, 2025
@ti-chi-bot ti-chi-bot bot merged commit f57ba7a into pingcap:release-8.1 Jun 3, 2025
9 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
approved area/tidb-cloud This PR relates to the area of TiDB Cloud. contribution This PR is from a community contributor. first-time-contributor Indicates that the PR was contributed by an external member and is a first-time contributor. lgtm needs-ok-to-test Indicates a PR created by contributors and need ORG member send '/ok-to-test' to start testing. size/S Denotes a PR that changes 10-29 lines, ignoring generated files. translation/no-need No need to translate this PR.
Projects
None yet
Development

Successfully merging this pull request may close these issues.

5 participants