Skip to content

ticdc: add new doc for TiCDC data replication capabilities (#20967) #21018

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Merged

Conversation

ti-chi-bot
Copy link
Member

This is an automated cherry-pick of #20967

First-time contributors' checklist

What is changed, added or deleted? (Required)

Which TiDB version(s) do your changes apply to? (Required)

Tips for choosing the affected version(s):

By default, CHOOSE MASTER ONLY so your changes will be applied to the next TiDB major or minor releases. If your PR involves a product feature behavior change or a compatibility change, CHOOSE THE AFFECTED RELEASE BRANCH(ES) AND MASTER.

For details, see tips for choosing the affected versions.

  • master (the latest development version)
  • v9.0 (TiDB 9.0 versions)
  • v8.5 (TiDB 8.5 versions)
  • v8.4 (TiDB 8.4 versions)
  • v8.3 (TiDB 8.3 versions)
  • v8.1 (TiDB 8.1 versions)
  • v7.5 (TiDB 7.5 versions)
  • v7.1 (TiDB 7.1 versions)
  • v6.5 (TiDB 6.5 versions)
  • v6.1 (TiDB 6.1 versions)
  • v5.4 (TiDB 5.4 versions)

What is the related PR or file link(s)?

Do your changes match any of the following descriptions?

  • Delete files
  • Change aliases
  • Need modification after applied to another branch
  • Might cause conflicts after applied to another branch

@ti-chi-bot ti-chi-bot added area/ticdc Indicates that the Issue or PR belongs to the area of TiCDC. do-not-merge/hold Indicates that a PR should not merge because someone has issued a /hold command. lgtm needs-1-more-lgtm Indicates a PR needs 1 more LGTM. ONCALL Relates to documentation oncall. size/M Denotes a PR that changes 30-99 lines, ignoring generated files. type/cherry-pick-for-release-8.1 This PR is cherry-picked to release-8.1 from a source PR. type/enhancement The issue or PR belongs to an enhancement. labels May 23, 2025
@ti-chi-bot
Copy link
Member Author

@hongyunyan This PR has conflicts, I have hold it.
Please resolve them or ask others to resolve them, then comment /unhold to remove the hold label.

Copy link

@gemini-code-assist gemini-code-assist bot left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Hello @ti-chi-bot, I'm Gemini Code Assist1! I'm currently reviewing this pull request and will post my feedback shortly. In the meantime, here's a summary to help you and other reviewers quickly get up to speed!

Summary of Changes

As gemini-code-assist, I've reviewed this pull request. The primary goal is to add a new documentation page detailing the data replication capabilities of TiCDC. This PR, which is an automated cherry-pick from a Chinese documentation repository, introduces the new document and updates the table of contents and the TiCDC overview page to link to it.

Highlights

  • New Documentation Page: A new document, ticdc/ticdc-data-replication-capabilities.md, has been added. It covers how TiCDC works, supported downstream systems, the scope of replicated data (supported/unsupported DDL/DML), and limitations.
  • Table of Contents Update: The TOC.md file has been updated to include a new "Reference" section under the TiCDC documentation, adding a link to the new capabilities document along with many other existing TiCDC reference links.
  • Overview Page Link: The ticdc/ticdc-overview.md file has been modified to include a link to the new data replication capabilities document in its introductory paragraph.

Changelog

  • TOC.md
    • Added a new 'Reference' section under TiCDC documentation.
    • Included a link to the new data replication capabilities document.
    • Added links to numerous other existing TiCDC reference documents.
  • ticdc/ticdc-data-replication-capabilities.md
    • New file added.
    • Contains detailed information on TiCDC's data replication capabilities, including how it works, supported downstreams, replication scope, and limitations.
  • ticdc/ticdc-overview.md
    • Added a link to the new 'TiCDC Data Replication Capabilities' document in the introduction.
Using Gemini Code Assist

The full guide for Gemini Code Assist can be found on our documentation page, here are some quick tips.

Invoking Gemini

You can request assistance from Gemini at any point in your pull request via creating an issue comment (i.e. comment on the pull request page) using either /gemini <command> or @gemini-code-assist <command>. Below is a summary of the supported commands.

Feature Command Description
Code Review /gemini review Performs a code review for the current pull request in its current state.
Pull Request Summary /gemini summary Provides a summary of the current pull request in its current state.
Comment @gemini-code-assist Responds in comments when explicitly tagged, both in issue comments and review comments.
Help /gemini help Displays a list of available commands.

Customization

To customize Gemini Code Assist for GitHub experience, repository maintainers can create a configuration file and/or provide a custom code review style guide (such as PEP-8 for Python) by creating and adding files to a .gemini/ folder in the base of the repository. Detailed instructions can be found here.

Limitations & Feedback

Gemini Code Assist is currently in preview and may make mistakes. Please leave feedback on any instances where its feedback is incorrect or counter productive. You can react with 👍 and 👎 on @gemini-code-assist comments to provide feedback.

You can also get AI-powered code generation, chat, as well as code reviews directly in the IDE at no cost with the Gemini Code Assist IDE Extension.

Footnotes

  1. Review the Privacy Notices, Generative AI Prohibited Use Policy, Terms of Service, and learn how to configure Gemini Code Assist in GitHub here. Gemini can make mistakes, so double check it and use code with caution.

Copy link

@gemini-code-assist gemini-code-assist bot left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Code Review

This pull request introduces a new document, ticdc-data-replication-capabilities.md, detailing TiCDC's replication capabilities, and updates the Table of Contents and the TiCDC overview page to include links to this new document. The new content is generally clear and informative.

However, there are critical merge conflict markers in TOC.md and ticdc/ticdc-overview.md that must be resolved before merging. Additionally, the new documentation file has a few minor stylistic and formatting issues that could be improved for consistency and readability.

I've reviewed the changes based on the provided style guide, focusing on clarity, technical accuracy, formatting, and adherence to Markdown best practices.

Summary of Findings

  • Merge Conflict Markers: Files TOC.md and ticdc/ticdc-overview.md contain merge conflict markers (e.g., <<<<<<< HEAD, =======, >>>>>>> ...). These are critical issues that will break the documentation and must be resolved.
  • Markdown Formatting: In ticdc/ticdc-data-replication-capabilities.md, there are minor formatting issues such as non-standard list markers (+ instead of - or *), a missing space, a zero-width space in a heading, and a missing newline at the end of the file. Addressing these will improve consistency and readability.
  • Style Guide Adherence: The review considered elements from the style guide, including writing in the second person, using present tense, consistent terminology, sentence case for headings, proper use of backticks, and Markdown formatting best practices. The new document generally adheres well, with minor areas for improvement noted.

Merge Readiness

This pull request adds valuable documentation on TiCDC's data replication capabilities. However, it cannot be merged in its current state due to critical merge conflict markers in TOC.md and ticdc/ticdc-overview.md. These must be resolved first. There are also some minor stylistic suggestions for the new ticdc-data-replication-capabilities.md file that would enhance its quality.

Once the critical issues are addressed, this PR will be in much better shape. As a reviewer, I am not authorized to approve pull requests, so please ensure these changes are made and then seek further review and approval from the maintainers.

TOC.md Outdated
Comment on lines 159 to 160
<<<<<<< HEAD
=======

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

critical

It appears there are merge conflict markers (<<<<<<< HEAD and =======) in the file. These need to be removed to ensure the document renders correctly. Could you please resolve these conflicts?

TOC.md Outdated
- [Troubleshoot](/ticdc/troubleshoot-ticdc.md)
- [FAQs](/ticdc/ticdc-faq.md)
- [Glossary](/ticdc/ticdc-glossary.md)
>>>>>>> 2ebdc6db76 (ticdc: add new doc for TiCDC data replication capabilities (#20967))

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

critical

This line seems to be a merge conflict marker (>>>>>>> ...). This should be removed. Could you please resolve this?

@@ -5,7 +5,11 @@ summary: Learn what TiCDC is, what features TiCDC provides, and how to install a

# TiCDC Overview

<<<<<<< HEAD

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

critical

This line appears to be a merge conflict marker (<<<<<<< HEAD). This needs to be removed to ensure the document renders correctly. Could you please resolve this conflict?

[TiCDC](https://github.com/pingcap/tiflow/tree/release-8.1/cdc) is a tool used to replicate incremental data from TiDB. Specifically, TiCDC pulls TiKV change logs, sorts captured data, and exports row-based incremental data to downstream databases.
=======

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

critical

This line appears to be a merge conflict marker (=======). This needs to be removed. Could you please resolve this?

[TiCDC](https://github.com/pingcap/tiflow/tree/release-8.1/cdc) is a tool used to replicate incremental data from TiDB. Specifically, TiCDC pulls TiKV change logs, sorts captured data, and exports row-based incremental data to downstream databases.
=======
[TiCDC](https://github.com/pingcap/tiflow/tree/master/cdc) is a tool used to replicate incremental data from TiDB. Specifically, TiCDC pulls TiKV change logs, sorts captured data, and exports row-based incremental data to downstream databases. For detailed data replication capabilities, see [TiCDC Data Replication Capabilities](/ticdc/ticdc-data-replication-capabilities.md).
>>>>>>> 2ebdc6db76 (ticdc: add new doc for TiCDC data replication capabilities (#20967))

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

critical

This line appears to be a merge conflict marker (>>>>>>> ...). This needs to be removed. Could you please resolve this?


+ **Supported:**

- DDL and DML statements(excluding system tables).

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

low

It looks like there's a missing space between 'statements' and '(excluding system tables)'. Adding a space would improve readability. 1

Style Guide References

Suggested change
- DDL and DML statements(excluding system tables).
- DDL and DML statements (excluding system tables).

Footnotes

  1. The style guide emphasizes correct English grammar and punctuation. A missing space is a minor punctuation/grammatical issue. (link)

- DDL and DML statements executed in upstream temporary tables.
- DQL (Data Query Language) and DCL (Data Control Language) statements.

## Limitations​

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

low

There seems to be a zero-width space character (U+200B) at the end of the heading 'Limitations​'. This character is invisible but can sometimes cause rendering or processing issues. It's best to remove it. 1

Style Guide References

Suggested change
## Limitations
## Limitations

Footnotes

  1. Ensuring clarity, simplicity, and readability includes avoiding non-standard or potentially problematic characters. (link)

Copy link

ti-chi-bot bot commented May 23, 2025

@gemini-code-assist[bot]: adding LGTM is restricted to approvers and reviewers in OWNERS files.

In response to this:

Code Review

This pull request introduces a new document, ticdc-data-replication-capabilities.md, detailing TiCDC's replication capabilities, and updates the Table of Contents and the TiCDC overview page to include links to this new document. The new content is generally clear and informative.

However, there are critical merge conflict markers in TOC.md and ticdc/ticdc-overview.md that must be resolved before merging. Additionally, the new documentation file has a few minor stylistic and formatting issues that could be improved for consistency and readability.

I've reviewed the changes based on the provided style guide, focusing on clarity, technical accuracy, formatting, and adherence to Markdown best practices.

Summary of Findings

  • Merge Conflict Markers: Files TOC.md and ticdc/ticdc-overview.md contain merge conflict markers (e.g., <<<<<<< HEAD, =======, >>>>>>> ...). These are critical issues that will break the documentation and must be resolved.
  • Markdown Formatting: In ticdc/ticdc-data-replication-capabilities.md, there are minor formatting issues such as non-standard list markers (+ instead of - or *), a missing space, a zero-width space in a heading, and a missing newline at the end of the file. Addressing these will improve consistency and readability.
  • Style Guide Adherence: The review considered elements from the style guide, including writing in the second person, using present tense, consistent terminology, sentence case for headings, proper use of backticks, and Markdown formatting best practices. The new document generally adheres well, with minor areas for improvement noted.

Merge Readiness

This pull request adds valuable documentation on TiCDC's data replication capabilities. However, it cannot be merged in its current state due to critical merge conflict markers in TOC.md and ticdc/ticdc-overview.md. These must be resolved first. There are also some minor stylistic suggestions for the new ticdc-data-replication-capabilities.md file that would enhance its quality.

Once the critical issues are addressed, this PR will be in much better shape. As a reviewer, I am not authorized to approve pull requests, so please ensure these changes are made and then seek further review and approval from the maintainers.

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes-sigs/prow repository.

@lilin90 lilin90 assigned lilin90 and unassigned hongyunyan May 23, 2025
Copy link
Member

@lilin90 lilin90 left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

/approve

Copy link

ti-chi-bot bot commented May 23, 2025

[APPROVALNOTIFIER] This PR is APPROVED

This pull-request has been approved by: lilin90

The full list of commands accepted by this bot can be found here.

The pull request process is described here

Needs approval from an approver in each of these files:

Approvers can indicate their approval by writing /approve in a comment
Approvers can cancel approval by writing /approve cancel in a comment

@ti-chi-bot ti-chi-bot bot added the approved label May 23, 2025
@lilin90
Copy link
Member

lilin90 commented May 23, 2025

/unhold

@ti-chi-bot ti-chi-bot bot removed the do-not-merge/hold Indicates that a PR should not merge because someone has issued a /hold command. label May 23, 2025
@ti-chi-bot ti-chi-bot bot merged commit f05e5b4 into pingcap:release-8.1 May 23, 2025
9 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
approved area/ticdc Indicates that the Issue or PR belongs to the area of TiCDC. lgtm needs-1-more-lgtm Indicates a PR needs 1 more LGTM. ONCALL Relates to documentation oncall. size/M Denotes a PR that changes 30-99 lines, ignoring generated files. type/cherry-pick-for-release-8.1 This PR is cherry-picked to release-8.1 from a source PR. type/enhancement The issue or PR belongs to an enhancement.
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants