Update McpToolUtils.java #4192

shishuiwuhen2009 · 2025-08-20T04:10:59Z

fix McpToolUtils.prefixedToolName formatted in order to support Chinese characters in toolName

fix McpToolUtils.prefixedToolName formatted in order to support Chinese characters in toolName Signed-off-by: shishuiwuhen2009 <[email protected]>

fix McpToolUtils.prefixedToolName formatted in order to support Chinese characters in toolName. Signed-off-by: shishuiwuhen2009 <[email protected]>

Replace hard-coded Unicode range with comprehensive Unicode property approach to fix incomplete Han character coverage in MCP tool name formatting. Changes: - Replace \u4e00-\u9fa5 range with union of Unicode Script and Block properties - Use \p{IsHan} + \p{InCJK_Unified_Ideographs} + \p{InCJK_Compatibility_Ideographs} - Fix boundary case where \u9fff was incorrectly excluded by script-only approach - Add comprehensive test coverage for all Han character blocks and edge cases Technical details: - Addresses Unicode Script vs Block classification differences across JDK versions - \u9fff (鿿) is in CJK Unified Ideographs block but not Han script in some JDKs - Union approach ensures complete coverage while maintaining exclusion of other scripts - Future-proof solution that automatically includes new Han characters in Unicode updates Test coverage added: - CJK Unified Ideographs boundary cases (\u4e00, \u9fff) - CJK Extension A characters (\u3400) - CJK Compatibility Ideographs (\uf900) - Mixed character block scenarios - Proper exclusion verification for non-Han scripts (Hiragana, Emoji, etc.) Fixes incomplete Chinese character support while maintaining backward compatibility and minimal risk profile of the original change. Signed-off-by: shishuiwuhen2009 Signed-off-by: Mark Pollack <[email protected]> Fixes #4192 (cherry picked from commit 35486e9)

markpollack · 2025-08-21T03:59:38Z

Thanks!!! had some help from ai! let me know what you think of the final changes @shishuiwuhen2009 . it is also backported.

shishuiwuhen2009 · 2025-08-21T04:25:44Z

Thank you so much for your detailed modification suggestions! This Unicode property-based approach for Chinese character support is much more standardized and comprehensive than my original hard-coded range method. I’ve learned a lot about robust Unicode handling—especially the nuances of Script vs. Block classification across JDK versions and how to ensure full Han character coverage. It’s really helpful for improving the quality of my code. Great work on pointing out these key details! 👍

shishuiwuhen2009 added 2 commits August 20, 2025 12:09

Update McpToolUtils.java

4e7252c

fix McpToolUtils.prefixedToolName formatted in order to support Chinese characters in toolName Signed-off-by: shishuiwuhen2009 <[email protected]>

Update McpToolUtils.java

e95e632

fix McpToolUtils.prefixedToolName formatted in order to support Chinese characters in toolName. Signed-off-by: shishuiwuhen2009 <[email protected]>

markpollack closed this in 35486e9 Aug 21, 2025

spring-builds added the status: backported label Aug 21, 2025

spring-builds mentioned this pull request Aug 21, 2025

Update McpToolUtils.java #4202

Closed

markpollack self-assigned this Aug 21, 2025

markpollack added the MCP label Aug 21, 2025

markpollack added this to the 1.1.0.M1 milestone Aug 21, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Update McpToolUtils.java #4192

Update McpToolUtils.java #4192

shishuiwuhen2009 commented Aug 20, 2025

Uh oh!

markpollack commented Aug 21, 2025

Uh oh!

shishuiwuhen2009 commented Aug 21, 2025

Uh oh!

Uh oh!

Update McpToolUtils.java #4192

Update McpToolUtils.java #4192

Conversation

shishuiwuhen2009 commented Aug 20, 2025

Uh oh!

markpollack commented Aug 21, 2025

Uh oh!

shishuiwuhen2009 commented Aug 21, 2025

Uh oh!

Uh oh!