Allow URLs with non-ASCII characters to be parsed #1063

azrogers · 2025-01-15T19:02:46Z

As described in #973, glTFs that include URIs with non-ASCII characters in them don't currently work with Cesium Native. That's because the library we use for URI parsing, uriparser, follows the RFC 3986 standard. According to RFC 3986, only ASCII characters are allowed in URLs, and all other characters must be escaped. Unicode support came later, in RFC 3987 and now the WhatWG URL specification which seems to be the modern standard browsers aim to support. The glTF spec allows Unicode characters in URIs "as-is," meaning a strictly RFC 3986-compliant parser won't do the job. One option would be to substitute a different, WhatWG-compliant parser in its place - but as I described in this comment, this introduces more problems than it solves.

Instead, the solution I went with was to encode all non-ASCII characters in the string before passing it to uriparser, then decoding them again before returning from the method. This seems to work flawlessly - uriparser gets the RFC 3986-compliant URLs it expects, and the user gets the WhatWG-compliant URLs they expect. Unfortunately, this solution does introduce an extra layer of string copies into URL parsing. This isn't ideal, but considering URLs are usually fairly short, I think it's a worthwhile tradeoff to gain this compliance with the glTF spec.

azrogers · 2025-01-17T22:04:56Z

Closed in favor of #1072.

azrogers added 2 commits January 15, 2025 13:53

Support non-ascii characters in URLs

71ff30f

Format, CHANGES

3735a11

azrogers closed this Jan 17, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Allow URLs with non-ASCII characters to be parsed #1063

Allow URLs with non-ASCII characters to be parsed #1063

azrogers commented Jan 15, 2025

azrogers commented Jan 17, 2025

Allow URLs with non-ASCII characters to be parsed #1063

Allow URLs with non-ASCII characters to be parsed #1063

Conversation

azrogers commented Jan 15, 2025

azrogers commented Jan 17, 2025