change escaping to hex escape sequences #65

michaelficarra · 2023-11-21T01:01:29Z

There's no need to add complexity of single-character identity escapes for every ASCII punctuator. I would prefer escaping using hex escape sequences instead, as discussed in #58. The only argument given against this is that you'd have to copy-paste any RegExp constructed using this function into a RegExp explainer to understand it, but let's be honest, you were going to have to do that anyway. @sophiebits also points out that by not modifying the grammar, we allow this feature to be polyfilled in older browsers.

bakkot · 2023-11-21T01:11:16Z

What's the argument for doing this, other than the polyfilling thing?

michaelficarra · 2023-11-21T01:20:00Z

Less RegExp grammar complexity. While I still assert that nobody should be reading the output of RegExp.escape, these grammar additions apply to all RegExps, which will mean I will have to read (or at least be on the lookout for) escaped ASCII punctuators in any RegExp context. I don't want them if they serve no purpose other than to make it harder for me to mentally parse a RegExp.

bakkot · 2023-11-21T01:43:19Z

I'd prefer to encounter \& rather than \x26. At least I have some hope of figuring out what the first one means (i.e., &, the same as how \- means -, etc).

ljharb · 2023-11-21T01:44:31Z

I agree; I would expect developers are quite comfortable with a backslash being a noop for the character, whereas hex escapes would be wildly unfamiliar.

oliverfoster · 2023-11-21T07:31:57Z

As a lay person, if I may, I've got some questions.

Punctuator escaping

a) As hex

Polyfillable
Less complex

b) As human readable characters

More easily human readable
Shorter, prettier

Potential additional complexity

It sounds to me like a one or two line change, with a lookup table or equivalent for current punctuators, is that a fair assessment? Or is considerably more complex to produce one over the other?

Preference

I'm in favour of whichever is simpler. I'd be happy if anything that impedes the progress of .escape is parked for a later date.
I don't think hex escaping is wildly unfamiliar (encodeURI, html special characters) and I agree that \& feels perfectly readable, if not normal (regex escape sequences).

ljharb · 2023-11-21T12:23:12Z

@oliverfoster this can’t be parked for later; it has to be decided before the feature ships and likely can never be changed in the future.

Spec complexity will likely be about the same with either approach; a line or two of grammar vs a line or two to do the hex escape.

DJ-Laser · 2023-12-06T20:44:20Z

I feel like pollyfill for older browsers is more important, and there can always be a function to translate hex codes into backslash escaped characters

ljharb · 2023-12-06T20:55:48Z

We don't generally make changes to proposals solely due to polyfillability.

ljharb · 2024-02-07T23:06:11Z

Rough consensus was to make this change; I'll do that, and then come back in a future meeting to seek stage 2.7.

Fixes #65

bakkot · 2024-03-22T22:41:08Z

Couple comments:

uppercase or lowercase?
some whitespace is not ascii and so needs \u rather than \x

ljharb · 2024-03-22T22:41:13Z

Filed #67. Currently goes with lowercase.

michaelficarra · 2024-03-22T23:06:33Z

The Encode AO (currently used by encodeURI and encodeURIComponent) uses uppercase.

Let hex be the String representation of octet, formatted as an uppercase hexadecimal number.

ljharb · 2024-03-22T23:14:07Z

True, but the base64 proposal uses lowercase, as does Number.prototype.toString.

ljharb mentioned this issue Nov 21, 2023

Path to Stage 4! #58

Open

32 tasks

ljharb added a commit that referenced this issue Mar 22, 2024

[spec] hex-escape punctuators

a293f72

Fixes #65

ljharb mentioned this issue Mar 22, 2024

[spec] hex-escape punctuators #67

Merged

ljharb closed this as completed in f01c310 Mar 25, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

change escaping to hex escape sequences #65

change escaping to hex escape sequences #65

michaelficarra commented Nov 21, 2023

bakkot commented Nov 21, 2023

michaelficarra commented Nov 21, 2023

bakkot commented Nov 21, 2023 •

edited

Loading

ljharb commented Nov 21, 2023

oliverfoster commented Nov 21, 2023 •

edited

Loading

ljharb commented Nov 21, 2023

DJ-Laser commented Dec 6, 2023

ljharb commented Dec 6, 2023

ljharb commented Feb 7, 2024

bakkot commented Mar 22, 2024

ljharb commented Mar 22, 2024 •

edited

Loading

michaelficarra commented Mar 22, 2024

ljharb commented Mar 22, 2024

change escaping to hex escape sequences #65

change escaping to hex escape sequences #65

Comments

michaelficarra commented Nov 21, 2023

bakkot commented Nov 21, 2023

michaelficarra commented Nov 21, 2023

bakkot commented Nov 21, 2023 • edited Loading

ljharb commented Nov 21, 2023

oliverfoster commented Nov 21, 2023 • edited Loading

Punctuator escaping

Potential additional complexity

Other questions

Preference

ljharb commented Nov 21, 2023

DJ-Laser commented Dec 6, 2023

ljharb commented Dec 6, 2023

ljharb commented Feb 7, 2024

bakkot commented Mar 22, 2024

ljharb commented Mar 22, 2024 • edited Loading

michaelficarra commented Mar 22, 2024

ljharb commented Mar 22, 2024

bakkot commented Nov 21, 2023 •

edited

Loading

oliverfoster commented Nov 21, 2023 •

edited

Loading

ljharb commented Mar 22, 2024 •

edited

Loading