Types: unreserve more characters #18

RileyApeldoorn · 2023-12-20T12:49:13Z

adds all missing characters specified in the sub-delims ABNF rule in RFC 3986 to the unreserved lists in Network.HTTP.Types.URI with two exceptions: ; and + are excluded from the query string list because the library already counts ; as query key-value pair delimiters (so adding them would break compatibility and does indeed cause tests to fail), and + because they have special handling in parseQueryReplacePlus.

Vlix · 2023-12-22T13:16:00Z

Thank you for the PR, I hope to take some time this weekend to check the correctness of these changes and see what the best course of action is in releasing, since it feels like this change might break people's previous expectations of what this code does.

Vlix · 2023-12-23T23:32:43Z

Seeing as this will produce different output of a function that is already used in the wild, I'm hesitant to add this change "in-place". It would be better to add urlEncodePath and urlEncodeQuery functions, so that previous code keeps working as expected and to slowly phase out/deprecate the urlEncode :: Bool -> ... function.
(I already hate the Bool arguments, so this might be a good way to slowly change the API)

The problem is that urlEncodeBuilder is used by both renderQueryBuilder and encodePathSegment... so the options are:

we don't change those functions ever and make new ones
we have those functions produce different output than in http-types <= 0.12.4 (at which point, why bother with phasing out urlEncode)

I'm already in the process of adding regression tests and other missing tests, so I'll keep this PR open until I'm happy with the coverage and then we can see if this would break anything after those tests are merged in.

Could you:

adjust the doctest to not fail on:

./Network/HTTP/Types/URI.hs:529: failure in expression `renderQueryPartialEscape True [("a", [QN "x:z + ", QE (encodeUtf8 "They said: \"שלום\"")])]'
expected: "?a=x:z + They%20said%3A%20%22%D7%A9%D7%9C%D7%95%D7%9D%22"
 but got: "?a=x:z + They%20said:%20%22%D7%A9%D7%9C%D7%95%D7%9D%22"

So that I can see that the other tests aren't failing too?

add url(En|De)code roundtrip (property) tests for all combinations of True and False, to show more obviously that this doesn't break anything.

Also, RFC 3986 states that Query and Segment parts can include non-percent-encoded / and ? as well.

Vlix · 2024-01-07T01:50:35Z

I've added a bunch of tests and also regression tests, so this change will most certainly fail on the golden test I added. Please make sure to run the test and mv test/.golden/urlEncode-*/actual test/.golden/urlEncode-*/golden before updating this PR. 👍

RileyApeldoorn · 2024-01-31T16:59:40Z

adding / to the list of unreserved characters for query strings is no biggie, but adding the ? causes at least the following tests to fail sporadically:

parseQuery, is parsed the same regardless of question mark
encode/decode query, add ? in front of Query if and only if necessary
encode/decode query, is identity to convert to and from Simple

i have no idea how to remedy this situation, though.

also, the roundtrip test i added for urlDecode True and urlEncode False is always falsified, which makes a certain amount of sense for that combination. perhaps it should be converted into a test that is expected to fail, what do you think?

Vlix · 2024-02-02T12:40:30Z

The failing tests when not percent-encoding ? is because it will sometimes add the ? to the front, and some tests want to make sure the function itself doesn't add a ? to the front.
The failing ones are all property tests, not unit tests, right?

I'll think about the urlDecode True <-> urlEncode False case, I'll come back to you on that at a later date.

Types: unreserve more characters

3cfbb53

RileyApeldoorn and others added 5 commits January 31, 2024 15:28

Tests: add property tests for url encoding/decoding roundtrips

fc98d54

Merge branch 'Vlix:master' into unreserve-more-characters

9956636

Types: un-urlencode : in renderQueryPartialEscape doctest

0239ee2

Tests: update golden test files

31db21a

Types: add ? and / to unreserved query string characters

1d46145

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Types: unreserve more characters #18

Types: unreserve more characters #18

RileyApeldoorn commented Dec 20, 2023

Vlix commented Dec 22, 2023

Vlix commented Dec 23, 2023 •

edited

Loading

Vlix commented Jan 7, 2024

RileyApeldoorn commented Jan 31, 2024

Vlix commented Feb 2, 2024 •

edited

Loading

Types: unreserve more characters #18

Are you sure you want to change the base?

Types: unreserve more characters #18

Conversation

RileyApeldoorn commented Dec 20, 2023

Vlix commented Dec 22, 2023

Vlix commented Dec 23, 2023 • edited Loading

Vlix commented Jan 7, 2024

RileyApeldoorn commented Jan 31, 2024

Vlix commented Feb 2, 2024 • edited Loading

Vlix commented Dec 23, 2023 •

edited

Loading

Vlix commented Feb 2, 2024 •

edited

Loading