Replace chunked write API #10138

yawkat · 2023-11-20T14:57:29Z

This PR replaces the writeChunked and writeFile APIs with a new writeStream API that takes an InputStream. This removes the need for the ChunkedWriteHandler.

Chunked writes were used for two purposes: Sending file regions and sending InputStreams. This has always complicated the HTTP pipeline somewhat as the pipeline had to deal with not just HttpContent objects but also ChunkedInput and FileRegion objects.

This PR replaces the machinery for InputStream writing with a more straight-forward solution that reads the data on the IO thread and then sends it down the channel.

Additionally, the file-specific APIs based on RandomAccessFile are removed. The body writer now just creates an InputStream for the file region in question and sends that. This removes support for zero-copy transfers, however that is a niche feature anyway because it doesn't work with TLS or HTTP/2. If someone wants a performant HTTP server, HTTP/2 takes priority over zero-copy so it makes little sense.

This PR may have small conflicts with #10131 as that PR changed the PipeliningServerHandler body handling a little bit. Otherwise this PR should have no visible impact on users.

This PR replaces the writeChunked and writeFile APIs with a new writeStream API that takes an InputStream. This removes the need for the ChunkedWriteHandler. Chunked writes were used for two purposes: Sending file regions and sending InputStreams. This has always complicated the HTTP pipeline somewhat as the pipeline had to deal with not just HttpContent objects but also ChunkedInput and FileRegion objects. This PR replaces the machinery for InputStream writing with a more straight-forward solution that reads the data on the IO thread and then sends it down the channel. Additionally, the file-specific APIs based on RandomAccessFile are removed. The body writer now just creates an InputStream for the file region in question and sends that. This removes support for zero-copy transfers, however that is a niche feature anyway because it doesn't work with TLS or HTTP/2. If someone wants a performant HTTP server, HTTP/2 takes priority over zero-copy so it makes little sense. This PR may have small conflicts with #10131 as that PR changed the PipeliningServerHandler body handling a little bit. Otherwise this PR should have no visible impact on users.

timyates · 2023-11-20T16:16:50Z

This removes support for zero-copy transfers, however that is a niche feature anyway because it doesn't work with TLS or HTTP/2. If someone wants a performant HTTP server, HTTP/2 takes priority over zero-copy so it makes little sense.

Does the techempower test take advantage of zero copy transfers? 🤔

timyates

I think there's an unused class, and I'm not sure who closes FileInputStreams

But that's probably a gap in my knowledge, rather than an issue here

I always feel out of my depth with these reviews 😉

timyates · 2023-11-20T16:22:03Z

http-server-netty/src/main/java/io/micronaut/http/server/netty/body/SystemFileBodyWriter.java

@@ -175,4 +174,90 @@ private static class IntRange {
        }
    }

+    private static class RafInputStream extends InputStream {


Is this class used?

nope, will remove

timyates · 2023-11-20T16:23:37Z

http-server-netty/src/main/java/io/micronaut/http/server/netty/body/SystemFileBodyWriter.java

+                File file = systemFile.getFile();
+                InputStream is;
+                try {
+                    is = new FileInputStream(file);


Who closes this?

it is passed on to PipeliningServerHandler where BlockingOutboundHandler.work closes it using try-with-resources

yawkat · 2023-11-20T16:35:04Z

@timyates it does not, it doesn't serve file system files afaik.

timyates · 2023-11-20T16:47:09Z

@dstepanov Do you know if any of the techempower tests serve files from the filesystem and might be affected by the removal of zero-copy transfers?

yawkat · 2023-11-20T16:51:45Z

denis is on vacation.

ive looked, and they don't use the file system.

sonarcloud · 2023-11-21T07:31:52Z

Kudos, SonarCloud Quality Gate passed!

0 Bugs
0 Vulnerabilities
0 Security Hotspots
5 Code Smells

70.5% Coverage
0.0% Duplication

PipeliningServerHandler was supposed to implement backpressure, but it turns out that auto read was still enabled and that the implementation didn't really work. This means that it would keep reading even if that means buffering data when the downstream can't keep up. This PR disables auto read and fixes the read implementation in PipeliningServerHandler. In principle there should be no change to users, this just means that instead of buffering any incoming data internally, backpressure is now applied to the client. This PR is based on #10138 but is separate for easier review. It also has conflicts with #10131.

* Replace chunked write API This PR replaces the writeChunked and writeFile APIs with a new writeStream API that takes an InputStream. This removes the need for the ChunkedWriteHandler. Chunked writes were used for two purposes: Sending file regions and sending InputStreams. This has always complicated the HTTP pipeline somewhat as the pipeline had to deal with not just HttpContent objects but also ChunkedInput and FileRegion objects. This PR replaces the machinery for InputStream writing with a more straight-forward solution that reads the data on the IO thread and then sends it down the channel. Additionally, the file-specific APIs based on RandomAccessFile are removed. The body writer now just creates an InputStream for the file region in question and sends that. This removes support for zero-copy transfers, however that is a niche feature anyway because it doesn't work with TLS or HTTP/2. If someone wants a performant HTTP server, HTTP/2 takes priority over zero-copy so it makes little sense. This PR may have small conflicts with #10131 as that PR changed the PipeliningServerHandler body handling a little bit. Otherwise this PR should have no visible impact on users. * remove unused class * remove unused class * Fix request backpressure PipeliningServerHandler was supposed to implement backpressure, but it turns out that auto read was still enabled and that the implementation didn't really work. This means that it would keep reading even if that means buffering data when the downstream can't keep up. This PR disables auto read and fixes the read implementation in PipeliningServerHandler. In principle there should be no change to users, this just means that instead of buffering any incoming data internally, backpressure is now applied to the client. This PR is based on #10138 but is separate for easier review. It also has conflicts with #10131. * fix test

* Replace chunked write API This PR replaces the writeChunked and writeFile APIs with a new writeStream API that takes an InputStream. This removes the need for the ChunkedWriteHandler. Chunked writes were used for two purposes: Sending file regions and sending InputStreams. This has always complicated the HTTP pipeline somewhat as the pipeline had to deal with not just HttpContent objects but also ChunkedInput and FileRegion objects. This PR replaces the machinery for InputStream writing with a more straight-forward solution that reads the data on the IO thread and then sends it down the channel. Additionally, the file-specific APIs based on RandomAccessFile are removed. The body writer now just creates an InputStream for the file region in question and sends that. This removes support for zero-copy transfers, however that is a niche feature anyway because it doesn't work with TLS or HTTP/2. If someone wants a performant HTTP server, HTTP/2 takes priority over zero-copy so it makes little sense. This PR may have small conflicts with #10131 as that PR changed the PipeliningServerHandler body handling a little bit. Otherwise this PR should have no visible impact on users. * remove unused class * remove unused class * Fix request backpressure PipeliningServerHandler was supposed to implement backpressure, but it turns out that auto read was still enabled and that the implementation didn't really work. This means that it would keep reading even if that means buffering data when the downstream can't keep up. This PR disables auto read and fixes the read implementation in PipeliningServerHandler. In principle there should be no change to users, this just means that instead of buffering any incoming data internally, backpressure is now applied to the client. This PR is based on #10138 but is separate for easier review. It also has conflicts with #10131. * Implement decompression in PipeliningServerHandler This patch implements the logic of HttpContentDecompressor in PipeliningServerHandler. This allows us to shrink the pipeline a little. The perf impact for uncompressed requests should basically be zero. This builds on the changes in #10142. * address review * revert * add DecompressionSpec

* Replace chunked write API This PR replaces the writeChunked and writeFile APIs with a new writeStream API that takes an InputStream. This removes the need for the ChunkedWriteHandler. Chunked writes were used for two purposes: Sending file regions and sending InputStreams. This has always complicated the HTTP pipeline somewhat as the pipeline had to deal with not just HttpContent objects but also ChunkedInput and FileRegion objects. This PR replaces the machinery for InputStream writing with a more straight-forward solution that reads the data on the IO thread and then sends it down the channel. Additionally, the file-specific APIs based on RandomAccessFile are removed. The body writer now just creates an InputStream for the file region in question and sends that. This removes support for zero-copy transfers, however that is a niche feature anyway because it doesn't work with TLS or HTTP/2. If someone wants a performant HTTP server, HTTP/2 takes priority over zero-copy so it makes little sense. This PR may have small conflicts with #10131 as that PR changed the PipeliningServerHandler body handling a little bit. Otherwise this PR should have no visible impact on users. * remove unused class * remove unused class * Fix request backpressure PipeliningServerHandler was supposed to implement backpressure, but it turns out that auto read was still enabled and that the implementation didn't really work. This means that it would keep reading even if that means buffering data when the downstream can't keep up. This PR disables auto read and fixes the read implementation in PipeliningServerHandler. In principle there should be no change to users, this just means that instead of buffering any incoming data internally, backpressure is now applied to the client. This PR is based on #10138 but is separate for easier review. It also has conflicts with #10131. * Implement decompression in PipeliningServerHandler This patch implements the logic of HttpContentDecompressor in PipeliningServerHandler. This allows us to shrink the pipeline a little. The perf impact for uncompressed requests should basically be zero. This builds on the changes in #10142. * address review * revert * add DecompressionSpec * Compression support in PipeliningServerHandler Like #10155

yawkat added the type: improvement A minor improvement to an existing feature label Nov 20, 2023

yawkat added this to the 4.3.0 milestone Nov 20, 2023

yawkat requested review from timyates and graemerocher November 20, 2023 14:57

timyates approved these changes Nov 20, 2023

View reviewed changes

remove unused class

590f987

remove unused class

f7e0f25

yawkat mentioned this pull request Nov 21, 2023

Fix request backpressure #10142

Merged

yawkat requested review from dstepanov and removed request for graemerocher December 7, 2023 10:09

dstepanov approved these changes Dec 8, 2023

View reviewed changes

yawkat merged commit cfc3092 into 4.3.x Dec 8, 2023
16 checks passed

yawkat deleted the no-chunks branch December 8, 2023 11:09

sdelamo mentioned this pull request Jan 5, 2024

Merge 4.2.x into 4.3.x #10329

Merged

This was referenced May 5, 2024

Update dependency io.micronaut:micronaut-inject-java to v4 piomin/sample-micronaut-applications#36

Open

Update dependency io.micronaut:micronaut-inject-java to v4 - autoclosed itobey/datadog-api-collector#29

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Replace chunked write API #10138

Replace chunked write API #10138

yawkat commented Nov 20, 2023

timyates commented Nov 20, 2023

timyates left a comment

timyates Nov 20, 2023

yawkat Nov 20, 2023

timyates Nov 20, 2023

yawkat Nov 20, 2023

yawkat commented Nov 20, 2023

timyates commented Nov 20, 2023

yawkat commented Nov 20, 2023

sonarcloud bot commented Nov 21, 2023 •

edited

Loading

Replace chunked write API #10138

Replace chunked write API #10138

Conversation

yawkat commented Nov 20, 2023

timyates commented Nov 20, 2023

timyates left a comment

Choose a reason for hiding this comment

timyates Nov 20, 2023

Choose a reason for hiding this comment

yawkat Nov 20, 2023

Choose a reason for hiding this comment

timyates Nov 20, 2023

Choose a reason for hiding this comment

yawkat Nov 20, 2023

Choose a reason for hiding this comment

yawkat commented Nov 20, 2023

timyates commented Nov 20, 2023

yawkat commented Nov 20, 2023

sonarcloud bot commented Nov 21, 2023 • edited Loading

sonarcloud bot commented Nov 21, 2023 •

edited

Loading