Implement streaming approach for DeviceSelect
and DevicePartition
#2238
Closed
3 tasks done
DeviceSelect
and DevicePartition
#2238
PR #2205 was a prototypical implementation of a streaming implementation for
DeviceSelect
, which was good enough to gather some performance data but had several limitation:the number of selected items actually was not written to a temporary to propagate to the
i+1
-th invocationDevicePartition
does not work out of the box, because the rejected items are written in reverse order from the end of the given output iterator. This cannot trivially be implemented using a fancy iterator approach as originally used in [DRAFT]: Experimental: StreamingDeviceSelect
#2205Implement [FEA]: Tabulate Output Iterator #1486 to redirect
num_selected_out
to either a temporary or the user-providednum_selected_out
, depending on whether it's the last invocationAdd a kernel parameter that captures the
streaming state
of previous invocationsAdapt
AgenetSelectIf
to incorporate thestreaming state
The text was updated successfully, but these errors were encountered: