[FEA]: Output tensors should be a constructor argument to the TritonInferenceStage #1458
Open
2 tasks done
Labels
feature request
New feature or request
Is this a new feature, an improvement, or a change to existing functionality?
New Feature
How would you describe the priority of this feature request
Medium
Please provide a clear description of problem this feature solves
Currently the
TritonInferenceStage
only operates on a single output tensor, the probabilities tensor.The need to handle an additional output tensor is currently the primary reason why a user would need to subclass both the
TritonInferenceStage
and theTritonInferenceWorker
Describe your ideal solution
The
inout_mapping
gets us about 90% of the way there, but currently we only use it to determine the name of the probabilities orprobs
tensor.This would allow us to make
TritonInferenceWorker
a private class. We could potentially remove the worker folding it into the stage, but the separation of the stage & worker is something baked into the baseInferenceStage
class.Additional context
Relates to PR #1402
Code of Conduct
The text was updated successfully, but these errors were encountered: