This package provides a MLServer runtime compatible with MLflow models.
You can install the runtime, alongside mlserver
, as:
pip install mlserver mlserver-mlflow
The MLflow inference runtime introduces a new dict
content type, which
decodes an incoming V2 request as a dictionary of
tensors.
This is useful for certain MLflow-serialised models, which will expect that the
model inputs are serialised in this format.
The `dict` content type can be _stacked_ with other content types, like
[`np`](../../docs/user-guide/content-type).
This allows the user to use a different set of content types to decode each of
the dict entries.