Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

support decoupled mode in perf_analyzer #4530

Open
Jackiexiao opened this issue Jun 20, 2022 · 1 comment
Open

support decoupled mode in perf_analyzer #4530

Jackiexiao opened this issue Jun 20, 2022 · 1 comment
Labels
enhancement New feature or request

Comments

@Jackiexiao
Copy link

when using perf_analyzer to analyze a python decoupled model like triton-decoupled using command below

perf_analyzer -i grpc --streaming -m repeat --concurrency-range 1:2  -vv --input-data repeat_data.json

repeat_data.json

  {
    "data" :
     [
        {
          "IN" : [5]
        }
      ]
  }

which 5 means how many response will server (decouple mode) will send to client

the results will show error

No valid requests recorded within time interval. Please use a larger time window.

it seems like perf_analyzer will send a request when it get a response, but in decouple mode, perf_analyzer should not send request again until it get final response from decouple mode

@Tabrizian
Copy link
Member

Tabrizian commented Jun 20, 2022

@Jackiexiao Thanks for your feature request. Perf Analyzer has limited support for decoupled mode by measuring the time between the request and first response. However, as you mentioned in the issue, it is not suitable for all the different combinations that can be expressed using the decoupled mode. I'll mark this ticket as an enhancement.

@Tabrizian Tabrizian added the enhancement New feature or request label Jun 20, 2022
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement New feature or request
Development

No branches or pull requests

2 participants