Create an API for measuring the total runtime of an arbitrary ttnn op chain #16920

arminaleTT · 2025-01-20T21:00:09Z

Prerequisite for compile-time perf measurements in tt-mlir

… call end_trace_capture and release_trace

…n for use during forge compilation (#16921) ### Ticket #16920 ### Problem description Provide an API for the forge optimizer to run arbitrary ttnn ops **during** forge compilation and measure their runtime. These compile-time perf measurements are an alternative to offline perf models while those are being developed for each op. API should: - take an arbitrary callable of ttnn ops and an arbitrary set of arguments - return the runtime of the callable by actually running it on the device - should match the interface and nomenclature of the L1 constraints API - see PR #15046 and ticket #15291 ### What's changed - Create a new `get_op_runtime()` API with identical interface to `get_op_constraints()` - Use trace capture for perf measurement - Given an op chain, capture the trace of the op chain. Then execute the trace and report the runtime of the trace as the perf measurement - Enables end-to-end perf measurement without using a profiler-enabled build or any dependency on the device profiler - Unit tests to demonstrate functionality for single op and a chain of ops. Note: the forge consumer for this API has not been built yet ### Checklist - [x] Post commit CI passes - [ ] Blackhole Post commit (if applicable) - [ ] Model regression CI testing passes (if applicable) - [ ] Device performance regression CI testing passes (if applicable) - [ ] **(For models and ops writers)** Full [new models](https://github.com/tenstorrent/tt-metal/actions/workflows/full-new-models-suite.yaml) tests passes - [x] New/Existing tests provide coverage for changes

arminaleTT added feature ttnn labels Jan 20, 2025

arminaleTT self-assigned this Jan 20, 2025

arminaleTT changed the title ~~Create an API for measuring the total runtime of an arbitrary tonne op chain~~ Create an API for measuring the total runtime of an arbitrary ttnn op chain Jan 20, 2025

arminaleTT added a commit that referenced this issue Jan 20, 2025

#16920: add comment on required trace region size

8771352

arminaleTT added a commit that referenced this issue Jan 20, 2025

#16920: remove unncessary disabling of async mode

6fbdb7f

arminaleTT added the forge label Jan 20, 2025

arminaleTT added a commit that referenced this issue Jan 20, 2025

#16920: test cleanup

db1a446

arminaleTT mentioned this issue Jan 20, 2025

Create an API for running and measuring the runtime of a ttnn op chain for use during forge compilation #16921

Merged

6 tasks

arminaleTT added a commit that referenced this issue Jan 21, 2025

#16920: removed unncessary test naming + minor fix

012e7f1

arminaleTT added a commit that referenced this issue Jan 21, 2025

#16920: clean up of test code

01b29e5

arminaleTT added a commit that referenced this issue Jan 22, 2025

#16920: fix license, code cleanup

589adbc

arminaleTT added a commit that referenced this issue Jan 22, 2025

#16920: fix license, restructure the code to make sure all code paths…

ad95bfd

… call end_trace_capture and release_trace

arminaleTT added a commit that referenced this issue Jan 22, 2025

#16920: add missing call to release_trace

49960a8

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Create an API for measuring the total runtime of an arbitrary ttnn op chain #16920

Create an API for measuring the total runtime of an arbitrary ttnn op chain #16920

arminaleTT commented Jan 20, 2025

Create an API for measuring the total runtime of an arbitrary ttnn op chain #16920

Create an API for measuring the total runtime of an arbitrary ttnn op chain #16920

Comments

arminaleTT commented Jan 20, 2025