Releases: tenstorrent/tt-metal
v0.53.1-rc19
Note
If you are installing from a release, please refer to the README, INSTALLATION instructions, and any other documentation packaged with the release, not on the main branch. There may be differences between the latest main and the previous release.
The changelog will now follow, showing the changes from last release.
This release was generated by the CI workflow https://github.com/tenstorrent/tt-metal/actions/runs/12267951232
- no changes
v0.53.1-rc18
Note
If you are installing from a release, please refer to the README, INSTALLATION instructions, and any other documentation packaged with the release, not on the main branch. There may be differences between the latest main and the previous release.
The changelog will now follow, showing the changes from last release.
This release was generated by the CI workflow https://github.com/tenstorrent/tt-metal/actions/runs/12265838284
- no changes
v0.53.1-rc17
Note
If you are installing from a release, please refer to the README, INSTALLATION instructions, and any other documentation packaged with the release, not on the main branch. There may be differences between the latest main and the previous release.
The changelog will now follow, showing the changes from last release.
This release was generated by the CI workflow https://github.com/tenstorrent/tt-metal/actions/runs/12261030742
- no changes
v0.53.1-rc16
Note
If you are installing from a release, please refer to the README, INSTALLATION instructions, and any other documentation packaged with the release, not on the main branch. There may be differences between the latest main and the previous release.
The changelog will now follow, showing the changes from last release.
This release was generated by the CI workflow https://github.com/tenstorrent/tt-metal/actions/runs/12247657687
📦 Uncategorized
- ARCH_NAME related header cleanup
- PR: #15634
- Set owners for fold and untilize_with_halo_v2
- PR: #15651
- Update Mamba expected demo outputs
- PR: #15447
- #0: Clean up bfloat8/4 pack/unpack functions
- PR: #15610
- [skip ci] Update Readme.#14257
- PR: #15641
- #0: Add overloaded helper to tt::stl
- PR: #15606
- [tt-train] Gradient accumulation steps
- PR: #15658
- Supporting non-overlapping output core grid for create_qkv_heads
- PR: #15341
- Global CB Support
- PR: #15180
- Re-enable multicore untilize on blackhole
- PR: #15626
- #14050: Timestamped data and event recording for device side
- PR: #15620
- #0: Remove warning log: "Specifying tile shape for a row major layout is deprecated"
- PR: #15663
- don't run UMD tests in post commit anymore
- PR: #15664
- Add new HAL APIs
- PR: #15645
- [tt-train] Improve logging information
- PR: #15673
- Remove inclusion of ARCH_NAME include from command_queue.cpp
- PR: #15676
- #0: Fix missing brace from bad conflict resolution
- PR: #15678
v0.53.1-rc15
Note
If you are installing from a release, please refer to the README, INSTALLATION instructions, and any other documentation packaged with the release, not on the main branch. There may be differences between the latest main and the previous release.
The changelog will now follow, showing the changes from last release.
This release was generated by the CI workflow https://github.com/tenstorrent/tt-metal/actions/runs/12227280390
📦 Uncategorized
- [skip ci] Update CONTRIBUTING.md with pre-commit info
- PR: #15537
- Sharded sweep tests
- PR: #15246
- #0: Fix typos in ttnn docs
- PR: #15517
- #0: Make sub-device bank ids for cores the same as global bank ids
- PR: #15579
- #12151: Generalize
max_pool2d
code into generic pool op- PR: #15561
- Update INSTALLING.md
- PR: #15512
- #13944: Prohibit copying memory objects
- PR: #15434
- #15267: Merge erisc kernel data and bss sections
- PR: #15457
- Update perf and latest features for llm models (Dec 2)
- PR: #15600
- Update llms.md
- PR: #15612
- #11795: Update pgm dispatch golden file
- PR: #15614
- [tt-train] Add kahan summation in AdamW
- PR: #15518
- Remove core_config.h from host code inclusion in dev_msgs.h
- PR: #15584
- Add new Hal API "valid_reg_addr"
- PR: #15559
- #0: Fix DPRINT_DATA macro bug
- PR: #15617
- [tt-train] Help find UMD library for tt-train build
- PR: #15631
- #14257: few more optimization for yolo
- PR: #15582
- Fix simulator setup
- PR: #15227
- #15548: SELU op update
- PR: #15588
- #13779: Optimize pow
- PR: #15534
- [skip ci] Use @tenstorrent/metalium-developers-infra in CODEOWNERS
- PR: #15560
- ARCH_NAME related header cleanup
- PR: #15634
- Add N300 perf to pipeline
- PR: #14937
- Set owners for fold and untilize_with_halo_v2
- PR: #15651
- Update Mamba expected demo outputs
- PR: #15447
- #0: Clean up bfloat8/4 pack/unpack functions
- PR: #15610
- [skip ci] Update Readme.#14257
- PR: #15641
- #0: Add overloaded helper to tt::stl
- PR: #15606
- [tt-train] Gradient accumulation steps
- PR: #15658
- Supporting non-overlapping output core grid for create_qkv_heads
- PR: #15341
- Global CB Support
- PR: #15180
- Re-enable multicore untilize on blackhole
- PR: #15626
- #14050: Timestamped data and event recording for device side
- PR: #15620
- #0: Remove warning log: "Specifying tile shape for a row major layout is deprecated"
- PR: #15663
- don't run UMD tests in post commit anymore
- PR: #15664
- Add new HAL APIs
- PR: #15645
- [tt-train] Improve logging information
- PR: #15673
- Remove inclusion of ARCH_NAME include from command_queue.cpp
- PR: #15676
- #0: Fix missing brace from bad conflict resolution
- PR: #15678
v0.53.1-rc14
Note
If you are installing from a release, please refer to the README, INSTALLATION instructions, and any other documentation packaged with the release, not on the main branch. There may be differences between the latest main and the previous release.
The changelog will now follow, showing the changes from last release.
This release was generated by the CI workflow https://github.com/tenstorrent/tt-metal/actions/runs/12208751194
📦 Uncategorized
- [skip ci] Update CONTRIBUTING.md with pre-commit info
- PR: #15537
- #14982: Update Unary examples
- PR: #15421
- #14982: Update Unary example and docs
- PR: #15453
- #5893: Fix ops with low PCC
- PR: #14652
- #13127: Clean up layout conversion
- PR: #15393
- Sharded sweep tests
- PR: #15246
- #0: Fix typos in ttnn docs
- PR: #15517
- #0: Make sub-device bank ids for cores the same as global bank ids
- PR: #15579
- #12151: Generalize
max_pool2d
code into generic pool op- PR: #15561
- Update INSTALLING.md
- PR: #15512
- #13944: Prohibit copying memory objects
- PR: #15434
- #15267: Merge erisc kernel data and bss sections
- PR: #15457
- Update perf and latest features for llm models (Dec 2)
- PR: #15600
- Update llms.md
- PR: #15612
- #11795: Update pgm dispatch golden file
- PR: #15614
- [tt-train] Add kahan summation in AdamW
- PR: #15518
- Remove core_config.h from host code inclusion in dev_msgs.h
- PR: #15584
- Add new Hal API "valid_reg_addr"
- PR: #15559
- #0: Fix DPRINT_DATA macro bug
- PR: #15617
- [tt-train] Help find UMD library for tt-train build
- PR: #15631
- #14257: few more optimization for yolo
- PR: #15582
- Fix simulator setup
- PR: #15227
- #15548: SELU op update
- PR: #15588
- #13779: Optimize pow
- PR: #15534
- [skip ci] Use @tenstorrent/metalium-developers-infra in CODEOWNERS
- PR: #15560
- ARCH_NAME related header cleanup
- PR: #15634
- Add N300 perf to pipeline
- PR: #14937
- Set owners for fold and untilize_with_halo_v2
- PR: #15651
- Update Mamba expected demo outputs
- PR: #15447
- #0: Clean up bfloat8/4 pack/unpack functions
- PR: #15610
- [skip ci] Update Readme.#14257
- PR: #15641
- #0: Add overloaded helper to tt::stl
- PR: #15606
- [tt-train] Gradient accumulation steps
- PR: #15658
- Supporting non-overlapping output core grid for create_qkv_heads
- PR: #15341
- Global CB Support
- PR: #15180
- Re-enable multicore untilize on blackhole
- PR: #15626
- #14050: Timestamped data and event recording for device side
- PR: #15620
- #0: Remove warning log: "Specifying tile shape for a row major layout is deprecated"
- PR: #15663
- don't run UMD tests in post commit anymore
- PR: #15664
- Add new HAL APIs
- PR: #15645
- [tt-train] Improve logging information
- PR: #15673
- Remove inclusion of ARCH_NAME include from command_queue.cpp
- PR: #15676
- #0: Fix missing brace from bad conflict resolution
- PR: #15678
v0.53.1-rc13
Note
If you are installing from a release, please refer to the README, INSTALLATION instructions, and any other documentation packaged with the release, not on the main branch. There may be differences between the latest main and the previous release.
The changelog will now follow, showing the changes from last release.
This release was generated by the CI workflow https://github.com/tenstorrent/tt-metal/actions/runs/12193793478
📦 Uncategorized
- [skip ci] Update CONTRIBUTING.md with pre-commit info
- PR: #15537
- #13415: Update left shift
- PR: #15538
- #14982: Update Unary examples
- PR: #15421
- #14982: Update Unary example and docs
- PR: #15453
- #5893: Fix ops with low PCC
- PR: #14652
- #13127: Clean up layout conversion
- PR: #15393
- Sharded sweep tests
- PR: #15246
- #0: Fix typos in ttnn docs
- PR: #15517
- #0: Make sub-device bank ids for cores the same as global bank ids
- PR: #15579
- #12151: Generalize
max_pool2d
code into generic pool op- PR: #15561
- Update INSTALLING.md
- PR: #15512
- #13944: Prohibit copying memory objects
- PR: #15434
- #15267: Merge erisc kernel data and bss sections
- PR: #15457
- Update perf and latest features for llm models (Dec 2)
- PR: #15600
- Update llms.md
- PR: #15612
- #11795: Update pgm dispatch golden file
- PR: #15614
- [tt-train] Add kahan summation in AdamW
- PR: #15518
- Remove core_config.h from host code inclusion in dev_msgs.h
- PR: #15584
- Add new Hal API "valid_reg_addr"
- PR: #15559
- #0: Fix DPRINT_DATA macro bug
- PR: #15617
- [tt-train] Help find UMD library for tt-train build
- PR: #15631
- #14257: few more optimization for yolo
- PR: #15582
- Fix simulator setup
- PR: #15227
- #15548: SELU op update
- PR: #15588
- #13779: Optimize pow
- PR: #15534
- [skip ci] Use @tenstorrent/metalium-developers-infra in CODEOWNERS
- PR: #15560
- ARCH_NAME related header cleanup
- PR: #15634
- Add N300 perf to pipeline
- PR: #14937
- Set owners for fold and untilize_with_halo_v2
- PR: #15651
- Update Mamba expected demo outputs
- PR: #15447
- #0: Clean up bfloat8/4 pack/unpack functions
- PR: #15610
- [skip ci] Update Readme.#14257
- PR: #15641
- #0: Add overloaded helper to tt::stl
- PR: #15606
- [tt-train] Gradient accumulation steps
- PR: #15658
- Supporting non-overlapping output core grid for create_qkv_heads
- PR: #15341
- Global CB Support
- PR: #15180
- Re-enable multicore untilize on blackhole
- PR: #15626
- #14050: Timestamped data and event recording for device side
- PR: #15620
- #0: Remove warning log: "Specifying tile shape for a row major layout is deprecated"
- PR: #15663
- don't run UMD tests in post commit anymore
- PR: #15664
- Add new HAL APIs
- PR: #15645
- [tt-train] Improve logging information
- PR: #15673
- Remove inclusion of ARCH_NAME include from command_queue.cpp
- PR: #15676
- #0: Fix missing brace from bad conflict resolution
- PR: #15678
v0.53.1-rc12
Note
If you are installing from a release, please refer to the README, INSTALLATION instructions, and any other documentation packaged with the release, not on the main branch. There may be differences between the latest main and the previous release.
The changelog will now follow, showing the changes from last release.
This release was generated by the CI workflow https://github.com/tenstorrent/tt-metal/actions/runs/12191260435
📦 Uncategorized
- [skip ci] Update CONTRIBUTING.md with pre-commit info
- PR: #15537
- #13415: Update left shift
- PR: #15538
- #14982: Update Unary examples
- PR: #15421
- #14982: Update Unary example and docs
- PR: #15453
- #5893: Fix ops with low PCC
- PR: #14652
- #13127: Clean up layout conversion
- PR: #15393
- Sharded sweep tests
- PR: #15246
- #0: Fix typos in ttnn docs
- PR: #15517
- #0: Make sub-device bank ids for cores the same as global bank ids
- PR: #15579
- #12151: Generalize
max_pool2d
code into generic pool op- PR: #15561
- Update INSTALLING.md
- PR: #15512
- #13944: Prohibit copying memory objects
- PR: #15434
- #15267: Merge erisc kernel data and bss sections
- PR: #15457
- Update perf and latest features for llm models (Dec 2)
- PR: #15600
- Update llms.md
- PR: #15612
- #11795: Update pgm dispatch golden file
- PR: #15614
- [tt-train] Add kahan summation in AdamW
- PR: #15518
- Remove core_config.h from host code inclusion in dev_msgs.h
- PR: #15584
- Add new Hal API "valid_reg_addr"
- PR: #15559
- #0: Fix DPRINT_DATA macro bug
- PR: #15617
- [tt-train] Help find UMD library for tt-train build
- PR: #15631
- #14257: few more optimization for yolo
- PR: #15582
- Fix simulator setup
- PR: #15227
- #15548: SELU op update
- PR: #15588
- #13779: Optimize pow
- PR: #15534
- [skip ci] Use @tenstorrent/metalium-developers-infra in CODEOWNERS
- PR: #15560
- ARCH_NAME related header cleanup
- PR: #15634
- Add N300 perf to pipeline
- PR: #14937
- Set owners for fold and untilize_with_halo_v2
- PR: #15651
- Update Mamba expected demo outputs
- PR: #15447
- #0: Clean up bfloat8/4 pack/unpack functions
- PR: #15610
- [skip ci] Update Readme.#14257
- PR: #15641
- #0: Add overloaded helper to tt::stl
- PR: #15606
- [tt-train] Gradient accumulation steps
- PR: #15658
- Supporting non-overlapping output core grid for create_qkv_heads
- PR: #15341
- Global CB Support
- PR: #15180
- Re-enable multicore untilize on blackhole
- PR: #15626
- #14050: Timestamped data and event recording for device side
- PR: #15620
- #0: Remove warning log: "Specifying tile shape for a row major layout is deprecated"
- PR: #15663
- don't run UMD tests in post commit anymore
- PR: #15664
- Add new HAL APIs
- PR: #15645
- [tt-train] Improve logging information
- PR: #15673
- Remove inclusion of ARCH_NAME include from command_queue.cpp
- PR: #15676
- #0: Fix missing brace from bad conflict resolution
- PR: #15678
v0.53.1-rc11
Note
If you are installing from a release, please refer to the README, INSTALLATION instructions, and any other documentation packaged with the release, not on the main branch. There may be differences between the latest main and the previous release.
The changelog will now follow, showing the changes from last release.
This release was generated by the CI workflow https://github.com/tenstorrent/tt-metal/actions/runs/12171537860
📦 Uncategorized
- [skip ci] Update CONTRIBUTING.md with pre-commit info
- PR: #15537
- Fix race condition in DRAM sharded MM
- PR: #15569
- Fix to concat support for tensors with tile padding
- PR: #15513
- Adjust perf test targets for Falcon7b-t3k-decode-noasync to account for CI instability
- PR: #15573
- Format Subset of tests/ directory
- PR: #15576
- [skip ci] Update git-blame-ignore-revs
- PR: #15577
- #14982: Update Unary examples docs
- PR: #15417
- fix graph_processor buffer_alloc_id calculus
- PR: #15426
- #13415: Update left shift
- PR: #15538
- #14982: Update Unary examples
- PR: #15421
- #14982: Update Unary example and docs
- PR: #15453
- #5893: Fix ops with low PCC
- PR: #14652
- #13127: Clean up layout conversion
- PR: #15393
- Sharded sweep tests
- PR: #15246
- #0: Fix typos in ttnn docs
- PR: #15517
- #0: Make sub-device bank ids for cores the same as global bank ids
- PR: #15579
- #12151: Generalize
max_pool2d
code into generic pool op- PR: #15561
- Update INSTALLING.md
- PR: #15512
- #13944: Prohibit copying memory objects
- PR: #15434
- #15267: Merge erisc kernel data and bss sections
- PR: #15457
- Update perf and latest features for llm models (Dec 2)
- PR: #15600
- Update llms.md
- PR: #15612
- #11795: Update pgm dispatch golden file
- PR: #15614
- [tt-train] Add kahan summation in AdamW
- PR: #15518
- Remove core_config.h from host code inclusion in dev_msgs.h
- PR: #15584
- Add new Hal API "valid_reg_addr"
- PR: #15559
- #0: Fix DPRINT_DATA macro bug
- PR: #15617
- [tt-train] Help find UMD library for tt-train build
- PR: #15631
- #14257: few more optimization for yolo
- PR: #15582
- Fix simulator setup
- PR: #15227
- #15548: SELU op update
- PR: #15588
- #13779: Optimize pow
- PR: #15534
- [skip ci] Use @tenstorrent/metalium-developers-infra in CODEOWNERS
- PR: #15560
- ARCH_NAME related header cleanup
- PR: #15634
- Add N300 perf to pipeline
- PR: #14937
- Set owners for fold and untilize_with_halo_v2
- PR: #15651
- Update Mamba expected demo outputs
- PR: #15447
- #0: Clean up bfloat8/4 pack/unpack functions
- PR: #15610
- [skip ci] Update Readme.#14257
- PR: #15641
- #0: Add overloaded helper to tt::stl
- PR: #15606
- [tt-train] Gradient accumulation steps
- PR: #15658
- Supporting non-overlapping output core grid for create_qkv_heads
- PR: #15341
- Global CB Support
- PR: #15180
- Re-enable multicore untilize on blackhole
- PR: #15626
- #14050: Timestamped data and event recording for device side
- PR: #15620
- #0: Remove warning log: "Specifying tile shape for a row major layout is deprecated"
- PR: #15663
- don't run UMD tests in post commit anymore
- PR: #15664
- Add new HAL APIs
- PR: #15645
- [tt-train] Improve logging information
- PR: #15673
- Remove inclusion of ARCH_NAME include from command_queue.cpp
- PR: #15676
- #0: Fix missing brace from bad conflict resolution
- PR: #15678
v0.53.1-rc9
Note
If you are installing from a release, please refer to the README, INSTALLATION instructions, and any other documentation packaged with the release, not on the main branch. There may be differences between the latest main and the previous release.
The changelog will now follow, showing the changes from last release.
This release was generated by the CI workflow https://github.com/tenstorrent/tt-metal/actions/runs/12151320227
🚀 Features
📦 Uncategorized
- Fix the test for whether to install the wheel, and also exit the script on the first error
- PR: #15480
- #12558: TTNN implementation of MNIST model
- PR: #12647
- Remove unsupported shapes to make pipeline green
- PR: #15531
- #13401: Add data parallel support for Bert-Tiny model
- PR: #14033
- #15297: Allow MeshDevice to be initialized for chips without eth coordinates
- PR: #15475
- #0: Disable clang-format precommit check once again due to errors
- PR: #15556
- #15337: Fix incorrectly sized cb in remote cb microbenchmark
- PR: #15506
- [skip ci] Update CONTRIBUTING.md with pre-commit info
- PR: #15537
- Remove ClusterDescriptor path from constructor
- PR: #15554
- Add performance and accuracy configurations to Llama 3
- PR: #15545
- Disable upblock 3 and 4 unet unit tests
- PR: #15568
- Re-enable git-clang-format for pre-commit again
- PR: #15562
- Fix race condition in DRAM sharded MM
- PR: #15569
- Fix to concat support for tensors with tile padding
- PR: #15513
- Adjust perf test targets for Falcon7b-t3k-decode-noasync to account for CI instability
- PR: #15573
- Format Subset of tests/ directory
- PR: #15576
- [skip ci] Update git-blame-ignore-revs
- PR: #15577
- #14982: Update Unary examples docs
- PR: #15417
- fix graph_processor buffer_alloc_id calculus
- PR: #15426
- #13415: Update left shift
- PR: #15538
- #14982: Update Unary examples
- PR: #15421
- #14982: Update Unary example and docs
- PR: #15453
- #5893: Fix ops with low PCC
- PR: #14652
- #13127: Clean up layout conversion
- PR: #15393
- Sharded sweep tests
- PR: #15246
- #0: Fix typos in ttnn docs
- PR: #15517
- #0: Make sub-device bank ids for cores the same as global bank ids
- PR: #15579
- #12151: Generalize
max_pool2d
code into generic pool op- PR: #15561
- Update INSTALLING.md
- PR: #15512
- #13944: Prohibit copying memory objects
- PR: #15434
- #15267: Merge erisc kernel data and bss sections
- PR: #15457
- Update perf and latest features for llm models (Dec 2)
- PR: #15600
- Update llms.md
- PR: #15612
- #11795: Update pgm dispatch golden file
- PR: #15614
- [tt-train] Add kahan summation in AdamW
- PR: #15518
- Remove core_config.h from host code inclusion in dev_msgs.h
- PR: #15584
- Add new Hal API "valid_reg_addr"
- PR: #15559
- #0: Fix DPRINT_DATA macro bug
- PR: #15617
- [tt-train] Help find UMD library for tt-train build
- PR: #15631
- #14257: few more optimization for yolo
- PR: #15582
- Fix simulator setup
- PR: #15227
- #15548: SELU op update
- PR: #15588
- #13779: Optimize pow
- PR: #15534
- [skip ci] Use @tenstorrent/metalium-developers-infra in CODEOWNERS
- PR: #15560
- ARCH_NAME related header cleanup
- PR: #15634
- Add N300 perf to pipeline
- PR: #14937
- Set owners for fold and untilize_with_halo_v2
- PR: #15651
- Update Mamba expected demo outputs
- PR: #15447
- #0: Clean up bfloat8/4 pack/unpack functions
- PR: #15610
- [skip ci] Update Readme.#14257
- PR: #15641
- #0: Add overloaded helper to tt::stl
- PR: #15606
- [tt-train] Gradient accumulation steps
- PR: #15658
- Supporting non-overlapping output core grid for create_qkv_heads
- PR: #15341
- Global CB Support
- PR: #15180
- Re-enable multicore untilize on blackhole
- PR: #15626
- #14050: Timestamped data and event recording for device side
- PR: #15620
- #0: Remove warning log: "Specifying tile shape for a row major layout is deprecated"
- PR: #15663
- don't run UMD tests in post commit anymore
- PR: #15664
- Add new HAL APIs
- PR: #15645
- [tt-train] Improve logging information
- PR: #15673
- Remove inclusion of ARCH_NAME include from command_queue.cpp
- PR: #15676
- #0: Fix missing brace from bad conflict resolution
- PR: #15678