Skip to content

Releases: tenstorrent/tt-metal

v0.53.1-rc19

11 Dec 02:01
eb9f4c3
Compare
Choose a tag to compare
v0.53.1-rc19 Pre-release
Pre-release

Note

If you are installing from a release, please refer to the README, INSTALLATION instructions, and any other documentation packaged with the release, not on the main branch. There may be differences between the latest main and the previous release.

The changelog will now follow, showing the changes from last release.

This release was generated by the CI workflow https://github.com/tenstorrent/tt-metal/actions/runs/12267951232

  • no changes

v0.53.1-rc18

10 Dec 22:54
Compare
Choose a tag to compare
v0.53.1-rc18 Pre-release
Pre-release

Note

If you are installing from a release, please refer to the README, INSTALLATION instructions, and any other documentation packaged with the release, not on the main branch. There may be differences between the latest main and the previous release.

The changelog will now follow, showing the changes from last release.

This release was generated by the CI workflow https://github.com/tenstorrent/tt-metal/actions/runs/12265838284

  • no changes

v0.53.1-rc17

10 Dec 17:28
664f5d9
Compare
Choose a tag to compare
v0.53.1-rc17 Pre-release
Pre-release

Note

If you are installing from a release, please refer to the README, INSTALLATION instructions, and any other documentation packaged with the release, not on the main branch. There may be differences between the latest main and the previous release.

The changelog will now follow, showing the changes from last release.

This release was generated by the CI workflow https://github.com/tenstorrent/tt-metal/actions/runs/12261030742

  • no changes

v0.53.1-rc16

10 Dec 02:00
e3526de
Compare
Choose a tag to compare
v0.53.1-rc16 Pre-release
Pre-release

Note

If you are installing from a release, please refer to the README, INSTALLATION instructions, and any other documentation packaged with the release, not on the main branch. There may be differences between the latest main and the previous release.

The changelog will now follow, showing the changes from last release.

This release was generated by the CI workflow https://github.com/tenstorrent/tt-metal/actions/runs/12247657687

📦 Uncategorized

  • ARCH_NAME related header cleanup
  • Set owners for fold and untilize_with_halo_v2
  • Update Mamba expected demo outputs
  • #0: Clean up bfloat8/4 pack/unpack functions
  • [skip ci] Update Readme.#14257
  • #0: Add overloaded helper to tt::stl
  • [tt-train] Gradient accumulation steps
  • Supporting non-overlapping output core grid for create_qkv_heads
  • Global CB Support
  • Re-enable multicore untilize on blackhole
  • #14050: Timestamped data and event recording for device side
  • #0: Remove warning log: "Specifying tile shape for a row major layout is deprecated"
  • don't run UMD tests in post commit anymore
  • Add new HAL APIs
  • [tt-train] Improve logging information
  • Remove inclusion of ARCH_NAME include from command_queue.cpp
  • #0: Fix missing brace from bad conflict resolution

v0.53.1-rc15

09 Dec 02:00
9eada08
Compare
Choose a tag to compare
v0.53.1-rc15 Pre-release
Pre-release

Note

If you are installing from a release, please refer to the README, INSTALLATION instructions, and any other documentation packaged with the release, not on the main branch. There may be differences between the latest main and the previous release.

The changelog will now follow, showing the changes from last release.

This release was generated by the CI workflow https://github.com/tenstorrent/tt-metal/actions/runs/12227280390

📦 Uncategorized

  • [skip ci] Update CONTRIBUTING.md with pre-commit info
  • Sharded sweep tests
  • #0: Fix typos in ttnn docs
  • #0: Make sub-device bank ids for cores the same as global bank ids
  • #12151: Generalize max_pool2d code into generic pool op
  • Update INSTALLING.md
  • #13944: Prohibit copying memory objects
  • #15267: Merge erisc kernel data and bss sections
  • Update perf and latest features for llm models (Dec 2)
  • Update llms.md
  • #11795: Update pgm dispatch golden file
  • [tt-train] Add kahan summation in AdamW
  • Remove core_config.h from host code inclusion in dev_msgs.h
  • Add new Hal API "valid_reg_addr"
  • #0: Fix DPRINT_DATA macro bug
  • [tt-train] Help find UMD library for tt-train build
  • #14257: few more optimization for yolo
  • Fix simulator setup
  • #15548: SELU op update
  • #13779: Optimize pow
  • [skip ci] Use @tenstorrent/metalium-developers-infra in CODEOWNERS
  • ARCH_NAME related header cleanup
  • Add N300 perf to pipeline
  • Set owners for fold and untilize_with_halo_v2
  • Update Mamba expected demo outputs
  • #0: Clean up bfloat8/4 pack/unpack functions
  • [skip ci] Update Readme.#14257
  • #0: Add overloaded helper to tt::stl
  • [tt-train] Gradient accumulation steps
  • Supporting non-overlapping output core grid for create_qkv_heads
  • Global CB Support
  • Re-enable multicore untilize on blackhole
  • #14050: Timestamped data and event recording for device side
  • #0: Remove warning log: "Specifying tile shape for a row major layout is deprecated"
  • don't run UMD tests in post commit anymore
  • Add new HAL APIs
  • [tt-train] Improve logging information
  • Remove inclusion of ARCH_NAME include from command_queue.cpp
  • #0: Fix missing brace from bad conflict resolution

v0.53.1-rc14

07 Dec 01:59
71a4cff
Compare
Choose a tag to compare
v0.53.1-rc14 Pre-release
Pre-release

Note

If you are installing from a release, please refer to the README, INSTALLATION instructions, and any other documentation packaged with the release, not on the main branch. There may be differences between the latest main and the previous release.

The changelog will now follow, showing the changes from last release.

This release was generated by the CI workflow https://github.com/tenstorrent/tt-metal/actions/runs/12208751194

📦 Uncategorized

  • [skip ci] Update CONTRIBUTING.md with pre-commit info
  • #14982: Update Unary examples
  • #14982: Update Unary example and docs
  • #5893: Fix ops with low PCC
  • #13127: Clean up layout conversion
  • Sharded sweep tests
  • #0: Fix typos in ttnn docs
  • #0: Make sub-device bank ids for cores the same as global bank ids
  • #12151: Generalize max_pool2d code into generic pool op
  • Update INSTALLING.md
  • #13944: Prohibit copying memory objects
  • #15267: Merge erisc kernel data and bss sections
  • Update perf and latest features for llm models (Dec 2)
  • Update llms.md
  • #11795: Update pgm dispatch golden file
  • [tt-train] Add kahan summation in AdamW
  • Remove core_config.h from host code inclusion in dev_msgs.h
  • Add new Hal API "valid_reg_addr"
  • #0: Fix DPRINT_DATA macro bug
  • [tt-train] Help find UMD library for tt-train build
  • #14257: few more optimization for yolo
  • Fix simulator setup
  • #15548: SELU op update
  • #13779: Optimize pow
  • [skip ci] Use @tenstorrent/metalium-developers-infra in CODEOWNERS
  • ARCH_NAME related header cleanup
  • Add N300 perf to pipeline
  • Set owners for fold and untilize_with_halo_v2
  • Update Mamba expected demo outputs
  • #0: Clean up bfloat8/4 pack/unpack functions
  • [skip ci] Update Readme.#14257
  • #0: Add overloaded helper to tt::stl
  • [tt-train] Gradient accumulation steps
  • Supporting non-overlapping output core grid for create_qkv_heads
  • Global CB Support
  • Re-enable multicore untilize on blackhole
  • #14050: Timestamped data and event recording for device side
  • #0: Remove warning log: "Specifying tile shape for a row major layout is deprecated"
  • don't run UMD tests in post commit anymore
  • Add new HAL APIs
  • [tt-train] Improve logging information
  • Remove inclusion of ARCH_NAME include from command_queue.cpp
  • #0: Fix missing brace from bad conflict resolution

v0.53.1-rc13

06 Dec 06:23
4952f01
Compare
Choose a tag to compare
v0.53.1-rc13 Pre-release
Pre-release

Note

If you are installing from a release, please refer to the README, INSTALLATION instructions, and any other documentation packaged with the release, not on the main branch. There may be differences between the latest main and the previous release.

The changelog will now follow, showing the changes from last release.

This release was generated by the CI workflow https://github.com/tenstorrent/tt-metal/actions/runs/12193793478

📦 Uncategorized

  • [skip ci] Update CONTRIBUTING.md with pre-commit info
  • #13415: Update left shift
  • #14982: Update Unary examples
  • #14982: Update Unary example and docs
  • #5893: Fix ops with low PCC
  • #13127: Clean up layout conversion
  • Sharded sweep tests
  • #0: Fix typos in ttnn docs
  • #0: Make sub-device bank ids for cores the same as global bank ids
  • #12151: Generalize max_pool2d code into generic pool op
  • Update INSTALLING.md
  • #13944: Prohibit copying memory objects
  • #15267: Merge erisc kernel data and bss sections
  • Update perf and latest features for llm models (Dec 2)
  • Update llms.md
  • #11795: Update pgm dispatch golden file
  • [tt-train] Add kahan summation in AdamW
  • Remove core_config.h from host code inclusion in dev_msgs.h
  • Add new Hal API "valid_reg_addr"
  • #0: Fix DPRINT_DATA macro bug
  • [tt-train] Help find UMD library for tt-train build
  • #14257: few more optimization for yolo
  • Fix simulator setup
  • #15548: SELU op update
  • #13779: Optimize pow
  • [skip ci] Use @tenstorrent/metalium-developers-infra in CODEOWNERS
  • ARCH_NAME related header cleanup
  • Add N300 perf to pipeline
  • Set owners for fold and untilize_with_halo_v2
  • Update Mamba expected demo outputs
  • #0: Clean up bfloat8/4 pack/unpack functions
  • [skip ci] Update Readme.#14257
  • #0: Add overloaded helper to tt::stl
  • [tt-train] Gradient accumulation steps
  • Supporting non-overlapping output core grid for create_qkv_heads
  • Global CB Support
  • Re-enable multicore untilize on blackhole
  • #14050: Timestamped data and event recording for device side
  • #0: Remove warning log: "Specifying tile shape for a row major layout is deprecated"
  • don't run UMD tests in post commit anymore
  • Add new HAL APIs
  • [tt-train] Improve logging information
  • Remove inclusion of ARCH_NAME include from command_queue.cpp
  • #0: Fix missing brace from bad conflict resolution

v0.53.1-rc12

06 Dec 01:59
aee56ac
Compare
Choose a tag to compare
v0.53.1-rc12 Pre-release
Pre-release

Note

If you are installing from a release, please refer to the README, INSTALLATION instructions, and any other documentation packaged with the release, not on the main branch. There may be differences between the latest main and the previous release.

The changelog will now follow, showing the changes from last release.

This release was generated by the CI workflow https://github.com/tenstorrent/tt-metal/actions/runs/12191260435

📦 Uncategorized

  • [skip ci] Update CONTRIBUTING.md with pre-commit info
  • #13415: Update left shift
  • #14982: Update Unary examples
  • #14982: Update Unary example and docs
  • #5893: Fix ops with low PCC
  • #13127: Clean up layout conversion
  • Sharded sweep tests
  • #0: Fix typos in ttnn docs
  • #0: Make sub-device bank ids for cores the same as global bank ids
  • #12151: Generalize max_pool2d code into generic pool op
  • Update INSTALLING.md
  • #13944: Prohibit copying memory objects
  • #15267: Merge erisc kernel data and bss sections
  • Update perf and latest features for llm models (Dec 2)
  • Update llms.md
  • #11795: Update pgm dispatch golden file
  • [tt-train] Add kahan summation in AdamW
  • Remove core_config.h from host code inclusion in dev_msgs.h
  • Add new Hal API "valid_reg_addr"
  • #0: Fix DPRINT_DATA macro bug
  • [tt-train] Help find UMD library for tt-train build
  • #14257: few more optimization for yolo
  • Fix simulator setup
  • #15548: SELU op update
  • #13779: Optimize pow
  • [skip ci] Use @tenstorrent/metalium-developers-infra in CODEOWNERS
  • ARCH_NAME related header cleanup
  • Add N300 perf to pipeline
  • Set owners for fold and untilize_with_halo_v2
  • Update Mamba expected demo outputs
  • #0: Clean up bfloat8/4 pack/unpack functions
  • [skip ci] Update Readme.#14257
  • #0: Add overloaded helper to tt::stl
  • [tt-train] Gradient accumulation steps
  • Supporting non-overlapping output core grid for create_qkv_heads
  • Global CB Support
  • Re-enable multicore untilize on blackhole
  • #14050: Timestamped data and event recording for device side
  • #0: Remove warning log: "Specifying tile shape for a row major layout is deprecated"
  • don't run UMD tests in post commit anymore
  • Add new HAL APIs
  • [tt-train] Improve logging information
  • Remove inclusion of ARCH_NAME include from command_queue.cpp
  • #0: Fix missing brace from bad conflict resolution

v0.53.1-rc11

05 Dec 01:59
Compare
Choose a tag to compare
v0.53.1-rc11 Pre-release
Pre-release

Note

If you are installing from a release, please refer to the README, INSTALLATION instructions, and any other documentation packaged with the release, not on the main branch. There may be differences between the latest main and the previous release.

The changelog will now follow, showing the changes from last release.

This release was generated by the CI workflow https://github.com/tenstorrent/tt-metal/actions/runs/12171537860

📦 Uncategorized

  • [skip ci] Update CONTRIBUTING.md with pre-commit info
  • Fix race condition in DRAM sharded MM
  • Fix to concat support for tensors with tile padding
  • Adjust perf test targets for Falcon7b-t3k-decode-noasync to account for CI instability
  • Format Subset of tests/ directory
  • [skip ci] Update git-blame-ignore-revs
  • #14982: Update Unary examples docs
  • fix graph_processor buffer_alloc_id calculus
  • #13415: Update left shift
  • #14982: Update Unary examples
  • #14982: Update Unary example and docs
  • #5893: Fix ops with low PCC
  • #13127: Clean up layout conversion
  • Sharded sweep tests
  • #0: Fix typos in ttnn docs
  • #0: Make sub-device bank ids for cores the same as global bank ids
  • #12151: Generalize max_pool2d code into generic pool op
  • Update INSTALLING.md
  • #13944: Prohibit copying memory objects
  • #15267: Merge erisc kernel data and bss sections
  • Update perf and latest features for llm models (Dec 2)
  • Update llms.md
  • #11795: Update pgm dispatch golden file
  • [tt-train] Add kahan summation in AdamW
  • Remove core_config.h from host code inclusion in dev_msgs.h
  • Add new Hal API "valid_reg_addr"
  • #0: Fix DPRINT_DATA macro bug
  • [tt-train] Help find UMD library for tt-train build
  • #14257: few more optimization for yolo
  • Fix simulator setup
  • #15548: SELU op update
  • #13779: Optimize pow
  • [skip ci] Use @tenstorrent/metalium-developers-infra in CODEOWNERS
  • ARCH_NAME related header cleanup
  • Add N300 perf to pipeline
  • Set owners for fold and untilize_with_halo_v2
  • Update Mamba expected demo outputs
  • #0: Clean up bfloat8/4 pack/unpack functions
  • [skip ci] Update Readme.#14257
  • #0: Add overloaded helper to tt::stl
  • [tt-train] Gradient accumulation steps
  • Supporting non-overlapping output core grid for create_qkv_heads
  • Global CB Support
  • Re-enable multicore untilize on blackhole
  • #14050: Timestamped data and event recording for device side
  • #0: Remove warning log: "Specifying tile shape for a row major layout is deprecated"
  • don't run UMD tests in post commit anymore
  • Add new HAL APIs
  • [tt-train] Improve logging information
  • Remove inclusion of ARCH_NAME include from command_queue.cpp
  • #0: Fix missing brace from bad conflict resolution

v0.53.1-rc9

04 Dec 01:59
51bc24e
Compare
Choose a tag to compare
v0.53.1-rc9 Pre-release
Pre-release

Note

If you are installing from a release, please refer to the README, INSTALLATION instructions, and any other documentation packaged with the release, not on the main branch. There may be differences between the latest main and the previous release.

The changelog will now follow, showing the changes from last release.

This release was generated by the CI workflow https://github.com/tenstorrent/tt-metal/actions/runs/12151320227

🚀 Features

📦 Uncategorized

  • Fix the test for whether to install the wheel, and also exit the script on the first error
  • #12558: TTNN implementation of MNIST model
  • Remove unsupported shapes to make pipeline green
  • #13401: Add data parallel support for Bert-Tiny model
  • #15297: Allow MeshDevice to be initialized for chips without eth coordinates
  • #0: Disable clang-format precommit check once again due to errors
  • #15337: Fix incorrectly sized cb in remote cb microbenchmark
  • [skip ci] Update CONTRIBUTING.md with pre-commit info
  • Remove ClusterDescriptor path from constructor
  • Add performance and accuracy configurations to Llama 3
  • Disable upblock 3 and 4 unet unit tests
  • Re-enable git-clang-format for pre-commit again
  • Fix race condition in DRAM sharded MM
  • Fix to concat support for tensors with tile padding
  • Adjust perf test targets for Falcon7b-t3k-decode-noasync to account for CI instability
  • Format Subset of tests/ directory
  • [skip ci] Update git-blame-ignore-revs
  • #14982: Update Unary examples docs
  • fix graph_processor buffer_alloc_id calculus
  • #13415: Update left shift
  • #14982: Update Unary examples
  • #14982: Update Unary example and docs
  • #5893: Fix ops with low PCC
  • #13127: Clean up layout conversion
  • Sharded sweep tests
  • #0: Fix typos in ttnn docs
  • #0: Make sub-device bank ids for cores the same as global bank ids
  • #12151: Generalize max_pool2d code into generic pool op
  • Update INSTALLING.md
  • #13944: Prohibit copying memory objects
  • #15267: Merge erisc kernel data and bss sections
  • Update perf and latest features for llm models (Dec 2)
  • Update llms.md
  • #11795: Update pgm dispatch golden file
  • [tt-train] Add kahan summation in AdamW
  • Remove core_config.h from host code inclusion in dev_msgs.h
  • Add new Hal API "valid_reg_addr"
  • #0: Fix DPRINT_DATA macro bug
  • [tt-train] Help find UMD library for tt-train build
  • #14257: few more optimization for yolo
  • Fix simulator setup
  • #15548: SELU op update
  • #13779: Optimize pow
  • [skip ci] Use @tenstorrent/metalium-developers-infra in CODEOWNERS
  • ARCH_NAME related header cleanup
  • Add N300 perf to pipeline
  • Set owners for fold and untilize_with_halo_v2
  • Update Mamba expected demo outputs
  • #0: Clean up bfloat8/4 pack/unpack functions
  • [skip ci] Update Readme.#14257
  • #0: Add overloaded helper to tt::stl
  • [tt-train] Gradient accumulation steps
  • Supporting non-overlapping output core grid for create_qkv_heads
  • Global CB Support
  • Re-enable multicore untilize on blackhole
  • #14050: Timestamped data and event recording for device side
  • #0: Remove warning log: "Specifying tile shape for a row major layout is deprecated"
  • don't run UMD tests in post commit anymore
  • Add new HAL APIs
  • [tt-train] Improve logging information
  • Remove inclusion of ARCH_NAME include from command_queue.cpp
  • #0: Fix missing brace from bad conflict resolution