-
Notifications
You must be signed in to change notification settings - Fork 1k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
oneDNN v3.7 release notes #2481
base: rls-v3.7
Are you sure you want to change the base?
Conversation
17acd19
to
23e00b1
Compare
Co-authored-by: Mourad Gouicem <[email protected]>
Co-authored-by: Mourad Gouicem <[email protected]>
Co-authored-by: Mourad Gouicem <[email protected]>
Co-authored-by: Mourad Gouicem <[email protected]>
Co-authored-by: Mourad Gouicem <[email protected]>
Co-authored-by: Mourad Gouicem <[email protected]>
* Improved performance of the following subgraphs with Graph API | ||
* Scaled dot-product Attention (SDPA) [with causal mask](https://oneapi-src.github.io/oneDNN/dev_guide_graph_sdpa.html#doxid-dev-guide-graph-sdpa) | ||
* Scaled dot-product Attention (SDPA) [with compressed key and value](https://oneapi-src.github.io/oneDNN/dev_guide_graph_sdpa_compressed_kv.html#doxid-dev-guide-graph-sdpa-compressed-kv) | ||
## AArch64-based Processors |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
@jondea, @theComputeKid, could you please help summarizing AArch64 improvements?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
@Sqvid I think you have a list of our improvements?
cc: @Radu2k @Ryo-not-rio
Co-authored-by: Mourad Gouicem <[email protected]>
Co-authored-by: Tao Lv <[email protected]>
Co-authored-by: Vadim Pirogov <[email protected]>
Co-authored-by: Tao Lv <[email protected]>
Co-authored-by: Tao Lv <[email protected]>
Co-authored-by: Tao Lv <[email protected]>
Co-authored-by: Vadim Pirogov <[email protected]>
Co-authored-by: Vadim Pirogov <[email protected]>
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Minor changes suggested, please incorporate as you see fit
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Minor comments suggested, please incorporate as you see fit! Thanks!
Co-authored-by: Ranu Kundu <[email protected]>
Co-authored-by: Ranu Kundu <[email protected]>
Co-authored-by: Ranu Kundu <[email protected]>
Co-authored-by: Ranu Kundu <[email protected]>
Co-authored-by: Ranu Kundu <[email protected]>
Co-authored-by: Ranu Kundu <[email protected]>
Co-authored-by: Ranu Kundu <[email protected]>
Co-authored-by: Ranu Kundu <[email protected]>
Co-authored-by: Ranu Kundu <[email protected]>
@@ -0,0 +1,89 @@ | |||
# Performance Optimizations |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
@sgeor255, @ShanoToni, @t4c1, @Rbiessy, could you please help with release notes content for NVIDIA backend and generic SYCL kernels?
We are primarily looking for two things: performance improvements (stuff that works faster) and new features (stuff that did not work before).
This PR includes a release notes draft based on the information from the PRs for the contributors to review. Your additions and corrections are highly appreciated.