Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Rework our mdspan implementation #3343

Draft
wants to merge 53 commits into
base: main
Choose a base branch
from
Draft

Conversation

miscco
Copy link
Collaborator

@miscco miscco commented Jan 10, 2025

This completely rewrites our mdspan implementation.

It drop a lot of the old workarounds and follows the recent standard changes.

This is especially prominent for submdspan which has been reworked a ton during standardization

This did not yet implement the padded layouts which have been merged recently

I started using C++17 exclusively which hase been soo awesome

Copy link

copy-pr-bot bot commented Jan 10, 2025

Auto-sync is disabled for draft pull requests in this repository. Workflows must be run manually.

Contributors can view more details about this message here.

@fbusato
Copy link
Contributor

fbusato commented Jan 10, 2025

it seems that the new implementation of submdspan works correctly when the data type of mdspan is const and returns the new mdspan with the right accessor. Anyway, please add a test to check the behavior of the accessor.

@miscco miscco force-pushed the update_mdspan branch 4 times, most recently from 0ce2d94 to 4847384 Compare January 20, 2025 08:05
@miscco
Copy link
Collaborator Author

miscco commented Jan 20, 2025

/ok to test

@miscco
Copy link
Collaborator Author

miscco commented Jan 20, 2025

/ok to test

@miscco miscco force-pushed the update_mdspan branch 3 times, most recently from df41ca8 to 29cba88 Compare January 21, 2025 10:00
@miscco
Copy link
Collaborator Author

miscco commented Jan 21, 2025

/ok to test

@miscco
Copy link
Collaborator Author

miscco commented Jan 21, 2025

/ok to test

@miscco miscco force-pushed the update_mdspan branch 2 times, most recently from 3153baf to cb6065d Compare January 21, 2025 12:22
@miscco
Copy link
Collaborator Author

miscco commented Jan 21, 2025

/ok to test

1 similar comment
@miscco
Copy link
Collaborator Author

miscco commented Jan 21, 2025

/ok to test

@miscco
Copy link
Collaborator Author

miscco commented Jan 21, 2025

/ok to test

@miscco
Copy link
Collaborator Author

miscco commented Jan 22, 2025

/ok to test

@miscco
Copy link
Collaborator Author

miscco commented Jan 22, 2025

/ok to test

@miscco
Copy link
Collaborator Author

miscco commented Jan 30, 2025

/ok to test

Copy link
Contributor

🟨 CI finished in 1h 43m: Pass: 96%/156 | Total: 3d 12h | Avg: 32m 29s | Max: 1h 24m | Hits: 158%/16672
  • 🟨 libcudacxx: Pass: 86%/43 | Total: 11h 49m | Avg: 16m 30s | Max: 47m 57s | Hits: 384%/5254

    🔍 cpu: amd64 🔍
      🔍 amd64              Pass:  85%/41  | Total: 11h 26m | Avg: 16m 44s | Max: 47m 57s | Hits: 384%/5254  
      🟩 arm64              Pass: 100%/2   | Total: 23m 36s | Avg: 11m 48s | Max: 12m 30s
    🔍 cudacxx_family: nvcc 🔍
      🟩 ClangCUDA          Pass: 100%/4   | Total:  1h 10m | Avg: 17m 36s | Max: 22m 27s
      🔍 nvcc               Pass:  84%/39  | Total: 10h 39m | Avg: 16m 23s | Max: 47m 57s | Hits: 384%/5254  
    🚨 sm: 75 🚨
      🔥 75                 Pass:   0%/2   | Total: 39m 23s | Avg: 19m 41s | Max: 19m 57s
      🟩 90                 Pass: 100%/1   | Total: 14m 15s | Avg: 14m 15s | Max: 14m 15s
      🟩 90a                Pass: 100%/2   | Total: 21m 39s | Avg: 10m 49s | Max: 13m 48s
    🟨 ctk
      🟨 12.0               Pass:  80%/5   | Total:  1h 13m | Avg: 14m 46s | Max: 31m 01s
      🟩 12.5               Pass: 100%/2   | Total:  1h 09m | Avg: 34m 37s | Max: 35m 16s
      🟨 12.6               Pass:  86%/36  | Total:  9h 26m | Avg: 15m 44s | Max: 47m 57s | Hits: 384%/5254  
    🟨 cudacxx
      🟩 ClangCUDA18        Pass: 100%/4   | Total:  1h 10m | Avg: 17m 36s | Max: 22m 27s
      🟨 nvcc12.0           Pass:  80%/5   | Total:  1h 13m | Avg: 14m 46s | Max: 31m 01s
      🟩 nvcc12.5           Pass: 100%/2   | Total:  1h 09m | Avg: 34m 37s | Max: 35m 16s
      🟨 nvcc12.6           Pass:  84%/32  | Total:  8h 16m | Avg: 15m 30s | Max: 47m 57s | Hits: 384%/5254  
    🟨 cxx
      🟩 Clang14            Pass: 100%/4   | Total: 44m 50s | Avg: 11m 12s | Max: 12m 27s
      🟩 Clang15            Pass: 100%/2   | Total: 25m 27s | Avg: 12m 43s | Max: 12m 59s
      🟩 Clang16            Pass: 100%/2   | Total: 22m 27s | Avg: 11m 13s | Max: 11m 49s
      🟩 Clang17            Pass: 100%/2   | Total: 23m 53s | Avg: 11m 56s | Max: 12m 40s
      🟨 Clang18            Pass:  87%/8   | Total:  2h 14m | Avg: 16m 51s | Max: 27m 36s
      🟩 GCC7               Pass: 100%/2   | Total: 20m 47s | Avg: 10m 23s | Max: 10m 25s
      🟩 GCC8               Pass: 100%/1   | Total: 11m 41s | Avg: 11m 41s | Max: 11m 41s
      🟩 GCC9               Pass: 100%/2   | Total: 23m 10s | Avg: 11m 35s | Max: 11m 47s
      🟩 GCC10              Pass: 100%/2   | Total: 22m 16s | Avg: 11m 08s | Max: 11m 28s
      🟩 GCC11              Pass: 100%/2   | Total: 22m 49s | Avg: 11m 24s | Max: 11m 37s
      🟩 GCC12              Pass: 100%/2   | Total: 24m 29s | Avg: 12m 14s | Max: 12m 20s
      🟨 GCC13              Pass:  62%/8   | Total:  1h 49m | Avg: 13m 40s | Max: 25m 32s
      🟥 MSVC14.29          Pass:   0%/2   | Total:  1h 07m | Avg: 33m 47s | Max: 36m 33s
      🟩 MSVC14.39          Pass: 100%/2   | Total:  1h 26m | Avg: 43m 26s | Max: 47m 57s | Hits: 384%/5254  
      🟩 NVHPC24.7          Pass: 100%/2   | Total:  1h 09m | Avg: 34m 37s | Max: 35m 16s
    🟨 cxx_family
      🟨 Clang              Pass:  94%/18  | Total:  4h 11m | Avg: 13m 58s | Max: 27m 36s
      🟨 GCC                Pass:  84%/19  | Total:  3h 54m | Avg: 12m 20s | Max: 25m 32s
      🟨 MSVC               Pass:  50%/4   | Total:  2h 34m | Avg: 38m 36s | Max: 47m 57s | Hits: 384%/5254  
      🟩 NVHPC              Pass: 100%/2   | Total:  1h 09m | Avg: 34m 37s | Max: 35m 16s
    🟨 jobs
      🟨 Build              Pass:  94%/38  | Total: 10h 15m | Avg: 16m 11s | Max: 47m 57s | Hits: 384%/5254  
      🟥 NVRTC              Pass:   0%/2   | Total: 39m 23s | Avg: 19m 41s | Max: 19m 57s
      🟥 Test               Pass:   0%/2   | Total: 53m 08s | Avg: 26m 34s | Max: 27m 36s
      🟩 VerifyCodegen      Pass: 100%/1   | Total:  1m 58s | Avg:  1m 58s | Max:  1m 58s
    🟨 gpu
      🟨 rtx2080            Pass:  33%/6   | Total:  1h 57m | Avg: 19m 32s | Max: 27m 36s
      🟨 v100               Pass:  94%/37  | Total:  9h 52m | Avg: 16m 00s | Max: 47m 57s | Hits: 384%/5254  
    🟨 std
      🟨 17                 Pass:  85%/21  | Total:  5h 50m | Avg: 16m 40s | Max: 38m 55s | Hits: 385%/2597  
      🟨 20                 Pass:  85%/21  | Total:  5h 57m | Avg: 17m 01s | Max: 47m 57s | Hits: 384%/2657  
    
  • 🟩 cub: Pass: 100%/44 | Total: 1d 16h | Avg: 54m 51s | Max: 1h 19m | Hits: 35%/3512

    🟩 cpu
      🟩 amd64              Pass: 100%/42  | Total:  1d 14h | Avg: 54m 17s | Max:  1h 19m | Hits:  35%/3512  
      🟩 arm64              Pass: 100%/2   | Total:  2h 13m | Avg:  1h 06m | Max:  1h 07m
    🟩 ctk
      🟩 12.0               Pass: 100%/5   | Total:  4h 59m | Avg: 59m 59s | Max:  1h 04m | Hits:  37%/878   
      🟩 12.5               Pass: 100%/2   | Total:  2h 18m | Avg:  1h 09m | Max:  1h 10m
      🟩 12.6               Pass: 100%/37  | Total:  1d 08h | Avg: 53m 22s | Max:  1h 19m | Hits:  35%/2634  
    🟩 cudacxx
      🟩 ClangCUDA18        Pass: 100%/2   | Total:  2h 23m | Avg:  1h 11m | Max:  1h 19m
      🟩 nvcc12.0           Pass: 100%/5   | Total:  4h 59m | Avg: 59m 59s | Max:  1h 04m | Hits:  37%/878   
      🟩 nvcc12.5           Pass: 100%/2   | Total:  2h 18m | Avg:  1h 09m | Max:  1h 10m
      🟩 nvcc12.6           Pass: 100%/35  | Total:  1d 06h | Avg: 52m 18s | Max:  1h 12m | Hits:  35%/2634  
    🟩 cudacxx_family
      🟩 ClangCUDA          Pass: 100%/2   | Total:  2h 23m | Avg:  1h 11m | Max:  1h 19m
      🟩 nvcc               Pass: 100%/42  | Total:  1d 13h | Avg: 54m 02s | Max:  1h 12m | Hits:  35%/3512  
    🟩 cxx
      🟩 Clang14            Pass: 100%/4   | Total:  4h 01m | Avg:  1h 00m | Max:  1h 02m
      🟩 Clang15            Pass: 100%/2   | Total:  1h 57m | Avg: 58m 51s | Max: 59m 06s
      🟩 Clang16            Pass: 100%/2   | Total:  2h 03m | Avg:  1h 01m | Max:  1h 01m
      🟩 Clang17            Pass: 100%/2   | Total:  2h 03m | Avg:  1h 01m | Max:  1h 03m
      🟩 Clang18            Pass: 100%/7   | Total:  6h 20m | Avg: 54m 25s | Max:  1h 19m
      🟩 GCC7               Pass: 100%/2   | Total:  2h 00m | Avg:  1h 00m | Max:  1h 04m
      🟩 GCC8               Pass: 100%/1   | Total:  1h 01m | Avg:  1h 01m | Max:  1h 01m
      🟩 GCC9               Pass: 100%/2   | Total:  2h 01m | Avg:  1h 00m | Max:  1h 02m
      🟩 GCC10              Pass: 100%/2   | Total:  2h 00m | Avg:  1h 00m | Max:  1h 00m
      🟩 GCC11              Pass: 100%/2   | Total:  2h 01m | Avg:  1h 00m | Max:  1h 00m
      🟩 GCC12              Pass: 100%/4   | Total:  2h 57m | Avg: 44m 25s | Max:  1h 04m
      🟩 GCC13              Pass: 100%/8   | Total:  4h 54m | Avg: 36m 49s | Max:  1h 05m
      🟩 MSVC14.29          Pass: 100%/2   | Total:  2h 12m | Avg:  1h 06m | Max:  1h 12m | Hits:  37%/1756  
      🟩 MSVC14.39          Pass: 100%/2   | Total:  2h 18m | Avg:  1h 09m | Max:  1h 09m | Hits:  34%/1756  
      🟩 NVHPC24.7          Pass: 100%/2   | Total:  2h 18m | Avg:  1h 09m | Max:  1h 10m
    🟩 cxx_family
      🟩 Clang              Pass: 100%/17  | Total: 16h 26m | Avg: 58m 03s | Max:  1h 19m
      🟩 GCC                Pass: 100%/21  | Total: 16h 57m | Avg: 48m 27s | Max:  1h 05m
      🟩 MSVC               Pass: 100%/4   | Total:  4h 30m | Avg:  1h 07m | Max:  1h 12m | Hits:  35%/3512  
      🟩 NVHPC              Pass: 100%/2   | Total:  2h 18m | Avg:  1h 09m | Max:  1h 10m
    🟩 gpu
      🟩 h100               Pass: 100%/2   | Total: 51m 41s | Avg: 25m 50s | Max: 26m 43s
      🟩 rtxa6000           Pass: 100%/8   | Total:  4h 08m | Avg: 31m 06s | Max:  1h 03m
      🟩 v100               Pass: 100%/34  | Total:  1d 11h | Avg:  1h 02m | Max:  1h 19m | Hits:  35%/3512  
    🟩 jobs
      🟩 Build              Pass: 100%/37  | Total:  1d 13h | Avg:  1h 01m | Max:  1h 19m | Hits:  35%/3512  
      🟩 DeviceLaunch       Pass: 100%/1   | Total: 22m 42s | Avg: 22m 42s | Max: 22m 42s
      🟩 GraphCapture       Pass: 100%/1   | Total: 16m 20s | Avg: 16m 20s | Max: 16m 20s
      🟩 HostLaunch         Pass: 100%/3   | Total:  1h 13m | Avg: 24m 26s | Max: 26m 43s
      🟩 TestGPU            Pass: 100%/2   | Total: 42m 38s | Avg: 21m 19s | Max: 22m 24s
    🟩 sm
      🟩 90                 Pass: 100%/2   | Total: 51m 41s | Avg: 25m 50s | Max: 26m 43s
      🟩 90a                Pass: 100%/1   | Total: 25m 09s | Avg: 25m 09s | Max: 25m 09s
    🟩 std
      🟩 17                 Pass: 100%/20  | Total: 21h 06m | Avg:  1h 03m | Max:  1h 19m | Hits:  37%/2634  
      🟩 20                 Pass: 100%/24  | Total: 19h 07m | Avg: 47m 47s | Max:  1h 10m | Hits:  31%/878   
    
  • 🟩 thrust: Pass: 100%/42 | Total: 1d 02h | Avg: 38m 32s | Max: 1h 24m | Hits: 60%/7384

    🟩 cmake_options
      🟩 -DTHRUST_DISPATCH_TYPE=Force32bit Pass: 100%/2   | Total: 45m 43s | Avg: 22m 51s | Max: 34m 29s
    🟩 cpu
      🟩 amd64              Pass: 100%/40  | Total:  1d 01h | Avg: 38m 44s | Max:  1h 24m | Hits:  60%/7384  
      🟩 arm64              Pass: 100%/2   | Total:  1h 08m | Avg: 34m 24s | Max: 36m 36s
    🟩 ctk
      🟩 12.0               Pass: 100%/5   | Total:  3h 29m | Avg: 41m 54s | Max:  1h 04m | Hits:  58%/1846  
      🟩 12.5               Pass: 100%/2   | Total:  2h 27m | Avg:  1h 13m | Max:  1h 15m
      🟩 12.6               Pass: 100%/35  | Total: 21h 01m | Avg: 36m 02s | Max:  1h 24m | Hits:  61%/5538  
    🟩 cudacxx
      🟩 ClangCUDA18        Pass: 100%/2   | Total:  1h 03m | Avg: 31m 40s | Max: 31m 59s
      🟩 nvcc12.0           Pass: 100%/5   | Total:  3h 29m | Avg: 41m 54s | Max:  1h 04m | Hits:  58%/1846  
      🟩 nvcc12.5           Pass: 100%/2   | Total:  2h 27m | Avg:  1h 13m | Max:  1h 15m
      🟩 nvcc12.6           Pass: 100%/33  | Total: 19h 58m | Avg: 36m 18s | Max:  1h 24m | Hits:  61%/5538  
    🟩 cudacxx_family
      🟩 ClangCUDA          Pass: 100%/2   | Total:  1h 03m | Avg: 31m 40s | Max: 31m 59s
      🟩 nvcc               Pass: 100%/40  | Total:  1d 01h | Avg: 38m 52s | Max:  1h 24m | Hits:  60%/7384  
    🟩 cxx
      🟩 Clang14            Pass: 100%/4   | Total:  2h 21m | Avg: 35m 23s | Max: 35m 27s
      🟩 Clang15            Pass: 100%/2   | Total:  1h 22m | Avg: 41m 07s | Max: 45m 29s
      🟩 Clang16            Pass: 100%/2   | Total:  1h 15m | Avg: 37m 36s | Max: 40m 38s
      🟩 Clang17            Pass: 100%/2   | Total:  1h 16m | Avg: 38m 27s | Max: 39m 29s
      🟩 Clang18            Pass: 100%/7   | Total:  3h 05m | Avg: 26m 27s | Max: 37m 52s
      🟩 GCC7               Pass: 100%/2   | Total:  1h 11m | Avg: 35m 48s | Max: 35m 52s
      🟩 GCC8               Pass: 100%/1   | Total: 38m 39s | Avg: 38m 39s | Max: 38m 39s
      🟩 GCC9               Pass: 100%/2   | Total:  1h 17m | Avg: 38m 50s | Max: 38m 54s
      🟩 GCC10              Pass: 100%/2   | Total:  1h 16m | Avg: 38m 11s | Max: 39m 11s
      🟩 GCC11              Pass: 100%/2   | Total:  1h 15m | Avg: 37m 47s | Max: 39m 09s
      🟩 GCC12              Pass: 100%/2   | Total:  1h 22m | Avg: 41m 07s | Max: 41m 33s
      🟩 GCC13              Pass: 100%/8   | Total:  3h 20m | Avg: 25m 01s | Max: 39m 52s
      🟩 MSVC14.29          Pass: 100%/2   | Total:  2h 09m | Avg:  1h 04m | Max:  1h 05m | Hits:  83%/3692  
      🟩 MSVC14.39          Pass: 100%/2   | Total:  2h 38m | Avg:  1h 19m | Max:  1h 24m | Hits:  38%/3692  
      🟩 NVHPC24.7          Pass: 100%/2   | Total:  2h 27m | Avg:  1h 13m | Max:  1h 15m
    🟩 cxx_family
      🟩 Clang              Pass: 100%/17  | Total:  9h 21m | Avg: 33m 00s | Max: 45m 29s
      🟩 GCC                Pass: 100%/19  | Total: 10h 22m | Avg: 32m 45s | Max: 41m 33s
      🟩 MSVC               Pass: 100%/4   | Total:  4h 47m | Avg:  1h 11m | Max:  1h 24m | Hits:  60%/7384  
      🟩 NVHPC              Pass: 100%/2   | Total:  2h 27m | Avg:  1h 13m | Max:  1h 15m
    🟩 gpu
      🟩 rtx4090            Pass: 100%/8   | Total:  2h 35m | Avg: 19m 29s | Max: 37m 52s
      🟩 v100               Pass: 100%/34  | Total:  1d 00h | Avg: 43m 00s | Max:  1h 24m | Hits:  60%/7384  
    🟩 jobs
      🟩 Build              Pass: 100%/37  | Total:  1d 02h | Avg: 42m 22s | Max:  1h 24m | Hits:  60%/7384  
      🟩 TestCPU            Pass: 100%/2   | Total: 17m 08s | Avg:  8m 34s | Max:  9m 17s
      🟩 TestGPU            Pass: 100%/3   | Total: 33m 22s | Avg: 11m 07s | Max: 11m 36s
    🟩 sm
      🟩 90a                Pass: 100%/1   | Total: 23m 59s | Avg: 23m 59s | Max: 23m 59s
    🟩 std
      🟩 17                 Pass: 100%/20  | Total: 14h 37m | Avg: 43m 52s | Max:  1h 13m | Hits:  75%/5538  
      🟩 20                 Pass: 100%/20  | Total: 11h 35m | Avg: 34m 45s | Max:  1h 24m | Hits:  18%/1846  
    
  • 🟩 cudax: Pass: 100%/20 | Total: 4h 30m | Avg: 13m 30s | Max: 17m 32s | Hits: 90%/522

    🟩 cpu
      🟩 amd64              Pass: 100%/16  | Total:  3h 36m | Avg: 13m 33s | Max: 17m 32s | Hits:  90%/522   
      🟩 arm64              Pass: 100%/4   | Total: 53m 13s | Avg: 13m 18s | Max: 13m 55s
    🟩 ctk
      🟩 12.0               Pass: 100%/1   | Total:  9m 35s | Avg:  9m 35s | Max:  9m 35s | Hits:  90%/261   
      🟩 12.5               Pass: 100%/2   | Total: 19m 34s | Avg:  9m 47s | Max: 10m 06s
      🟩 12.6               Pass: 100%/17  | Total:  4h 00m | Avg: 14m 10s | Max: 17m 32s | Hits:  90%/261   
    🟩 cudacxx
      🟩 nvcc12.0           Pass: 100%/1   | Total:  9m 35s | Avg:  9m 35s | Max:  9m 35s | Hits:  90%/261   
      🟩 nvcc12.5           Pass: 100%/2   | Total: 19m 34s | Avg:  9m 47s | Max: 10m 06s
      🟩 nvcc12.6           Pass: 100%/17  | Total:  4h 00m | Avg: 14m 10s | Max: 17m 32s | Hits:  90%/261   
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/20  | Total:  4h 30m | Avg: 13m 30s | Max: 17m 32s | Hits:  90%/522   
    🟩 cxx
      🟩 Clang14            Pass: 100%/1   | Total: 13m 29s | Avg: 13m 29s | Max: 13m 29s
      🟩 Clang15            Pass: 100%/1   | Total: 16m 50s | Avg: 16m 50s | Max: 16m 50s
      🟩 Clang16            Pass: 100%/1   | Total: 16m 21s | Avg: 16m 21s | Max: 16m 21s
      🟩 Clang17            Pass: 100%/1   | Total: 17m 01s | Avg: 17m 01s | Max: 17m 01s
      🟩 Clang18            Pass: 100%/4   | Total: 53m 20s | Avg: 13m 20s | Max: 15m 06s
      🟩 GCC10              Pass: 100%/1   | Total: 16m 06s | Avg: 16m 06s | Max: 16m 06s
      🟩 GCC11              Pass: 100%/1   | Total: 14m 44s | Avg: 14m 44s | Max: 14m 44s
      🟩 GCC12              Pass: 100%/2   | Total: 29m 21s | Avg: 14m 40s | Max: 17m 32s
      🟩 GCC13              Pass: 100%/4   | Total: 51m 34s | Avg: 12m 53s | Max: 13m 55s
      🟩 MSVC14.36          Pass: 100%/1   | Total:  9m 35s | Avg:  9m 35s | Max:  9m 35s | Hits:  90%/261   
      🟩 MSVC14.39          Pass: 100%/1   | Total: 12m 06s | Avg: 12m 06s | Max: 12m 06s | Hits:  90%/261   
      🟩 NVHPC24.7          Pass: 100%/2   | Total: 19m 34s | Avg:  9m 47s | Max: 10m 06s
    🟩 cxx_family
      🟩 Clang              Pass: 100%/8   | Total:  1h 57m | Avg: 14m 37s | Max: 17m 01s
      🟩 GCC                Pass: 100%/8   | Total:  1h 51m | Avg: 13m 58s | Max: 17m 32s
      🟩 MSVC               Pass: 100%/2   | Total: 21m 41s | Avg: 10m 50s | Max: 12m 06s | Hits:  90%/522   
      🟩 NVHPC              Pass: 100%/2   | Total: 19m 34s | Avg:  9m 47s | Max: 10m 06s
    🟩 gpu
      🟩 rtx2080            Pass: 100%/4   | Total: 56m 07s | Avg: 14m 01s | Max: 17m 32s
      🟩 v100               Pass: 100%/16  | Total:  3h 33m | Avg: 13m 22s | Max: 17m 01s | Hits:  90%/522   
    🟩 jobs
      🟩 Build              Pass: 100%/18  | Total:  4h 06m | Avg: 13m 41s | Max: 17m 32s | Hits:  90%/522   
      🟩 Test               Pass: 100%/2   | Total: 23m 29s | Avg: 11m 44s | Max: 11m 49s
    🟩 sm
      🟩 90                 Pass: 100%/1   | Total: 13m 11s | Avg: 13m 11s | Max: 13m 11s
      🟩 90a                Pass: 100%/1   | Total: 11m 44s | Avg: 11m 44s | Max: 11m 44s
    🟩 std
      🟩 17                 Pass: 100%/4   | Total: 48m 20s | Avg: 12m 05s | Max: 13m 11s
      🟩 20                 Pass: 100%/16  | Total:  3h 41m | Avg: 13m 51s | Max: 17m 32s | Hits:  90%/522   
    
  • 🟩 cccl: Pass: 100%/4 | Total: 15m 36s | Avg: 3m 54s | Max: 4m 43s

    🟩 cpu
      🟩 amd64              Pass: 100%/4   | Total: 15m 36s | Avg:  3m 54s | Max:  4m 43s
    🟩 ctk
      🟩 12.0               Pass: 100%/2   | Total:  8m 10s | Avg:  4m 05s | Max:  4m 43s
      🟩 12.6               Pass: 100%/2   | Total:  7m 26s | Avg:  3m 43s | Max:  4m 12s
    🟩 cudacxx
      🟩 nvcc12.0           Pass: 100%/2   | Total:  8m 10s | Avg:  4m 05s | Max:  4m 43s
      🟩 nvcc12.6           Pass: 100%/2   | Total:  7m 26s | Avg:  3m 43s | Max:  4m 12s
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/4   | Total: 15m 36s | Avg:  3m 54s | Max:  4m 43s
    🟩 cxx
      🟩 Clang14            Pass: 100%/1   | Total:  3m 27s | Avg:  3m 27s | Max:  3m 27s
      🟩 Clang18            Pass: 100%/1   | Total:  4m 12s | Avg:  4m 12s | Max:  4m 12s
      🟩 GCC12              Pass: 100%/1   | Total:  4m 43s | Avg:  4m 43s | Max:  4m 43s
      🟩 GCC13              Pass: 100%/1   | Total:  3m 14s | Avg:  3m 14s | Max:  3m 14s
    🟩 cxx_family
      🟩 Clang              Pass: 100%/2   | Total:  7m 39s | Avg:  3m 49s | Max:  4m 12s
      🟩 GCC                Pass: 100%/2   | Total:  7m 57s | Avg:  3m 58s | Max:  4m 43s
    🟩 gpu
      🟩 rtx2080            Pass: 100%/4   | Total: 15m 36s | Avg:  3m 54s | Max:  4m 43s
    🟩 jobs
      🟩 Infra              Pass: 100%/4   | Total: 15m 36s | Avg:  3m 54s | Max:  4m 43s
    
  • 🟩 cccl_c_parallel: Pass: 100%/2 | Total: 6m 59s | Avg: 3m 29s | Max: 4m 44s

    🟩 cpu
      🟩 amd64              Pass: 100%/2   | Total:  6m 59s | Avg:  3m 29s | Max:  4m 44s
    🟩 ctk
      🟩 12.6               Pass: 100%/2   | Total:  6m 59s | Avg:  3m 29s | Max:  4m 44s
    🟩 cudacxx
      🟩 nvcc12.6           Pass: 100%/2   | Total:  6m 59s | Avg:  3m 29s | Max:  4m 44s
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/2   | Total:  6m 59s | Avg:  3m 29s | Max:  4m 44s
    🟩 cxx
      🟩 GCC13              Pass: 100%/2   | Total:  6m 59s | Avg:  3m 29s | Max:  4m 44s
    🟩 cxx_family
      🟩 GCC                Pass: 100%/2   | Total:  6m 59s | Avg:  3m 29s | Max:  4m 44s
    🟩 gpu
      🟩 rtx2080            Pass: 100%/2   | Total:  6m 59s | Avg:  3m 29s | Max:  4m 44s
    🟩 jobs
      🟩 Build              Pass: 100%/1   | Total:  2m 15s | Avg:  2m 15s | Max:  2m 15s
      🟩 Test               Pass: 100%/1   | Total:  4m 44s | Avg:  4m 44s | Max:  4m 44s
    
  • 🟩 python: Pass: 100%/1 | Total: 33m 49s | Avg: 33m 49s | Max: 33m 49s

    🟩 cpu
      🟩 amd64              Pass: 100%/1   | Total: 33m 49s | Avg: 33m 49s | Max: 33m 49s
    🟩 ctk
      🟩 12.6               Pass: 100%/1   | Total: 33m 49s | Avg: 33m 49s | Max: 33m 49s
    🟩 cudacxx
      🟩 nvcc12.6           Pass: 100%/1   | Total: 33m 49s | Avg: 33m 49s | Max: 33m 49s
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/1   | Total: 33m 49s | Avg: 33m 49s | Max: 33m 49s
    🟩 cxx
      🟩 GCC13              Pass: 100%/1   | Total: 33m 49s | Avg: 33m 49s | Max: 33m 49s
    🟩 cxx_family
      🟩 GCC                Pass: 100%/1   | Total: 33m 49s | Avg: 33m 49s | Max: 33m 49s
    🟩 gpu
      🟩 rtx2080            Pass: 100%/1   | Total: 33m 49s | Avg: 33m 49s | Max: 33m 49s
    🟩 jobs
      🟩 Test               Pass: 100%/1   | Total: 33m 49s | Avg: 33m 49s | Max: 33m 49s
    

👃 Inspect Changes

Modifications in project?

Project
+/- CCCL Infrastructure
+/- libcu++
CUB
Thrust
+/- CUDA Experimental
python
CCCL C Parallel Library
Catch2Helper

Modifications in project or dependencies?

Project
+/- CCCL Infrastructure
+/- libcu++
+/- CUB
+/- Thrust
+/- CUDA Experimental
+/- python
+/- CCCL C Parallel Library
+/- Catch2Helper

🏃‍ Runner counts (total jobs: 156)

# Runner
110 linux-amd64-cpu16
14 windows-amd64-cpu16
12 linux-amd64-gpu-rtx2080-latest-1
10 linux-arm64-cpu16
6 linux-amd64-gpu-rtxa6000-latest-1
3 linux-amd64-gpu-rtx4090-latest-1
1 linux-amd64-gpu-h100-latest-1

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
Status: In Progress
Development

Successfully merging this pull request may close these issues.

Replace mdspan implementation with one derived from the newer LLVM implementation
3 participants