- 浦口
-
23:03
- 8h ahead
Stars
图片/字体取模软件,跨平台。Cross platform GUI converting images or fonts into array data.
A Fast and Extensible DRAM Simulator, with built-in support for modeling many different DRAM technologies including DDRx, LPDDRx, GDDRx, WIOx, HBMx, and various academic proposals. Described in the…
Ramulator 2.0 is a modern, modular, extensible, and fast cycle-accurate DRAM simulator. It provides support for agile implementation and evaluation of new memory system designs (e.g., new DRAM stan…
Ecosystem of libraries and tools for writing and executing fast GPU code fully in Rust.
Online compiler for HIP and NVIDIA® CUDA® code to WebGPU
📚200+ Tensor/CUDA Cores Kernels, ⚡️flash-attn-mma, ⚡️hgemm with WMMA, MMA and CuTe (98%~100% TFLOPS of cuBLAS/FA2 🎉🎉).
The experimental work to rewrite Chisel in pure Scala 3 and the Panama Project
likelovewant / ollama-for-amd
Forked from ollama/ollamaGet up and running with Llama 3, Mistral, Gemma, and other large language models.by adding more amd gpu support.
A machine learning compiler for GPUs, CPUs, and ML accelerators
Backward compatible ML compute opset inspired by HLO/MHLO
HIP: C++ Heterogeneous-Compute Interface for Portability
A guide that explains how high level programming language constructs are mapped to the LLVM intermediate language.
Smaller, easier, more powerful, and more reliable than make. An implementation of djb's redo.
Intermediate Language (IL) for Hardware Accelerator Generators
A flexible, high-performance, user-friendly computer architecture simulator engine