- Lausanne, Switzerland
- yconquesty.github.io
Lists (2)
Sort Name ascending (A-Z)
Stars
A repository to unravel the language of GPUs, making their kernel conversations easy to understand
RWKV (pronounced RwaKuv) is an RNN with great LLM performance, which can also be directly trained like a GPT transformer (parallelizable). We are at RWKV-7 "Goose". So it's combining the best of RN…
A lightweight data processing framework built on DuckDB and 3FS.
A high-performance distributed file system designed to address the challenges of AI training and inference workloads.
Analyze computation-communication overlap in V3/R1.
A bidirectional pipeline parallelism algorithm for computation-communication overlap in V3/R1 training.
rocSHMEM intra-kernel networking runtime for AMD dGPUs on the ROCm platform.
DeepGEMM: clean and efficient FP8 GEMM kernels with fine-grained scaling
DeepEP: an efficient expert-parallel communication library
A list of companies of possible interest for mathematicians (or related) that are looking for a job in quantitative finance in Zurich.
🎨 Refly is an open-source AI-native creation engine. Its intuitive free-form canvas interface combines multi-threaded dialogues, AI knowledge base integration, chrome extension clip & save, context…
nanobind: tiny and efficient C++/Python bindings
Lean 4 programming language and theorem prover
Source code examples from the Parallel Forall Blog
A lightweight (3 file, single function) library for running micro-benchmarks on C++ code
A fast single-producer, single-consumer lock-free queue for C++
Repo for counting stars and contributing. Press F to pay respect to glorious developers.
AXI, AXI stream, Ethernet, and PCIe components in System Verilog
User-friendly Desktop Client App for AI Models/LLMs (GPT, Claude, Gemini, Ollama...)
Clean, minimal, accessible reproduction of DeepSeek R1-Zero
这是一个简单的技术科普教程项目,主要聚焦于解释一些有趣的,前沿的技术概念和原理。每篇文章都力求在 5 分钟内阅读完成。