Skip to content

Pull requests: flashinfer-ai/flashinfer

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

feat: add memcpy and memset to CUPTI timing method
#2223 opened Dec 16, 2025 by nv-yunzheq Loading…
5 tasks
[WIP] Add norm fp4quant fusion
#2220 opened Dec 15, 2025 by nv-yunzheq Loading…
5 tasks
feat: add fused top-k page construction kernels for DSA
#2215 opened Dec 13, 2025 by yzh119 Loading…
5 tasks
misc: support checks for gemm
#2214 opened Dec 13, 2025 by jimmyzho Loading…
5 tasks
feat: Cold L2 Cache Benchmarking with Rotating Buffers
#2213 opened Dec 12, 2025 by bkryu Loading…
3 of 5 tasks
2025 Dec
cicd: Add sanity test script
#2212 opened Dec 12, 2025 by kahyunnam Loading…
5 tasks done
refactor: update fa3 codebase [part 2]
#2192 opened Dec 9, 2025 by yzh119 Loading…
4 of 5 tasks
Add CUDA graph buffers for persistent attention
#2185 opened Dec 7, 2025 by Edenzzzz Loading…
5 tasks
Fix/moe_sm110 (to be tested)
#2183 opened Dec 6, 2025 by aleozlx Draft
5 tasks
Enable Hopper FA3 FP8 attention in decode.py
#2148 opened Nov 28, 2025 by nvpohanh Loading…
5 tasks done
make DeepGEMM swapAB available for linear gemm SM90
#2131 opened Nov 22, 2025 by katec846 Loading…
3 of 5 tasks
make DeepGEMM swapAB available for linear gemm SM90
#2101 opened Nov 17, 2025 by xuanzic Loading…
5 tasks
feat: add sink to flashinfer decode
#2087 opened Nov 13, 2025 by djmmoss Loading…
feat: BF16 GEMM using CUTLASS backend for SM100
#2070 opened Nov 10, 2025 by raayandhar Loading…
5 tasks done
ProTip! Type g i on any issue or pull request to go back to the issue listing page.