-
Notifications
You must be signed in to change notification settings - Fork 432
Pull requests: AI-Hypercomputer/maxtext
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
This change introduces a pure JAX implementation of flash attention to Maxtext, designed as a drop-in replacement for the existing Pallas kernel. In this cl we set up the stage by integrating it with maxtext in fsdp mode. We have plans for further optimizations to close the gap with pallas using different techniques such as:
#2793
opened Dec 5, 2025 by
copybara-service
bot
Loading…
Add concat_then_split packing in Grain pipeline
#2786
opened Dec 4, 2025 by
aireenmei
Loading…
4 tasks done
Onboard explicit sharding to deepseek [split version]
gemini-review
#2783
opened Dec 3, 2025 by
NuojCheng
Loading…
4 tasks done
Fixes Issue #2781 for gcloud command in getting number of workers
#2782
opened Dec 3, 2025 by
MrGeislinger
Loading…
4 tasks done
Enable to autotune maxtext
extra_flags and maxtext_flags.
#2779
opened Dec 2, 2025 by
copybara-service
bot
Loading…
Support Custom MaxText model (with vLLM engine) in RL rollouts.
#2778
opened Dec 2, 2025 by
NicoGrande
Loading…
4 tasks done
Use MaxText max_segments_per_seq config variable to control Grain batch packing
#2774
opened Dec 2, 2025 by
gabeweisz
Loading…
4 tasks done
Support checkpointing TE quantizations with new remat policies
#2773
opened Dec 2, 2025 by
jberchtold-nvidia
Loading…
4 tasks done
Docs: Improve TPU Runtime & Colab setup guide
#2768
opened Dec 1, 2025 by
RexBearIU
Loading…
4 tasks done
Add vLLM Support for GPT OSS and its mapping generator for tunix.
#2766
opened Dec 1, 2025 by
abhinavclemson
Loading…
4 tasks done
Update documentation links and remove an old requirements file.
#2763
opened Nov 28, 2025 by
copybara-service
bot
Loading…
[pyproject.toml] Use already-in-use hatch plugin over custom hook
#2762
opened Nov 27, 2025 by
SamuelMarks
Loading…
4 tasks done
Fix tuple unpacking regression in Decoder Layers
#2753
opened Nov 25, 2025 by
gagika
Loading…
4 tasks done
Implement option to perform yarn embedding using a rotation matrix in real vector space.
#2752
opened Nov 25, 2025 by
copybara-service
bot
Loading…
Adding option for TE Dot with BF16 + unittest
#2750
opened Nov 25, 2025 by
phu0ngng
Loading…
4 tasks done
Fix the grpo on llama3.1-8b-instruction and its notebook example.
gemini-review
#2746
opened Nov 24, 2025 by
mathczh
Loading…
4 tasks done
Previous Next
ProTip!
Type g i on any issue or pull request to go back to the issue listing page.