Skip to content

Pull requests: axolotl-ai-cloud/axolotl

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

Add peft_autocast_adapter_dtype config option
#3311 opened Dec 9, 2025 by xzuyn Loading…
[WIP] Fix SP tests
#3309 opened Dec 8, 2025 by SalmanMohammadi Loading…
support for xformers wheels for torch 2.9
#3308 opened Dec 8, 2025 by winglian Loading…
add liger support kernal for dpo
#3302 opened Dec 4, 2025 by ved1beta Loading…
upgrade dependencies dec 2025
#3299 opened Dec 3, 2025 by winglian Loading…
compute loss only if training
#3293 opened Dec 2, 2025 by ved1beta Loading…
fix: perplexity metric
#3288 opened Dec 1, 2025 by xzuyn Draft
Add QAT NVFP4 configs for blogpost
#3280 opened Nov 26, 2025 by SalmanMohammadi Loading…
Prepare for transformers v5 upgrade
#3272 opened Nov 20, 2025 by winglian Loading…
fix: Fix evaluation loss in KD trainer
#3271 opened Nov 20, 2025 by roycho96 Loading…
1 task done
Dist muon
#3264 opened Nov 14, 2025 by SalmanMohammadi Loading…
pass base_model insted of model_type
#3263 opened Nov 14, 2025 by ved1beta Loading…
MoE Grouped MM support (5X+ MoE training perf gains)
#3260 opened Nov 12, 2025 by lhl Loading…
Moe aux loss free
#3259 opened Nov 11, 2025 by lhl Loading…
Allow muon optimizer with DeepSpeed Zero 1-2
#3258 opened Nov 11, 2025 by lhl Loading…
sample gen support sft
#3240 opened Oct 30, 2025 by ved1beta Loading…
include tool in default message_property_mappings hold don't merge this yet
#3228 opened Oct 23, 2025 by winglian Loading…
Fix: Resolve torch_dtype correctly for AMP
#3189 opened Sep 27, 2025 by onesnep Loading…
Increased test coverage for lora/qlora
#3147 opened Sep 10, 2025 by ved1beta Loading…
ProTip! Find all pull requests that aren't related to any open issues with -linked:issue.