Skip to content

Issues: pytorch/torchtune

v0.6.0 tracker
#2232 opened Jan 6, 2025 by joecummings
Open
Testing tracker
#1890 opened Oct 23, 2024 by felipemello1
Open
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Assignee
Filter by who’s assigned
Sort

Issues list

Grad Norm Differences Across Nodes discussion Start a discussion
#2240 opened Jan 9, 2025 by EugenHotaj
Finetune meta-llama/Llama-Guard-3-1B triaged This issue has been assigned an owner and appropriate label
#2237 opened Jan 8, 2025 by jingzhaoou
v0.6.0 tracker
#2232 opened Jan 6, 2025 by joecummings
quantization recipe should mimic checkpointer.save_checkpoint better engineering Tasks which help improve eng productivity e.g. building tools, cleaning up code, writing docs
#2229 opened Jan 4, 2025 by felipemello1
Hugging Face from_pretrained() using merged weights KeyError: 'base_model_name_or_path' bug Something isn't working triaged This issue has been assigned an owner and appropriate label
#2224 opened Jan 2, 2025 by chg0901
How to use train and test split with the recipes? enhancement New feature or request triaged This issue has been assigned an owner and appropriate label
#2222 opened Jan 1, 2025 by 7rabbit
packed errors bug Something isn't working triaged This issue has been assigned an owner and appropriate label
#2218 opened Dec 31, 2024 by chg0901
More Chat Loss Masking Strategies
#2214 opened Dec 30, 2024 by EugenHotaj
hotw to estimate gpu memory needed for knowledge distillation? discussion Start a discussion triaged This issue has been assigned an owner and appropriate label
#2213 opened Dec 30, 2024 by chuangzhidan
Llama3.1 models do not allow configuring max_seq_len bug Something isn't working triaged This issue has been assigned an owner and appropriate label
#2202 opened Dec 23, 2024 by akashc1
How to use float8 for training?
#2201 opened Dec 23, 2024 by vgoklani
ProTip! Add no:assignee to see everything that’s not assigned.