-
Notifications
You must be signed in to change notification settings - Fork 511
Issues: allenai/OLMo
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Author
Label
Projects
Milestones
Assignee
Sort
Issues list
make it available as gguf and available in llama.cpp and ollama
type/feature
An issue or pull request that introduces a new feature
#772
opened Dec 25, 2024 by
olumolu
Single Accelerator training and MPS support (PR #769)
type/feature
An issue or pull request that introduces a new feature
#770
opened Dec 21, 2024 by
peter-sk
Sudden data error during training
type/bug
An issue about a bug
#766
opened Dec 16, 2024 by
faresobeid
tokenizer.encode function`s param add_special_tokens=False not work.
type/bug
An issue about a bug
#765
opened Dec 12, 2024 by
xiaohan2909
Difference Between DDP and FSDP Modes
type/question
An issue that's a question
#762
opened Dec 6, 2024 by
lllabmaster
About eos_token_id in config file (20M, 1B)
type/question
An issue that's a question
#757
opened Nov 29, 2024 by
lllabmaster
Fail to load tokenizer for checkpoints
type/bug
An issue about a bug
#741
opened Oct 24, 2024 by
tresiwald
Error Encountered During Multi-Node Pretraining with Torchrun
type/bug
An issue about a bug
#737
opened Oct 21, 2024 by
Zehui127
8-bit allgather support
type/question
An issue that's a question
#722
opened Sep 19, 2024 by
yaroslavvb
Which mmlu validation setting is recommend?
type/question
An issue that's a question
#714
opened Aug 27, 2024 by
mathfinder
[Quick question]: How do I turn off FSDP?
type/question
An issue that's a question
#703
opened Aug 15, 2024 by
candygocandy
RuntimeError: Triton Error [CUDA]: invalid device context
type/bug
An issue about a bug
#700
opened Aug 13, 2024 by
andymvp2018
slurm script for: configs/official/OLMo-7B.yaml
type/question
An issue that's a question
#699
opened Aug 13, 2024 by
andymvp2018
Gflops computation is faulty for FSDP due to bug in
OLMo.num_params()
#695
opened Aug 7, 2024 by
AkshitaB
why CrossEntropyLoss is zero,i
type/question
An issue that's a question
#692
opened Aug 6, 2024 by
aizhweiwei
Olmo 0724 An issue about a bug
-hf
checkpoints don't load the proper config when instantiating with OLMoForCausalLM
type/bug
#689
opened Aug 5, 2024 by
sarahwie
Model ladder has no documentation
type/documentation
An issue or pull request related to documentation
#683
opened Jul 31, 2024 by
IanMagnusson
mlp_ratio not adjusted in config if mlp_hidden_size is set
type/bug
An issue about a bug
#673
opened Jul 21, 2024 by
Muennighoff
Does global_train_batch_size support gradient accumulation?
type/question
An issue that's a question
#672
opened Jul 21, 2024 by
jinzhuoran
Is there explicitly instruction-following data in the version of Dolma used to train v1?
type/question
An issue that's a question
#658
opened Jul 15, 2024 by
john-hewitt
Can long text be splitted into short texts?
type/question
An issue that's a question
#655
opened Jul 12, 2024 by
CoinCheung
Cannot convert internal OLMo checkpoint to HF
type/bug
An issue about a bug
#654
opened Jul 11, 2024 by
viking-sudo-rm
start_index not getting reset in data loader when moving to new epoch
type/bug
An issue about a bug
#650
opened Jul 10, 2024 by
leon-g-xu
Previous Next
ProTip!
Follow long discussions with comments:>50.