-
Notifications
You must be signed in to change notification settings - Fork 1.2k
Issues: Lightning-AI/litgpt
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Author
Label
Projects
Milestones
Assignee
Sort
Issues list
Mistral7B pretraining convergence is too slow
question
Further information is requested
#697
opened Nov 5, 2023 by
LamOne1
[question] how to set the max_iters value
question
Further information is requested
#682
opened Oct 29, 2023 by
nevermet
[question] how to finetune efficiently
question
Further information is requested
#680
opened Oct 28, 2023 by
nevermet
Custom 4k context length supporting and converting model config to huggingface supportted config file
question
Further information is requested
#666
opened Oct 24, 2023 by
SinclairCoder
Add NEFTune to improve fine-tuning?
enhancement
New feature or request
fine-tuning
#642
opened Oct 13, 2023 by
nkasmanoff
AttributeError: <class 'lit_gpt.utils.NotYetLoadedTensor'> does not have permute
bug
Something isn't working
#639
opened Oct 11, 2023 by
azinman
Any finetuning is having No effect on small models
question
Further information is requested
#627
opened Oct 8, 2023 by
VatsaDev
Sample packing for pretraining/fine-tuning
enhancement
New feature or request
pre-training
#620
opened Oct 6, 2023 by
alitirmizi23
Will longlora finetune supported in the future?
model-weights
#616
opened Oct 5, 2023 by
universewill
QA-LoRA: Quantization Aware Low-Rank Adaptation
enhancement
New feature or request
fine-tuning
#595
opened Sep 28, 2023 by
Andrei-Aksionov
How to use Activation Checkpointing and Parameter Offloading in a single GPU?
bug
Something isn't working
#594
opened Sep 28, 2023 by
ifshine
[Question] How to decrease my loss?
question
Further information is requested
#575
opened Sep 21, 2023 by
ifshine
Adding RLHF support
enhancement
New feature or request
fine-tuning
#557
opened Sep 16, 2023 by
rasbt
Can I use lora or adapter to fine-tune some non-instruction set data?
question
Further information is requested
#550
opened Sep 15, 2023 by
hello-eternity
TPU RuntimeError: Expected query, key, and value to have the same dtype, but got query.dtype: float key.dtype: c10::BFloat16 and value.dtype: c10::BFloat16 instead.
bug
Something isn't working
#518
opened Sep 9, 2023 by
ethanhe42
LoRA with quantization: Something isn't working
quantization
micro_batch_size
effect on memory footprint
bug
#501
opened Sep 6, 2023 by
Andrei-Aksionov
3 tasks
OOM with bf16-true, Quantization, for long context length.
bug
Something isn't working
quantization
#477
opened Aug 29, 2023 by
KOVVURISATYANARAYANAREDDY
QLoRA for Multi-GPU Settings
enhancement
New feature or request
fine-tuning
#449
opened Aug 22, 2023 by
rasbt
Look into adding Dolma pretraining dataset
enhancement
New feature or request
#439
opened Aug 18, 2023 by
rasbt
Evaluation with triviaqa
bug
Something isn't working
evaluation
#424
opened Aug 16, 2023 by
sanyalsunny111
ProTip!
Add no:assignee to see everything that’s not assigned.