Lightning-AI / litgpt Public

Notifications You must be signed in to change notification settings
Fork 1.2k
Star 11.8k

Code
Issues 260
Pull requests 16
Actions
Security
Insights

Additional navigation options

Code
Issues
Pull requests
Actions
Security
Insights

Issues: Lightning-AI/litgpt

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

260 Open 589 Closed

Author

Filter by author

Label

Filter by label

Use alt + click/return to exclude labels

or ⇧ + click/return for logical OR

Projects

Filter by project

Milestones

Filter by milestone

Assignee

Filter by who’s assigned

Assigned to nobody

Sort

Sort by

Newest Oldest Most commented Least commented Recently updated Least recently updated Best match

Most reactions

Issues list

Mistral7B pretraining convergence is too slow question

Further information is requested

#697 opened Nov 5, 2023 by LamOne1

Significantly different results with inference using a saved checkpoint v/s inferencing during fine-tuning

#686 opened Oct 30, 2023 by madhurapande19

[question] how to set the max_iters value question

Further information is requested

#682 opened Oct 29, 2023 by nevermet

[question] how to finetune efficiently question

Further information is requested

#680 opened Oct 28, 2023 by nevermet

Custom 4k context length supporting and converting model config to huggingface supportted config file question

Further information is requested

#666 opened Oct 24, 2023 by SinclairCoder

Finetuning bug bug

Something isn't working

#645 opened Oct 15, 2023 by Gideonah

Add NEFTune to improve fine-tuning? enhancement

New feature or request

fine-tuning

#642 opened Oct 13, 2023 by nkasmanoff

AttributeError: <class 'lit_gpt.utils.NotYetLoadedTensor'> does not have permute bug

Something isn't working

#639 opened Oct 11, 2023 by azinman

Any finetuning is having No effect on small models question

Further information is requested

#627 opened Oct 8, 2023 by VatsaDev

Sample packing for pretraining/fine-tuning enhancement

New feature or request

pre-training

#620 opened Oct 6, 2023 by alitirmizi23

Will longlora finetune supported in the future? model-weights

#616 opened Oct 5, 2023 by universewill

QA-LoRA: Quantization Aware Low-Rank Adaptation enhancement

New feature or request

fine-tuning

#595 opened Sep 28, 2023 by Andrei-Aksionov

How to use Activation Checkpointing and Parameter Offloading in a single GPU? bug

Something isn't working

#594 opened Sep 28, 2023 by ifshine

Add tutorial for custom checkpoints model-weights

#591 opened Sep 27, 2023 by JOHW85

[Question] How to decrease my loss? question

Further information is requested

#575 opened Sep 21, 2023 by ifshine

Adding RLHF support enhancement

New feature or request

fine-tuning

#557 opened Sep 16, 2023 by rasbt

Can I use Wizardcoder to finetune？ model-weights

#555 opened Sep 16, 2023 by hello-eternity

Can I use lora or adapter to fine-tune some non-instruction set data? question

Further information is requested

#550 opened Sep 15, 2023 by hello-eternity

slow finetuning on TPUv3-8 bug

Something isn't working

#519 opened Sep 10, 2023 by ethanhe42

TPU RuntimeError: Expected query, key, and value to have the same dtype, but got query.dtype: float key.dtype: c10::BFloat16 and value.dtype: c10::BFloat16 instead. bug

Something isn't working

#518 opened Sep 9, 2023 by ethanhe42

LoRA with quantization: micro_batch_size effect on memory footprint bug

Something isn't working

quantization

#501 opened Sep 6, 2023 by Andrei-Aksionov

3 tasks

OOM with bf16-true, Quantization, for long context length. bug

Something isn't working

quantization

#477 opened Aug 29, 2023 by KOVVURISATYANARAYANAREDDY

QLoRA for Multi-GPU Settings enhancement

New feature or request

fine-tuning

#449 opened Aug 22, 2023 by rasbt

Look into adding Dolma pretraining dataset enhancement

New feature or request

#439 opened Aug 18, 2023 by rasbt

Evaluation with triviaqa bug

Something isn't working

evaluation

#424 opened Aug 16, 2023 by sanyalsunny111

Previous 1 2 … 7 8 9 10 11 Next

Previous Next

ProTip! Add no:assignee to see everything that’s not assigned.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly