Dora_datacollector_updated #2197

shirinyamani · 2024-11-03T18:48:42Z

This PR just updates the datacollector for fine-tuning using Dora

Co-authored-by: Benjamin Bossan <BenjaminBossan@users.noreply.github.com>

HuggingFaceDocBuilderDev · 2024-11-04T10:19:13Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

BenjaminBossan

Thanks for fixing this.

I think when creating the PR, something went wrong, as it shows 39 commits, even though there is only one change. But it's no big deal, I can still merge.

shirinyamani · 2024-11-13T00:23:25Z

Hi Benjamin @BenjaminBossan
Thanks for all the good you do. I just added the implementation SnapKV Cache paper to cache_utils.py file. To reflect the SnapKV approach, I added the implementation on llama_modeling under llama_snapkv.py for the user to see what changes has to be applied to flash_attention2 to reflect the snap_kv, however, I'm not entirely sure tht there is the best location for this implementation. I’m confident that the main SnapKV logic belongs in cache_utils.py, but I would really appreciate your insights on whether the llama_snapkv.py example is well-placed or if it should be integrated differently.
Much appreciated!

BenjaminBossan · 2024-11-13T14:47:49Z

@shirinyamani Thanks for working on this. I'm not a transformers maintainer, so I can't really tell you where best to implement this feature. It's best if you reach out to the transformers folks for this. What you could also do is create a draft PR with your implementation the way that you suggested, I'm sure they'll let you know if you have to change something.

shirinyamani and others added 30 commits June 19, 2024 23:15

final script added

3fddcd2

dora img added

da14858

ephoc increased

eda9cf5

PEFT CONFIQ updated

091f467

model hub id removed

7a87bbc

tokenizer and model comments added

4fedd83

README updated on HUB integration

cb1b7cb

README updated on device map

e73e3b6

epochs updated

de00eda

output dir changed

6ba1dfe

readme updated

e577aa1

readme updated

a4dcd8d

torch.compile added

762e6af

torch.compile added to RM

1374011

args updated to user

ddfcd4c

Merge branch 'huggingface:main' into main

71aa469

Update examples/dora_finetuning/README.md

e4cfd34

Co-authored-by: Benjamin Bossan <BenjaminBossan@users.noreply.github.com>

Update examples/dora_finetuning/README.md

ea121c8

Co-authored-by: Benjamin Bossan <BenjaminBossan@users.noreply.github.com>

Update examples/dora_finetuning/README.md

058a617

Co-authored-by: Benjamin Bossan <BenjaminBossan@users.noreply.github.com>

Update examples/dora_finetuning/QDoRA_finetuning.ipynb

d137e67

Co-authored-by: Benjamin Bossan <BenjaminBossan@users.noreply.github.com>

Update examples/dora_finetuning/QDoRA_finetuning.ipynb

5d3e12a

Co-authored-by: Benjamin Bossan <BenjaminBossan@users.noreply.github.com>

use_compile removed from .py + readme

d07814a

Qdora output dir updated

8a3af1c

use_dora added

fb5f317

use_peft removed from readme and use_dora added

2d63c3a

readme updated + epoch num increased

7371a85

prepare_model_for_kbit_training added + gradient cheack set to true

1cc5afd

readme updated python ---> bash

c3b0110

notebook successfully re-ran

8d7d802

notebook successfully re-ran

47eefc1

shirinyamani and others added 9 commits June 26, 2024 17:57

notebook successfully re-ran

4041d2c

quantize config updated

1a7dadd

reqs added

97397c4

makefile added

8424e8d

make style added

3864ad7

make style ran successfully

280e760

ruff check && fromat examples tried

1e2ceb7

Merge branch 'huggingface:main' into main

7e394b3

datacollector updated, mlm=false

63295b3

BenjaminBossan approved these changes Nov 4, 2024

View reviewed changes

BenjaminBossan merged commit 4e57aa5 into huggingface:main Nov 4, 2024
14 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Dora_datacollector_updated #2197

Dora_datacollector_updated #2197

shirinyamani commented Nov 3, 2024

HuggingFaceDocBuilderDev commented Nov 4, 2024

BenjaminBossan left a comment

shirinyamani commented Nov 13, 2024

BenjaminBossan commented Nov 13, 2024

Dora_datacollector_updated #2197

Dora_datacollector_updated #2197

Conversation

shirinyamani commented Nov 3, 2024

HuggingFaceDocBuilderDev commented Nov 4, 2024

BenjaminBossan left a comment

Choose a reason for hiding this comment

shirinyamani commented Nov 13, 2024

BenjaminBossan commented Nov 13, 2024