Migrate hf trainer #287

gkumbhat · 2023-11-30T16:08:33Z

Changes

Replace custom training loop with HF Trainer

Co-authored-by: Alex-Brooks <[email protected]> Signed-off-by: gkumbhat <[email protected]>

Signed-off-by: gkumbhat <[email protected]>

Co-authored-by: Alex-Brooks <[email protected]> Signed-off-by: gkumbhat <[email protected]>

Signed-off-by: gkumbhat <[email protected]>

Co-authored-by: Alex-Brooks <[email protected]> Signed-off-by: gkumbhat <[email protected]>

Ssukriti

from the changes it seems like fine tuning was already using Trainer, you have now added it to prompt tuning as well. I think you meant to re-use some functions, hence you moved them to toolkit.
Prompt tuning is using launch_training and preprocessing from toolkit, but text_generation_local has its own definition of both functions, _launch_training. Code looks the same between both launch training, so maybe you forgot to change fine tuning to use toolkit as well?

caikit_nlp/modules/text_generation/peft_prompt_tuning.py

gkumbhat · 2023-12-04T00:05:40Z

@Ssukriti , I did this on purpose to avoid making too many refactors in one PR. I added a comment regarding that: https://github.com/caikit/caikit-nlp/pull/287/files#diff-3ca8e28141febc0ff5a812a8b7a2f92997f0098406b924bd4bdbcab680813cc3R73

…down Signed-off-by: [email protected] <[email protected]>

Ssukriti

I have tested PT before and after the change with llama model on some quick text samples, and results are same.

gkumbhat and others added 7 commits November 26, 2023 14:49

🚧 Initiate HF Trainer

f6ae261

Co-authored-by: Alex-Brooks <[email protected]> Signed-off-by: gkumbhat <[email protected]>

🚧 Replace custom training loop with HF Trainer

da9418d

Co-authored-by: Alex-Brooks <[email protected]> Signed-off-by: gkumbhat <[email protected]>

🔥 Remove old training execution functions

c442b06

Signed-off-by: gkumbhat <[email protected]>

🚧 Fix missing imports and arguments

565cb3c

Co-authored-by: Alex-Brooks <[email protected]> Signed-off-by: gkumbhat <[email protected]>

🐛 Remove explicit conversion of model to dtype

fc6e689

Co-authored-by: Alex-Brooks <[email protected]> Signed-off-by: gkumbhat <[email protected]>

🎨 Fix formatting

3498d3d

Co-authored-by: Alex-Brooks <[email protected]> Signed-off-by: gkumbhat <[email protected]>

🎨 Fix linting

0fbac9f

Signed-off-by: gkumbhat <[email protected]>

gkumbhat requested review from alex-jw-brooks, evaline-ju, gabe-l-hart and tharapalanivel as code owners November 30, 2023 16:08

gkumbhat and others added 2 commits November 30, 2023 10:15

🎨 Fix formatting

f051d83

Signed-off-by: gkumbhat <[email protected]>

🎨 Fix linting

4561ae1

Co-authored-by: Alex-Brooks <[email protected]> Signed-off-by: gkumbhat <[email protected]>

gkumbhat mentioned this pull request Dec 1, 2023

Add multigpu pt #288

Draft

Ssukriti requested changes Dec 3, 2023

View reviewed changes

caikit_nlp/modules/text_generation/peft_prompt_tuning.py Outdated Show resolved Hide resolved

Add accumulation steps back with new experimental results show noslow…

d77417c

…down Signed-off-by: [email protected] <[email protected]>

Ssukriti approved these changes Dec 5, 2023

View reviewed changes

Ssukriti merged commit bc595c6 into caikit:main Dec 5, 2023

gkumbhat deleted the migrate_hf_trainer branch December 5, 2023 22:44

Ssukriti restored the migrate_hf_trainer branch December 13, 2023 00:27

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Migrate hf trainer #287

Migrate hf trainer #287

Uh oh!

gkumbhat commented Nov 30, 2023

Uh oh!

Ssukriti left a comment

Uh oh!

Uh oh!

gkumbhat commented Dec 4, 2023

Uh oh!

Ssukriti left a comment

Uh oh!

Uh oh!

Migrate hf trainer #287

Migrate hf trainer #287

Uh oh!

Conversation

gkumbhat commented Nov 30, 2023

Changes

Uh oh!

Ssukriti left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

gkumbhat commented Dec 4, 2023

Uh oh!

Ssukriti left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!