Commit64f0faa

akoumpa

and

ko3n1g

authored

[automodel] fallback FP8 + LCE -> FP8 + CE (#13349)

* fixSigned-off-by: Alexandros Koumparoulis <akoumparouli@nvidia.com>* make fp8 tests non-optionalSigned-off-by: Alexandros Koumparoulis <akoumparouli@nvidia.com>* switch to gemmaSigned-off-by: Alexandros Koumparoulis <akoumparouli@nvidia.com>---------Signed-off-by: Alexandros Koumparoulis <akoumparouli@nvidia.com>Co-authored-by: oliver könig <okoenig@nvidia.com>

1 parent28db904 commit64f0faaCopy full SHA for 64f0faa

File tree

3 files changed

-3

lines changed

.github/workflows
- cicd-main-automodel.yml
nemo/collections/llm/gpt/model
- hf_auto_model_for_causal_lm.py
tests/functional_tests
- L2_HF_Transformer_PEFT_2gpu_FSDP2_fp8.sh

3 files changed

-3

lines changed

`‎.github/workflows/cicd-main-automodel.yml‎`

Lines changed: 0 additions & 2 deletions

Original file line number	Diff line number	Diff line change
`@@ -84,7 +84,6 @@ jobs:`
`84`	`84`	`script:L2_HF_Transformer_PEFT_2gpu_FSDP2_liger`
`85`	`85`	`-runner:azure-gpu-vm-runner1-h100`
`86`	`86`	`script:L2_HF_Transformer_PEFT_2gpu_FSDP2_fp8`
`87`		`-is_optional:true`
`88`	`87`	`-runner:self-hosted-azure`
`89`	`88`	`script:L2_HF_Transformer_PEFT_2gpu_FSDP2`
`90`	`89`	`-runner:self-hosted-azure`
`@@ -95,7 +94,6 @@ jobs:`
`95`	`94`	`script:L2_HF_Transformer_SFT_2gpu_FSDP2`
`96`	`95`	`-runner:azure-gpu-vm-runner1-h100`
`97`	`96`	`script:L2_HF_Transformer_SFT_2gpu_FSDP2_fp8`
`98`		`-is_optional:true`
`99`	`97`	`-runner:self-hosted-azure`
`100`	`98`	`script:L2_HF_Transformer_SFT_2gpu_nemorun`
`101`	`99`	`-runner:self-hosted-azure`

`‎nemo/collections/llm/gpt/model/hf_auto_model_for_causal_lm.py‎`

Lines changed: 8 additions & 0 deletions

Original file line number	Diff line number	Diff line change
`@@ -265,6 +265,14 @@ def configure_model(self):`
`265`	`265`
`266`	`266`	`te_accelerate(self.model,self.model_accelerator.fp8_autocast)`
`267`	`267`
	`268`	`+ifself.use_linear_ce_loss:`
	`269`	`+# scan the model for fp8 layers, if found disable lce`
	`270`	`+formoduleinself.model.modules():`
	`271`	`+ifhasattr(module,'fp8'):`
	`272`	`+logging.warning("LCE does not support FP8, switching to regular CE.")`
	`273`	`+self.use_linear_ce_loss=False`
	`274`	`+break`
	`275`	`+`
`268`	`276`	`ifself.enable_grad_ckpt:`
`269`	`277`	`ifgetattr(self.model,'supports_gradient_checkpointing',False):`
`270`	`278`	`self.model.gradient_checkpointing_enable()`

`‎tests/functional_tests/L2_HF_Transformer_PEFT_2gpu_FSDP2_fp8.sh‎`

Lines changed: 1 addition & 1 deletion

Original file line number	Diff line number	Diff line change
`@@ -1,7 +1,7 @@`
`1`	`1`	`export TRANSFORMERS_OFFLINE=1`
`2`	`2`	`export HF_HOME=/home/TestData/automodel/hf_home`
`3`	`3`	`coverage run -a --data-file=/workspace/.coverage --source=/workspace/nemo examples/llm/peft/automodel.py \`
`4`		`- --model /home/TestData/akoumparouli/hf_mixtral_2l/ \`
	`4`	`+ --model /home/TestData/akoumparouli/hf_gemma_38m/ \`
`5`	`5`	`--max-steps 3 \`
`6`	`6`	`--devices 2 \`
`7`	`7`	`--strategy fsdp2 --fp8`

0 commit comments

Comments

(0)

Movatterモバイル変換

Navigation Menu

Search code, repositories, users, issues, pull requests...

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Commit64f0faa

File tree

3 files changed

3 files changed

`‎.github/workflows/cicd-main-automodel.yml‎`

`‎nemo/collections/llm/gpt/model/hf_auto_model_for_causal_lm.py‎`

`‎tests/functional_tests/L2_HF_Transformer_PEFT_2gpu_FSDP2_fp8.sh‎`

0 commit comments