Add support for Paged Optimizers (Adam, Adamw), 8-bit optimizers, and new optimizers: LARS, LAMB and LION #3588

arnavgarg1 · 2023-09-06T14:39:35Z

This PR adds support for a variety of new optimizers, including paged and 8-bit variants. All of these are useful for fine-tuning. I will add better descriptions for parameters in a follow-up PR when I better understand some of the underlying papers for LAMB, LARS and LION. I will also add tests once #3578 PR lands.

What are paged optimizers?

Paged Optimizers use NVIDIA unified memory 3 feature which does automatic page-to-page transfers between the CPU and GPU for error-free GPU processing in the scenario where the GPU occasionally runs out-of-memory. The feature works like regular memory paging between CPU RAM and the disk. This allocates paged memory for the optimizer states which are then automatically evicted to CPU RAM when the GPU runs out-of-memory and paged back into GPU memory when the memory is needed in the optimizer update step.

What new optimizers are being added to Ludwig?

Here's a summary of the new optimizers that will be added with this PR:

SGD: 8-bit support
Adam: 8-bit, Paged Adam, Paged Adam 8-bit
AdamW: 8-bit, Paged AdamW, Paged AdamW 8-bit
Adagrad: 8-bit
RMSProp: 8-bit
LAMB, Lamb 8-bit
LARS, LARS 8-bit
LION, LION 8-bit, Paged LION, Paged LION 8-bit

Here's an example of how you can set these different variants:

Regular AdamW

trainer:
  optimizer:
    type: adamw

8-Bit AdamW

trainer:
  optimizer:
    type: adamw_8bit

Paged AdamW

trainer:
  optimizer:
    type: paged_adamw

Paged AdamW 8-bit

trainer:
  optimizer:
    type: paged_adamw_8bit

All of this has been made possible through a deeper integration with the bitsandbytes library directly. Note that the 8-bit and paged optimizers only work on GPU machines, and they are not compatible with Deepspeed.

for more information, see https://pre-commit.ci

github-actions · 2023-09-06T16:11:52Z

Unit Test Results

      6 files ±0       6 suites ±0 1h 40m 56s ⏱️ + 16m 20s
2 830 tests +3 2 794 ✔️ +4 12 💤 ±0 24 ❌ - 1
2 873 runs +3 2 828 ✔️ +4 21 💤 ±0 24 ❌ - 1

For more details on these failures, see this check.

Results for commit 0158d03. ± Comparison against base commit 3d2ff0b.

♻️ This comment has been updated with latest results.

…adamw

ludwig/config_validation/checks.py

ludwig/schema/optimizers.py

ludwig/utils/llm_utils.py

ludwig/models/llm.py

arnavgarg1 and others added 12 commits August 29, 2023 23:17

Initial implementation of PagedAdamW

e44300f

Add prints

9c5c194

Add PagedAdam and LAMB

560ec03

re-organize

5ce3153

Merge branch 'master' into paged_adamw

089216b

Move bitsandbytes to general requirements

c7c3941

Add Lars, Lamb, Lion, PagedLion

daa6c74

Add property to base class

4028b9c

Update some descriptions

dd7496c

move things around

d91dfd8

Add

2ee9702

[pre-commit.ci] auto fixes from pre-commit.com hooks

0045e10

for more information, see https://pre-commit.ci

arnavgarg1 added 6 commits September 6, 2023 16:34

Add remaining optimizers

7654aa7

Merge branch 'paged_adamw' of github.com:ludwig-ai/ludwig into paged_…

d62af7d

…adamw

Add config validation check

8956463

Add todo

6a6edeb

Fix tests

c1cbf6a

alias

02c341c

arnavgarg1 marked this pull request as ready for review September 6, 2023 18:07

arnavgarg1 changed the title ~~[WIP] Add support for Paged Optimizers (Adam, Adamw), 8-bit optimizers, and new optimizers: LARS, LAMB and LION~~ Add support for Paged Optimizers (Adam, Adamw), 8-bit optimizers, and new optimizers: LARS, LAMB and LION Sep 6, 2023

Add checks for 8-bit optimizers

fb49c42

tgaddair reviewed Sep 6, 2023

View reviewed changes

ludwig/config_validation/checks.py Outdated Show resolved Hide resolved

ludwig/schema/optimizers.py Show resolved Hide resolved

arnavgarg1 added 3 commits September 6, 2023 20:25

Test new check at optimizer creation time

25d94c0

Fix missing returns in properties

d7629ef

Add checks

449f99a

arnavgarg1 requested review from w4nderlust, justinxzhao, jeffkinnison and Infernaught September 6, 2023 20:46

arnavgarg1 requested a review from tgaddair September 6, 2023 20:47

tgaddair approved these changes Sep 6, 2023

View reviewed changes

arnavgarg1 added 3 commits September 6, 2023 22:23

Replace pretrained embedding layer with bnb.nn.modules.Embedding

ef6bac2

Add inline embedding update using module_path

e093ca0

Fix comments

b7eae74

arnavgarg1 commented Sep 6, 2023

View reviewed changes

ludwig/utils/llm_utils.py Show resolved Hide resolved

arnavgarg1 commented Sep 6, 2023

View reviewed changes

ludwig/models/llm.py Outdated Show resolved Hide resolved

arnavgarg1 added 3 commits September 7, 2023 18:56

Add skip for paged and 8-bit optimizer tests

6c4d8c5

Minor fixes

0e2518b

Add check for device count

b79c2cb

justinxzhao approved these changes Sep 7, 2023

View reviewed changes

arnavgarg1 added 2 commits September 7, 2023 20:58

Skip bnb optimizers in combinatorial tests

7cacb77

Add unit tests for model_utils

0158d03

arnavgarg1 merged commit 6f9ed8f into master Sep 8, 2023

arnavgarg1 deleted the paged_adamw branch September 8, 2023 17:36

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add support for Paged Optimizers (Adam, Adamw), 8-bit optimizers, and new optimizers: LARS, LAMB and LION #3588

Add support for Paged Optimizers (Adam, Adamw), 8-bit optimizers, and new optimizers: LARS, LAMB and LION #3588

arnavgarg1 commented Sep 6, 2023 •

edited

Loading

github-actions bot commented Sep 6, 2023 •

edited

Loading

Add support for Paged Optimizers (Adam, Adamw), 8-bit optimizers, and new optimizers: LARS, LAMB and LION #3588

Add support for Paged Optimizers (Adam, Adamw), 8-bit optimizers, and new optimizers: LARS, LAMB and LION #3588

Conversation

arnavgarg1 commented Sep 6, 2023 • edited Loading

What are paged optimizers?

What new optimizers are being added to Ludwig?

github-actions bot commented Sep 6, 2023 • edited Loading

Unit Test Results

arnavgarg1 commented Sep 6, 2023 •

edited

Loading

github-actions bot commented Sep 6, 2023 •

edited

Loading