Add `weight_decay_filter` and `lars_adaptation_filter` to LARS #1432

turian · 2022-08-29T09:29:43Z

🚀 Feature

Add weight_decay_filter and lars_adaptation_filter to LARS

Motivation

weight decay typically shouldn't be applied to BatchNorm. See fast.ai and this pytorch discuss thread.

The facebook vicreg code has parameters weight_decay_filter and lars_adaptation_filter which they set to True for any parameter that has ndim 1.

Pitch

There should be a simple way to disable weight decay and LARS adaptation on ndim==1 parameters.

Alternatives

Port Facebook LARS code and use it instead of lightning flash LARS code.

The text was updated successfully, but these errors were encountered:

krshrimali · 2022-09-01T07:05:42Z

Hi, @turian - Thank you for creating the issue. Just to let you know, I have this on my list to take a look at, and I'll try to get back by this weekend. A bit occupied, apologies for the delay.

krshrimali · 2022-09-04T17:24:22Z

Hi, @turian - Thank you for giving the context, I went through the discussion on the PyTorch forum. I think it's fair to give an option to the user to disable this based on the condition (ndim == 1). Would you like to create a PR to add this? If not, I'll be able to take a look, hopefully soon. Thank you! ⚡

turian · 2022-09-04T19:00:37Z

@krshrimali I am not sure that I would able to create a PR that covers all corner cases. :(

krshrimali · 2022-09-04T19:11:16Z

@krshrimali I am not sure that I would able to create a PR that covers all corner cases. :(

No worries at all! I will try to take a look, we are working towards a release tomorrow, so I will need some time but I have added this to my list. Thank you again!!

turian · 2022-09-12T12:39:54Z

@krshrimali Great! I am following this issue.

krshrimali · 2022-09-12T12:41:31Z

@krshrimali Great! I am following this issue.

I'll try to pick this up over the coming weekend. 🤞🏻 Thanks for your patience, @turian 🚀

turian · 2022-09-12T12:56:12Z

@krshrimali Thanks! And I am happy to help with code review if you tag me in the PR

krshrimali · 2022-09-12T12:59:51Z

@krshrimali Thanks! And I am happy to help with code review if you tag me in the PR

Thanks! I'll make sure to request your review :)

stale · 2023-03-18T16:44:55Z

This issue has been automatically marked as stale because it has not had recent activity. It will be closed if no further activity occurs. Thank you for your contributions.

turian added enhancement New feature or request help wanted Extra attention is needed labels Aug 29, 2022

krshrimali self-assigned this Sep 12, 2022

turian mentioned this issue Sep 12, 2022

Remove dim=1 parameters from LARS updates and weight decay vturrisi/solo-learn#301

Closed

stale bot added the won't fix This will not be worked on label Mar 18, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add `weight_decay_filter` and `lars_adaptation_filter` to LARS #1432

Add `weight_decay_filter` and `lars_adaptation_filter` to LARS #1432

turian commented Aug 29, 2022

krshrimali commented Sep 1, 2022

krshrimali commented Sep 4, 2022

turian commented Sep 4, 2022

krshrimali commented Sep 4, 2022

turian commented Sep 12, 2022

krshrimali commented Sep 12, 2022

turian commented Sep 12, 2022

krshrimali commented Sep 12, 2022

stale bot commented Mar 18, 2023

Add weight_decay_filter and lars_adaptation_filter to LARS #1432

Add weight_decay_filter and lars_adaptation_filter to LARS #1432

Comments

turian commented Aug 29, 2022

🚀 Feature

Motivation

Pitch

Alternatives

krshrimali commented Sep 1, 2022

krshrimali commented Sep 4, 2022

turian commented Sep 4, 2022

krshrimali commented Sep 4, 2022

turian commented Sep 12, 2022

krshrimali commented Sep 12, 2022

turian commented Sep 12, 2022

krshrimali commented Sep 12, 2022

stale bot commented Mar 18, 2023

Add `weight_decay_filter` and `lars_adaptation_filter` to LARS #1432

Add `weight_decay_filter` and `lars_adaptation_filter` to LARS #1432