Possibility to sparsify convolutional layers (torch.nn.Conv2d)? #2

SunHaozhe · 2023-01-19T22:06:23Z

I would like to turn an existing convolutional neural network (ResNet) into a sparse model. This model contains torch.nn.Conv2d but not torch.nn.Linear. Does sten support this operation?

I checked the tutorial notebook examples/modify_existing.ipynb. It seems that sten can only sparsify torch.nn.Linear, it cannot sparsify torch.nn.Conv2d? Is it the case?

The text was updated successfully, but these errors were encountered:

and-ivanov · 2023-01-20T08:37:33Z

Sten can help to sparisfy any operator but you need to provide custom implementations for them as shown here examples/custom_implementations.ipynb. The choice of actual implementation depends on the sparse format you want to use and architecture (GPU or CPU).

SunHaozhe · 2023-01-20T16:44:41Z

Since torch.nn.Conv2d is a standard layer/operator in modern neural networks, does stenhave a plan to officially include a sparsified version of torch.nn.Conv2d?

I see that the actual implementation could depend on the choice of the sparse format and architecture (GPU or CPU), but I believe providing a default implementation of sparsified torch.nn.Conv2d could make sten more impactful. Users may just want to test out the effect of sparsifying torch.nn.Conv2d in classical CPUs and/or GPUs.

and-ivanov · 2023-01-22T16:19:03Z

We currently propose to use sparse convolution, implemented through multiplication by a dense matrix. The last time I checked available libraries, there were no implementations of sparse convolution that had a significant performance improvement over dense. If you can suggest any libraries, we can include support for them.

SunHaozhe · 2023-01-22T20:42:43Z

Sorry, could you please clarify what you mean by "sparse convolution, implemented through multiplication by a dense matrix"? Is there any existing implementation of what you are describing here?

and-ivanov · 2023-01-23T09:07:52Z

I mean first do an element-by-element multiplication of the input tensor and/or filter tensor by a mask tensor of zeros and ones. Then use a dense convolution as usual.

SunHaozhe · 2023-01-23T11:40:20Z

I mean first do an element-by-element multiplication of the input tensor and/or filter tensor by a mask tensor of zeros and ones. Then use a dense convolution as usual.

On what kinds of hardware would this implementation provide real speedup (in your opinion)?

and-ivanov · 2023-01-23T14:28:47Z

This implementation is not supposed to give any speedup, quite opposite, it will be slower than the non-sparse version. However, it may still be useful. For example, to evaluate how much accuracy can be preserved from the sparsification of the model.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Possibility to sparsify convolutional layers (torch.nn.Conv2d)? #2

Possibility to sparsify convolutional layers (torch.nn.Conv2d)? #2

SunHaozhe commented Jan 19, 2023

and-ivanov commented Jan 20, 2023

SunHaozhe commented Jan 20, 2023 •

edited

Loading

and-ivanov commented Jan 22, 2023

SunHaozhe commented Jan 22, 2023 •

edited

Loading

and-ivanov commented Jan 23, 2023

SunHaozhe commented Jan 23, 2023 •

edited

Loading

and-ivanov commented Jan 23, 2023

Possibility to sparsify convolutional layers (torch.nn.Conv2d)? #2

Possibility to sparsify convolutional layers (torch.nn.Conv2d)? #2

Comments

SunHaozhe commented Jan 19, 2023

and-ivanov commented Jan 20, 2023

SunHaozhe commented Jan 20, 2023 • edited Loading

and-ivanov commented Jan 22, 2023

SunHaozhe commented Jan 22, 2023 • edited Loading

and-ivanov commented Jan 23, 2023

SunHaozhe commented Jan 23, 2023 • edited Loading

and-ivanov commented Jan 23, 2023

SunHaozhe commented Jan 20, 2023 •

edited

Loading

SunHaozhe commented Jan 22, 2023 •

edited

Loading

SunHaozhe commented Jan 23, 2023 •

edited

Loading