hello. First of all, thank you for sharing your great research. I am

finetuning sparsified LLaMa about sparsegpt HOT 5 CLOSED

kiucho commented on July 19, 2024

finetuning sparsified LLaMa

from sparsegpt.

Comments (5)

efrantar commented on July 19, 2024

Hi, --save saves a HuggingFace checkpoint of the sparse model where sparse weights are exactly 0. In principle, you should be able to use this with an appropriate finetuning script, however if you want to keep the model sparse, you have to make sure that the exactly 0 weights remain 0. A simple way to accomplish this is to store the mask at the beginning (e.g. via p == 0 for each parameter in the saved checkpoint) and then zero out the corresponding weights directly after each gradient update (i.e., after each optimizer.step()).

from sparsegpt.

kiucho commented on July 19, 2024

Thanks for your reply!
Let me ask you one more question.

Below is the code that outputs the number of parameters for the dense and pruned models.

If their number of parameters are exactly same, is there no advantage of model size?
When I set the sparsity to 0.5, 50% of all parameters are set to zero, but it seems that those parameters are still involved in the multiplication and addition operations during the forward pass. So is there a computational cost benefit to the pruned model?

Thank you.

from sparsegpt.

moonlightian commented on July 19, 2024

Sparsity is one kind of unstructured prune method and it would not change the size of models.
As for the second question, I am interested too. It seems that computational cost benefit is not optimal but the model can be speedup by CUTLASS.
Maybe not correct, looking forward to author's reply

from sparsegpt.

SSshuishui commented on July 19, 2024

Maybe can use some optimized storage methods to save the sparsity model? Otherwise, saving the large model will become a problem such as the 175B model.

from sparsegpt.

kiucho commented on July 19, 2024

Now, speedup with CUTLASS is available with Pytorch 2.1. but storage issue is unlikely to be resolved until something else comes along.

Closing this issue.

from sparsegpt.

Recommend Projects

finetuning sparsified LLaMa about sparsegpt HOT 5 CLOSED

Comments (5)

Related Issues (20)

Recommend Projects

React

Vue.js

Typescript

TensorFlow

Django

Laravel

D3

Recommend Topics

javascript

web

server

Machine learning

Visualization

Game

Recommend Org

Facebook

Microsoft

Google

Alibaba

D3

Tencent