Does transformer layer supported??

OriAlpha commented 2 years ago

i am trying to pass transformer model, but encountering issues while passing model. simplified_model = simplify(model, dummy_input,fuse_bn=False)

Am getting AssertionError Does transformer layer is supported at the time??

AndreaBrg commented 2 years ago

HI, could you please provide a minimal reproducible example? Namely:

How do you define the model.
How do you pruned the model.
How do you apply simplify.

And if possible the whole error you encounter.

OriAlpha commented 2 years ago

I would be happy to create an example which would help to improve library

OriAlpha commented 2 years ago

Please refer this example.: I am using gelectra model, you can get it from here https://huggingface.co/deepset/gelectra-base/tree/main or you can use a small version of bert

Steps: first you load the model with

tokenizer = AutoTokenizer.from_pretrained("./gelectra")
model = AutoModelForSequenceClassification.from_pretrained("./gelectra")

then you can call pytorch pruning tool:

for name, module in model.named_modules():

    # prune 20% of connections in all 2D-conv layers
    if isinstance(module, t.nn.Embedding):
        prune.l1_unstructured(module, name='weight', amount=0.2)
    # prune 40% of connections in all linear layers
    elif isinstance(module, t.nn.Linear):
        prune.l1_unstructured(module, name='weight', amount=0.1)

removing original weights in model

for name, module in model.named_modules():

    if isinstance(module, t.nn.Embedding):
        prune.remove(module, 'weight')
        #prune.l1_unstructured(module, name='weight', amount=0.2)
    # prune 40% of connections in all linear layers
    elif isinstance(module, t.nn.Linear):
        prune.remove(module, 'weight')

which results in zeroing some weights inside the model. Now comes the simplify, when you try to load model it fails if you have any issue, i can assist you

AndreaBrg commented 2 years ago

Ok, thank you. We will get back to you asap.

AndreaBrg commented 2 years ago

@OriAlpha I'm not really familiar with NLP models, could you please give me an example of the dumm_input you are using?

OriAlpha commented 2 years ago

This is where it gets confusing, while passing data. Usually you can pass inputs as below:

input_ids = tokenizer("Studies have been shown that owning a dog is good for you", return_tensors="pt").input_ids  # Batch size 1
decoder_input_ids = tokenizer("Studies show that", return_tensors="pt").input_ids  # Batch size 1

out = model(input_ids)

simplified_model = simplify(model, input_ids) #fails here

AndreaBrg commented 2 years ago

@OriAlpha Ok, so for what I could find there are a couple of problems with this model:

https://github.com/EIDOSlab/simplify/blob/c7db97fbf3a30ede351d2b400e73d5825ab83134/simplify/utils.py#L105 it seems that building the model graph doesn't work. Do you know if it is possible to torchscript this model? https://pytorch.org/tutorials/beginner/Intro_to_TorchScript_tutorial.html
even if we ignore the previous step (you would have to supply the pinned_out list), currently we identify the neurons to remove using forward hooks with specifically coded inputs https://github.com/EIDOSlab/simplify/blob/70f4dc7439c4357fe8a5d5866af60a8e42900fa8/simplify/remove.py#L157-L158 but this NLP model requires either int or long which cannot be nan as far as I understand.

I currently wouldn't know how to solve these issues but you are welcome to propose a PR in the meantime.

OriAlpha commented 2 years ago

Thanks for looking into this. While testing i had also came across this issue, you could try to pass.

x = torch.LongTensor(1, 512).random_(0, 2^53)
model(x)

I will look into this torchscript

EIDOSLAB / simplify

Does transformer layer supported?? #6