microsoft / TransformerCompression

For releasing code related to compression methods for transformers, accompanying our publications
MIT License
354 stars 31 forks source link

model inference #160

Open ChrisXULC opened 3 months ago

ChrisXULC commented 3 months ago

Hi, could you provide an example of how to perform inference using a sliced model?