studio-ousia / luke

LUKE -- Language Understanding with Knowledge-based Embeddings
Apache License 2.0
705 stars 102 forks source link

How to use LUKE for long input text? #178

Open anushkasw opened 1 year ago

anushkasw commented 1 year ago

Hello I am trying to finetune LUKE on the DOCRED dataset, which is a document-level dataset for relation extraction. I see that the max length of the sequence with LUKE can be 512 tokens. Is there any other workaround other than truncation of the sample for this problem?