I was just reading about the past flag, for caching state, and wondered whether it's possible to use that in CoreML?
Also, is this repo still active, or has CoreML 4 (and the new pytorch->CoreML conversion tools) kind of eclipsed the work being done here? I'm using your GPT2, trained on custom data, and would be curious about experimenting with XLNet, but activity here seems pretty quiet... ?
I was just reading about the
past
flag, for caching state, and wondered whether it's possible to use that in CoreML? Also, is this repo still active, or has CoreML 4 (and the new pytorch->CoreML conversion tools) kind of eclipsed the work being done here? I'm using your GPT2, trained on custom data, and would be curious about experimenting with XLNet, but activity here seems pretty quiet... ?