microsoft / unilm

Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities
https://aka.ms/GeneralAI
MIT License
19.62k stars 2.5k forks source link

funsd_preprocess.py does not exist #212

Open elnazsn1988 opened 4 years ago

elnazsn1988 commented 4 years ago

Hi - Am trying to debug a few issues I am facing here, upon detailed examination it seems that this part of the readme does not exist anymore, i.e. I cannot find the 'scripts' folder anywhere inside layoutlm to find the funsd_preprocess.py:

First, we need to preprocess the JSON file into txt. You can run the preprocessing scripts funsd_preprocess.py in the scripts directory. For more options, please refer to the arguments.

elnazsn1988 commented 4 years ago

image

r000bin commented 4 years ago

Guess they renamed funsd_preprocess.py to preprocess.py and stored it here: https://github.com/microsoft/unilm/tree/master/layoutlm/examples/seq_labeling

I'm currently working with the code in preprocess.py

elnazsn1988 commented 4 years ago

@r000bin thanks for the info, to anyone else facing issue - I had previously forked old version, this version works for me: https://github.com/elnazsn1988/unilm/tree/master/layoutlm , however not validated full end-to end yet.