tensorflow / models

Models and examples built with TensorFlow
Other
77.16k stars 45.77k forks source link

Hassle free own data input to Transformer #5976

Closed abhishek-niranjan closed 5 years ago

abhishek-niranjan commented 5 years ago

System information

Describe the problem

Is there a hassle free way to input your own data to transformer model, say train.src and train.tgt without taking the code-bits from data_download.py and modifying to make it work?

hichiaty commented 5 years ago

If you use the tensor2tensor version, you can create a very simple script to register your "problem" in almost no time as described in the docs. https://github.com/tensorflow/tensor2tensor/blob/master/docs/new_problem.md

abhishek-niranjan commented 5 years ago

@hichiaty , actually I want to modify the architecture a little and prefer to work on this version as I feel it's fairly easy to make changes here. Besides, I believe that this implementation should also have an easier way if you want to train the model on your own data.

abhishek-niranjan commented 5 years ago

@hichiaty , actually I want to modify the architecture a little and prefer to work on this version as I feel it's fairly easy to make changes here. Besides, I believe that this implementation should also have an easier way if you want to train the model on your own data.

abhishek-niranjan commented 5 years ago

UPDATE: I've fixed a script from the pieces of data_download.py to be able to run the model on our own data. Closing the issue.