Open luozhouyang opened 4 years ago
We need a more flexible and powerful data pipeline when training on very large corpus.
Use tf.data API to build the high performance and flexible data pipeline.
tf.data
Do you mean the tf.data can not handle large-scale dataset? Did you try the MatchZoo-py version?
Is your feature request related to a problem? Please describe.
We need a more flexible and powerful data pipeline when training on very large corpus.
Describe the solution you'd like
Use
tf.data
API to build the high performance and flexible data pipeline.