intel-analytics / analytics-zoo

Distributed Tensorflow, Keras and PyTorch on Apache Spark/Flink & Ray
https://analytics-zoo.readthedocs.io/
Apache License 2.0
16 stars 3 forks source link

Pytorch next steps #1027

Open hhbyyh opened 5 years ago

hhbyyh commented 5 years ago

Use this issue as a reminder to track the Pytorch-related work items

  1. Support Table/multiple input and output models. [done]

  2. JNI memory optimization. Avoid mem leak and unnecessary copy.

  3. Scalability test for training and inference

  4. Support export to Pytorch model.

  5. Utility to convert data loader to RDD/DataFrame

  6. RNN/LSTM training.

  7. Confirm tracing batch normalization works for training.

hhbyyh commented 5 years ago
  1. Support transfer learning with Pytorch model, i.e. lock weights etc.
hhbyyh commented 5 years ago
  1. Investigate the performance impact of short cut the backward process from loss to model.
hhbyyh commented 5 years ago
  1. Docs
  2. Better exception handling and error messages.