Open keertanavc opened 5 years ago
Very interesting paper! I know that we can parallelly train ANN models using data parallelization. In that case are there any challenges to implementing this context-dependent gating method?
Very interesting paper! I know that we can parallelly train ANN models using data parallelization. In that case are there any challenges to implementing this context-dependent gating method?