Parallelizing non-linear sequential models over the sequence length
BSD 3-Clause "New" or "Revised" License
40
stars
1
forks
source link
Set an option for deer iteration or Newton's method to return the best error in case of no convergence #32
Open
mfkasim1 opened 2 months ago