switch default line_search_fn of L-BFGS to 'strong_wolfe'
'n_iter' in the L-BFGS of paddle and torch will be increased by one at the beginning. Therefore, the original code cannot achieve the expected effect of exiting after training converges because 'prev_n_iter' always equal to 'n_iter - 1' in that line of code. Fixed it in the PR (by the way, for backend torch).