Closed artek0chumak closed 7 months ago
Fix allocation of the dummy key_value cache. It's not used in actual computations, but torch checkers require them to be on the correct device.
Fix allocation of the dummy key_value cache. It's not used in actual computations, but torch checkers require them to be on the correct device.