hi, i do to training VRT model currently.
but reached to memory lack problem.
i used to NVIDIA RTX 2080 Ti which device memory amount is 11GB for training.
so i was adjusted batch-size from 8 to 4 and torch.utils.checkpoint used.
(use_checkpoint_attn: true, use_checkpoint_ffn: true, gt_size=256)
but arise to memory lack (cannot allocate memory).
hi, i do to training VRT model currently. but reached to memory lack problem.
i used to NVIDIA RTX 2080 Ti which device memory amount is 11GB for training. so i was adjusted batch-size from 8 to 4 and torch.utils.checkpoint used. (use_checkpoint_attn: true, use_checkpoint_ffn: true, gt_size=256)
but arise to memory lack (cannot allocate memory).
is it possible training using RTX 2080 Ti?