added the ability to compute logits on the fly as well.

Enhance `compute_loss` Method for Dynamic Logit Generation

This PR enhances the compute_loss method in the DAMTrainer class to support dynamic generation of logits on the fly. The key changes include:

Dynamic Logit Generation: Introduced a new parameter generate_logits_on_fly to control whether logits should be generated dynamically during loss computation.
Conditional Logic: Added conditional logic to handle both scenarios:
- If generate_logits_on_fly is True, the method generates logits for each model in merged_model.num_models and computes the individual logit losses.
- If generate_logits_on_fly is False, it uses the precomputed topk_logits and topk_indices from the inputs.

These changes improve the flexibility of the compute_loss method, allowing it to adapt to different use cases and optimize performance based on the specific requirements of the training process.

arcee-ai / DAM

added the ability to compute logits on the fly as well. #18

Enhance `compute_loss` Method for Dynamic Logit Generation

arcee-ai / DAM

added the ability to compute logits on the fly as well. #18

Enhance compute_loss Method for Dynamic Logit Generation

Enhance `compute_loss` Method for Dynamic Logit Generation