Is this the right way to compute KL divergence?

kevinyaobytedance / llm_unlearn

LLM Unlearning

MIT License

99 stars 13 forks source link

Open himalalps opened 3 months ago

himalalps commented 3 months ago

The code in utils.py related to compute KL divergence is as follows, but I think maybe this is not the KL divergence but cross entropy.

Why not directly use PyTorch KLDivLoss?