Question on the basis settings

arthurdouillard / incremental_learning.pytorch

A collection of incremental learning paper implementations including PODNet (ECCV20) and Ghost (CVPR-W21).

MIT License

383 stars 60 forks source link

Hello, I'm a beginner at continuous learning. First of all, thank you for allowing me to use your wonderful platform.

I have a question while looking at your code.

I was going to proceed with the LwF study. In your code, the variable "distillation_config" exists in the LwF class.

class LwM(IncrementalLearner):

    def __init__(self, args):
        self._device = args["device"][0]
        self._multiple_devices = args["device"]

        self._opt_name = args["optimizer"]
        self._lr = args["lr"]
        self._lr_decay = args["lr_decay"]
        self._weight_decay = args["weight_decay"]
        self._n_epochs = args["epochs"]
        self._scheduling = args["scheduling"]

        self._distillation_config = args["distillation_config"]
        self._attention_config = args.get("attention_config", {})

        logger.info("Initializing LwM")

        self._network = network.BasicNet(
            args["convnet"],
            convnet_kwargs=args.get("convnet_config", {}),
            classifier_kwargs=args.get("classifier_config", {
                "type": "fc",
                "use_bias": True
            }),
            device=self._device,
            gradcam_hook=True
        )

        self._n_classes = 0
        self._old_model = None

May I know what the role of this variable is? Also, is it a valid task to learn only the LwF model in your code?

This is a dictonnary, that is usually defined in an option file.

It is used here https://github.com/arthurdouillard/incremental_learning.pytorch/blob/master/inclearn/models/lwf.py#L143 and there https://github.com/arthurdouillard/incremental_learning.pytorch/blob/master/inclearn/models/lwf.py#L152 to regulate the value of the temperature and the lambda factor of the distillation loss for LwF.

And here https://github.com/arthurdouillard/incremental_learning.pytorch/blob/master/inclearn/models/lwm.py#L154 for LwM.

You can take inspiration from the other models' options file.

Beware, you are probably confusing LwF (what you write) and LwM (the code you pasted). These are two different models.

I think my implementation of LwF works ok, but I've never been able to make LwM works as well as the original paper. To be honest, I'm not sure anyone managed to do it and the official code was never released so...

Does that answer your question?

arthurdouillard / incremental_learning.pytorch

Question on the basis settings #54