Some suggestions/issues in "XXXModel" class - Githubissues

FedML-AI / FedNLP

FedNLP: An Industry and Research Integrated Platform for Federated Learning in Natural Language Processing, Backed by FedML, Inc. The Previous Research Version is Accepted to NAACL 2022

223 stars 45 forks source link

Some suggestions/issues in "XXXModel" class #9

Closed chaoyanghe closed 3 years ago

chaoyanghe commented 3 years ago

I am trying to simplify the code.

Some suggestions:

I suggest finishing a function in a screen length (less than 100 lines). In some companies, this is hard rule. Even in NLP domain, like huggingface, they also follow this rule well. Their code is readable to me.
I don't see some special training tricks, so the training loop can be finished in a screen length.
when we want to repeat a functionality, it's better to extract it as a function. For example, 1) the early stopping related code repeats twice in the training loop, but the content is nearly the same; 2) defining the trainable parameters can be shrunk into a function (the begining part of train()).
Many variant Transformers' code are merged with different branches. I suggest only consider the models in FL. As a benchmark, two models will be enough.

Some issues:

this class also contains data loading in the training and evaluation loop, which should be handled outside the trainer class in FL, otherwise the performance may downgrade and some hidden issues may happen. Under FedML framework, the design pattern is to finish data loading of each client before starting the training loop.

(May update once I found more)

chaoyanghe commented 3 years ago

In the trainer class of FL framework, it's better only implement one task. So for different tasks, we can define more trainers.

chaoyanghe commented 3 years ago

No worries. I will upload my code for review once I finished.