Create a more standard training loop interface for pretraining

Currently, clmbr_train_model conceals more familiar training loops structure from users. In most demos and APIs, the boilerplate looks like what's outlined here https://github.com/PyTorchLightning/pytorch-lightning with this structure

dataset = MNIST(os.getcwd(), download=True, transform=transforms.ToTensor())
train, val = random_split(dataset, [55000, 5000])

autoencoder = LitAutoEncoder()
trainer = pl.Trainer()
trainer.fit(autoencoder, DataLoader(train), DataLoader(val))

basically the form

dataloader
data splits
model architecture
training

Specific details around the loss are configured in the model architecture and the trainer class handles stuff like progress bars, choice of optimizer, etc.

What is the lift required to provide a demo and refactor to support this type of workflow?

som-shahlab / ehr_ml

Create a more standard training loop interface for pretraining #8