Refactor dataloaders - Githubissues

This PR modified the place where qlty stitching should happen during training and inference.

To specify:

During training: all labeled frames and their corresponding masks will be loaded into memory and cropped into patches by qlty, right now the batch_size_train controls how many patches we want to load into device per batch. Previously this parameter controls the number of frames loaded per batch, which will likely run into memory issues with large patch numbers for big images.
During inference: grab a single frame at a time, crop to patches, then use batch_size_inference to control how many patches we want to make prediction per batch. Previously this parameter was set to control how many images we pass into device per batch, which will give same issue as described above.
Added data standardization for input images.

mlexchange / mlex_dlsia_segmentation_prototype