Question: models that handle (or not) probabilistic labels

Hi, after reading your papers and the first of your tutorials, I'm still not so sure about which models can handle probabilistic labels and which cannot. Until now, I made the following distinction:

Standard machine learning algorithms (such as Random Forest, Gradient Boosting Machine, XGBoost, Logistic Regression, etc., the ones you can find in scikit-learn) cannot handle probabilistic labels and therefore we need to transform the output from predict_proba() method of the LabelModel into binary values.
Neural networks (from pytorch, keras/tensorflow, etc.) can handle probabilistic labels during training, and it's better to use them since we don't lose confidence or information about them.

Is this distinction correct or can it be integrated/improved?

Moreover, another question regarding this topic. If I need to discretize the predicted probabilities obtained by predict_proba() - for instance, I want to assign label 1 to the observations whose positive-class probability predicted by the LabelModel is larger than a threshold t - does it make sense to use a validation set with gold labels (distinct from the development and the test sets) and tune the threshold t in order to obtain the maximum accuracy/F1-score on this validation set, and then apply the optimized threshold to discretize the predicted probabilities of the unlabeled training set too?

I hope I've been clear in presenting my questions; in case I will edit them.

P.S. Great job with the Snorkel project, I find the applications very interesting and useful!

snorkel-team / snorkel

Question: models that handle (or not) probabilistic labels #1574