malucalle / selbal

selbal: selection of balances for microbial signatures
32 stars 15 forks source link

accuracy per prediction group & groups of numerator and demominator #13

Closed L-Boonstra closed 4 years ago

L-Boonstra commented 4 years ago

Hello, I have a question regarding the output given by the Selbal output.

I would like to know what percentage of each group got predicted correctly besides the total AUC the accuracy plot shows. Besides that, I would like to know which group gets selected as the numerator and the denominator. Does Selbal store this information in its output?

Thank you in advance for your help.

UVic-omics commented 4 years ago

Hi @L-Boonstra ,

First of all thank you for using selbal.

First of all, let me tell you that you can see more information about the output of the functions here (https://htmlpreview.github.io/?https://github.com/UVic-omics/selbal/blob/master/vignettes/vignette.html). There you can read that:

1) You can get the AUC values through $cv.tab 2) The plots show you which are the variables included in the numerator and the numerator of the balance. Nevertheless, if you change them, remember that the AUC will be the same and the only thing that changes is the sign of the balance.

L-Boonstra commented 4 years ago

Hello, thank you very much for your answer and the information, but it wasn’t an answer to the question I tried to ask, so to explain it a bit better this time:

With CV.BAL.dic$accuracy.nvar I can view the accuracies for the number of variables used in the balances but this is only one value per number and with cv.tab I can see the frequencies of the balances and the percentage of the features.

If I for example have 100 samples in total of which 50 are case samples and 50 are control samples and I’m trying to predict whether a samples is case or control, then I can with accuracy.nvar see that x percent of those 100 samples are predicted correctly when using a balances of y variables. I would besides that like the know the AUC per group I’m trying to predict so x percent of the 50 case and x percent of the 50 control. That way I can see if one group is getting predicted a lot better than the other or if the accuracies are similar. Does Selbal store the AUC’s per group (case vs control) in the output?

Then the var.barplot shows which features are included in the numerator and the denominator, but does the output of Selbal say anywhere if the numerator is the case group or the control group?

UVic-omics commented 4 years ago

Hello!

Ok, this time I understood better your questions:

L-Boonstra commented 4 years ago

Thank you for answering