Open wanmeijianchi opened 2 years ago
Hi This is not an issue. You are over-stating the order of the mixture. The data are generated from a mixture of 3-component but you are setting the order to 4. Therefore, in the best-case scenario, one of the components will split and the estimated coefficients are going to be similar. In other scenarios, either dispersion, mixing proportion, or the coefficients are going to be very small. There are techniques to penalize the order of mixture models such as BIC, AIC, or modern ones which are based on the lasso.
When I use
fmrs.mle
to fit an FMR model, I find that sometimes there will be some groups with exactly the same estimates (coefficients, dispersion, and evenmixProp
). For example, here the data are generated byfmrs.gendata
with 3 components.Then I fit a FMR model
res.fmr0
usingfmrs.mle
,fmrs.tunsel
andfmrs.varsel
. The result shows that, the 1st and 2nd (the same for 3rd and 4th) compnent has exactly the same estimates of coefficients, dispersion, and evenmixProp
.I wonder why did this happen?
Does your algorithm have the function of adaptively selecting the number of groups? For example, there are 3 groups here in reality. When there are at most 4 groups (set
nComp = 4
, andnComp
is just an upper bound of the order of FMR), the number of groups estimated by your algorithm is 2?Or is it just because of the problem of unidentifiability, and has nothing to do with your algorithm implementation?
The result of
res.fmr0
isThe code for fitting FMR model is as following: