Open TalWac opened 2 years ago
Dear developer,
thank you for the package and the examples code!
Wanted to ask what are the meaning of the ts
attribute.
in the code written
ts dict: length R. Each element is a nXK matrix : the scores per axis per block
If R in the number of principal components,
n number of data points\ observations and K is the number of blocks in the X matrix.
you used norm of ts
in the code above and I do not understand what it represent (mod.model.ts[0]
).
Kindly your help
Dear phscha, A new version of py_ddspls has bean released (version 1.1.1) taking into account deflation and Ridge regularization. Thanks to this new version, I can show you those figures:
One the first figure you can see the RMSEP (
kfolds=10
) forR=1
(blue),R=2
(red) andR=3
(green). You can notice that errors are similar for \lambda>~0.67 for the 3 solutions, while error is largely higher forR=1
as soon as \lambda<~0.67 and also higher forR=2
as soon as \lambda<~0.45. One the second figure are given the norms of the 1st (black) 2nd (green) and 3rd (purple) component for the model withR=3
. You can see that \lambda~0.67 corresponds to the annulation of the 2nd component while \lambda~0.45 to the annulation of the 3rd component.According to that analysis it would be difficult to find uncorrelated components (2 components) describing that Y response better than a model with a single component unless you accept variables correlated with less than ~0.67 with the response. This corresponds to:
Thank you for your help PS: This issue has allowed me to fix a bug on the R package. Thank you for this too. PPS: Here is the code used (with version ddspls >1.1.1) :
_Originally posted by @hlorenzo in https://github.com/hlorenzo/py_ddspls/issues/1#issuecomment-600247211_