Closed davideferrari92 closed 3 years ago
Hi Davide - never seen this error before. Can you please send the dimension of the dataset and the range of values for X? Thanks.
Hi! Thank you for the very fast answer.
The dimension of my dataset is 963 rows x 121 columns. Overall in the dataset the minimum and maximum values that I can have are -1 and 726964.
They are almost all integer from 1 to 10 and few columns with larger number. All of them are as float64.
Davide
Davide Ferrari Graduate Research Fellow @ UNIMORE
Dipartimento Chirurgico, Medico, Odontoiatrico e di Scienze Morfologiche con interesse Trapiantologico, Oncologico e di Medicina Dipartimento di Scienze Matematiche, Fisiche e Informatiche
On 29 Oct 2020, at 21:11, Ahmed M. Alaa notifications@github.com wrote:
Hi Davide - never seen this error before. Can you please send the dimension of the dataset and the range of values for X? Thanks.
— You are receiving this because you authored the thread. Reply to this email directly, view it on GitHub https://github.com/ahmedmalaa/Symbolic-Metamodeling/issues/4#issuecomment-718993618, or unsubscribe https://github.com/notifications/unsubscribe-auth/AAQIFDNLBPIGOPPGA2CXTN3SNHD5NANCNFSM4TDNIGHQ.
I am not sure if that's the source of error but 726964 is pretty big and maybe causing some numerical problems. I recommend you normalize all columns to [0, 1] and if that still giving an error you may try fitting a subset of the columns first e.g. 10 features only. Also note that this method does not scale well for very high dimensions, with 121 feature you will get super long equations that are not really interpretable anymore.
That’s sounds good. I’ll try!
Thank you!
I’ll get to you if things still does not go well.
Have a nice day!!
Davide
On 30 Oct 2020, at 11:00, Ahmed M. Alaa notifications@github.com wrote:
I am not sure if that's the source of error but 726964 is pretty big and maybe causing some numerical problems. I recommend you normalize all columns to [0, 1] and if that still giving an error you may try fitting a subset of the columns first e.g. 10 features only. Also note that this method does not scale well for very high dimensions, with 121 feature you will get super long equations that are not really interpretable anymore.
— You are receiving this because you authored the thread. Reply to this email directly, view it on GitHub https://github.com/ahmedmalaa/Symbolic-Metamodeling/issues/4#issuecomment-719458582, or unsubscribe https://github.com/notifications/unsubscribe-auth/AAQIFDIAVTYLNF3NBIPHG4DSNKFCVANCNFSM4TDNIGHQ.
Hi! I've got this problem trying to build the metamodel on my dataset. This is a complete dataset (no NaN) and they are all float64 values, as shown here.
I leave here the trace of the error in which is shown that the ValueError is not explanatory on what is going on.
What do you think is happening?
Thank you very much!
Davide