EiffL / sfh-inference-hackathon

Repository for SFH inference hackathon at AstroInfo 2021
MIT License
7 stars 2 forks source link

Training a model to predict time to last merger from kinetic data #19

Closed EiffL closed 2 years ago

Siouar commented 2 years ago

@ppfn mean and std from 100 samples : sigma: [18212.175615239412, 47.085832102830075, 29.16657788363947] mean: [167915.38, 1.2179685, inf] there are some issues with velocity dispersion NaN values found in velocity dispersion (replaced by 0) then Inf values,...

ppfn commented 2 years ago

Thanks @Siouar. For the NaN values, we have to decide to do: Replacing with zeros, Interpolating data from nearby pixels, exclude the affected FITS file. @mhuertascompany, any thought on that point ?

ppfn commented 2 years ago

Otherwise, I was able to train a model from a reduced dataset not affected by those NaN values. Here is a plot of the predicted loopback times (in Gy) as a function of the true loopback times: Screenshot 2021-12-08 at 23 47 39 This is promising but, again, it was just on a reduced part of the dataset...

Siouar commented 2 years ago

sigma: [18212.175615239412, 47.085832102830075, 29.16650153197798] mean: [167915.38, 1.2179685, 80.03517]

mhuertascompany commented 2 years ago

HI guys. For the NaN values, I do not know to be honest. Do we know how many images are affected by those? If the fraction is small I'd remove them from the training set. If it's large then, replacing by zeros is an option. Do we know how these NaN values are distributed in each galaxy? It is just one or two pixels? Or more than that?