Closed drphilmarshall closed 10 years ago
The difference between the two corners is whether I plot in log space or not.
These are with the training enabled. I should make some plots with that factor disabled.
I'm not sure what to make of the floating central cluster for the stage 2 probabilities. They seem to be points the online system is not totally set on, but the offline system is pretty confident about. I think the broad distribution at low P somewhat reflects the fact that the offline system doesn't clip at low P values.
NB. All the axes are showing the subject P values, here.
Did you check in an offline catalog into projects/CFHTLS? We are set up to do our own "expert" inspection but we need a catalog for input. Thanks!
On Tue, Aug 5, 2014 at 7:25 PM, cpadavis notifications@github.com wrote:
The difference between the two corners is whether I plot in log space or not.
These are with the training enabled. I should make some plots with that factor disabled.
I'm not sure what to make of the floating central cluster for the stage 2 probabilities. They seem to be points the online system is not totally set on, but the offline system is pretty confident about. I think the broad distribution at low P somewhat reflects the fact that the offline system doesn't clip at low P values.
— Reply to this email directly or view it on GitHub https://github.com/drphilmarshall/SpaceWarps/issues/57#issuecomment-51286780 .
what's the Prejection for offline?
I don't think there is a rejection threshold in P in the offline version... To emulate retirement we'd have to run the offline analysis every week's worth of classifications or so. This would be interesting to do with our current stage 1 data, but maybe not necessary now. for paper 1, we should focus on understanding stage 2, I think.
On Wed, Aug 6, 2014 at 7:05 PM, anupreeta27 notifications@github.com wrote:
what's the Prejection for offline?
— Reply to this email directly or view it on GitHub https://github.com/drphilmarshall/SpaceWarps/issues/57#issuecomment-51422379 .
Quick update on this one: Anu's pulling out the P_offline values for the things we have already inspected, and also the offline sample ranked by P_offline. I am optimistic of a better correlation between P and expert grade! We'll see.
Poffline and Ponline vs. expert grades: no significant difference found. P values from both swap runs do not show any obvious correlation with expert grades of the known+new lens candidates
Poffline vs Ponline: Subjects with lens candidates (known+new from onl.) get systematically higher P offl. values, roughly these subjects have P>0.4
Here are some plots sure to generate some discussion between us:
(I also did it for stage 1)
stage2:
stage1: