Closed ctroupin closed 1 year ago
Should we move this to https://github.com/gher-uliege/DINCAE.jl ?
Why does the input has to be in written as contiguous ragged array representation?
For altimetry: we have a vector of tracks (vector of vectors) and when we split the data (training vs tests) we do not split the tracks. So it is good to keep the information about which data points are in a same track.
Maybe for the CPRs this is relevant too (where tracks are campaigns).
yes sure I wrote it to the wrong repository!!! should be DINCAE.jl
Is there an example of such a file; so I can use ncgen
on it?
The doc string of DINCAE.reconstruct_points contains the output of ncgen
of the altimetry test case.
https://github.com/gher-uliege/DINCAE.jl/blob/main/src/points.jl#L492-L508
And this info:
The file should contain the variables
lon
(longitude),lat
(latitude),dtime
(time of measurement) andid
(numeric identifier, only used by post processing scripts) anddates
(time instance of the gridded field). The file should be in the contiguous ragged array representation as specified by the CF convention allowing to group data points into "features" (e.g. tracks for altimetry). Every feature can also contain a single data point.
Should we add something to make it more clear? The data variable is called sla
in my case, but this can be adapted (e.g. Calanus_Finmarchicus
, in this case the varname
parameter of DINCAE.reconstruct_points
is "Calanus_Finmarchicus"
).
ok thanks! I'd seen the docstring but wasn't sure about the ragged array format, now that's clear.
Instead of track
I will use the sampleID
, which seems to indicate the cruise tracks.
Describe the bug
Trying to run DINCAE on CPR observations.
Stracktrace
so the problem comes at the reading step with the function
DINCAE.loaddata(filename,varname)
: https://github.com/gher-uliege/DINCAE.jl/blob/main/src/points.jl#L440-L457, which usesloadragged
.Question
Why does the input has to be in written as contiguous ragged array representation?
To Reproduce
Please provide a minimal code example which reproduces the behavior (bug, performance regression, ...).
Environment
Input file