Open peterdesmet opened 4 days ago
Looks fun to work on, I'm looking forward to it. I already explored data.table
for reading fixed with files (fwf) earlier this week: https://gist.github.com/PietrH/f13fb98f95b37242e59c92407fde1917
There is also readr::read_fwf()
, it offers more control but requires more setup.
I'm slightly tempted to explore arrow::open_dataset()
as the radar/year partitioning might actually come in handy.
There is both an FTP as well as a HTTP endpoint, for now I'll probably prefer using the HTTP endpoint.
arrow actually looks quite cool I need to look into that!
- Is the metadata header always the same length?
Do you mean: the same amount of rows? Not sure, but they should always start with #
, which can be ignored with readr:: read_fwf(comment = "#")
- Are the columns always the same width and order?
Yes
- Is the metadata header always the same length?
Do you mean: the same amount of rows? Not sure, but they should always start with
#
, which can be ignored withreadr:: read_fwf(comment = "#")
Sadly the header is also commented out, but if we are very certain the columns never change, this shouldn't be an issue.
Source
I suggest the value
rmi
for the parametersource
:rmi
. It is the only VPTS dataset by RMI. I think the alternative valuecrow
would be confusing as a name, since that is also used for the visualization.Scope
Metadata and context can be found here. The dataset covers 10 radars and has data since 2019. More data are added daily.
Data files
Data files are deposited at https://opendata.meteo.be/ftp/observations/radar/vbird/ and organized in radar and year directories. The file names are of the format
<radar>_vpts_<yyyymmdd>.txt
(e.g.behel_vpts_20191015.txt
)Data format
The data format is the default stdout of vol2bird, which is fixed width (example). If you write a parser for that format, I would call it vol2bird_vpts, not rmi_vpts. The CROW visualization has a minimal parser
The format unfortunately does not contain all columns of VPTS CSV. Below is a suggestion how it could be completed.
<date>T<time>00Z
< HGHT>
<u>
<v>
<w>
<ff>
<dd>
<sd_vvp>
<gap>
<eta>
<dens>
<dbz>
<DBZH>
<n>
<dbz>
<n_all>
<n_dbz_all>
11
, but it's not recorded and not a required term2
, but it's not recorded and not a required termradar_latitude
radar_latitude
radar_latitude
# polar volume input: /tmp/20191015143000.rad.behel.pvol.dbzh.scanz.pvol.h5