unicode-org / icu4x

Solving i18n for client-side and resource-constrained environments.
https://icu4x.unicode.org
Other
1.37k stars 176 forks source link

Parse IANA files into ICU4X data formats for TzdbDataProvider #1001

Open nordzilla opened 3 years ago

nordzilla commented 3 years ago

One of the formats we should be able to generate ICU4X TimeZone data from is the human-readable file format provided by the IANA TZDB

There are two ways we can go about this:

1) Build the zic C binary included in the IANA TZDB, and use it to generate TZif files. Then use our TZif capabilities from TODO(NUMBER) to generate the ICU4X data. 2) Write a custom parser to go directly from IANA files to ICU4X data.

This will then match the combined --tz-src-format iana and --tz-src-path options in the datagen crate (see TODO(Update datagen crate ... from file system))

This will also match the --tz-src-version options in the datagen crate.

For more context see

Depends on:

sffc commented 2 years ago

@nordzilla Please assign this to an appropriate 0.6 sprint milestone.

nekevss commented 4 months ago

@sffc Was there any plan to still implement this?

sffc commented 4 months ago

PR https://github.com/unicode-org/icu4x/pull/2913 has been open since December 2022 implementing some of this. I hope @nordzilla would pick it back up, but I believe he wouldn't mind some help to bring it over the finish line.

nekevss commented 4 months ago

I can definitely help out wherever needed on this if no one is picking it back up. Might need a guidance on the ideal direction though.

sffc commented 4 months ago

Ok let's talk about a plan at the next team meeting!

sffc commented 4 months ago

Notes from 2024-06-20