airbnb / chronon

Chronon is a data platform for serving for AI/ML applications.
Apache License 2.0
673 stars 36 forks source link

Documentation on how timestamps are used in Chronon #749

Open kambstreat opened 2 months ago

kambstreat commented 2 months ago

I tried exploring chronon for the past one week with one of the production usecase. Faced few issues in understanding the following

  1. How ts_ds is used in joins ? Looks like this field is created implicitly.
  2. There are many places where date argument is used. Confused on how to use different date args.
  1. Which partition does backfill starts filling. May be documenting different scenarios might be helpful.

Example scenario. I have a groupBy with backfill_start_date : 2024-01-01 and I gave run.py --ds=2024-05-01 . If I have groupBy output for 2024-03-31 (but do not have 2024-01-01 to 2024-03-30) . From where my backfill will start populating partitions.