Only home page data should be written to the YYYY_MM_DD tables (ie pages.2022_06_01_desktop). The new pipeline being developed in https://github.com/HTTPArchive/data-pipeline/pull/75 will handle writing home and secondary page data to the new all dataset.
Only home page data should be written to the YYYY_MM_DD tables (ie
pages.2022_06_01_desktop
). The new pipeline being developed in https://github.com/HTTPArchive/data-pipeline/pull/75 will handle writing home and secondary page data to the newall
dataset.