dfo-pacific-science / data-stewardship-unit

A quarto project that generates that DSU website via Github actions
https://dfo-pacific-science.github.io/data-stewardship-unit/
0 stars 0 forks source link

Turn FOS Dashboard Proof of Concept into MVP #56

Open Br-Johnson opened 2 months ago

Br-Johnson commented 2 months ago

Required:

Desirable:

Br-Johnson commented 2 months ago

@AnnaD-M I think we should delay starting this next phase by two weeks so that you help contribute to some of the broader regional coordination, especially regarding establishing the Pacific Salmon Data Community of Practice.

AnnaD-M commented 1 month ago

Lakehouse shortcuts @alburycatalina Background:

Impacts: we would just have to set the refresh schedules on the artifacts downstream of the shortcut. Sherry/Peter would be responsible for setting the refresh schedule for the initial data extraction process to the lakehouses within the Pacific Salmon Data -Landing workspace.

Steps:

  1. Create a new Lakehouse, e.g., 'shortcut_BRONZE_FOS_PROD'
  2. For ‘Get data in your lakehouse’, use ‘New shortcut’
  3. Select ‘Internal sources: Microsoft OneLake’
  4. Select ‘BRONZE_FOS_PROD’ in the ‘Pacific Salmon Data – Landing’ workspace.
  5. Select the tables you need for your workflow, e.g., fishery, catch, opening_gear.
  6. Replicate Jason's queries using notebooks
  7. Create a new Lakehouse, 'silver_FOS_prod', and direct the tabular outputs of notebooks here.

To follow:

AnnaD-M commented 3 weeks ago

Tested the audience configuration for 'entire organization' and confirmed that you can limit the content, and the app can still be seen by any DFO staff member (e.g., only had the 'about' report visible and someone in the Ecosystem Management Branch could view the app).

AnnaD-M commented 2 weeks ago

Figured out process to display last data update/modification timestamp: For each table in a lakehouse, there is an associated _delta_log directory. Each delta log file corresponds to a change in the table (see screenshot). The ‘nb_bronze_fos_prod_table_last_modified’ notebook reads the delta logs for each table in the BRONZE_FOS_PROD lakehouse and determines the maximum modification time across all tables. It then writes the max modification time to the ‘bronze_fos_prod_max_modification_time’ table in the SILVER_FOS_PROD lakehouse. So, in a roundabout way it captures the timestamp for which the initial data extraction process completed. The 'max modification time' can then be displayed on relevant Power BI reports to that end users know how current the data is. Note that the modification timestamp differs from the refresh timestamp.

AnnaD-M commented 2 days ago

Updates for Nov 18th review with Erika and Jason: • Workaround to dynamically summarise how the data is being filtered. Cannot dynamically update lakehouse tables, and cannot display dynamically generated ‘measures’ in a table visual. Smart narratives and text boxes do not allow you to select, copy and paste text (only the option to ‘share’ the visual). Workaround is to create a DAX table to house the dynamic (filter selection). • Updates/additions to disclaimers, including definition of ‘Protected B’ data. • Workaround to mimic pop up for acknowledgment of disclaimers (using selections and bookmarks). In theory, any DFO computer that has visited the site would have been exposed to the disclaimer and had to click the 'I acknowledge', so this would be a way to see a list of users that have 'acknowledged' the disclaimers. • Added PFMA map. Note that there are some PFMAs that when selected do not return any results, so that list will truncate in the future. • Rearranged selectors/filters to reflect order of importance. • Reordered species column in order of importance based on DFO web content, with bycatch last. • Added tabs for additional information. • Added definition of ‘change over time’. Future updates: • Update data flow diagram to remove lakehouse names and include SQL query replication. Provides end users with a general understanding of how the platform works. • Updates to openings query results (pending completion of Sherry’s review process).

AnnaD-M commented 1 day ago

Updates based on feedback 18 nov 2024

'About' page:

'Catch Insights' and 'Catch Summary' pages:

Unfulfilled requests:

ArcGIS for Power BI plugin: