Open lachlan194924 opened 3 weeks ago
I have created a big matlab structure to aid in the polishing of our headerfiles. Each row in a header file is read in.
row 2 in this struct, tells you how many unique entries there are in the warehouse currently: (This is agency name)
The first column shows you the uique name, the next is how many header files has it, then the next one is supposed to tell you the path of each header file that has it. BUT, right now it doesnt because i need to rerun the warehouse stats code to include that information but i dont have access right now. So right now it tells you the Data tag, so at least you can try workout which codebase is responsable for producing that headerfile.
I have copied this code that I wrote on my laptop onto davy, it needs "calculate_warehouse_stats.m" to be run to get the filepath headers into the summary csv, then run "WarehouseStatSummary.m" to create the interactive structure, (this should have headerfiles in the struct because I added it I just cant test, due to no access). Alternatively my local copy produces with just the tag, and it works
Using the script calculate_warehouse_stats.m, which is in actions, folder, all of the header files have been read and each row of the headerfile is a column in the Stats document, the purpose of this is to determine if there are synonyms in the document, we want consitant naming, despite meaning the same thing we only want to use one word.
the screenshot
For example in deployment information, Seafloor means the same as seabed, but for consistency we only want one of them to appear, This needs to be the case for every aspect of the header files, no blank spaces either.
warehouse_stats.xlsx A summary pivot table can be found here^^