scientist-softserv / adventist_knapsack

Apache License 2.0
2 stars 0 forks source link

:gift: Convert dog biscuits/properties to yaml files for Valkyrie Resources #689

Closed ShanaLMoore closed 6 days ago

ShanaLMoore commented 2 months ago

Summary

Dog biscuits is replaced by the simple schema way for valkyrie objects.

SPIKE: https://docs.google.com/document/d/1iSlzEfM0ImPQXM2u-if3icuet52SaqHupNEPErTG-Go/edit?usp=sharing

Models

Steps

Identify all of the terms for each work type. Figure out which ones are not in the basic and core metadata yaml files. The yamls will get created when we run the generator to create the resources. We'll update these yamls with our findings.

Acceptance Criteria

Testing Instructions

Notes

To clarify, we are NOT removing dog biscuits entirely. However, new valkyrie models SHOULD NOT be using dog biscuits.

⚠ We CANNOT remove dog biscuits until the application has been fully migrated. ⚠

laritakr commented 1 month ago

To verify the metadata, I found the attributes for each existing work & collection model on the prod server:

types = ["GenericWork", "Image", "Collection", "Thesis", "ConferenceItem", "Dataset", "ExamPaper", "JournalArticle", "PublishedWork"] Attributes are found via types each do { |type| type.constantize.new.attributes.keys.map(&:to_s) }

I then did the same locally for the new resource models.

Resulting differences are mostly as expected due to new features coming in via Hyrax/Hyku (pdf viewer options) as well as upcoming development (i.e. slugs).

Generic Work ``` GenericWork ["id", "head", "tail", "depositor", "title", "date_uploaded", "date_modified", "state", "proxy_depositor", "on_behalf_of", "arkivo_checksum", "owner", "alternative_title", "label", "relative_path", "import_url", "resource_type", "creator", "contributor", "description", "abstract", "keyword", "license", "rights_notes", "rights_statement", "access_right", "publisher", "date_created", "subject", "language", "identifier", "based_near", "related_url", "bibliographic_citation", "source", "is_child", "split_from_pdf_id", "video_embed", "date_issued", "lat", "long", "alt", "part_of", "place_of_publication", "aark_id", "slug", "slug_for_upgrade", "remote_url", "access_control_id", "representative_id", "thumbnail_id", "rendering_ids", "admin_set_id", "embargo_id", "lease_id"] GenericWorkResource ["id", "internal_resource", "created_at", "updated_at", "new_record", "alternate_ids", "embargo_id", "lease_id", "title", "date_modified", "date_uploaded", "depositor", "admin_set_id", "member_ids", "member_of_collection_ids", "on_behalf_of", "proxy_depositor", "state", "rendering_ids", "representative_id", "thumbnail_id", "abstract", "access_right", "alternative_title", "arkivo_checksum", "based_near", "bibliographic_citation", "contributor", "creator", "date_created", "description", "identifier", "import_url", "publisher", "label", "language", "license", "relative_path", "related_url", "resource_type", "rights_notes", "rights_statement", "source", "subject", "long", "lat", "alt", "owner", "bulkrax_identifier", "date_issued", "keyword", "place_of_publication", "part_of", "remote_url", "show_pdf_viewer", "show_pdf_download_button", "video_embed", "is_child", "split_from_pdf_id"] not in resource: ["head", "tail", "aark_id", "slug", "slug_for_upgrade", "access_control_id"] only in resource: ["internal_resource", "created_at", "updated_at", "new_record", "alternate_ids", "member_ids", "member_of_collection_ids", "bulkrax_identifier", "show_pdf_viewer", "show_pdf_download_button"] ```
Image ``` Image ["id", "head", "tail", "depositor", "title", "date_uploaded", "date_modified", "state", "proxy_depositor", "on_behalf_of", "arkivo_checksum", "owner", "extent", "alternative_title", "label", "relative_path", "import_url", "resource_type", "creator", "contributor", "description", "abstract", "keyword", "license", "rights_notes", "rights_statement", "access_right", "publisher", "date_created", "subject", "language", "identifier", "based_near", "related_url", "bibliographic_citation", "source", "is_child", "split_from_pdf_id", "date_issued", "lat", "long", "alt", "part_of", "place_of_publication", "aark_id", "slug", "slug_for_upgrade", "remote_url", "access_control_id", "representative_id", "thumbnail_id", "rendering_ids", "admin_set_id", "embargo_id", "lease_id"] ImageResource ["id", "internal_resource", "created_at", "updated_at", "new_record", "alternate_ids", "embargo_id", "lease_id", "title", "date_modified", "date_uploaded", "depositor", "admin_set_id", "member_ids", "member_of_collection_ids", "on_behalf_of", "proxy_depositor", "state", "rendering_ids", "representative_id", "thumbnail_id", "abstract", "access_right", "alternative_title", "arkivo_checksum", "based_near", "bibliographic_citation", "contributor", "creator", "date_created", "description", "identifier", "import_url", "publisher", "label", "language", "license", "relative_path", "related_url", "resource_type", "rights_notes", "rights_statement", "source", "subject", "long", "lat", "alt", "owner", "bulkrax_identifier", "date_issued", "keyword", "extent", "part_of", "place_of_publication", "remote_url", "show_pdf_viewer", "show_pdf_download_button", "video_embed", "is_child", "split_from_pdf_id"] not in resource: ["head", "tail", "aark_id", "slug", "slug_for_upgrade", "access_control_id"] only in resource: ["internal_resource", "created_at", "updated_at", "new_record", "alternate_ids", "member_ids", "member_of_collection_ids", "bulkrax_identifier", "show_pdf_viewer", "show_pdf_download_button", "video_embed"] ```
Collection ``` Collection ["id", "depositor", "title", "date_uploaded", "date_modified", "head", "tail", "collection_type_gid", "alternative_title", "label", "relative_path", "import_url", "resource_type", "creator", "contributor", "description", "abstract", "keyword", "license", "rights_notes", "rights_statement", "access_right", "publisher", "date_created", "subject", "language", "identifier", "based_near", "related_url", "bibliographic_citation", "source", "aark_id", "slug", "slug_for_upgrade", "remote_url", "date_available", "date_published", "date_submitted", "date_accepted", "issue_number", "managing_organisation", "official_url", "orcid", "pagination", "part_of", "publication_status", "refereed", "volume_number", "preflabel", "altlabel", "rdfs_label", "rights_holder", "rights_description", "date", "department", "funder", "language_code", "output_of", "doi", "lat", "long", "alt", "location", "note", "former_identifier", "date_issued", "place_of_publication", "access_control_id", "representative_id", "thumbnail_id", "output_of_resource_ids"] CollectionResource ["id", "internal_resource", "created_at", "updated_at", "new_record", "alternate_ids", "embargo_id", "lease_id", "title", "date_modified", "date_uploaded", "depositor", "collection_type_gid", "member_ids", "member_of_collection_ids", "abstract", "access_right", "alternative_title", "arkivo_checksum", "based_near", "bibliographic_citation", "contributor", "creator", "date_created", "description", "identifier", "import_url", "publisher", "label", "language", "license", "relative_path", "related_url", "resource_type", "rights_notes", "rights_statement", "source", "subject", "long", "lat", "alt", "owner", "on_behalf_of", "state", "proxy_depositor", "bulkrax_identifier", "keyword", "date", "date_available", "date_published", "date_submitted", "date_accepted", "date_issued", "doi", "former_identifier", "funder", "issue_number", "location", "managing_organisation", "note", "official_url", "orcid", "output_of", "pagination", "part_of", "publication_status", "place_of_publication", "refereed", "remote_url", "rights_holder", "rights_description", "preflabel", "altlabel", "volume_number"] not in resource: ["head", "tail", "aark_id", "slug", "slug_for_upgrade", "rdfs_label", "department", "language_code", "access_control_id", "representative_id", "thumbnail_id", "output_of_resource_ids"] only in resource: ["internal_resource", "created_at", "updated_at", "new_record", "alternate_ids", "embargo_id", "lease_id", "member_ids", "member_of_collection_ids", "arkivo_checksum", "owner", "on_behalf_of", "state", "proxy_depositor", "bulkrax_identifier"] ```
Thesis ``` Thesis ["id", "head", "tail", "video_embed", "depositor", "title", "date_uploaded", "date_modified", "state", "proxy_depositor", "on_behalf_of", "arkivo_checksum", "owner", "bibliographic_citation", "date_issued", "extent", "lat", "long", "alt", "part_of", "place_of_publication", "remote_url", "alternative_title", "label", "relative_path", "import_url", "resource_type", "creator", "contributor", "description", "abstract", "keyword", "license", "rights_notes", "rights_statement", "access_right", "publisher", "date_created", "subject", "language", "identifier", "based_near", "related_url", "source", "aark_id", "slug", "slug_for_upgrade", "advisor", "awarding_institution", "date_of_award", "department", "orcid", "qualification_name", "qualification_level", "preflabel", "altlabel", "rdfs_label", "rights_holder", "rights_description", "date", "funder", "language_code", "output_of", "doi", "location", "managing_organisation", "note", "former_identifier", "is_child", "split_from_pdf_id", "packaged_by_ids", "access_control_id", "representative_id", "thumbnail_id", "rendering_ids", "admin_set_id", "embargo_id", "lease_id", "mainfile_ids", "output_of_resource_ids"] ThesisResource ["id", "internal_resource", "created_at", "updated_at", "new_record", "alternate_ids", "embargo_id", "lease_id", "title", "date_modified", "date_uploaded", "depositor", "admin_set_id", "member_ids", "member_of_collection_ids", "on_behalf_of", "proxy_depositor", "state", "rendering_ids", "representative_id", "thumbnail_id", "abstract", "access_right", "alternative_title", "arkivo_checksum", "based_near", "bibliographic_citation", "contributor", "creator", "date_created", "description", "identifier", "import_url", "publisher", "label", "language", "license", "relative_path", "related_url", "resource_type", "rights_notes", "rights_statement", "source", "subject", "long", "lat", "alt", "owner", "funder", "output_of", "doi", "location", "managing_organisation", "note", "former_identifier", "preflabel", "altlabel", "rdfs_label", "date", "rights_description", "department", "language_code", "rights_holder", "date_issued", "keyword", "extent", "part_of", "place_of_publication", "remote_url", "advisor", "orcid", "date_of_award", "qualification_level", "awarding_institution", "qualification_name", "bulkrax_identifier", "show_pdf_viewer", "show_pdf_download_button", "video_embed", "is_child", "split_from_pdf_id"] not in resource: ["head", "tail", "aark_id", "slug", "slug_for_upgrade", "packaged_by_ids", "access_control_id", "mainfile_ids", "output_of_resource_ids"] only in resource: ["internal_resource", "created_at", "updated_at", "new_record", "alternate_ids", "member_ids", "member_of_collection_ids", "bulkrax_identifier", "show_pdf_viewer", "show_pdf_download_button"] ```
ConferenceItem ``` ConferenceItem ["id", "head", "tail", "video_embed", "depositor", "title", "date_uploaded", "date_modified", "state", "proxy_depositor", "on_behalf_of", "arkivo_checksum", "owner", "alternative_title", "label", "relative_path", "import_url", "resource_type", "creator", "contributor", "description", "abstract", "keyword", "license", "rights_notes", "rights_statement", "access_right", "publisher", "date_created", "subject", "language", "identifier", "based_near", "related_url", "bibliographic_citation", "source", "aark_id", "slug", "slug_for_upgrade", "date_available", "date_published", "date_submitted", "date_accepted", "editor", "event_date", "isbn", "official_url", "pagination", "part_of", "place_of_publication", "publication_status", "refereed", "presented_at", "preflabel", "altlabel", "rdfs_label", "rights_holder", "rights_description", "date", "department", "funder", "language_code", "output_of", "doi", "lat", "long", "alt", "location", "managing_organisation", "note", "former_identifier", "is_child", "split_from_pdf_id", "packaged_by_ids", "access_control_id", "representative_id", "thumbnail_id", "rendering_ids", "admin_set_id", "embargo_id", "lease_id", "output_of_resource_ids"] ConferenceItemResource ["id", "internal_resource", "created_at", "updated_at", "new_record", "alternate_ids", "embargo_id", "lease_id", "title", "date_modified", "date_uploaded", "depositor", "admin_set_id", "member_ids", "member_of_collection_ids", "on_behalf_of", "proxy_depositor", "state", "rendering_ids", "representative_id", "thumbnail_id", "abstract", "access_right", "alternative_title", "arkivo_checksum", "based_near", "bibliographic_citation", "contributor", "creator", "date_created", "description", "identifier", "import_url", "publisher", "label", "language", "license", "relative_path", "related_url", "resource_type", "rights_notes", "rights_statement", "source", "subject", "long", "lat", "alt", "owner", "funder", "output_of", "doi", "location", "managing_organisation", "note", "former_identifier", "preflabel", "altlabel", "rdfs_label", "date", "rights_description", "department", "language_code", "rights_holder", "date_published", "keyword", "date_available", "date_submitted", "date_accepted", "editor", "event_date", "isbn", "official_url", "pagination", "refereed", "place_of_publication", "part_of", "publication_status", "presented_at", "bulkrax_identifier", "show_pdf_viewer", "show_pdf_download_button", "video_embed", "is_child", "split_from_pdf_id"] not in resource: ["head", "tail", "aark_id", "slug", "slug_for_upgrade", "packaged_by_ids", "access_control_id", "output_of_resource_ids"] only in resource: ["internal_resource", "created_at", "updated_at", "new_record", "alternate_ids", "member_ids", "member_of_collection_ids", "bulkrax_identifier", "show_pdf_viewer", "show_pdf_download_button"] ```
Dataset ``` Dataset ["id", "head", "tail", "video_embed", "depositor", "title", "date_uploaded", "date_modified", "state", "proxy_depositor", "on_behalf_of", "arkivo_checksum", "owner", "alternative_title", "label", "relative_path", "import_url", "resource_type", "creator", "contributor", "description", "abstract", "keyword", "license", "rights_notes", "rights_statement", "access_right", "publisher", "date_created", "subject", "language", "identifier", "based_near", "related_url", "bibliographic_citation", "source", "aark_id", "slug", "slug_for_upgrade", "dc_access_rights", "date_available", "content_version", "date_accepted", "date_collected", "date_copyrighted", "date_issued", "date_published", "date_submitted", "date_updated", "date_valid", "extent", "for_indexing", "dc_format", "funder", "resource_type_general", "has_restriction", "last_access", "number_of_downloads", "readme", "requestor_email", "subtitle", "preflabel", "altlabel", "rdfs_label", "rights_holder", "rights_description", "date", "department", "language_code", "output_of", "doi", "lat", "long", "alt", "location", "managing_organisation", "note", "former_identifier", "is_child", "split_from_pdf_id", "packaged_by_ids", "access_control_id", "representative_id", "thumbnail_id", "rendering_ids", "admin_set_id", "embargo_id", "lease_id", "readmefile_ids", "replaced_by_ids", "replace_ids", "output_of_resource_ids"] DatasetResource ["id", "internal_resource", "created_at", "updated_at", "new_record", "alternate_ids", "embargo_id", "lease_id", "title", "date_modified", "date_uploaded", "depositor", "admin_set_id", "member_ids", "member_of_collection_ids", "on_behalf_of", "proxy_depositor", "state", "rendering_ids", "representative_id", "thumbnail_id", "abstract", "access_right", "alternative_title", "arkivo_checksum", "based_near", "bibliographic_citation", "contributor", "creator", "date_created", "description", "identifier", "import_url", "publisher", "label", "language", "license", "relative_path", "related_url", "resource_type", "rights_notes", "rights_statement", "source", "subject", "long", "lat", "alt", "owner", "funder", "output_of", "doi", "location", "managing_organisation", "note", "former_identifier", "preflabel", "altlabel", "rdfs_label", "date", "rights_description", "department", "language_code", "rights_holder", "date_published", "resource_type_general", "date_collected", "date_issued", "keyword", "dc_format", "extent", "dc_access_rights", "date_accepted", "date_copyrighted", "content_version", "date_available", "date_submitted", "for_indexing", "has_restriction", "date_valid", "number_of_downloads", "readme", "requestor_email", "subtitle", "date_updated", "last_access", "bulkrax_identifier", "show_pdf_viewer", "show_pdf_download_button", "video_embed", "is_child", "split_from_pdf_id"] not in resource: ["head", "tail", "aark_id", "slug", "slug_for_upgrade", "packaged_by_ids", "access_control_id", "readmefile_ids", "replaced_by_ids", "replace_ids", "output_of_resource_ids"] only in resource: ["internal_resource", "created_at", "updated_at", "new_record", "alternate_ids", "member_ids", "member_of_collection_ids", "bulkrax_identifier", "show_pdf_viewer", "show_pdf_download_button"] ```
ExamPaper ``` ExamPaper ["id", "head", "tail", "depositor", "title", "date_uploaded", "date_modified", "state", "proxy_depositor", "on_behalf_of", "arkivo_checksum", "owner", "alternative_title", "label", "relative_path", "import_url", "resource_type", "creator", "contributor", "description", "abstract", "keyword", "license", "rights_notes", "rights_statement", "access_right", "publisher", "date_created", "subject", "language", "identifier", "based_near", "related_url", "bibliographic_citation", "source", "aark_id", "slug", "slug_for_upgrade", "module_code", "qualification_name", "qualification_level", "date_available", "preflabel", "altlabel", "rdfs_label", "rights_holder", "rights_description", "date", "department", "funder", "language_code", "output_of", "doi", "lat", "long", "alt", "location", "managing_organisation", "note", "former_identifier", "is_child", "split_from_pdf_id", "packaged_by_ids", "access_control_id", "representative_id", "thumbnail_id", "rendering_ids", "admin_set_id", "embargo_id", "lease_id", "mainfile_ids", "output_of_resource_ids"] ExamPaperResource ["id", "internal_resource", "created_at", "updated_at", "new_record", "alternate_ids", "embargo_id", "lease_id", "title", "date_modified", "date_uploaded", "depositor", "admin_set_id", "member_ids", "member_of_collection_ids", "on_behalf_of", "proxy_depositor", "state", "rendering_ids", "representative_id", "thumbnail_id", "abstract", "access_right", "alternative_title", "arkivo_checksum", "based_near", "bibliographic_citation", "contributor", "creator", "date_created", "description", "identifier", "import_url", "publisher", "label", "language", "license", "relative_path", "related_url", "resource_type", "rights_notes", "rights_statement", "source", "subject", "long", "lat", "alt", "owner", "funder", "output_of", "doi", "location", "managing_organisation", "note", "former_identifier", "preflabel", "altlabel", "rdfs_label", "date", "rights_description", "department", "language_code", "rights_holder", "keyword", "module_code", "qualification_name", "qualification_level", "date_available", "bulkrax_identifier", "show_pdf_viewer", "show_pdf_download_button", "video_embed", "is_child", "split_from_pdf_id"] not in resource: ["head", "tail", "aark_id", "slug", "slug_for_upgrade", "packaged_by_ids", "access_control_id", "mainfile_ids", "output_of_resource_ids"] only in resource: ["internal_resource", "created_at", "updated_at", "new_record", "alternate_ids", "member_ids", "member_of_collection_ids", "bulkrax_identifier", "show_pdf_viewer", "show_pdf_download_button", "video_embed"] ```
JournalArticle ``` JournalArticle ["id", "head", "tail", "depositor", "title", "date_uploaded", "date_modified", "state", "proxy_depositor", "on_behalf_of", "arkivo_checksum", "owner", "bibliographic_citation", "date_issued", "lat", "long", "alt", "place_of_publication", "remote_url", "alternative_title", "label", "relative_path", "import_url", "resource_type", "creator", "contributor", "description", "abstract", "keyword", "license", "rights_notes", "rights_statement", "access_right", "publisher", "date_created", "subject", "language", "identifier", "based_near", "related_url", "source", "aark_id", "slug", "slug_for_upgrade", "date_available", "date_published", "date_submitted", "date_accepted", "issue_number", "managing_organisation", "official_url", "orcid", "pagination", "part_of", "publication_status", "refereed", "volume_number", "preflabel", "altlabel", "rdfs_label", "rights_holder", "rights_description", "date", "department", "funder", "language_code", "output_of", "doi", "location", "note", "former_identifier", "is_child", "split_from_pdf_id", "packaged_by_ids", "access_control_id", "representative_id", "thumbnail_id", "rendering_ids", "admin_set_id", "embargo_id", "lease_id", "output_of_resource_ids"] JournalArticleResource ["id", "internal_resource", "created_at", "updated_at", "new_record", "alternate_ids", "embargo_id", "lease_id", "title", "date_modified", "date_uploaded", "depositor", "admin_set_id", "member_ids", "member_of_collection_ids", "on_behalf_of", "proxy_depositor", "state", "rendering_ids", "representative_id", "thumbnail_id", "abstract", "access_right", "alternative_title", "arkivo_checksum", "based_near", "bibliographic_citation", "contributor", "creator", "date_created", "description", "identifier", "import_url", "publisher", "label", "language", "license", "relative_path", "related_url", "resource_type", "rights_notes", "rights_statement", "source", "subject", "long", "lat", "alt", "owner", "funder", "output_of", "doi", "location", "managing_organisation", "note", "former_identifier", "preflabel", "altlabel", "rdfs_label", "date", "rights_description", "department", "language_code", "rights_holder", "date_published", "date_issued", "keyword", "place_of_publication", "remote_url", "date_available", "date_submitted", "date_accepted", "orcid", "part_of", "publication_status", "refereed", "issue_number", "official_url", "pagination", "volume_number", "bulkrax_identifier", "show_pdf_viewer", "show_pdf_download_button", "video_embed", "is_child", "split_from_pdf_id"] not in resource: ["head", "tail", "aark_id", "slug", "slug_for_upgrade", "packaged_by_ids", "access_control_id", "output_of_resource_ids"] only in resource: ["internal_resource", "created_at", "updated_at", "new_record", "alternate_ids", "member_ids", "member_of_collection_ids", "bulkrax_identifier", "show_pdf_viewer", "show_pdf_download_button", "video_embed"] ```
PublishedWork ``` PublishedWork ["id", "head", "tail", "video_embed", "depositor", "title", "date_uploaded", "date_modified", "state", "proxy_depositor", "on_behalf_of", "arkivo_checksum", "owner", "bibliographic_citation", "date_issued", "lat", "long", "alt", "extent", "remote_url", "alternative_title", "label", "relative_path", "import_url", "resource_type", "creator", "contributor", "description", "abstract", "keyword", "license", "rights_notes", "rights_statement", "access_right", "publisher", "date_created", "subject", "language", "identifier", "based_near", "related_url", "source", "aark_id", "slug", "slug_for_upgrade", "date_available", "date_published", "date_submitted", "date_accepted", "edition", "editor", "issue_number", "isbn", "official_url", "pagination", "part", "place_of_publication", "publication_status", "refereed", "series", "volume_number", "preflabel", "altlabel", "rdfs_label", "rights_holder", "rights_description", "date", "department", "funder", "language_code", "output_of", "doi", "location", "managing_organisation", "note", "former_identifier", "is_child", "split_from_pdf_id", "packaged_by_ids", "access_control_id", "representative_id", "thumbnail_id", "rendering_ids", "admin_set_id", "embargo_id", "lease_id", "output_of_resource_ids"] PublishedWorkResource ["id", "internal_resource", "created_at", "updated_at", "new_record", "alternate_ids", "embargo_id", "lease_id", "title", "date_modified", "date_uploaded", "depositor", "admin_set_id", "member_ids", "member_of_collection_ids", "on_behalf_of", "proxy_depositor", "state", "rendering_ids", "representative_id", "thumbnail_id", "abstract", "access_right", "alternative_title", "arkivo_checksum", "based_near", "bibliographic_citation", "contributor", "creator", "date_created", "description", "identifier", "import_url", "publisher", "label", "language", "license", "relative_path", "related_url", "resource_type", "rights_notes", "rights_statement", "source", "subject", "long", "lat", "alt", "owner", "funder", "output_of", "doi", "location", "managing_organisation", "note", "former_identifier", "preflabel", "altlabel", "rdfs_label", "date", "rights_description", "department", "language_code", "rights_holder", "date_published", "date_issued", "keyword", "extent", "remote_url", "date_available", "date_submitted", "issue_number", "edition", "official_url", "pagination", "part", "date_accepted", "publication_status", "refereed", "series", "volume_number", "editor", "isbn", "place_of_publication", "bulkrax_identifier", "show_pdf_viewer", "show_pdf_download_button", "video_embed", "is_child", "split_from_pdf_id"] not in resource: ["head", "tail", "aark_id", "slug", "slug_for_upgrade", "packaged_by_ids", "access_control_id", "output_of_resource_ids"] only in resource: ["internal_resource", "created_at", "updated_at", "new_record", "alternate_ids", "member_ids", "member_of_collection_ids", "bulkrax_identifier", "show_pdf_viewer", "show_pdf_download_button"] ```