Open timcosgrove opened 1 year ago
See #10894 for anything useful.
Note that this epic was written with the assumption of a more managed solution to files. We have also proposed 'dumb sync' where anything written by the CMS to its asset file system automatically syncs to S3.
We need to investigate that proposal, write it up, put out an RFC
Background
CMS-managed file assets - images and PDFs primarily, that are added by CMS editors as part of their content - are stored in a file store that is within the VA network, like the CMS itself is. This means that when CMS content data that references these files, the URLs are to locations that are not accessible by regular users.
Content Build solves this problem by downloading every file asset used in VA.gov pages with every single content release. This is time-consuming.
To solve this problem for Next Build, we have set up file sync from the CMS file stores to public S3 buckets that can operate as web servers. Files are synced automatically as they are added to the CMS; the sync typically takes a few minutes.
As pages are built by Next Build, we need to substitute the URL to a given file as the CMS understands it with a URL to the public S3 bucket location. This will need to be done on all text where file references could existn.
Assumptions
Acceptance Criteria