Currently, the bucket name for the embeddings is hardcoded in the Census codebase. This is not ideal: if we need to move the bucket, it will break previous versions.
At least for embeddings, we should use the following approach:
Add an embedding_uri field to the manifest which points to the location of the artifact on S3
Replace the code that uses the hardcoded bucket location with such URI
We should also ensure that the same pattern doesn't get added for indexes.
Currently, the bucket name for the embeddings is hardcoded in the Census codebase. This is not ideal: if we need to move the bucket, it will break previous versions.
At least for embeddings, we should use the following approach:
embedding_uri
field to the manifest which points to the location of the artifact on S3We should also ensure that the same pattern doesn't get added for indexes.