Closed hannes-ucsc closed 3 years ago
The first of the affected subgraphs refers to a sequence_file
input fdfba67d-c25d-4459-a196-f3ef38657cce:
There is no subgraph in the snapshot that defines that input:
and no matching sequence_file
row either:
Here are the queries in text form:
select content from `datarepo-dev-6883f2a5.hca_dev_f48e7c39cc6740559d79bc437892840c__20210830_20210929.links`
where links_id = "208ea59a-7f02-5006-8a79-c25104219109"
SELECT links_id, version, JSON_EXTRACT_SCALAR(link_output, "$.output_id") AS output_id
FROM `datarepo-dev-6883f2a5.hca_dev_f48e7c39cc6740559d79bc437892840c__20210830_20210929.links` AS links
JOIN UNNEST(JSON_EXTRACT_ARRAY(links.content, '$.links')) AS content_links
ON JSON_EXTRACT_SCALAR(content_links, '$.link_type') = 'process_link'
JOIN UNNEST(JSON_EXTRACT_ARRAY(content_links, '$.outputs')) AS link_output
ON JSON_EXTRACT_SCALAR(link_output, "$.output_id") = 'fdfba67d-c25d-4459-a196-f3ef38657cce'
select count(*) from `datarepo-dev-6883f2a5.hca_dev_f48e7c39cc6740559d79bc437892840c__20210830_20210929.sequence_file`
where sequence_file_id = "fdfba67d-c25d-4459-a196-f3ef38657cce"
@hannes-ucsc please see this updated example from the affected project after reimport
Looks good. We'll be ready to index the replacement snapshots.
Confirmed fixed in hca_dev_f48e7c39cc6740559d79bc437892840c__20210830_20211007
.
Affected snapshot:
Affected subgraphs:
CloudWatch Logs Insights
region: us-east-1
log-group-names: /aws/lambda/azul-indexer-hannes-contribute_retry
start-time: -3600s
end-time: 0s
query-string:
datarepo-dev-6883f2a5.hca_dev_f48e7c39cc6740559d79bc437892840c__20210830_20210929.links
AS links\n JOIN UNNEST(JSON_EXTRACT_ARRAY(links.content, '$.links')) AS content_links\n ON JSON_EXTRACT_SCALAR(content_links, '$.link_type') = 'process_link'\n JOIN UNNEST(JSON_EXTRACT_ARRAY(content_links, '$.outputs')) AS link_output\n ON JSON_EXTRACT_SCALAR(link_output, \"$.output_id\") IN UNNEST(['0817dd3d-5796-4888-8117-6653a947488d', 'fdfba67d-c25d-4459-a196-f3ef38657cce'])\n "}datarepo-dev-6883f2a5.hca_dev_f48e7c39cc6740559d79bc437892840c__20210830_20210929.links
AS links\n JOIN UNNEST(JSON_EXTRACT_ARRAY(links.content, \'$.links\')) AS content_links\n ON JSON_EXTRACT_SCALAR(content_links, \'$.link_type\') = \'process_link\'\n JOIN UNNEST(JSON_EXTRACT_ARRAY(content_links, \'$.outputs\')) AS link_output\n ON JSON_EXTRACT_SCALAR(link_output, "$.output_id") IN UNNEST([\'0817dd3d-5796-4888-8117-6653a947488d\', \'fdfba67d-c25d-4459-a196-f3ef38657cce\'])\n 'datarepo-dev-6883f2a5.hca_dev_f48e7c39cc6740559d79bc437892840c__20210830_20210929.links
\n WHERE links_id = '208ea59a-7f02-5006-8a79-c25104219109'\n AND version = TIMESTAMP('2021-09-10T15:13:09.000000Z')\n "}datarepo-dev-6883f2a5.hca_dev_f48e7c39cc6740559d79bc437892840c__20210830_20210929.links
\n WHERE links_id = \'208ea59a-7f02-5006-8a79-c25104219109\'\n AND version = TIMESTAMP(\'2021-09-10T15:13:09.000000Z\')\n '