Closed harsha-kotha closed 11 months ago
Fix details
@harsha-kotha We have given two fixes for this.
Ingestion report before fix: Job_name_1695983869315.pdf
Ingestion report after fix: Job_name_1695983888069.pdf
Blob Ingestion performance was improvised.
In 3.1.5 - 224368 record content count
Ingestion time - 1 hour 26 mins
Before Report -
Job_name_1695983869315 (3).pdf
In 3.1.7 - 224368 records content count
Ingestion time - 12 mins 7 sec
After Report -
Please share the size of Blob added in both runs along with number of blob ingested in each run. If it is 1 single blob, please test with more number of blobs and share ingestion report.
@harsha-kotha We tested with more that one blob and also attached the screenshot for the size & count of blobs ingested.
Count:
Size:
If this is not satisfactory kindly helps us with the scenarios to test
We will try to generate and ingest data that falls under this and submit the report.
Thanks, we will do more rounds from our side. I see scope for improvement in performance on Ingestion.
Environment - Pulte New Dev ADS 3.1.7
https://archon-datastore.platform3solutions.com/pulte/dev/login
App : Recruiting Management Search : Test_Assign Table : CANDIDATE_BLOB
Structured Parquet Size : 9.67 MB Unstructured Size : 12.33 Both above sizes are post ingestion.
12.33 GB ingestion took 6 Hours 29 Minutes.
INGESTION_REPORT_07936521-b624-4559-b7b3-8befe66246b7_1690129087974.pdf