Closed davidgxue closed 8 months ago
Latest commit: |
3142709
|
Status: | ✅ Deploy successful! |
Preview URL: | https://dc48aa3a.ask-astro.pages.dev |
Branch Preview URL: | https://improve-data-ingestion-pt2.ask-astro.pages.dev |
Updated the PR description with test results!!
Code ready for review, evaluations/testing is still in progress
Description
Technical Changes
split.py
file tochunking_utils.py
(multiple files changed due to import naming)stable
version of the docs.Evaluations
data_ingest_comparison_part_2.csv data_ingest_results_part_2.csv
Related Issues
closes #221 closes #258 closes #295 (Reranker has been addressed in GCP environment variables, embedding model change completed in a different PR) closes #285 (This PR prevents empty docs from being ingested)