opensearch-project / neural-search

Plugin that adds dense neural retrieval into the OpenSearch ecosytem
Apache License 2.0
57 stars 58 forks source link

[FEATURE] Support batch ingestion in TextEmbeddingProcessor & SparseEncodingProcessor #744

Closed chishui closed 1 month ago

chishui commented 1 month ago

Description

Add support for batch ingestion in TextEmbeddingProcessor & SparseEncodingProcessor to improve ingestion performance.

https://github.com/opensearch-project/neural-search/issues/743

Issues Resolved

https://github.com/opensearch-project/neural-search/issues/743

Check List

By submitting this pull request, I confirm that my contribution is made under the terms of the Apache 2.0 license. For more information on following Developer Certificate of Origin and signing off your commits, please check here.

chishui commented 1 month ago

@martin-gaievski do you have other comments? Can I get an approval from you?

opensearch-trigger-bot[bot] commented 1 month ago

The backport to 2.x failed:

The process '/usr/bin/git' failed with exit code 1

To backport manually, run these commands in your terminal:

# Fetch latest updates from GitHub
git fetch
# Create a new working tree
git worktree add .worktrees/backport-2.x 2.x
# Navigate to the new working tree
cd .worktrees/backport-2.x
# Create a new branch
git switch --create backport/backport-744-to-2.x
# Cherry-pick the merged commit of this pull request and resolve the conflicts
git cherry-pick -x --mainline 1 afd1215a930a14cb889ea8751997a8a18cce5d1a
# Push it to GitHub
git push --set-upstream origin backport/backport-744-to-2.x
# Go back to the original working tree
cd ../..
# Delete the working tree
git worktree remove .worktrees/backport-2.x

Then, create a pull request where the base branch is 2.x and the compare/head branch is backport/backport-744-to-2.x.