Closed jparkerweb closed 2 weeks ago
The text input is an array of document objects, each containing the following properties:
text
document_name
document_text: String - The text of the document
document_text
text = [ { document_name: "document1", document_text: "..." }, { document_name: "document2", document_text: "..." }, ... ]
The output is an array of chunks, each containing the following properties:
document_id
number_of_chunks
chunk_number
model_name
is_model_quantized
embedding
returnEmbedding
true
token_length
returnTokenLength
Input
The
text
input is an array of document objects, each containing the following properties:document_name
: String - The name of the document being to be chunkeddocument_text
: String - The text of the documentOutput
The output is an array of chunks, each containing the following properties:
document_id
: Integer - A unique identifier for the document (current timestamp in milliseconds)document_name
: String - The name of the document being chunked (if provided)number_of_chunks
: Integer - The total number of final chunks returned from the input textchunk_number
: Integer - The number of the current chunkmodel_name
: String - The name of the embedding model usedis_model_quantized
: Boolean - Indicates whether the embedding model is quantizedtext
: String - The chunked textembedding
: Array - The embedding vector (ifreturnEmbedding
istrue
)token_length
: Integer - The token length (ifreturnTokenLength
istrue
)