Mintplex-Labs / anything-llm

The all-in-one Desktop & Docker AI application with full RAG and AI Agent capabilities.
https://anythingllm.com
MIT License
19.32k stars 2.11k forks source link

[BUG]: Values length is less than the length multiplied by the value size for FixedSizeList; LanceDB embedding insertion faliure #1281

Closed frost19k closed 3 months ago

frost19k commented 3 months ago

How are you running AnythingLLM?

Docker (local)

What happened?

There's an error with the LanceDB configuration that sometimes results in vector insertion faliure.

Smaller documents seem to succeed:

ollama        | [GIN] 2024/05/04 - 13:06:58 | 200 |  1.324026703s |      172.27.0.3 | POST     "/api/embeddings"
anything-llm  | Inserting vectorized chunks into LanceDB collection.
anything-llm  | Caching vectorized results of custom-documents/Custom-Instructions.odt-64d48e43-e4d4-4bf7-b89f-2bbe4e860e8b.json to prevent duplicated embedding.
anything-llm  | [TELEMETRY SENT] {
anything-llm  |   event: 'documents_embedded_in_workspace',
anything-llm  |   distinctId: '0254fb58-bee5-4553-93bd-1571a82cfcc8',
anything-llm  |   properties: {
anything-llm  |     LLMSelection: 'ollama',
anything-llm  |     Embedder: 'ollama',
anything-llm  |     VectorDbSelection: 'lancedb',
anything-llm  |     runtime: 'docker'
anything-llm  |   }
anything-llm  | }
anything-llm  | [Event Logged] - workspace_documents_added

Larger documents seem to fail. I'm not at all familiar with vector databases. I have no clue.

ollama        | [GIN] 2024/05/04 - 13:09:22 | 200 |   576.84988ms |      172.27.0.3 | POST     "/api/embeddings"
anything-llm  | Inserting vectorized chunks into LanceDB collection.
anything-llm  | addDocumentToNamespace Invalid argument error: Values length 138240 is less than the length (1024) multiplied by the value size (1024) for FixedSizeList(Field { name: "item", data_type: Float32, nullable: true, dict_id: 0, dict_is_ordered: false, metadata: {} }, 1024)
anything-llm  | Failed to vectorize Zed Attack Proxy Cookbook.pdf
anything-llm  | [TELEMETRY SENT] {
anything-llm  |   event: 'documents_embedded_in_workspace',
anything-llm  |   distinctId: '0254fb58-bee5-4553-93bd-1571a82cfcc8',
anything-llm  |   properties: {
anything-llm  |     LLMSelection: 'ollama',
anything-llm  |     Embedder: 'ollama',
anything-llm  |     VectorDbSelection: 'lancedb',
anything-llm  |     runtime: 'docker'
anything-llm  |   }
anything-llm  | }

I did run a minimal Weaviate instance to confirm that the issue is LanceDB and not Ollama Embedding model.

ollama        | [GIN] 2024/05/04 - 13:13:01 | 200 |  1.782602796s |      172.27.0.4 | POST     "/api/embeddings"
weaviate      | {"level":"info","msg":"Created shard csecstudy_Tq917rNC6TUp in 625.879µs","time":"2024-05-04T13:13:01Z"}
weaviate      | {"action":"hnsw_vector_cache_prefill","count":1000,"index_id":"main","level":"info","limit":1000000000000,"msg":"prefilled vector cache","time":"2024-05-04T13:13:01Z","took":59550}
anything-llm  | Inserting vectorized chunks into Weaviate collection.
anything-llm  | Caching vectorized results of custom-documents/Zed-Attack-Proxy-Cookbook.pdf-c996f078-921a-4ed7-aba5-83524c8c6613.json to prevent duplicated embedding.
anything-llm  | [TELEMETRY SENT] {
anything-llm  |   event: 'documents_embedded_in_workspace',
anything-llm  |   distinctId: '0254fb58-bee5-4553-93bd-1571a82cfcc8',
anything-llm  |   properties: {
anything-llm  |     LLMSelection: 'ollama',
anything-llm  |     Embedder: 'ollama',
anything-llm  |     VectorDbSelection: 'weaviate',
anything-llm  |     runtime: 'docker'
anything-llm  |   }
anything-llm  | }

Having said that, the RAG results from the built in database do seem to be better than Weviate. Maybe because I didn't set-up reranking? I really don't wanna bother with all that. Your built in DB is super... when it works.

Error occurs with mxbai-embed-large and nomic-embed-text on Ollama. But nomic-embed-text running on LM Studio works properly.

Are there known steps to reproduce?

No response

frost19k commented 3 months ago

ChromaDB also errors

ollama        | {"function":"launch_slot_with_data","level":"INFO","line":830,"msg":"slot is processing task","slot_id":0,"task_id":723,"tid":"137462474399744","timestamp":1714922166}
ollama        | {"function":"update_slots","level":"INFO","line":1839,"msg":"kv cache rm [p0, end)","p0":0,"slot_id":0,"task_id":723,"tid":"137462474399744","timestamp":1714922166}
ollama        | {"function":"update_slots","level":"INFO","line":1643,"msg":"slot released","n_cache_tokens":395,"n_ctx":512,"n_past":395,"n_system_tokens":0,"slot_id":0,"task_id":723,"tid":"137462474399744","timestamp":1714922166,"truncated":true}
ollama        | {"function":"log_server_request","level":"INFO","line":2737,"method":"POST","msg":"request","params":{},"path":"/embedding","remote_addr":"127.0.0.1","remote_port":38458,"status":200,"tid":"137461086814208","timestamp":1714922166}
ollama        | [GIN] 2024/05/05 - 15:16:06 | 200 |  604.422732ms |   192.168.208.4 | POST     "/api/embeddings"
chromadb      | INFO:     [05-05-2024 15:16:06] 192.168.208.4:52078 - "GET /api/v1/heartbeat HTTP/1.1" 200
chromadb      | INFO:     [05-05-2024 15:16:06] Collection csec-study is not created.
chromadb      | INFO:     [05-05-2024 15:16:06] 192.168.208.4:52088 - "POST /api/v1/collections?tenant=default_tenant&database=default_database HTTP/1.1" 200
anything-llm  | Inserting vectorized chunks into Chroma collection.
chromadb      | ERROR:    [05-05-2024 15:16:06] Exception occurred invoking consumer for subscription a5d0da3d8603469d821093579c3c2857to topic persistent://default/default/daa9985c-f3b2-4540-b5d1-eee1384a3f3b Dimensionality of (0) does not match indexdimensionality (1024)
chromadb      | INFO:     [05-05-2024 15:16:06] 192.168.208.4:52078 - "POST /api/v1/collections/daa9985c-f3b2-4540-b5d1-eee1384a3f3b/add HTTP/1.1" 201
anything-llm  | Caching vectorized results of custom-documents/Zed-Attack-Proxy-Cookbook.pdf-6443e222-d4f4-4407-993a-4feec15f66fa.json to prevent duplicated embedding.
frost19k commented 3 months ago

Fixed! Sort of...

I believe the issue was related to changing my embedding model from one with 768 dimensions (the default) to one with 1024. I deleted my persistence volume but retained my .env file which allowed me to "initialize" AnythingLLM with ollama embedding model with 1024 dimensions.

I got the idea here

frost19k commented 3 months ago

Apologies, I spoke too soon. Spent the whole day on this and then got excited.

The embeddings seem to have successfully inserted into LanceDB but not I get LanceDBError: No vector column found to create index.

ollama        | [GIN] 2024/05/05 - 15:59:23 | 200 |  655.615857ms |      172.20.0.3 | POST     "/api/embeddings"
anything-llm  | Inserting vectorized chunks into LanceDB collection.
anything-llm  | Caching vectorized results of custom-documents/Zed-Attack-Proxy-Cookbook.pdf-899b528b-c0c4-421c-85bb-a228af485234.json to prevent duplicated embedding.

Screenshot from 2024-05-05 21-32-07

timothycarambat commented 3 months ago

That error results if you have embedded a file with some known embedder, then later on swap embedders again and try to send a chat or embed a new document. The models are not the same so it fails because of dimension mismatches.

The embedder model should not change once you select it. If you do you have to basically reset everything to prevent these bizarre errors. Like delete documents fully, delete workspaces, and make sure nothing is embed

frost19k commented 3 months ago

I understand. I deleted the AnythinLLM docker volume and was working with a fresh installation when the No vector column error occurred.

I realised that since AnythingLLM does not allow for embedder configuration during onboarding I have to use a preconfigured .env to make sure the DB is configured for a 1024 dimension model on "first boot".

When I do that, ChromaDB and Weaviate work properly but LanceDB does not - throws the No vector column error.

Interestingly, Weaviate does not seem to care about the dimensionality of a preconfigured embedder model. ChromaDB needs the pre-configuration to be proper, I cannot switch dimensionality. LanceDB is the most stubborn - simply refuses to work with Ollama embedders (1024 or 768 dimensions).

If you wish to diagnose this further, I would prefer to use LanceDB. If not, I am comfortable with ChromaDB for my use case.

frost19k commented 3 months ago

I am attaching the full logs for AnythingLLM from a test run to demonstrate the issue.

Dockerfile ```dockerfile version: '3.9' services: ollama: image: ollama/ollama container_name: ollama shm_size: 4gb volumes: - ./ollama:/root/.ollama restart: on-failure runtime: nvidia deploy: resources: reservations: devices: - driver: nvidia device_ids: ['0'] capabilities: [gpu] anything-llm: image: mintplexlabs/anythingllm container_name: anything-llm userns_mode: "host" depends_on: - ollama cap_add: - SYS_ADMIN environment: - STORAGE_DIR=/app/server/storage volumes: - app-data:/app/server/storage - ./app/dotenv:/app/server/.env ports: - 3001:3001 networks: default: name: anything-llm volumes: app-data: name: anything-llm driver: local ```
AnythingLLM .env file ```text # Auto-dump ENV from system call on 03:41:37 GMT+0000 (Coordinated Universal Time) LLM_PROVIDER='ollama' EMBEDDING_MODEL_PREF='mxbai-embed-large:latest' OLLAMA_BASE_PATH='http://ollama:11434' OLLAMA_MODEL_PREF='dolphin-mistral:7b-v2.8-fp16' OLLAMA_MODEL_TOKEN_LIMIT='32768' EMBEDDING_ENGINE='ollama' EMBEDDING_BASE_PATH='http://ollama:11434' EMBEDDING_MODEL_MAX_CHUNK_LENGTH='2048' VECTOR_DB='lancedb' STORAGE_DIR='/app/server/storage' ```
AnythingLLM logs ```bash Collector hot directory and tmp storage wiped! Document processor app listening on port 8888 Environment variables loaded from .env Prisma schema loaded from prisma/schema.prisma ✔ Generated Prisma Client (v5.3.1) to ./node_modules/@prisma/client in 174ms Start using Prisma Client in Node.js (See: https://pris.ly/d/client) import { PrismaClient } from '@prisma/client' const prisma = new PrismaClient() or start using Prisma Client at the edge (See: https://pris.ly/d/accelerate) import { PrismaClient } from '@prisma/client/edge' const prisma = new PrismaClient() See other ways of importing Prisma Client: http://pris.ly/d/importing-client Environment variables loaded from .env Prisma schema loaded from prisma/schema.prisma Datasource "db": SQLite database "anythingllm.db" at "file:../storage/anythingllm.db" 19 migrations found in prisma/migrations No pending migrations to apply. [TELEMETRY ENABLED] Anonymous Telemetry enabled. Telemetry helps Mintplex Labs Inc improve AnythingLLM. prisma:info Starting a sqlite pool with 17 connections. fatal: not a git repository (or any of the parent directories): .git getGitVersion Command failed: git rev-parse HEAD fatal: not a git repository (or any of the parent directories): .git [TELEMETRY SENT] { event: 'server_boot', distinctId: '155f24da-8878-4e14-aa0b-cc2060d6a1d4', properties: { commit: '--', runtime: 'docker' } } [CommunicationKey] RSA key pair generated for signed payloads within AnythingLLM services. Primary server in HTTP mode listening on port 3001 [Event Logged] - update_embedding_engine [TELEMETRY SENT] { event: 'workspace_created', distinctId: '155f24da-8878-4e14-aa0b-cc2060d6a1d4', properties: { multiUserMode: false, LLMSelection: 'ollama', Embedder: 'ollama', VectorDbSelection: 'lancedb', runtime: 'docker' } } [Event Logged] - workspace_created -- Working The Web Application Hacker's Handbook.pdf -- [...] [SUCCESS]: The Web Application Hacker's Handbook.pdf converted & ready for embedding. [CollectorApi] Document The Web Application Hacker's Handbook.pdf uploaded processed and successfully. It is now available in documents. [TELEMETRY SENT] { event: 'document_uploaded', distinctId: '155f24da-8878-4e14-aa0b-cc2060d6a1d4', properties: { runtime: 'docker' } } [Event Logged] - document_uploaded Adding new vectorized document into namespace test [RecursiveSplitter] Will split with { chunkSize: 2048, chunkOverlap: 512 } Chunks created from document: 1268 [OllamaEmbedder] Embedding 1268 chunks of text with mxbai-embed-large:latest. Inserting vectorized chunks into LanceDB collection. Caching vectorized results of custom-documents/The-Web-Application-Hacker's-Handbook.pdf-32128822-2b6a-40af-b0e2-9e2d32af453c.json to prevent duplicated embedding. [TELEMETRY SENT] { event: 'documents_embedded_in_workspace', distinctId: '155f24da-8878-4e14-aa0b-cc2060d6a1d4', properties: { LLMSelection: 'ollama', Embedder: 'ollama', VectorDbSelection: 'lancedb', runtime: 'docker' } } [Event Logged] - workspace_documents_added [OllamaEmbedder] Embedding 1 chunks of text with mxbai-embed-large:latest. [Error: LanceDBError: No vector column found to create index] [Event Logged] - workspace_deleted [TELEMETRY SENT] { event: 'workspace_created', distinctId: '155f24da-8878-4e14-aa0b-cc2060d6a1d4', properties: { multiUserMode: false, LLMSelection: 'ollama', Embedder: 'ollama', VectorDbSelection: 'lancedb', runtime: 'docker' } } [Event Logged] - workspace_created Adding new vectorized document into namespace test Cached vectorized results of custom-documents/The-Web-Application-Hacker's-Handbook.pdf-32128822-2b6a-40af-b0e2-9e2d32af453c.json found! Using cached data to save on embed costs. [TELEMETRY SENT] { event: 'documents_embedded_in_workspace', distinctId: '155f24da-8878-4e14-aa0b-cc2060d6a1d4', properties: { LLMSelection: 'ollama', Embedder: 'ollama', VectorDbSelection: 'lancedb', runtime: 'docker' } } [Event Logged] - workspace_documents_added [OllamaEmbedder] Embedding 1 chunks of text with mxbai-embed-large:latest. [Error: LanceDBError: No vector column found to create index] [Event Logged] - workspace_deleted [TELEMETRY SENT] { event: 'workspace_created', distinctId: '155f24da-8878-4e14-aa0b-cc2060d6a1d4', properties: { multiUserMode: false, LLMSelection: 'ollama', Embedder: 'ollama', VectorDbSelection: 'lancedb', runtime: 'docker' } } [Event Logged] - workspace_created -- Working Zed Attack Proxy Cookbook.pdf -- [...] [SUCCESS]: Zed Attack Proxy Cookbook.pdf converted & ready for embedding. [CollectorApi] Document Zed Attack Proxy Cookbook.pdf uploaded processed and successfully. It is now available in documents. [TELEMETRY SENT] { event: 'document_uploaded', distinctId: '155f24da-8878-4e14-aa0b-cc2060d6a1d4', properties: { runtime: 'docker' } } [Event Logged] - document_uploaded Adding new vectorized document into namespace test [RecursiveSplitter] Will split with { chunkSize: 2048, chunkOverlap: 512 } Chunks created from document: 187 [OllamaEmbedder] Embedding 187 chunks of text with mxbai-embed-large:latest. Inserting vectorized chunks into LanceDB collection. addDocumentToNamespace Invalid argument error: Values length 134144 is less than the length (1024) multiplied by the value size (1024) for FixedSizeList(Field { name: "item", data_type: Float32, nullable: true, dict_id: 0, dict_is_ordered: false, metadata: {} }, 1024) Failed to vectorize Zed Attack Proxy Cookbook.pdf [TELEMETRY SENT] { event: 'documents_embedded_in_workspace', distinctId: '155f24da-8878-4e14-aa0b-cc2060d6a1d4', properties: { LLMSelection: 'ollama', Embedder: 'ollama', VectorDbSelection: 'lancedb', runtime: 'docker' } } [Event Logged] - workspace_documents_added ```
Ollama logs ```bash time=2024-05-06T03:41:19.868Z level=INFO source=images.go:828 msg="total blobs: 14" time=2024-05-06T03:41:19.868Z level=INFO source=images.go:835 msg="total unused blobs removed: 0" time=2024-05-06T03:41:19.869Z level=INFO source=routes.go:1071 msg="Listening on [::]:11434 (version 0.1.33)" time=2024-05-06T03:41:19.869Z level=INFO source=payload.go:30 msg="extracting embedded files" dir=/tmp/ollama2573039759/runners time=2024-05-06T03:41:22.433Z level=INFO source=payload.go:44 msg="Dynamic LLM libraries [cpu cpu_avx cpu_avx2 cuda_v11 rocm_v60002]" time=2024-05-06T03:41:22.433Z level=INFO source=gpu.go:96 msg="Detecting GPUs" time=2024-05-06T03:41:22.451Z level=INFO source=gpu.go:101 msg="detected GPUs" library=/tmp/ollama2573039759/runners/cuda_v11/libcudart.so.11.0 count=1 time=2024-05-06T03:41:22.451Z level=INFO source=cpu_common.go:11 msg="CPU has AVX2" [GIN] 2024/05/06 - 03:41:28 | 200 | 987.321µs | 172.31.0.3 | GET "/api/tags" [GIN] 2024/05/06 - 03:42:08 | 200 | 26.46µs | 172.31.0.3 | HEAD "/" time=2024-05-06T03:42:09.454Z level=INFO source=gpu.go:96 msg="Detecting GPUs" time=2024-05-06T03:42:09.455Z level=INFO source=gpu.go:101 msg="detected GPUs" library=/tmp/ollama2573039759/runners/cuda_v11/libcudart.so.11.0 count=1 time=2024-05-06T03:42:09.455Z level=INFO source=cpu_common.go:11 msg="CPU has AVX2" [GIN] 2024/05/06 - 03:42:09 | 500 | 8.230029ms | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 10.10913ms | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 10.749417ms | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 10.584035ms | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 10.605395ms | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 8.460011ms | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 12.357654ms | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 9.700255ms | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 14.556828ms | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 10.923489ms | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 11.02383ms | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 8.948937ms | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 12.958241ms | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 9.611174ms | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 13.943531ms | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 14.973492ms | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 9.16863ms | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 13.622578ms | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 13.919721ms | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 12.753208ms | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 9.541804ms | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 10.020909ms | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 9.947398ms | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 10.506104ms | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 15.62303ms | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 15.928193ms | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 11.441054ms | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 10.681876ms | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 10.864588ms | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 11.03065ms | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 14.640449ms | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 12.651588ms | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 11.251452ms | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 13.212543ms | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 12.967791ms | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 11.02583ms | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 15.718351ms | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 13.630348ms | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 12.88933ms | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 250.563µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 10.009018ms | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 232.693µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 11.502385ms | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 245.902µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 253.753µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 236.622µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 235.092µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 9.975038ms | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 13.177474ms | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 561.456µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 15.572489ms | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 582.626µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 464.075µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 221.273µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 233.212µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 683.538µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 232.653µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 219.832µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 259.293µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 580.677µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 692.468µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 673.767µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 222.372µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 252.843µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 251.503µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 238.452µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 242.942µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 652.617µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 260.383µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 277.993µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 439.005µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 984.34µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 816.188µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 648.887µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 1.330125ms | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 636.807µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 686.968µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 619.467µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 540.676µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 471.395µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 252.843µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 1.296454ms | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 1.635488ms | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 1.480786ms | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 878.16µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 1.754529ms | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 908.44µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 1.595487ms | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 1.86153ms | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 890.3µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 624.357µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 2.418407ms | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 9.667215ms | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 986.871µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 207.442µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 3.952722ms | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 262.813µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 214.743µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 263.193µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 266.093µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 253.553µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 1.466095ms | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 3.892642ms | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 203.302µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 4.389467ms | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 271.003µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 314.734µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 182.091µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 3.76888ms | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 234.573µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 4.6423ms | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 276.903µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 3.979223ms | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 231.462µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 3.099563ms | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 206.312µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 185.382µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 5.185396ms | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 5.216147ms | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 240.133µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 248.173µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 3.652139ms | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 4.074614ms | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 256.633µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 226.752µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 4.294386ms | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 4.672671ms | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 228.393µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 4.402437ms | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 255.933µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 240.402µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 2.491716ms | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 237.683µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 259.342µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 247.593µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 4.311127ms | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 4.60346ms | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 236.523µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 211.873µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 234.602µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 234.972µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 237.942µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 243.932µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 224.063µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 5.032245ms | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 4.809062ms | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 4.429037ms | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 2.328495ms | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 229.123µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 225.262µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 205.542µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 249.923µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 4.996274ms | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 2.625498ms | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 233.383µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 243.553µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 5.037185ms | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 324.014µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 1.155643ms | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 222.452µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 233.853µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 225.182µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 237.053µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 219.592µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 209.382µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 199.022µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 224.922µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 246.113µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 199.722µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 234.312µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 202.502µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 280.463µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 236.402µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 201.652µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 233.443µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 231.353µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 202.002µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 210.762µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 199.662µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 187.272µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 229.683µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 223.222µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 222.452µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 222.783µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 242.383µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 172.362µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 211.383µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 176.451µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 335.254µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 247.083µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 186.083µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 253.233µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 176.582µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 220.912µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 230.862µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 198.242µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 206.943µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 183.562µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 212.112µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 176.132µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 197.913µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 176.892µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 210.522µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 275.583µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 199.942µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 238.832µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 249.183µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 186.192µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 209.142µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 200.092µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 240.613µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 211.542µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 207.982µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 210.432µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 253.593µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 436.604µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 431.545µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 312.113µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 296.834µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 223.612µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 488.325µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 463.765µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 266.023µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 327.453µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 524.315µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 304.403µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 328.674µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 267.253µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 285.043µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 281.983µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 279.743µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 292.903µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 431.665µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 329.803µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 325.023µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 527.225µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 496.526µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 219.883µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 196.732µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 235.863µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 204.892µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 214.072µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 202.263µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 193.132µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 173.842µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 197.773µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 182.662µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 187.482µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 312.954µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 256.382µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 272.753µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 232.112µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 285.163µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 236.473µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 266.693µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 231.703µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 209.133µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 253.313µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 210.203µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 190.463µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 203.412µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 217.803µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 192.322µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 174.192µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 210.563µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 185.812µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 177.422µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 203.492µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 162.082µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 153.102µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 196.862µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 178.842µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 218.352µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 213.502µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 218.983µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 220.233µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 190.492µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 171.202µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 217.633µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 169.292µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 199.333µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 226.092µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 176.842µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 206.962µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 205.343µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 216.022µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 203.172µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 204.792µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 208.242µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 204.882µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 174.092µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 194.193µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 210.532µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 211.633µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 192.523µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 166.742µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 164.272µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 215.533µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 178.262µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 162.982µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 168.232µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 218.633µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 170.251µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 176.012µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 164.672µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 210.452µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 165.062µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 169.782µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 176.342µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 218.712µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 200.973µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 211.662µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 183.362µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 207.512µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 209.942µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 206.602µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 191.482µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 200.613µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 162.572µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 210.843µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 160.011µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 181.682µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 203.982µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 219.212µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 170.842µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 233.922µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 172.592µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 198.393µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 204.932µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 218.052µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 219.292µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 212.342µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 163.352µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 180.922µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 192.252µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 313.894µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 202.942µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 194.262µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 209.463µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 215.042µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 164.332µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 185.192µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 318.393µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 173.822µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 181.492µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 183.062µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 196.943µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 196.452µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 516.256µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 1.105302ms | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 480.945µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 336.164µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 249.923µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 491.285µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 419.905µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 228.602µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 214.882µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 243.932µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 331.073µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 688.957µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 214.973µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 315.614µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 244.293µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 195.082µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 224.733µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 180.982µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 190.832µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 207.692µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 175.862µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 188.203µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 278.693µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 200.303µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 219.803µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 253.752µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 186.222µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 174.122µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 407.074µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 310.463µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 312.093µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 251.112µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 355.004µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 270.233µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 290.884µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 293.173µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 444.335µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 285.093µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 515.515µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 804.039µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 359.204µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 305.583µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 1.673598ms | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 353.534µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 946.38µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 747.438µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 555.396µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 606.906µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 257.552µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 535.535µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 243.703µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 239.083µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 204.122µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 207.682µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 178.642µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 181.462µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 190.472µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 169.452µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 184.302µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 209.092µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 200.222µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 190.452µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 207.722µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 251.392µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 170.502µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 212.322µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 168.002µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 212.963µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 208.802µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 182.782µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 173.492µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 190.482µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 239.053µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 164.142µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 211.622µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 205.452µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 230.083µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 194.132µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 206.013µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 221.462µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 184.062µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 198.012µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 213.982µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 176.722µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 210.942µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 188.292µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 195.442µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 220.203µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 183.602µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 205.552µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 186.542µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 217.723µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 188.242µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 226.352µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 259.732µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 190.132µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 185.932µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 252.723µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 189.342µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 195.842µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 178.712µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 192.022µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 186.202µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 209.422µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 210.032µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 258.263µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 179.102µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 198.992µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 260.123µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 214.302µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 211.283µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 199.092µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 209.163µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 239.302µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 205.393µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 221.792µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 202.472µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 187.642µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 250.143µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 192.882µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 209.382µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 194.302µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 174.492µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 204.293µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 208.912µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 166.522µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 226.193µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 169.381µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 203.492µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 195.662µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 169.712µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 199.143µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 193.372µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 204.082µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 178.922µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 190.112µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 159.701µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 220.632µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 170.051µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 162.182µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 240.093µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 167.572µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 177.081µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 174.602µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 239.493µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 169.952µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 201.522µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 162.132µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 178.892µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 179.182µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 180.132µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 187.512µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 209.872µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 182.812µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 233.043µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 194.482µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 167.871µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 165.882µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 170.012µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 184.932µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 241.762µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 152.812µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 165.112µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 169.412µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 230.433µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 165.462µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 172.861µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 196.132µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 167.412µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 173.922µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 234.852µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 180.982µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 188.972µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 230.433µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 183.952µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 162.012µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 159.892µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 167.152µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 197.602µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 205.503µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 160.011µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 175.132µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 171.252µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 180.102µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 471.095µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 282.523µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 746.319µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 281.703µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 341.264µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 523.715µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 1.305104ms | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 278.393µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 638.527µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 512.265µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 295.443µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 280.113µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 275.073µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 283.753µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 287.574µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 294.764µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 303.464µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 336.764µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 399.294µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 263.813µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 533.826µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 492.575µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 416.095µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 200.162µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 256.913µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 230.363µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 219.962µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 180.882µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 249.113µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 213.752µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 211.363µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 217.162µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 209.002µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 213.623µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 191.193µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 195.322µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 236.403µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 181.312µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 217.912µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 176.482µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 193.432µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 210.072µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 179.892µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 178.082µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 183.092µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 206.252µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 190.812µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 190.032µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 235.363µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 183.822µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 219.542µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 243.733µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 228.633µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 185.252µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 240.903µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 184.002µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 209.922µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 213.493µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 280.413µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 536.326µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 260.562µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 197.422µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 555.256µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 686.718µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 538.376µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 488.695µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 357.914µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 297.763µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 671.237µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 337.674µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 448.735µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 324.943µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 631.497µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 309.393µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 305.854µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 420.315µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 1.468256ms | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 392.484µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 355.134µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 249.773µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 424.474µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 262.952µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 716.508µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 629.597µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 718.738µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 553.596µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 253.253µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 250.683µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 337.294µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 463.465µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 198.082µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 356.604µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 233.633µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 196.942µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 418.585µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 264.282µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 171.962µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 317.943µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 175.372µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 219.302µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 202.922µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 162.522µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 185.302µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 203.352µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 177.432µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 175.191µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 193.772µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 175.472µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 185.002µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 432.794µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 172.102µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 216.173µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 190.912µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 185.282µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 261.463µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 203.193µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 218.272µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 186.532µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 423.534µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 259.343µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 186.292µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 175.572µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 307.593µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 317.014µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 209.902µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 217.492µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 220.132µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 271.703µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 308.994µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 316.724µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 217.352µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 239.043µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 256.522µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 180.562µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 351.854µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 257.493µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 175.302µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 424.174µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 323.044µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 326.923µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 318.054µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 233.473µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 238.193µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 249.632µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 226.352µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 297.133µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 234.413µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 227.222µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 216.462µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 195.702µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 341.004µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 214.022µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 211.823µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 308.553µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 206.712µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 225.432µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 167.592µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 412.374µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 270.583µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 179.152µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 202.792µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 316.473µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 284.383µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 184.272µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 180.452µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 289.593µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 180.392µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 326.964µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 178.322µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 258.333µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 213.692µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 200.202µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 315.004µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 249.922µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 212.673µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 238.043µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 202.012µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 240.663µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 187.302µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 209.852µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 193.672µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 181.942µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 175.632µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 299.953µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 188.142µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 229.323µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 185.452µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 199.412µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 290.583µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 185.122µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 201.272µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 185.802µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 264.933µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 181.872µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 194.752µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 175.012µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 242.403µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 181.652µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 179.282µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 240.033µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 172.772µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 174.572µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 175.392µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 279.743µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 165.972µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 202.312µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 182.852µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 249.162µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 200.632µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 197.772µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 313.693µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 220.253µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 202.253µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 193.563µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 319.763µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 227.803µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 340.804µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 236.863µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 229.682µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 201.972µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 394.924µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 214.853µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 359.564µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 491.735µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 299.434µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 312.834µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 421.274µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 362.344µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 297.133µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 416.064µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 290.833µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 381.624µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 288.163µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 588.806µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 1.195453ms | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 644.217µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 639.957µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 847.919µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 526.796µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 600.707µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 529.516µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 475.475µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 659.267µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 376.654µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 768.628µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 568.076µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 848.449µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 1.235403ms | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 660.447µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 555.386µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 502.995µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 995.511µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 611.097µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 536.496µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 610.657µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 612.316µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 454.254µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 598.737µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 669.237µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 624.067µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 664.228µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 857.769µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 893.87µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 685.747µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 600.127µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 280.183µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 294.963µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 388.974µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 281.483µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 344.914µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 355.914µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 264.703µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 269.573µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 271.763µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 333.443µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 270.873µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 226.962µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 232.632µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 260.263µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 266.753µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 213.413µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 308.013µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 284.054µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 221.152µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 245.143µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 211.292µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 207.452µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 296.093µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 207.692µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 208.882µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 220.083µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 200.442µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 325.594µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 267.792µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 193.542µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 223.123µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 391.804µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 343.153µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 273.803µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 210.942µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 173.152µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 192.592µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 217.192µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 204.552µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 206.182µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 249.583µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 211.462µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 180.012µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 254.343µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 215.853µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 220.712µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 234.743µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 232.562µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 190.212µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 272.323µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 188.352µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 210.143µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 250.423µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 187.552µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 173.582µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 174.392µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 222.142µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 262.343µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 178.762µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 214.712µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 249.272µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 203.642µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 165.141µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 184.812µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 191.763µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 176.152µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 183.312µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 173.052µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 171.702µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 184.002µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 172.092µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 172.802µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 203.652µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 173.272µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 190.092µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 177.442µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 244.813µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 181.812µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 231.452µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 190.032µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 202.493µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 250.773µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 227.783µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 220.792µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 224.702µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 201.152µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 220.312µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 204.302µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 192.592µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 232.543µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 213.422µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 191.622µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 183.682µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 195.442µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 183.432µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 180.052µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 238.682µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 194.952µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 212.412µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 221.843µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 195.642µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 167.412µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 217.062µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 215.243µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 230.482µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 180.952µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 204.422µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 209.622µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 220.612µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 171.142µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 189.492µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 185.142µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 169.452µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 193.382µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 200.802µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 168.142µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 180.892µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 221.532µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 163.822µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 194.552µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 223.653µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 181.652µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 215.702µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 222.472µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 221.492µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 206.512µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 212.943µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 175.112µs | 172.31.0.3 | POST "/api/embeddings" time=2024-05-06T03:42:09.736Z level=INFO source=memory.go:152 msg="offload to gpu" layers.real=-1 layers.estimate=25 memory.available="15943.7 MiB" memory.required.full="1104.8 MiB" memory.required.partial="1104.8 MiB" memory.required.kv="3.0 MiB" memory.weights.total="636.8 MiB" memory.weights.repeating="577.2 MiB" memory.weights.nonrepeating="59.6 MiB" memory.graph.full="8.0 MiB" memory.graph.partial="8.0 MiB" [GIN] 2024/05/06 - 03:42:09 | 500 | 181.402µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 184.762µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 188.542µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 184.112µs | 172.31.0.3 | POST "/api/embeddings" time=2024-05-06T03:42:09.737Z level=INFO source=memory.go:152 msg="offload to gpu" layers.real=-1 layers.estimate=25 memory.available="15943.7 MiB" memory.required.full="1104.8 MiB" memory.required.partial="1104.8 MiB" memory.required.kv="3.0 MiB" memory.weights.total="636.8 MiB" memory.weights.repeating="577.2 MiB" memory.weights.nonrepeating="59.6 MiB" memory.graph.full="8.0 MiB" memory.graph.partial="8.0 MiB" time=2024-05-06T03:42:09.737Z level=INFO source=cpu_common.go:11 msg="CPU has AVX2" [GIN] 2024/05/06 - 03:42:09 | 500 | 175.282µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 179.502µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 190.752µs | 172.31.0.3 | POST "/api/embeddings" time=2024-05-06T03:42:09.737Z level=INFO source=server.go:289 msg="starting llama server" cmd="/tmp/ollama2573039759/runners/cuda_v11/ollama_llama_server --model /root/.ollama/models/blobs/sha256-819c2adf5ce6df2b6bd2ae4ca90d2a69f060afeb438d0c171db57daa02e39c3d --ctx-size 512 --batch-size 512 --embedding --log-disable --n-gpu-layers 25 --parallel 1 --port 42805" [GIN] 2024/05/06 - 03:42:09 | 500 | 474.546µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 294.263µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 287.983µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 414.444µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 280.913µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 493.036µs | 172.31.0.3 | POST "/api/embeddings" time=2024-05-06T03:42:09.738Z level=INFO source=sched.go:340 msg="loaded runners" count=1 time=2024-05-06T03:42:09.738Z level=INFO source=server.go:432 msg="waiting for llama runner to start responding" [GIN] 2024/05/06 - 03:42:09 | 500 | 387.854µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 274.683µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 342.984µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 339.554µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 365.454µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 539.396µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 668.978µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 397.824µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 1.114012ms | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 322.673µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 572.246µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 424.065µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 487.205µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 331.803µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 691.488µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 587.126µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 254.773µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 718.478µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 640.077µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 1.071441ms | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 599.956µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 946.04µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 576.916µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 212.233µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 230.943µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 267.773µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 208.052µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 193.472µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 193.082µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 197.732µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 229.563µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 244.932µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 289.123µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 187.942µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 187.042µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 199.162µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 254.752µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 186.482µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 213.642µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 227.353µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 216.783µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 187.132µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 246.483µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 176.171µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 184.082µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 209.462µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 202.992µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 214.613µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 189.272µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 205.312µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 242.483µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 187.112µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 187.212µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 180.152µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 224.212µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 181.582µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 187.562µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 238.812µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 194.882µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 186.792µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 199.582µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 219.933µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 184.912µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 187.132µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 207.503µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 180.892µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 189.902µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 211.902µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 225.913µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 199.992µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 190.292µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 254.173µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 218.802µs | 172.31.0.3 | POST "/api/embeddings" {"function":"server_params_parse","level":"INFO","line":2606,"msg":"logging to file is disabled.","tid":"124309595705344","timestamp":1714966929} [GIN] 2024/05/06 - 03:42:09 | 500 | 185.222µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 218.722µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 240.213µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 168.621µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 212.202µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 166.551µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 194.462µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 263.053µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 217.503µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 211.022µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 194.033µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 162.742µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 198.582µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 241.843µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 229.602µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 195.072µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 184.122µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 264.273µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 228.962µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 207.082µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 189.562µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 196.312µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 200.142µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 204.342µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 177.272µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 220.682µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 185.752µs | 172.31.0.3 | POST "/api/embeddings" {"build":1,"commit":"952d03d","function":"main","level":"INFO","line":2822,"msg":"build info","tid":"124309595705344","timestamp":1714966929} {"function":"main","level":"INFO","line":2825,"msg":"system info","n_threads":8,"n_threads_batch":-1,"system_info":"AVX = 1 | AVX_VNNI = 0 | AVX2 = 0 | AVX512 = 0 | AVX512_VBMI = 0 | AVX512_VNNI = 0 | FMA = 0 | NEON = 0 | ARM_FMA = 0 | F16C = 0 | FP16_VA = 0 | WASM_SIMD = 0 | BLAS = 1 | SSE3 = 1 | SSSE3 = 1 | VSX = 0 | MATMUL_INT8 = 0 | LLAMAFILE = 1 | ","tid":"124309595705344","timestamp":1714966929,"total_threads":16} [GIN] 2024/05/06 - 03:42:09 | 500 | 199.382µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 176.762µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 211.462µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 233.193µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 210.423µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 210.252µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 198.972µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 211.072µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 220.642µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 198.032µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 199.242µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 192.572µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 190.112µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 192.122µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 189.592µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 192.792µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 173.202µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 238.972µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 172.912µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 192.262µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 552.546µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 182.242µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 207.402µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 193.772µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 201.682µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 217.492µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 206.923µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 205.852µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 224.682µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 214.043µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 215.442µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 210.672µs | 172.31.0.3 | POST "/api/embeddings" llama_model_loader: loaded meta data with 23 key-value pairs and 389 tensors from /root/.ollama/models/blobs/sha256-819c2adf5ce6df2b6bd2ae4ca90d2a69f060afeb438d0c171db57daa02e39c3d (version GGUF V3 (latest)) llama_model_loader: Dumping metadata keys/values. Note: KV overrides do not apply in this output. llama_model_loader: - kv 0: general.architecture str = bert llama_model_loader: - kv 1: general.name str = mxbai-embed-large-v1 llama_model_loader: - kv 2: bert.block_count u32 = 24 llama_model_loader: - kv 3: bert.context_length u32 = 512 llama_model_loader: - kv 4: bert.embedding_length u32 = 1024 llama_model_loader: - kv 5: bert.feed_forward_length u32 = 4096 [GIN] 2024/05/06 - 03:42:09 | 500 | 217.482µs | 172.31.0.3 | POST "/api/embeddings" llama_model_loader: - kv 6: bert.attention.head_count u32 = 16 llama_model_loader: - kv 7: bert.attention.layer_norm_epsilon f32 = 0.000000 llama_model_loader: - kv 8: general.file_type u32 = 1 llama_model_loader: - kv 9: bert.attention.causal bool = false llama_model_loader: - kv 10: bert.pooling_type u32 = 2 llama_model_loader: - kv 11: tokenizer.ggml.token_type_count u32 = 2 llama_model_loader: - kv 12: tokenizer.ggml.bos_token_id u32 = 101 llama_model_loader: - kv 13: tokenizer.ggml.eos_token_id u32 = 102 llama_model_loader: - kv 14: tokenizer.ggml.model str = bert [GIN] 2024/05/06 - 03:42:09 | 500 | 209.412µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 190.372µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 208.612µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 208.803µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 221.772µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 189.862µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 178.082µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 300.663µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 224.922µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 182.292µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 195.282µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 177.432µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 224.293µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 174.572µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 195.832µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 205.592µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 185.632µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 477.816µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 473.465µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 314.513µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 489.285µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 303.623µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 336.723µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 908.81µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 344.074µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 312.533µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 310.673µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 307.384µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 305.363µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 650.687µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 573.626µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 544.506µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 245.352µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 528.356µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 235.733µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 260.013µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 266.833µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 217.602µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 186.772µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 228.402µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 188.162µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 200.862µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 201.152µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 205.032µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 226.683µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 253.822µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 217.792µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 194.772µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 189.322µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 234.482µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 213.002µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 221.452µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 244.653µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 252.572µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 215.423µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 207.072µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 186.832µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 173.262µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 184.452µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 179.152µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 182.142µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 180.452µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 178.312µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 208.832µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 225.152µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 168.072µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 195.692µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 176.692µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 174.751µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 184.822µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 171.922µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 179.362µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 180.942µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 231.842µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 201.442µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 189.462µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 211.563µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 192.782µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 181.042µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 216.532µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 192.113µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 179.602µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 219.642µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 197.012µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 233.273µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 181.871µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 220.062µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 176.121µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 170.302µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 172.302µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 193.732µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 232.623µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 220.982µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 176.982µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 174.072µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 223.532µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 162.902µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 172.832µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 179.801µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 171.782µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 225.862µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 173.562µs | 172.31.0.3 | POST "/api/embeddings" llama_model_loader: - kv 15: tokenizer.ggml.tokens arr[str,30522] = ["[PAD]", "[unused0]", "[unused1]", "... [GIN] 2024/05/06 - 03:42:09 | 500 | 169.612µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 189.002µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 208.782µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 171.712µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 177.052µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 170.392µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 200.532µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 207.822µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 273.653µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 216.052µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 253.503µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 262.513µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 239.542µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 200.432µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 226.542µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 178.342µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 166.942µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 214.082µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 184.522µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 201.042µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 208.312µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 167.341µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 194.462µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 260.853µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 175.911µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 254.103µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 210.852µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 207.382µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 224.923µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 236.642µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 181.492µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 232.913µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 545.376µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 205.932µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 226.453µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 346.894µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 234.463µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 229.292µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 235.063µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 246.773µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 202.803µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 209.312µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 225.593µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 215.263µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 210.363µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 209.102µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 199.343µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 219.982µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 240.052µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 225.713µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 183.022µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 438.605µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 458.035µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 433.355µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 293.383µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 286.973µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 310.264µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 291.593µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 337.873µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 611.196µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 324.454µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 336.573µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 311.674µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 264.543µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 412.795µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 348.944µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 299.673µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 281.263µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 456.134µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 289.073µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 285.153µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 1.132323ms | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 927.47µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 455.635µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 608.037µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 538.526µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 501.565µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 193.012µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 450.795µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 234.822µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 251.193µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 196.342µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:09 | 500 | 217.922µs | 172.31.0.3 | POST "/api/embeddings" llama_model_loader: - kv 16: tokenizer.ggml.scores arr[f32,30522] = [-1000.000000, -1000.000000, -1000.00... llama_model_loader: - kv 17: tokenizer.ggml.token_type arr[i32,30522] = [3, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, ... llama_model_loader: - kv 18: tokenizer.ggml.unknown_token_id u32 = 100 llama_model_loader: - kv 19: tokenizer.ggml.seperator_token_id u32 = 102 llama_model_loader: - kv 20: tokenizer.ggml.padding_token_id u32 = 0 llama_model_loader: - kv 21: tokenizer.ggml.cls_token_id u32 = 101 llama_model_loader: - kv 22: tokenizer.ggml.mask_token_id u32 = 103 llama_model_loader: - type f32: 243 tensors llama_model_loader: - type f16: 146 tensors llm_load_vocab: mismatch in special tokens definition ( 7104/30522 vs 5/30522 ). llm_load_print_meta: format = GGUF V3 (latest) llm_load_print_meta: arch = bert llm_load_print_meta: vocab type = WPM llm_load_print_meta: n_vocab = 30522 llm_load_print_meta: n_merges = 0 llm_load_print_meta: n_ctx_train = 512 llm_load_print_meta: n_embd = 1024 llm_load_print_meta: n_head = 16 llm_load_print_meta: n_head_kv = 16 llm_load_print_meta: n_layer = 24 llm_load_print_meta: n_rot = 64 llm_load_print_meta: n_embd_head_k = 64 llm_load_print_meta: n_embd_head_v = 64 llm_load_print_meta: n_gqa = 1 llm_load_print_meta: n_embd_k_gqa = 1024 llm_load_print_meta: n_embd_v_gqa = 1024 llm_load_print_meta: f_norm_eps = 1.0e-12 llm_load_print_meta: f_norm_rms_eps = 0.0e+00 llm_load_print_meta: f_clamp_kqv = 0.0e+00 llm_load_print_meta: f_max_alibi_bias = 0.0e+00 llm_load_print_meta: f_logit_scale = 0.0e+00 llm_load_print_meta: n_ff = 4096 llm_load_print_meta: n_expert = 0 llm_load_print_meta: n_expert_used = 0 llm_load_print_meta: causal attn = 0 llm_load_print_meta: pooling type = 2 llm_load_print_meta: rope type = 2 llm_load_print_meta: rope scaling = linear llm_load_print_meta: freq_base_train = 10000.0 llm_load_print_meta: freq_scale_train = 1 llm_load_print_meta: n_yarn_orig_ctx = 512 llm_load_print_meta: rope_finetuned = unknown llm_load_print_meta: ssm_d_conv = 0 llm_load_print_meta: ssm_d_inner = 0 llm_load_print_meta: ssm_d_state = 0 llm_load_print_meta: ssm_dt_rank = 0 llm_load_print_meta: model type = ?B llm_load_print_meta: model ftype = F16 llm_load_print_meta: model params = 334.09 M llm_load_print_meta: model size = 637.85 MiB (16.02 BPW) llm_load_print_meta: general.name = mxbai-embed-large-v1 llm_load_print_meta: BOS token = 101 '[CLS]' llm_load_print_meta: EOS token = 102 '[SEP]' llm_load_print_meta: UNK token = 100 '[UNK]' llm_load_print_meta: SEP token = 102 '[SEP]' llm_load_print_meta: PAD token = 0 '[PAD]' llm_load_print_meta: CLS token = 101 '[CLS]' llm_load_print_meta: MASK token = 103 '[MASK]' llm_load_print_meta: LF token = 0 '[PAD]' ggml_cuda_init: GGML_CUDA_FORCE_MMQ: yes ggml_cuda_init: CUDA_USE_TENSOR_CORES: no ggml_cuda_init: found 1 CUDA devices: Device 0: NVIDIA GeForce RTX 4060 Ti, compute capability 8.9, VMM: yes llm_load_tensors: ggml ctx size = 0.35 MiB llm_load_tensors: offloading 24 repeating layers to GPU llm_load_tensors: offloading non-repeating layers to GPU llm_load_tensors: offloaded 25/25 layers to GPU llm_load_tensors: CPU buffer size = 60.62 MiB llm_load_tensors: CUDA0 buffer size = 577.23 MiB ................................................................................ llama_new_context_with_model: n_ctx = 512 llama_new_context_with_model: n_batch = 512 llama_new_context_with_model: n_ubatch = 512 llama_new_context_with_model: freq_base = 10000.0 llama_new_context_with_model: freq_scale = 1 llama_kv_cache_init: CUDA0 KV buffer size = 48.00 MiB llama_new_context_with_model: KV self size = 48.00 MiB, K (f16): 24.00 MiB, V (f16): 24.00 MiB llama_new_context_with_model: CPU output buffer size = 0.00 MiB llama_new_context_with_model: CUDA0 compute buffer size = 25.01 MiB llama_new_context_with_model: CUDA_Host compute buffer size = 5.01 MiB llama_new_context_with_model: graph nodes = 849 llama_new_context_with_model: graph splits = 2 {"function":"initialize","level":"INFO","line":448,"msg":"initializing slots","n_slots":1,"tid":"124309595705344","timestamp":1714966930} {"function":"initialize","level":"INFO","line":457,"msg":"new slot","n_ctx_slot":512,"slot_id":0,"tid":"124309595705344","timestamp":1714966930} {"function":"main","level":"INFO","line":3067,"msg":"model loaded","tid":"124309595705344","timestamp":1714966930} {"function":"main","hostname":"127.0.0.1","level":"INFO","line":3270,"msg":"HTTP server listening","n_threads_http":"15","port":"42805","tid":"124309595705344","timestamp":1714966930} {"function":"update_slots","level":"INFO","line":1581,"msg":"all slots are idle and system prompt is empty, clear the KV cache","tid":"124309595705344","timestamp":1714966930} {"function":"process_single_task","level":"INFO","line":1509,"msg":"slot data","n_idle_slots":1,"n_processing_slots":0,"task_id":0,"tid":"124309595705344","timestamp":1714966930} {"function":"process_single_task","level":"INFO","line":1509,"msg":"slot data","n_idle_slots":1,"n_processing_slots":0,"task_id":1,"tid":"124309595705344","timestamp":1714966930} {"function":"log_server_request","level":"INFO","line":2737,"method":"GET","msg":"request","params":{},"path":"/health","remote_addr":"127.0.0.1","remote_port":58234,"status":200,"tid":"124308353449984","timestamp":1714966930} {"function":"process_single_task","level":"INFO","line":1509,"msg":"slot data","n_idle_slots":1,"n_processing_slots":0,"task_id":2,"tid":"124309595705344","timestamp":1714966930} {"function":"log_server_request","level":"INFO","line":2737,"method":"GET","msg":"request","params":{},"path":"/health","remote_addr":"127.0.0.1","remote_port":58230,"status":200,"tid":"124308363935744","timestamp":1714966930} {"function":"process_single_task","level":"INFO","line":1509,"msg":"slot data","n_idle_slots":1,"n_processing_slots":0,"task_id":3,"tid":"124309595705344","timestamp":1714966930} {"function":"log_server_request","level":"INFO","line":2737,"method":"GET","msg":"request","params":{},"path":"/health","remote_addr":"127.0.0.1","remote_port":58254,"status":200,"tid":"124308219232256","timestamp":1714966930} {"function":"process_single_task","level":"INFO","line":1509,"msg":"slot data","n_idle_slots":1,"n_processing_slots":0,"task_id":4,"tid":"124309595705344","timestamp":1714966930} {"function":"log_server_request","level":"INFO","line":2737,"method":"GET","msg":"request","params":{},"path":"/health","remote_addr":"127.0.0.1","remote_port":58240,"status":200,"tid":"124308342964224","timestamp":1714966930} {"function":"log_server_request","level":"INFO","line":2737,"method":"GET","msg":"request","params":{},"path":"/health","remote_addr":"127.0.0.1","remote_port":58270,"status":200,"tid":"124308229718016","timestamp":1714966930} {"function":"process_single_task","level":"INFO","line":1509,"msg":"slot data","n_idle_slots":1,"n_processing_slots":0,"task_id":5,"tid":"124309595705344","timestamp":1714966930} {"function":"log_server_request","level":"INFO","line":2737,"method":"GET","msg":"request","params":{},"path":"/health","remote_addr":"127.0.0.1","remote_port":58270,"status":200,"tid":"124308229718016","timestamp":1714966930} {"function":"process_single_task","level":"INFO","line":1509,"msg":"slot data","n_idle_slots":1,"n_processing_slots":0,"task_id":6,"tid":"124309595705344","timestamp":1714966930} {"function":"log_server_request","level":"INFO","line":2737,"method":"GET","msg":"request","params":{},"path":"/health","remote_addr":"127.0.0.1","remote_port":58280,"status":200,"tid":"124308208746496","timestamp":1714966930} {"function":"process_single_task","level":"INFO","line":1509,"msg":"slot data","n_idle_slots":1,"n_processing_slots":0,"task_id":7,"tid":"124309595705344","timestamp":1714966930} {"function":"log_server_request","level":"INFO","line":2737,"method":"GET","msg":"request","params":{},"path":"/health","remote_addr":"127.0.0.1","remote_port":58280,"status":200,"tid":"124308208746496","timestamp":1714966930} {"function":"process_single_task","level":"INFO","line":1509,"msg":"slot data","n_idle_slots":1,"n_processing_slots":0,"task_id":8,"tid":"124309595705344","timestamp":1714966930} {"function":"launch_slot_with_data","level":"INFO","line":830,"msg":"slot is processing task","slot_id":0,"task_id":9,"tid":"124309595705344","timestamp":1714966930} {"function":"log_server_request","level":"INFO","line":2737,"method":"GET","msg":"request","params":{},"path":"/health","remote_addr":"127.0.0.1","remote_port":58270,"status":200,"tid":"124308229718016","timestamp":1714966930} {"function":"update_slots","level":"INFO","line":1839,"msg":"kv cache rm [p0, end)","p0":0,"slot_id":0,"task_id":9,"tid":"124309595705344","timestamp":1714966930} {"function":"process_single_task","level":"INFO","line":1509,"msg":"slot data","n_idle_slots":0,"n_processing_slots":1,"task_id":11,"tid":"124309595705344","timestamp":1714966930} {"function":"update_slots","level":"INFO","line":1643,"msg":"slot released","n_cache_tokens":414,"n_ctx":512,"n_past":414,"n_system_tokens":0,"slot_id":0,"task_id":9,"tid":"124309595705344","timestamp":1714966930,"truncated":false} {"function":"log_server_request","level":"INFO","line":2737,"method":"GET","msg":"request","params":{},"path":"/health","remote_addr":"127.0.0.1","remote_port":58270,"status":200,"tid":"124308229718016","timestamp":1714966930} {"function":"log_server_request","level":"INFO","line":2737,"method":"POST","msg":"request","params":{},"path":"/embedding","remote_addr":"127.0.0.1","remote_port":58280,"status":200,"tid":"124308208746496","timestamp":1714966930} {"function":"process_single_task","level":"INFO","line":1509,"msg":"slot data","n_idle_slots":1,"n_processing_slots":0,"task_id":13,"tid":"124309595705344","timestamp":1714966930} [GIN] 2024/05/06 - 03:42:10 | 200 | 1.416174181s | 172.31.0.3 | POST "/api/embeddings" {"function":"log_server_request","level":"INFO","line":2737,"method":"GET","msg":"request","params":{},"path":"/health","remote_addr":"127.0.0.1","remote_port":58280,"status":200,"tid":"124308208746496","timestamp":1714966930} {"function":"process_single_task","level":"INFO","line":1509,"msg":"slot data","n_idle_slots":1,"n_processing_slots":0,"task_id":14,"tid":"124309595705344","timestamp":1714966930} {"function":"launch_slot_with_data","level":"INFO","line":830,"msg":"slot is processing task","slot_id":0,"task_id":15,"tid":"124309595705344","timestamp":1714966930} {"function":"log_server_request","level":"INFO","line":2737,"method":"GET","msg":"request","params":{},"path":"/health","remote_addr":"127.0.0.1","remote_port":58280,"status":200,"tid":"124308208746496","timestamp":1714966930} {"function":"update_slots","level":"INFO","line":1839,"msg":"kv cache rm [p0, end)","p0":0,"slot_id":0,"task_id":15,"tid":"124309595705344","timestamp":1714966930} {"function":"process_single_task","level":"INFO","line":1509,"msg":"slot data","n_idle_slots":0,"n_processing_slots":1,"task_id":17,"tid":"124309595705344","timestamp":1714966930} {"function":"update_slots","level":"INFO","line":1643,"msg":"slot released","n_cache_tokens":401,"n_ctx":512,"n_past":401,"n_system_tokens":0,"slot_id":0,"task_id":15,"tid":"124309595705344","timestamp":1714966930,"truncated":false} {"function":"log_server_request","level":"INFO","line":2737,"method":"POST","msg":"request","params":{},"path":"/embedding","remote_addr":"127.0.0.1","remote_port":58270,"status":200,"tid":"124308229718016","timestamp":1714966930} {"function":"log_server_request","level":"INFO","line":2737,"method":"GET","msg":"request","params":{},"path":"/health","remote_addr":"127.0.0.1","remote_port":58294,"status":200,"tid":"124308129054720","timestamp":1714966930} {"function":"process_single_task","level":"INFO","line":1509,"msg":"slot data","n_idle_slots":1,"n_processing_slots":0,"task_id":19,"tid":"124309595705344","timestamp":1714966930} {"function":"log_server_request","level":"INFO","line":2737,"method":"GET","msg":"request","params":{},"path":"/health","remote_addr":"127.0.0.1","remote_port":58294,"status":200,"tid":"124308129054720","timestamp":1714966930} {"function":"process_single_task","level":"INFO","line":1509,"msg":"slot data","n_idle_slots":1,"n_processing_slots":0,"task_id":20,"tid":"124309595705344","timestamp":1714966930} [GIN] 2024/05/06 - 03:42:10 | 200 | 1.484750624s | 172.31.0.3 | POST "/api/embeddings" {"function":"log_server_request","level":"INFO","line":2737,"method":"GET","msg":"request","params":{},"path":"/health","remote_addr":"127.0.0.1","remote_port":58310,"status":200,"tid":"124308118568960","timestamp":1714966930} {"function":"launch_slot_with_data","level":"INFO","line":830,"msg":"slot is processing task","slot_id":0,"task_id":21,"tid":"124309595705344","timestamp":1714966930} {"function":"update_slots","level":"INFO","line":1839,"msg":"kv cache rm [p0, end)","p0":0,"slot_id":0,"task_id":21,"tid":"124309595705344","timestamp":1714966930} {"function":"update_slots","level":"INFO","line":1643,"msg":"slot released","n_cache_tokens":419,"n_ctx":512,"n_past":419,"n_system_tokens":0,"slot_id":0,"task_id":21,"tid":"124309595705344","timestamp":1714966930,"truncated":false} {"function":"log_server_request","level":"INFO","line":2737,"method":"POST","msg":"request","params":{},"path":"/embedding","remote_addr":"127.0.0.1","remote_port":58310,"status":200,"tid":"124308118568960","timestamp":1714966930} {"function":"process_single_task","level":"INFO","line":1509,"msg":"slot data","n_idle_slots":1,"n_processing_slots":0,"task_id":24,"tid":"124309595705344","timestamp":1714966930} {"function":"log_server_request","level":"INFO","line":2737,"method":"GET","msg":"request","params":{},"path":"/health","remote_addr":"127.0.0.1","remote_port":58310,"status":200,"tid":"124308118568960","timestamp":1714966930} [GIN] 2024/05/06 - 03:42:10 | 200 | 1.507305959s | 172.31.0.3 | POST "/api/embeddings" {"function":"process_single_task","level":"INFO","line":1509,"msg":"slot data","n_idle_slots":1,"n_processing_slots":0,"task_id":25,"tid":"124309595705344","timestamp":1714966930} {"function":"log_server_request","level":"INFO","line":2737,"method":"GET","msg":"request","params":{},"path":"/health","remote_addr":"127.0.0.1","remote_port":58294,"status":200,"tid":"124308129054720","timestamp":1714966930} {"function":"launch_slot_with_data","level":"INFO","line":830,"msg":"slot is processing task","slot_id":0,"task_id":26,"tid":"124309595705344","timestamp":1714966930} {"function":"update_slots","level":"INFO","line":1839,"msg":"kv cache rm [p0, end)","p0":0,"slot_id":0,"task_id":26,"tid":"124309595705344","timestamp":1714966930} {"function":"process_single_task","level":"INFO","line":1509,"msg":"slot data","n_idle_slots":0,"n_processing_slots":1,"task_id":28,"tid":"124309595705344","timestamp":1714966931} {"function":"update_slots","level":"INFO","line":1643,"msg":"slot released","n_cache_tokens":394,"n_ctx":512,"n_past":394,"n_system_tokens":0,"slot_id":0,"task_id":26,"tid":"124309595705344","timestamp":1714966931,"truncated":false} {"function":"log_server_request","level":"INFO","line":2737,"method":"GET","msg":"request","params":{},"path":"/health","remote_addr":"127.0.0.1","remote_port":58294,"status":200,"tid":"124308129054720","timestamp":1714966931} {"function":"log_server_request","level":"INFO","line":2737,"method":"POST","msg":"request","params":{},"path":"/embedding","remote_addr":"127.0.0.1","remote_port":58310,"status":200,"tid":"124308118568960","timestamp":1714966931} {"function":"process_single_task","level":"INFO","line":1509,"msg":"slot data","n_idle_slots":1,"n_processing_slots":0,"task_id":30,"tid":"124309595705344","timestamp":1714966931} {"function":"log_server_request","level":"INFO","line":2737,"method":"GET","msg":"request","params":{},"path":"/health","remote_addr":"127.0.0.1","remote_port":58310,"status":200,"tid":"124308118568960","timestamp":1714966931} [GIN] 2024/05/06 - 03:42:11 | 200 | 1.568673952s | 172.31.0.3 | POST "/api/embeddings" {"function":"launch_slot_with_data","level":"INFO","line":830,"msg":"slot is processing task","slot_id":0,"task_id":31,"tid":"124309595705344","timestamp":1714966931} {"function":"update_slots","level":"INFO","line":1839,"msg":"kv cache rm [p0, end)","p0":0,"slot_id":0,"task_id":31,"tid":"124309595705344","timestamp":1714966931} {"function":"update_slots","level":"INFO","line":1643,"msg":"slot released","n_cache_tokens":384,"n_ctx":512,"n_past":384,"n_system_tokens":0,"slot_id":0,"task_id":31,"tid":"124309595705344","timestamp":1714966931,"truncated":false} {"function":"log_server_request","level":"INFO","line":2737,"method":"POST","msg":"request","params":{},"path":"/embedding","remote_addr":"127.0.0.1","remote_port":58326,"status":200,"tid":"124308108083200","timestamp":1714966931} {"function":"process_single_task","level":"INFO","line":1509,"msg":"slot data","n_idle_slots":1,"n_processing_slots":0,"task_id":34,"tid":"124309595705344","timestamp":1714966931} {"function":"log_server_request","level":"INFO","line":2737,"method":"GET","msg":"request","params":{},"path":"/health","remote_addr":"127.0.0.1","remote_port":58326,"status":200,"tid":"124308108083200","timestamp":1714966931} [GIN] 2024/05/06 - 03:42:11 | 200 | 1.592941645s | 172.31.0.3 | POST "/api/embeddings" {"function":"process_single_task","level":"INFO","line":1509,"msg":"slot data","n_idle_slots":1,"n_processing_slots":0,"task_id":35,"tid":"124309595705344","timestamp":1714966931} {"function":"log_server_request","level":"INFO","line":2737,"method":"GET","msg":"request","params":{},"path":"/health","remote_addr":"127.0.0.1","remote_port":58294,"status":200,"tid":"124308129054720","timestamp":1714966931} {"function":"process_single_task","level":"INFO","line":1509,"msg":"slot data","n_idle_slots":1,"n_processing_slots":0,"task_id":36,"tid":"124309595705344","timestamp":1714966931} {"function":"log_server_request","level":"INFO","line":2737,"method":"GET","msg":"request","params":{},"path":"/health","remote_addr":"127.0.0.1","remote_port":58332,"status":200,"tid":"124308061945856","timestamp":1714966931} {"function":"launch_slot_with_data","level":"INFO","line":830,"msg":"slot is processing task","slot_id":0,"task_id":37,"tid":"124309595705344","timestamp":1714966931} {"function":"update_slots","level":"INFO","line":1839,"msg":"kv cache rm [p0, end)","p0":0,"slot_id":0,"task_id":37,"tid":"124309595705344","timestamp":1714966931} {"function":"update_slots","level":"INFO","line":1643,"msg":"slot released","n_cache_tokens":376,"n_ctx":512,"n_past":376,"n_system_tokens":0,"slot_id":0,"task_id":37,"tid":"124309595705344","timestamp":1714966931,"truncated":false} {"function":"log_server_request","level":"INFO","line":2737,"method":"POST","msg":"request","params":{},"path":"/embedding","remote_addr":"127.0.0.1","remote_port":58326,"status":200,"tid":"124308108083200","timestamp":1714966931} {"function":"process_single_task","level":"INFO","line":1509,"msg":"slot data","n_idle_slots":1,"n_processing_slots":0,"task_id":40,"tid":"124309595705344","timestamp":1714966931} {"function":"log_server_request","level":"INFO","line":2737,"method":"GET","msg":"request","params":{},"path":"/health","remote_addr":"127.0.0.1","remote_port":58326,"status":200,"tid":"124308108083200","timestamp":1714966931} [GIN] 2024/05/06 - 03:42:11 | 200 | 1.657263601s | 172.31.0.3 | POST "/api/embeddings" {"function":"launch_slot_with_data","level":"INFO","line":830,"msg":"slot is processing task","slot_id":0,"task_id":41,"tid":"124309595705344","timestamp":1714966931} {"function":"update_slots","level":"INFO","line":1839,"msg":"kv cache rm [p0, end)","p0":0,"slot_id":0,"task_id":41,"tid":"124309595705344","timestamp":1714966931} {"function":"update_slots","level":"INFO","line":1643,"msg":"slot released","n_cache_tokens":415,"n_ctx":512,"n_past":415,"n_system_tokens":0,"slot_id":0,"task_id":41,"tid":"124309595705344","timestamp":1714966931,"truncated":false} {"function":"log_server_request","level":"INFO","line":2737,"method":"POST","msg":"request","params":{},"path":"/embedding","remote_addr":"127.0.0.1","remote_port":58326,"status":200,"tid":"124308108083200","timestamp":1714966931} {"function":"process_single_task","level":"INFO","line":1509,"msg":"slot data","n_idle_slots":1,"n_processing_slots":0,"task_id":44,"tid":"124309595705344","timestamp":1714966931} {"function":"log_server_request","level":"INFO","line":2737,"method":"GET","msg":"request","params":{},"path":"/health","remote_addr":"127.0.0.1","remote_port":58332,"status":200,"tid":"124308061945856","timestamp":1714966931} [GIN] 2024/05/06 - 03:42:11 | 200 | 1.722957692s | 172.31.0.3 | POST "/api/embeddings" {"function":"launch_slot_with_data","level":"INFO","line":830,"msg":"slot is processing task","slot_id":0,"task_id":45,"tid":"124309595705344","timestamp":1714966931} {"function":"update_slots","level":"INFO","line":1839,"msg":"kv cache rm [p0, end)","p0":0,"slot_id":0,"task_id":45,"tid":"124309595705344","timestamp":1714966931} {"function":"update_slots","level":"INFO","line":1643,"msg":"slot released","n_cache_tokens":398,"n_ctx":512,"n_past":398,"n_system_tokens":0,"slot_id":0,"task_id":45,"tid":"124309595705344","timestamp":1714966931,"truncated":false} {"function":"log_server_request","level":"INFO","line":2737,"method":"POST","msg":"request","params":{},"path":"/embedding","remote_addr":"127.0.0.1","remote_port":58332,"status":200,"tid":"124308061945856","timestamp":1714966931} {"function":"process_single_task","level":"INFO","line":1509,"msg":"slot data","n_idle_slots":1,"n_processing_slots":0,"task_id":48,"tid":"124309595705344","timestamp":1714966931} {"function":"log_server_request","level":"INFO","line":2737,"method":"GET","msg":"request","params":{},"path":"/health","remote_addr":"127.0.0.1","remote_port":58332,"status":200,"tid":"124308061945856","timestamp":1714966931} [GIN] 2024/05/06 - 03:42:11 | 200 | 1.746344556s | 172.31.0.3 | POST "/api/embeddings" {"function":"launch_slot_with_data","level":"INFO","line":830,"msg":"slot is processing task","slot_id":0,"task_id":49,"tid":"124309595705344","timestamp":1714966931} {"function":"update_slots","level":"INFO","line":1839,"msg":"kv cache rm [p0, end)","p0":0,"slot_id":0,"task_id":49,"tid":"124309595705344","timestamp":1714966931} {"function":"update_slots","level":"INFO","line":1643,"msg":"slot released","n_cache_tokens":423,"n_ctx":512,"n_past":423,"n_system_tokens":0,"slot_id":0,"task_id":49,"tid":"124309595705344","timestamp":1714966931,"truncated":false} {"function":"log_server_request","level":"INFO","line":2737,"method":"POST","msg":"request","params":{},"path":"/embedding","remote_addr":"127.0.0.1","remote_port":58332,"status":200,"tid":"124308061945856","timestamp":1714966931} {"function":"process_single_task","level":"INFO","line":1509,"msg":"slot data","n_idle_slots":1,"n_processing_slots":0,"task_id":52,"tid":"124309595705344","timestamp":1714966931} {"function":"log_server_request","level":"INFO","line":2737,"method":"GET","msg":"request","params":{},"path":"/health","remote_addr":"127.0.0.1","remote_port":58334,"status":200,"tid":"124308051460096","timestamp":1714966931} [GIN] 2024/05/06 - 03:42:11 | 200 | 1.811759743s | 172.31.0.3 | POST "/api/embeddings" {"function":"launch_slot_with_data","level":"INFO","line":830,"msg":"slot is processing task","slot_id":0,"task_id":53,"tid":"124309595705344","timestamp":1714966931} {"function":"update_slots","level":"INFO","line":1839,"msg":"kv cache rm [p0, end)","p0":0,"slot_id":0,"task_id":53,"tid":"124309595705344","timestamp":1714966931} {"function":"update_slots","level":"INFO","line":1643,"msg":"slot released","n_cache_tokens":390,"n_ctx":512,"n_past":390,"n_system_tokens":0,"slot_id":0,"task_id":53,"tid":"124309595705344","timestamp":1714966931,"truncated":false} {"function":"log_server_request","level":"INFO","line":2737,"method":"POST","msg":"request","params":{},"path":"/embedding","remote_addr":"127.0.0.1","remote_port":58334,"status":200,"tid":"124308051460096","timestamp":1714966931} {"function":"process_single_task","level":"INFO","line":1509,"msg":"slot data","n_idle_slots":1,"n_processing_slots":0,"task_id":56,"tid":"124309595705344","timestamp":1714966931} [GIN] 2024/05/06 - 03:42:11 | 200 | 1.834057454s | 172.31.0.3 | POST "/api/embeddings" {"function":"log_server_request","level":"INFO","line":2737,"method":"GET","msg":"request","params":{},"path":"/health","remote_addr":"127.0.0.1","remote_port":58334,"status":200,"tid":"124308051460096","timestamp":1714966931} {"function":"launch_slot_with_data","level":"INFO","line":830,"msg":"slot is processing task","slot_id":0,"task_id":57,"tid":"124309595705344","timestamp":1714966931} {"function":"update_slots","level":"INFO","line":1839,"msg":"kv cache rm [p0, end)","p0":0,"slot_id":0,"task_id":57,"tid":"124309595705344","timestamp":1714966931} {"function":"update_slots","level":"INFO","line":1643,"msg":"slot released","n_cache_tokens":391,"n_ctx":512,"n_past":391,"n_system_tokens":0,"slot_id":0,"task_id":57,"tid":"124309595705344","timestamp":1714966931,"truncated":false} {"function":"log_server_request","level":"INFO","line":2737,"method":"POST","msg":"request","params":{},"path":"/embedding","remote_addr":"127.0.0.1","remote_port":58334,"status":200,"tid":"124308051460096","timestamp":1714966931} [GIN] 2024/05/06 - 03:42:11 | 200 | 1.901208131s | 172.31.0.3 | POST "/api/embeddings" {"function":"process_single_task","level":"INFO","line":1509,"msg":"slot data","n_idle_slots":1,"n_processing_slots":0,"task_id":60,"tid":"124309595705344","timestamp":1714966931} {"function":"log_server_request","level":"INFO","line":2737,"method":"GET","msg":"request","params":{},"path":"/health","remote_addr":"127.0.0.1","remote_port":58334,"status":200,"tid":"124308051460096","timestamp":1714966931} {"function":"launch_slot_with_data","level":"INFO","line":830,"msg":"slot is processing task","slot_id":0,"task_id":61,"tid":"124309595705344","timestamp":1714966931} {"function":"update_slots","level":"INFO","line":1839,"msg":"kv cache rm [p0, end)","p0":0,"slot_id":0,"task_id":61,"tid":"124309595705344","timestamp":1714966931} {"function":"update_slots","level":"INFO","line":1643,"msg":"slot released","n_cache_tokens":305,"n_ctx":512,"n_past":305,"n_system_tokens":0,"slot_id":0,"task_id":61,"tid":"124309595705344","timestamp":1714966931,"truncated":true} {"function":"log_server_request","level":"INFO","line":2737,"method":"POST","msg":"request","params":{},"path":"/embedding","remote_addr":"127.0.0.1","remote_port":58350,"status":200,"tid":"124308040974336","timestamp":1714966931} [GIN] 2024/05/06 - 03:42:11 | 200 | 1.630656389s | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:19 | 200 | 25.58µs | 172.31.0.3 | HEAD "/" {"function":"process_single_task","level":"INFO","line":1509,"msg":"slot data","n_idle_slots":1,"n_processing_slots":0,"task_id":64,"tid":"124309595705344","timestamp":1714966939} {"function":"log_server_request","level":"INFO","line":2737,"method":"GET","msg":"request","params":{},"path":"/health","remote_addr":"127.0.0.1","remote_port":52140,"status":200,"tid":"124307994836992","timestamp":1714966939} {"function":"process_single_task","level":"INFO","line":1509,"msg":"slot data","n_idle_slots":1,"n_processing_slots":0,"task_id":65,"tid":"124309595705344","timestamp":1714966939} {"function":"log_server_request","level":"INFO","line":2737,"method":"GET","msg":"request","params":{},"path":"/health","remote_addr":"127.0.0.1","remote_port":52140,"status":200,"tid":"124307994836992","timestamp":1714966939} {"function":"launch_slot_with_data","level":"INFO","line":830,"msg":"slot is processing task","slot_id":0,"task_id":66,"tid":"124309595705344","timestamp":1714966939} {"function":"update_slots","level":"INFO","line":1839,"msg":"kv cache rm [p0, end)","p0":0,"slot_id":0,"task_id":66,"tid":"124309595705344","timestamp":1714966939} {"function":"update_slots","level":"INFO","line":1643,"msg":"slot released","n_cache_tokens":3,"n_ctx":512,"n_past":3,"n_system_tokens":0,"slot_id":0,"task_id":66,"tid":"124309595705344","timestamp":1714966939,"truncated":false} {"function":"log_server_request","level":"INFO","line":2737,"method":"POST","msg":"request","params":{},"path":"/embedding","remote_addr":"127.0.0.1","remote_port":52140,"status":200,"tid":"124307994836992","timestamp":1714966939} [GIN] 2024/05/06 - 03:42:19 | 200 | 60.856905ms | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:42:39 | 200 | 19.72µs | 172.31.0.3 | HEAD "/" {"function":"process_single_task","level":"INFO","line":1509,"msg":"slot data","n_idle_slots":1,"n_processing_slots":0,"task_id":69,"tid":"124309595705344","timestamp":1714966959} {"function":"log_server_request","level":"INFO","line":2737,"method":"GET","msg":"request","params":{},"path":"/health","remote_addr":"127.0.0.1","remote_port":58154,"status":200,"tid":"124307984351232","timestamp":1714966959} {"function":"process_single_task","level":"INFO","line":1509,"msg":"slot data","n_idle_slots":1,"n_processing_slots":0,"task_id":70,"tid":"124309595705344","timestamp":1714966959} {"function":"log_server_request","level":"INFO","line":2737,"method":"GET","msg":"request","params":{},"path":"/health","remote_addr":"127.0.0.1","remote_port":58154,"status":200,"tid":"124307984351232","timestamp":1714966959} {"function":"launch_slot_with_data","level":"INFO","line":830,"msg":"slot is processing task","slot_id":0,"task_id":71,"tid":"124309595705344","timestamp":1714966959} {"function":"update_slots","level":"INFO","line":1839,"msg":"kv cache rm [p0, end)","p0":0,"slot_id":0,"task_id":71,"tid":"124309595705344","timestamp":1714966959} {"function":"update_slots","level":"INFO","line":1643,"msg":"slot released","n_cache_tokens":3,"n_ctx":512,"n_past":3,"n_system_tokens":0,"slot_id":0,"task_id":71,"tid":"124309595705344","timestamp":1714966959,"truncated":false} {"function":"log_server_request","level":"INFO","line":2737,"method":"POST","msg":"request","params":{},"path":"/embedding","remote_addr":"127.0.0.1","remote_port":58154,"status":200,"tid":"124307984351232","timestamp":1714966959} [GIN] 2024/05/06 - 03:42:39 | 200 | 68.538657ms | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:43:07 | 200 | 36.67µs | 172.31.0.3 | HEAD "/" {"function":"process_single_task","level":"INFO","line":1509,"msg":"slot data","n_idle_slots":1,"n_processing_slots":0,"task_id":74,"tid":"124309595705344","timestamp":1714966987} {"function":"log_server_request","level":"INFO","line":2737,"method":"GET","msg":"request","params":{},"path":"/health","remote_addr":"127.0.0.1","remote_port":44828,"status":200,"tid":"124307973865472","timestamp":1714966987} {"function":"process_single_task","level":"INFO","line":1509,"msg":"slot data","n_idle_slots":1,"n_processing_slots":0,"task_id":75,"tid":"124309595705344","timestamp":1714966987} {"function":"log_server_request","level":"INFO","line":2737,"method":"GET","msg":"request","params":{},"path":"/health","remote_addr":"127.0.0.1","remote_port":44828,"status":200,"tid":"124307973865472","timestamp":1714966987} {"function":"process_single_task","level":"INFO","line":1509,"msg":"slot data","n_idle_slots":1,"n_processing_slots":0,"task_id":76,"tid":"124309595705344","timestamp":1714966987} {"function":"log_server_request","level":"INFO","line":2737,"method":"GET","msg":"request","params":{},"path":"/health","remote_addr":"127.0.0.1","remote_port":44834,"status":200,"tid":"124308353449984","timestamp":1714966987} {"function":"launch_slot_with_data","level":"INFO","line":830,"msg":"slot is processing task","slot_id":0,"task_id":77,"tid":"124309595705344","timestamp":1714966987} [GIN] 2024/05/06 - 03:43:07 | 500 | 189.272µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:43:07 | 500 | 221.382µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:43:07 | 500 | 230.052µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:43:07 | 500 | 180.162µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:43:07 | 500 | 424.434µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:43:07 | 500 | 196.073µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:43:07 | 500 | 242.793µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:43:07 | 500 | 424.915µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:43:07 | 500 | 216.242µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:43:07 | 500 | 222.382µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:43:07 | 500 | 175.602µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:43:07 | 500 | 211.142µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:43:07 | 500 | 344.203µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:43:07 | 500 | 345.484µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:43:07 | 500 | 320.734µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:43:07 | 500 | 221.023µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:43:07 | 500 | 211.502µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:43:07 | 500 | 215.963µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:43:07 | 500 | 192.682µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:43:07 | 500 | 209.793µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:43:07 | 500 | 218.192µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:43:07 | 500 | 205.972µs | 172.31.0.3 | POST "/api/embeddings" {"function":"update_slots","level":"INFO","line":1839,"msg":"kv cache rm [p0, end)","p0":0,"slot_id":0,"task_id":77,"tid":"124309595705344","timestamp":1714966987} [GIN] 2024/05/06 - 03:43:07 | 500 | 180.181µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:43:07 | 500 | 200.752µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:43:07 | 500 | 206.912µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:43:07 | 500 | 210.882µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:43:07 | 500 | 203.673µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:43:07 | 500 | 193.522µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:43:07 | 500 | 212.003µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:43:07 | 500 | 211.442µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:43:07 | 500 | 247.693µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:43:07 | 500 | 220.372µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:43:07 | 500 | 191.022µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:43:07 | 500 | 445.774µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:43:07 | 500 | 185.711µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:43:07 | 500 | 248.583µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:43:07 | 500 | 291.343µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:43:07 | 500 | 209.552µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:43:07 | 500 | 326.093µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:43:07 | 500 | 354.714µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:43:07 | 500 | 383.884µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:43:07 | 500 | 215.283µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:43:07 | 500 | 377.144µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:43:07 | 500 | 226.212µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:43:07 | 500 | 209.172µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:43:07 | 500 | 359.214µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:43:07 | 500 | 197.642µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:43:07 | 500 | 366.904µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:43:07 | 500 | 226.063µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:43:07 | 500 | 343.743µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:43:07 | 500 | 178.901µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:43:07 | 500 | 356.484µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:43:07 | 500 | 183.552µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:43:07 | 500 | 304.964µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:43:07 | 500 | 166.022µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:43:07 | 500 | 172.302µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:43:07 | 500 | 313.883µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:43:07 | 500 | 181.722µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:43:07 | 500 | 241.893µs | 172.31.0.3 | POST "/api/embeddings" {"function":"update_slots","level":"INFO","line":1643,"msg":"slot released","n_cache_tokens":426,"n_ctx":512,"n_past":426,"n_system_tokens":0,"slot_id":0,"task_id":77,"tid":"124309595705344","timestamp":1714966987,"truncated":false} {"function":"log_server_request","level":"INFO","line":2737,"method":"POST","msg":"request","params":{},"path":"/embedding","remote_addr":"127.0.0.1","remote_port":44834,"status":200,"tid":"124308353449984","timestamp":1714966987} [GIN] 2024/05/06 - 03:43:07 | 200 | 39.156838ms | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:43:07 | 500 | 189.072µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:43:07 | 500 | 439.555µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:43:07 | 500 | 203.292µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:43:07 | 500 | 382.594µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:43:07 | 500 | 214.412µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:43:07 | 500 | 206.842µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:43:07 | 500 | 193.642µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:43:07 | 500 | 219.452µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:43:07 | 500 | 210.352µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:43:07 | 500 | 369.264µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:43:07 | 500 | 491.905µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:43:07 | 500 | 410.365µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:43:07 | 500 | 290.883µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:43:07 | 500 | 512.915µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:43:07 | 500 | 456.965µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:43:07 | 500 | 625.817µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:43:07 | 500 | 452.155µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:43:07 | 500 | 277.033µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:43:07 | 500 | 841.869µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:43:07 | 500 | 591.496µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:43:07 | 500 | 669.567µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:43:07 | 500 | 197.962µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:43:07 | 500 | 547.676µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:43:07 | 500 | 199.212µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:43:07 | 500 | 360.473µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:43:07 | 500 | 383.544µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:43:07 | 500 | 209.533µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:43:07 | 500 | 208.352µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:43:07 | 500 | 252.123µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:43:07 | 500 | 186.312µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:43:07 | 500 | 355.553µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:43:07 | 500 | 222.563µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:43:07 | 500 | 214.773µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:43:07 | 500 | 318.743µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:43:07 | 500 | 240.343µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:43:07 | 500 | 222.782µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:43:07 | 500 | 316.903µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:43:07 | 500 | 239.182µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:43:07 | 500 | 361.123µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:43:07 | 500 | 224.982µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:43:07 | 500 | 228.983µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:43:07 | 500 | 262.653µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:43:07 | 500 | 174.242µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:43:07 | 500 | 195.202µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:43:07 | 500 | 182.172µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:43:07 | 500 | 177.062µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:43:07 | 500 | 187.412µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:43:07 | 500 | 205.192µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:43:07 | 500 | 212.802µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:43:07 | 500 | 265.633µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:43:07 | 500 | 188.032µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:43:07 | 500 | 197.172µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:43:07 | 500 | 194.492µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:43:07 | 500 | 218.942µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:43:07 | 500 | 274.823µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:43:07 | 500 | 197.512µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:43:07 | 500 | 204.422µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:43:07 | 500 | 211.522µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:43:07 | 500 | 203.123µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:43:07 | 500 | 214.122µs | 172.31.0.3 | POST "/api/embeddings" {"function":"process_single_task","level":"INFO","line":1509,"msg":"slot data","n_idle_slots":1,"n_processing_slots":0,"task_id":80,"tid":"124309595705344","timestamp":1714966987} [GIN] 2024/05/06 - 03:43:07 | 500 | 203.572µs | 172.31.0.3 | POST "/api/embeddings" {"function":"process_single_task","level":"INFO","line":1509,"msg":"slot data","n_idle_slots":1,"n_processing_slots":0,"task_id":81,"tid":"124309595705344","timestamp":1714966987} {"function":"log_server_request","level":"INFO","line":2737,"method":"GET","msg":"request","params":{},"path":"/health","remote_addr":"127.0.0.1","remote_port":44828,"status":200,"tid":"124307973865472","timestamp":1714966987} [GIN] 2024/05/06 - 03:43:07 | 500 | 219.033µs | 172.31.0.3 | POST "/api/embeddings" {"function":"log_server_request","level":"INFO","line":2737,"method":"GET","msg":"request","params":{},"path":"/health","remote_addr":"127.0.0.1","remote_port":44834,"status":200,"tid":"124308353449984","timestamp":1714966987} [GIN] 2024/05/06 - 03:43:07 | 500 | 216.872µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:43:07 | 500 | 168.772µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:43:07 | 500 | 183.562µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:43:07 | 500 | 172.232µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:43:07 | 500 | 165.652µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:43:07 | 500 | 183.332µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:43:07 | 500 | 182.042µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:43:07 | 500 | 173.972µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:43:07 | 500 | 174.141µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:43:07 | 500 | 186.652µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:43:07 | 500 | 348.564µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:43:07 | 500 | 182.282µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:43:07 | 500 | 215.952µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:43:07 | 500 | 194.602µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:43:07 | 500 | 183.892µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:43:07 | 500 | 205.322µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:43:07 | 500 | 184.992µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:43:07 | 500 | 176.202µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:43:07 | 500 | 168.962µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:43:07 | 500 | 184.072µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:43:07 | 500 | 404.814µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:43:07 | 500 | 273.503µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:43:07 | 500 | 403.664µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:43:07 | 500 | 401.604µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:43:07 | 500 | 327.303µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:43:07 | 500 | 492.635µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:43:07 | 500 | 277.773µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:43:07 | 500 | 225.092µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:43:07 | 500 | 464.345µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:43:07 | 500 | 319.644µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:43:07 | 500 | 338.024µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:43:07 | 500 | 284.813µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:43:07 | 500 | 261.602µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:43:07 | 500 | 388.145µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:43:07 | 500 | 396.505µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:43:07 | 500 | 371.764µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:43:07 | 500 | 377.734µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:43:07 | 500 | 401.104µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:43:07 | 500 | 369.554µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:43:07 | 500 | 441.954µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:43:07 | 500 | 210.972µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:43:07 | 500 | 378.383µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:43:07 | 500 | 225.502µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:43:07 | 500 | 178.741µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:43:07 | 500 | 405.304µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:43:07 | 500 | 353.684µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:43:07 | 500 | 195.032µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:43:07 | 500 | 314.843µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:43:07 | 500 | 379.184µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:43:07 | 500 | 369.823µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:43:07 | 500 | 349.924µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:43:07 | 500 | 293.513µs | 172.31.0.3 | POST "/api/embeddings" [GIN] 2024/05/06 - 03:43:07 | 500 | 410.964µs | 172.31.0.3 | POST "/api/embeddings" {"function":"process_single_task","level":"INFO","line":1509,"msg":"slot data","n_idle_slots":1,"n_processing_slots":0,"task_id":82,"tid":"124309595705344","timestamp":1714966987} {"function":"launch_slot_with_data","level":"INFO","line":830,"msg":"slot is processing task","slot_id":0,"task_id":83,"tid":"124309595705344","timestamp":1714966987} {"function":"log_server_request","level":"INFO","line":2737,"method":"GET","msg":"request","params":{},"path":"/health","remote_addr":"127.0.0.1","remote_port":44834,"status":200,"tid":"124308353449984","timestamp":1714966987} {"function":"update_slots","level":"INFO","line":1839,"msg":"kv cache rm [p0, end)","p0":0,"slot_id":0,"task_id":83,"tid":"124309595705344","timestamp":1714966987} {"function":"update_slots","level":"INFO","line":1643,"msg":"slot released","n_cache_tokens":432,"n_ctx":512,"n_past":432,"n_system_tokens":0,"slot_id":0,"task_id":83,"tid":"124309595705344","timestamp":1714966987,"truncated":false} {"function":"log_server_request","level":"INFO","line":2737,"method":"POST","msg":"request","params":{},"path":"/embedding","remote_addr":"127.0.0.1","remote_port":44828,"status":200,"tid":"124307973865472","timestamp":1714966987} {"function":"process_single_task","level":"INFO","line":1509,"msg":"slot data","n_idle_slots":1,"n_processing_slots":0,"task_id":86,"tid":"124309595705344","timestamp":1714966987} {"function":"log_server_request","level":"INFO","line":2737,"method":"GET","msg":"request","params":{},"path":"/health","remote_addr":"127.0.0.1","remote_port":44828,"status":200,"tid":"124307973865472","timestamp":1714966987} [GIN] 2024/05/06 - 03:43:07 | 200 | 117.750017ms | 172.31.0.3 | POST "/api/embeddings" {"function":"launch_slot_with_data","level":"INFO","line":830,"msg":"slot is processing task","slot_id":0,"task_id":87,"tid":"124309595705344","timestamp":1714966987} {"function":"update_slots","level":"INFO","line":1839,"msg":"kv cache rm [p0, end)","p0":0,"slot_id":0,"task_id":87,"tid":"124309595705344","timestamp":1714966987} {"function":"process_single_task","level":"INFO","line":1509,"msg":"slot data","n_idle_slots":0,"n_processing_slots":1,"task_id":89,"tid":"124309595705344","timestamp":1714966987} {"function":"update_slots","level":"INFO","line":1643,"msg":"slot released","n_cache_tokens":473,"n_ctx":512,"n_past":473,"n_system_tokens":0,"slot_id":0,"task_id":87,"tid":"124309595705344","timestamp":1714966987,"truncated":false} {"function":"log_server_request","level":"INFO","line":2737,"method":"GET","msg":"request","params":{},"path":"/health","remote_addr":"127.0.0.1","remote_port":44834,"status":200,"tid":"124308353449984","timestamp":1714966987} {"function":"log_server_request","level":"INFO","line":2737,"method":"POST","msg":"request","params":{},"path":"/embedding","remote_addr":"127.0.0.1","remote_port":44842,"status":200,"tid":"124308363935744","timestamp":1714966987} {"function":"process_single_task","level":"INFO","line":1509,"msg":"slot data","n_idle_slots":1,"n_processing_slots":0,"task_id":91,"tid":"124309595705344","timestamp":1714966987} {"function":"log_server_request","level":"INFO","line":2737,"method":"GET","msg":"request","params":{},"path":"/health","remote_addr":"127.0.0.1","remote_port":44842,"status":200,"tid":"124308363935744","timestamp":1714966987} {"function":"process_single_task","level":"INFO","line":1509,"msg":"slot data","n_idle_slots":1,"n_processing_slots":0,"task_id":92,"tid":"124309595705344","timestamp":1714966987} {"function":"log_server_request","level":"INFO","line":2737,"method":"GET","msg":"request","params":{},"path":"/health","remote_addr":"127.0.0.1","remote_port":44854,"status":200,"tid":"124308219232256","timestamp":1714966987} [GIN] 2024/05/06 - 03:43:07 | 200 | 148.73527ms | 172.31.0.3 | POST "/api/embeddings" {"function":"launch_slot_with_data","level":"INFO","line":830,"msg":"slot is processing task","slot_id":0,"task_id":93,"tid":"124309595705344","timestamp":1714966987} {"function":"update_slots","level":"INFO","line":1839,"msg":"kv cache rm [p0, end)","p0":0,"slot_id":0,"task_id":93,"tid":"124309595705344","timestamp":1714966987} {"function":"update_slots","level":"INFO","line":1643,"msg":"slot released","n_cache_tokens":450,"n_ctx":512,"n_past":450,"n_system_tokens":0,"slot_id":0,"task_id":93,"tid":"124309595705344","timestamp":1714966987,"truncated":false} {"function":"log_server_request","level":"INFO","line":2737,"method":"POST","msg":"request","params":{},"path":"/embedding","remote_addr":"127.0.0.1","remote_port":44854,"status":200,"tid":"124308219232256","timestamp":1714966987} {"function":"process_single_task","level":"INFO","line":1509,"msg":"slot data","n_idle_slots":1,"n_processing_slots":0,"task_id":96,"tid":"124309595705344","timestamp":1714966987} {"function":"log_server_request","level":"INFO","line":2737,"method":"GET","msg":"request","params":{},"path":"/health","remote_addr":"127.0.0.1","remote_port":44854,"status":200,"tid":"124308219232256","timestamp":1714966987} [GIN] 2024/05/06 - 03:43:07 | 200 | 176.759092ms | 172.31.0.3 | POST "/api/embeddings" {"function":"process_single_task","level":"INFO","line":1509,"msg":"slot data","n_idle_slots":1,"n_processing_slots":0,"task_id":97,"tid":"124309595705344","timestamp":1714966987} {"function":"log_server_request","level":"INFO","line":2737,"method":"GET","msg":"request","params":{},"path":"/health","remote_addr":"127.0.0.1","remote_port":44842,"status":200,"tid":"124308363935744","timestamp":1714966987} {"function":"launch_slot_with_data","level":"INFO","line":830,"msg":"slot is processing task","slot_id":0,"task_id":98,"tid":"124309595705344","timestamp":1714966987} {"function":"update_slots","level":"INFO","line":1839,"msg":"kv cache rm [p0, end)","p0":0,"slot_id":0,"task_id":98,"tid":"124309595705344","timestamp":1714966987} {"function":"process_single_task","level":"INFO","line":1509,"msg":"slot data","n_idle_slots":0,"n_processing_slots":1,"task_id":100,"tid":"124309595705344","timestamp":1714966987} {"function":"update_slots","level":"INFO","line":1643,"msg":"slot released","n_cache_tokens":471,"n_ctx":512,"n_past":471,"n_system_tokens":0,"slot_id":0,"task_id":98,"tid":"124309595705344","timestamp":1714966987,"truncated":false} {"function":"log_server_request","level":"INFO","line":2737,"method":"GET","msg":"request","params":{},"path":"/health","remote_addr":"127.0.0.1","remote_port":44842,"status":200,"tid":"124308363935744","timestamp":1714966987} {"function":"log_server_request","level":"INFO","line":2737,"method":"POST","msg":"request","params":{},"path":"/embedding","remote_addr":"127.0.0.1","remote_port":44854,"status":200,"tid":"124308219232256","timestamp":1714966987} {"function":"process_single_task","level":"INFO","line":1509,"msg":"slot data","n_idle_slots":1,"n_processing_slots":0,"task_id":102,"tid":"124309595705344","timestamp":1714966987} [GIN] 2024/05/06 - 03:43:07 | 200 | 248.222196ms | 172.31.0.3 | POST "/api/embeddings" {"function":"log_server_request","level":"INFO","line":2737,"method":"GET","msg":"request","params":{},"path":"/health","remote_addr":"127.0.0.1","remote_port":44854,"status":200,"tid":"124308219232256","timestamp":1714966987} {"function":"launch_slot_with_data","level":"INFO","line":830,"msg":"slot is processing task","slot_id":0,"task_id":103,"tid":"124309595705344","timestamp":1714966987} {"function":"update_slots","level":"INFO","line":1839,"msg":"kv cache rm [p0, end)","p0":0,"slot_id":0,"task_id":103,"tid":"124309595705344","timestamp":1714966987} {"function":"update_slots","level":"INFO","line":1643,"msg":"slot released","n_cache_tokens":464,"n_ctx":512,"n_past":464,"n_system_tokens":0,"slot_id":0,"task_id":103,"tid":"124309595705344","timestamp":1714966987,"truncated":false} {"function":"log_server_request","level":"INFO","line":2737,"method":"POST","msg":"request","params":{},"path":"/embedding","remote_addr":"127.0.0.1","remote_port":44858,"status":200,"tid":"124308342964224","timestamp":1714966987} {"function":"process_single_task","level":"INFO","line":1509,"msg":"slot data","n_idle_slots":1,"n_processing_slots":0,"task_id":106,"tid":"124309595705344","timestamp":1714966987} {"function":"log_server_request","level":"INFO","line":2737,"method":"GET","msg":"request","params":{},"path":"/health","remote_addr":"127.0.0.1","remote_port":44858,"status":200,"tid":"124308342964224","timestamp":1714966987} [GIN] 2024/05/06 - 03:43:07 | 200 | 277.137587ms | 172.31.0.3 | POST "/api/embeddings" {"function":"process_single_task","level":"INFO","line":1509,"msg":"slot data","n_idle_slots":1,"n_processing_slots":0,"task_id":107,"tid":"124309595705344","timestamp":1714966987} {"function":"log_server_request","level":"INFO","line":2737,"method":"GET","msg":"request","params":{},"path":"/health","remote_addr":"127.0.0.1","remote_port":44842,"status":200,"tid":"124308363935744","timestamp":1714966987} {"function":"process_single_task","level":"INFO","line":1509,"msg":"slot data","n_idle_slots":1,"n_processing_slots":0,"task_id":108,"tid":"124309595705344","timestamp":1714966987} {"function":"log_server_request","level":"INFO","line":2737,"method":"GET","msg":"request","params":{},"path":"/health","remote_addr":"127.0.0.1","remote_port":44870,"status":200,"tid":"124308208746496","timestamp":1714966987} {"function":"process_single_task","level":"INFO","line":1509,"msg":"slot data","n_idle_slots":1,"n_processing_slots":0,"task_id":109,"tid":"124309595705344","timestamp":1714966987} {"function":"log_server_request","level":"INFO","line":2737,"method":"GET","msg":"request","params":{},"path":"/health","remote_addr":"127.0.0.1","remote_port":44870,"status":200,"tid":"124308208746496","timestamp":1714966987} {"function":"launch_slot_with_data","level":"INFO","line":830,"msg":"slot is processing task","slot_id":0,"task_id":110,"tid":"124309595705344","timestamp":1714966987} {"function":"update_slots","level":"INFO","line":1839,"msg":"kv cache rm [p0, end)","p0":0,"slot_id":0,"task_id":110,"tid":"124309595705344","timestamp":1714966987} {"function":"process_single_task","level":"INFO","line":1509,"msg":"slot data","n_idle_slots":0,"n_processing_slots":1,"task_id":112,"tid":"124309595705344","timestamp":1714966987} {"function":"update_slots","level":"INFO","line":1643,"msg":"slot released","n_cache_tokens":467,"n_ctx":512,"n_past":467,"n_system_tokens":0,"slot_id":0,"task_id":110,"tid":"124309595705344","timestamp":1714966987,"truncated":false} {"function":"log_server_request","level":"INFO","line":2737,"method":"GET","msg":"request","params":{},"path":"/health","remote_addr":"127.0.0.1","remote_port":44870,"status":200,"tid":"124308208746496","timestamp":1714966987} {"function":"log_server_request","level":"INFO","line":2737,"method":"POST","msg":"request","params":{},"path":"/embedding","remote_addr":"127.0.0.1","remote_port":44858,"status":200,"tid":"124308342964224","timestamp":1714966987} {"function":"process_single_task","level":"INFO","line":1509,"msg":"slot data","n_idle_slots":1,"n_processing_slots":0,"task_id":114,"tid":"124309595705344","timestamp":1714966987} {"function":"log_server_request","level":"INFO","line":2737,"method":"GET","msg":"request","params":{},"path":"/health","remote_addr":"127.0.0.1","remote_port":44858,"status":200,"tid":"124308342964224","timestamp":1714966987} [GIN] 2024/05/06 - 03:43:07 | 200 | 346.890174ms | 172.31.0.3 | POST "/api/embeddings" {"function":"process_single_task","level":"INFO","line":1509,"msg":"slot data","n_idle_slots":1,"n_processing_slots":0,"task_id":115,"tid":"124309595705344","timestamp":1714966987} {"function":"log_server_request","level":"INFO","line":2737,"method":"GET","msg":"request","params":{},"path":"/health","remote_addr":"127.0.0.1","remote_port":44870,"status":200,"tid":"124308208746496","timestamp":1714966987} {"function":"launch_slot_with_data","level":"INFO","line":830,"msg":"slot is processing task","slot_id":0,"task_id":116,"tid":"124309595705344","timestamp":1714966987} {"function":"update_slots","level":"INFO","line":1839,"msg":"kv cache rm [p0, end)","p0":0,"slot_id":0,"task_id":116,"tid":"124309595705344","timestamp":1714966987} {"function":"update_slots","level":"INFO","line":1643,"msg":"slot released","n_cache_tokens":479,"n_ctx":512,"n_past":479,"n_system_tokens":0,"slot_id":0,"task_id":116,"tid":"124309595705344","timestamp":1714966987,"truncated":false} {"function":"log_server_request","level":"INFO","line":2737,"method":"POST","msg":"request","params":{},"path":"/embedding","remote_addr":"127.0.0.1","remote_port":44858,"status":200,"tid":"124308342964224","timestamp":1714966987} [GIN] 2024/05/06 - 03:43:07 | 200 | 417.137156ms | 172.31.0.3 | POST "/api/embeddings" {"function":"process_single_task","level":"INFO","line":1509,"msg":"slot data","n_idle_slots":1,"n_processing_slots":0,"task_id":119,"tid":"124309595705344","timestamp":1714966987} {"function":"log_server_request","level":"INFO","line":2737,"method":"GET","msg":"request","params":{},"path":"/health","remote_addr":"127.0.0.1","remote_port":44872,"status":200,"tid":"124308229718016","timestamp":1714966987} {"function":"launch_slot_with_data","level":"INFO","line":830,"msg":"slot is processing task","slot_id":0,"task_id":120,"tid":"124309595705344","timestamp":1714966987} {"function":"update_slots","level":"INFO","line":1839,"msg":"kv cache rm [p0, end)","p0":0,"slot_id":0,"task_id":120,"tid":"124309595705344","timestamp":1714966987} {"function":"update_slots","level":"INFO","line":1643,"msg":"slot released","n_cache_tokens":475,"n_ctx":512,"n_past":475,"n_system_tokens":0,"slot_id":0,"task_id":120,"tid":"124309595705344","timestamp":1714966987,"truncated":false} {"function":"log_server_request","level":"INFO","line":2737,"method":"POST","msg":"request","params":{},"path":"/embedding","remote_addr":"127.0.0.1","remote_port":44872,"status":200,"tid":"124308229718016","timestamp":1714966987} {"function":"process_single_task","level":"INFO","line":1509,"msg":"slot data","n_idle_slots":1,"n_processing_slots":0,"task_id":123,"tid":"124309595705344","timestamp":1714966987} {"function":"log_server_request","level":"INFO","line":2737,"method":"GET","msg":"request","params":{},"path":"/health","remote_addr":"127.0.0.1","remote_port":44872,"status":200,"tid":"124308229718016","timestamp":1714966987} [GIN] 2024/05/06 - 03:43:07 | 200 | 445.244369ms | 172.31.0.3 | POST "/api/embeddings" {"function":"launch_slot_with_data","level":"INFO","line":830,"msg":"slot is processing task","slot_id":0,"task_id":124,"tid":"124309595705344","timestamp":1714966987} {"function":"update_slots","level":"INFO","line":1839,"msg":"kv cache rm [p0, end)","p0":0,"slot_id":0,"task_id":124,"tid":"124309595705344","timestamp":1714966987} {"function":"update_slots","level":"INFO","line":1643,"msg":"slot released","n_cache_tokens":447,"n_ctx":512,"n_past":447,"n_system_tokens":0,"slot_id":0,"task_id":124,"tid":"124309595705344","timestamp":1714966987,"truncated":false} {"function":"log_server_request","level":"INFO","line":2737,"method":"POST","msg":"request","params":{},"path":"/embedding","remote_addr":"127.0.0.1","remote_port":44872,"status":200,"tid":"124308229718016","timestamp":1714966987} {"function":"process_single_task","level":"INFO","line":1509,"msg":"slot data","n_idle_slots":1,"n_processing_slots":0,"task_id":127,"tid":"124309595705344","timestamp":1714966987} {"function":"log_server_request","level":"INFO","line":2737,"method":"GET","msg":"request","params":{},"path":"/health","remote_addr":"127.0.0.1","remote_port":44872,"status":200,"tid":"124308229718016","timestamp":1714966987} [GIN] 2024/05/06 - 03:43:07 | 200 | 510.675891ms | 172.31.0.3 | POST "/api/embeddings" {"function":"launch_slot_with_data","level":"INFO","line":830,"msg":"slot is processing task","slot_id":0,"task_id":128,"tid":"124309595705344","timestamp":1714966987} {"function":"update_slots","level":"INFO","line":1839,"msg":"kv cache rm [p0, end)","p0":0,"slot_id":0,"task_id":128,"tid":"124309595705344","timestamp":1714966987} {"function":"update_slots","level":"INFO","line":1643,"msg":"slot released","n_cache_tokens":432,"n_ctx":512,"n_past":432,"n_system_tokens":0,"slot_id":0,"task_id":128,"tid":"124309595705344","timestamp":1714966987,"truncated":false} {"function":"log_server_request","level":"INFO","line":2737,"method":"POST","msg":"request","params":{},"path":"/embedding","remote_addr":"127.0.0.1","remote_port":44870,"status":200,"tid":"124308208746496","timestamp":1714966987} [GIN] 2024/05/06 - 03:43:07 | 200 | 534.155886ms | 172.31.0.3 | POST "/api/embeddings" {"function":"process_single_task","level":"INFO","line":1509,"msg":"slot data","n_idle_slots":1,"n_processing_slots":0,"task_id":131,"tid":"124309595705344","timestamp":1714966987} {"function":"log_server_request","level":"INFO","line":2737,"method":"GET","msg":"request","params":{},"path":"/health","remote_addr":"127.0.0.1","remote_port":44874,"status":200,"tid":"124308118568960","timestamp":1714966987} {"function":"launch_slot_with_data","level":"INFO","line":830,"msg":"slot is processing task","slot_id":0,"task_id":132,"tid":"124309595705344","timestamp":1714966987} {"function":"update_slots","level":"INFO","line":1839,"msg":"kv cache rm [p0, end)","p0":0,"slot_id":0,"task_id":132,"tid":"124309595705344","timestamp":1714966987} {"function":"update_slots","level":"INFO","line":1643,"msg":"slot released","n_cache_tokens":444,"n_ctx":512,"n_past":444,"n_system_tokens":0,"slot_id":0,"task_id":132,"tid":"124309595705344","timestamp":1714966987,"truncated":false} {"function":"log_server_request","level":"INFO","line":2737,"method":"POST","msg":"request","params":{},"path":"/embedding","remote_addr":"127.0.0.1","remote_port":44874,"status":200,"tid":"124308118568960","timestamp":1714966987} {"function":"process_single_task","level":"INFO","line":1509,"msg":"slot data","n_idle_slots":1,"n_processing_slots":0,"task_id":135,"tid":"124309595705344","timestamp":1714966987} [GIN] 2024/05/06 - 03:43:07 | 200 | 557.476418ms | 172.31.0.3 | POST "/api/embeddings" {"function":"log_server_request","level":"INFO","line":2737,"method":"GET","msg":"request","params":{},"path":"/health","remote_addr":"127.0.0.1","remote_port":44874,"status":200,"tid":"124308118568960","timestamp":1714966987} {"function":"launch_slot_with_data","level":"INFO","line":830,"msg":"slot is processing task","slot_id":0,"task_id":136,"tid":"124309595705344","timestamp":1714966987} {"function":"update_slots","level":"INFO","line":1839,"msg":"kv cache rm [p0, end)","p0":0,"slot_id":0,"task_id":136,"tid":"124309595705344","timestamp":1714966987} {"function":"update_slots","level":"INFO","line":1643,"msg":"slot released","n_cache_tokens":455,"n_ctx":512,"n_past":455,"n_system_tokens":0,"slot_id":0,"task_id":136,"tid":"124309595705344","timestamp":1714966987,"truncated":false} {"function":"log_server_request","level":"INFO","line":2737,"method":"POST","msg":"request","params":{},"path":"/embedding","remote_addr":"127.0.0.1","remote_port":44874,"status":200,"tid":"124308118568960","timestamp":1714966987} [GIN] 2024/05/06 - 03:43:07 | 200 | 580.311046ms | 172.31.0.3 | POST "/api/embeddings" ```