Closed gropoli closed 2 years ago
Could you try to repro in Python? this irons out much of client-side errors that may occur. OPQ8_8 does not make much sense because it means there is a single dimension per input dimension. Better use OPQ8_32 or OPQ8_64
Could you try to repro in Python? this irons out much of client-side errors that may occur. OPQ8_8 does not make much sense because it means there is a single dimension per input dimension. Better use OPQ8_32 or OPQ8_64
Thanks @mdouze I'll try doing that, I'm just having a hard time right now trying to build the faiss package for python from the same source version I built the c++ lib I'll let you know when I could repro You're right for OPQ8_8, for now I was just naively exploring the different parameters from a benchmark script I wrote (in Python calling my c++ test program 🙃)
Summary
Hi,
Following the Guidelines to choose an index I am trying this index:
OPQ8_8,IVF65536_HNSW32,PQ8x4fsr
But while in the training phase I am running into an Access violation reading location 0x00000000000002D0 which very looks like a memory issue or a bug somewhere.
The call stack of my main thread:
and of the faulting thread:
I am training the index with a set of 1,768,364 vectors of dimension 256 (20% of MS MARCO set for passage ranking). MS MARCO makes a 8.8 GB index when all its vectors are added to a Flat index, and I have 288 GB RAM installed, so I am definitely not running out of memory.
I wouldn't assume MKL LAPACK code is buggy but who knows... or maybe there is something I am doing with Faiss at a higher level that is wrong...?
When verbose is on, all I get before it crashes is:
What other information can I produce to help you help me? :)
Thanks!
Platform
OS: Windows
Faiss version: 1.7.0
Installed from: compiled by myself
Running on:
Interface: