castorini / pyserini

Pyserini is a Python toolkit for reproducible information retrieval research with sparse and dense representations.
http://pyserini.io/
Apache License 2.0
1.57k stars 349 forks source link

Add Reproduction logs --Yiran Sun #1925

Closed Feng-12138 closed 1 week ago

Feng-12138 commented 1 month ago

OS: MacBook Pro M2 8GB Python: Python 3.10.14 Java: openjdk 21.0.3

Encounter issues when doing Contriever Baseline for NFCorpus

Command: python -m pyserini.search.faiss \ --encoder-class auto --encoder BAAI/bge-base-en-v1.5 --l2-norm \ --pooling mean \ --index indexes/nfcorpus.bge-base-en-v1.5 \ --topics collections/nfcorpus/queries.tsv \ --output runs/run.beir.bge-base-en-v1.5.nfcorpus.txt \ --batch 64 --threads 8 \ --hits 1000

Error:

# A fatal error has been detected by the Java Runtime Environment:
#
#  SIGSEGV (0xb) at pc=0x0000000103842f94, pid=19135, tid=24323
#
# JRE version: OpenJDK Runtime Environment Zulu21.34+19-CA (21.0.3+9) (build 21.0.3+9-LTS)
# Java VM: OpenJDK 64-Bit Server VM Zulu21.34+19-CA (21.0.3+9-LTS, mixed mode, sharing, tiered, compressed oops, compressed class ptrs, g1 gc, bsd-aarch64)
lintool commented 2 weeks ago

What happened to the SIGSEGV error? How did you resolve?

Please fix conflicts.

Feng-12138 commented 2 weeks ago

What happened to the SIGSEGV error? How did you resolve?

Please fix conflicts.

I reset up the pyserini env, and then everything moved smoothly. I might do something wrong during my first attempt.

Yes I will fix the conflicts