-
Notifications
You must be signed in to change notification settings - Fork 1
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Faiss indexing file 'pyserini/indexes/dindex-wikipedia-dpr_multi-bf-20200127-f403c3.29eb39fe0b00a03c36c0eeae4c24f775' not found #2
Comments
BTW, a large memory burden seems to exist. What's the lowest requirement of executing this project? |
Thanks for your interests in our work! Pyserini seems have updated their index since we released our initial code base so the index names might have been changed. I have checked their updated index and seems that the one you are referring to Also the memory requirement is indeed a bit large, I think for our setup, we uses a host memory size ~500GB to store the index files and for the knn experiments the memory requirement might be larger. |
Thanks for your reply. Can you please provide a file, like
|
Sry, as we don't actively maintain the code base right now so it's hard to provide a compatible requirements.txt. But can you verify that the |
I have checked the pyserini commit a year ago, the ssearcher attribute is not recently added and is initialized from the |
Hi, really brilliant work to accelerate retrieval process in retrieval augumented language models!
I've been working on running through your open-source work so far, but come across some problems recently. It seems before runnning the commands listed in Readme.md, indexing files should be prebuild. Herein, I look into the build_hnsw_index.py file, which shows a prebuilt 'pyserini/indexes/dindex-wikipedia-dpr_multi-bf-20200127-f403c3.29eb39fe0b00a03c36c0eeae4c24f775' is needed.
Whereas, as a fresh man in this field, I haven't found any prebuild version named that in https://github.com/castorini/pyserini/blob/master/docs/usage-search.md#learned-dense-retrieval-models. The most relevant one is faiss.wikipedia-dpr-100w.dpr_multi.20200127.f403c3.tar.gz, but not so sure. Can you give a more specific description of that? About the prebuilt index, and the whole workflow to execute this project? Thanks a lot!
The text was updated successfully, but these errors were encountered: