Skip to content

Latest commit

 

History

History
74 lines (60 loc) · 2.75 KB

README.md

File metadata and controls

74 lines (60 loc) · 2.75 KB

Learning to Plan for Retrieval-Augmented Large Language Models from Knowledge Graphs

🌈 Overview of LPKG Framework

image.png

💻 How to start

git clone https://github.com/zjukg/LPKG.git

1.Finetuning on KG-sourced planning data

The code of fine-tuning is constructed based on open-sourced repo LLaMA-Factory.

  1. Download our KG-sourced planning data from our Huggingface repo.
  2. Put the downloaded kg_supervise.json file under fine-tuning/data/ directory.
  3. Make sure you have downloaded the base model (Llama-3-8B-Instruct or CodeQwen1.5-7B-Chat). Fill in your base model directory BASE_MODEL_DIR, output directory OUTPUT_DIR in the fine-tuning/run_exp_llama.sh,fine-tuning/run_exp_qwen.sh.

Finetune Llama3:

cd fine-tuning
sh run_exp_llama.sh

Finetune CodeQwen:

cd finetuning
sh run_exp_qwen.sh

2.Predict the Plan on Downstream QA Datasets

Before running script, fill in your checkpoint directory CKPT_PATH, base model directory BASE_MODEL_DIR, output directory PRED_PATH, and the name of the dataset to be predicted DATASET_NAME in fine-tuning/run_predict_llama.sh, fine-tuning/run_predict_qwen.sh.

it should be note that the output in *_planning.json file is not the true output of planning LLM. They are just the final answers to questions.

Infer Llama3:

sh run_predict_llama.sh

Infer CodeQwen:

sh run_predict_qwen.sh

3.Parse Result

  1. Download the wikipedia dump and put them into /wikidump/.
cd parser/wikidump
wget https://dl.fbaipublicfiles.com/dpr/wikipedia_split/psgs_w100.tsv.gz
wget https://dl.fbaipublicfiles.com/contriever/embeddings/contriever-msmarco/wikipedia_embeddings.tar
  1. Download retriever model(Contriever-MSMARCO) and put it into /contriever_ms/
  2. Fill in the planning result directory and output directory in parse_result.py
  3. Fill in your OpenAI key in gpt/call_gpt.py
  4. Run parser. Make sure you have enough GPU memory to load wikipedia embedding(we use 2*80G A100 in our experiments):
cd parser
python parse_result.py

🤝 Cite:

Please consider citing this paper if you find our work useful.


@misc{wang2024learning,
      title={Learning to Plan for Retrieval-Augmented Large Language Models from Knowledge Graphs}, 
      author={Junjie Wang and Mingyang Chen and Binbin Hu and Dan Yang and Ziqi Liu and Yue Shen and Peng Wei and Zhiqiang Zhang and Jinjie Gu and Jun Zhou and Jeff Z. Pan and Wen Zhang and Huajun Chen},
      year={2024},
      eprint={2406.14282},
      archivePrefix={arXiv}
}