Skip to content

Latest commit

 

History

History
14 lines (12 loc) · 268 Bytes

README.md

File metadata and controls

14 lines (12 loc) · 268 Bytes

Installation

Install fleece-worker

pip install -e .

Run benchmarking

CUDA_VISIBLE_DEVICES=0 python bench.py -m llama-2-7b-chat-slice > results7b.json
CUDA_VISIBLE_DEVICES=0 python bench.py -m llama-2-70b-chat-slice > results70b.json