model weights are always downloaded repeatedly. #1236

1193749292 · 2023-11-30T10:03:28Z

Take facebook/opt-6.7b as an example. The following information is displayed each time you run the facebook/opt-6.7b directory.
/path/model/facebook/opt-6.7b' model weights not found in cache or outdated. Downloading from huggingface.co

llm = ff.LLM( "/path/model/model/facebook/opt-6.7b",cache_path="/path/model/model/facebook/opt-6.7b/half-precision")

When specifying the LLM, the weights_path and clean_cache parameters do not exist. Is there any parameter that can avoid this situation or other methods?

The text was updated successfully, but these errors were encountered:

jiazhihao · 2023-11-30T22:07:52Z

@goliaro Can you take a look at this issue?

goliaro · 2023-12-01T16:17:05Z

Let me take a look!

goliaro · 2023-12-08T17:20:26Z

@1193749292 I just double checked, but I'm currently not running into the issue above on my machine. If you are still running into the issue, could you post the script that you are using to run the LLM, so I can help you debug the issue?

Btw, if you are using the instructions below:

llm = ff.LLM( "/path/model/model/facebook/opt-6.7b",cache_path="/path/model/model/facebook/opt-6.7b/half-precision")

two things to note are that:

for the model name, unless you have previously downloaded the opt-6.7b model from huggingface, and you explicitly want to load the model from that local folder, you should use facebook/opt-6.7b rather than /path/model/model/facebook/opt-6.7b. After the first time, FlexFlow will automatically detect that the model is in the cache, and won't redownload
for the cache folder, you don't need to specify a path, unless you do not want FlexFlow to use the standard caching path ~/.cache/flexflow. If you do decide to pass a cache_path, note that this should be the path to where all flexflow models, including weights, tokenizers, configs, etc... are stored. FlexFlow will automatically create subfolders for the current model. So if you pass cache_path="/path/model/model/facebook/opt-6.7b/half-precision", FlexFlow will look for the opt-6.7b weights in folder /path/model/model/facebook/opt-6.7b/half-precision/weights/facebook/opt-6.7b/half-precision/, for example. Noting that the folder does not exist, it will proceed to download and convert the weights again

In conclusion, my recommendation is to use:

llm = ff.LLM("facebook/opt-6.7b")

leaving the cache_path blank. Let me know if it still doesn't work!

1193749292 · 2023-12-11T02:50:59Z

@goliaro Thank you very much for your answer. I executed the script in the same directory as flexflow and it worked.

And then I want to ask, is there a way to omit unnecessary output when executing a lot of things?

goliaro · 2023-12-13T16:31:35Z

@1193749292 We'll add a non-verbose inference mode soon. In the meantime, feel free to comment out the print statements that you don't need

xinlong-yang · 2024-01-30T10:03:30Z

@1193749292 We'll add a non-verbose inference mode soon. In the meantime, feel free to comment out the print statements that you don't need

hi, I wonder how to modify the cache path, I don't want cache model's weights in ~/.cache/flexflow, thanks!

lockshaw · 2024-03-19T16:47:20Z

hi, I wonder how to modify the cache path, I don't want cache model's weights in ~/.cache/flexflow, thanks!

Moved to flexflow/flexflow-serve#19.

goliaro closed this as completed Dec 18, 2023

1193749292 mentioned this issue Jan 3, 2024

FlexFlow performance test #1240

Open

lockshaw mentioned this issue Mar 12, 2024

Modify model cache path flexflow/flexflow-serve#19

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

model weights are always downloaded repeatedly. #1236

model weights are always downloaded repeatedly. #1236

1193749292 commented Nov 30, 2023

jiazhihao commented Nov 30, 2023

goliaro commented Dec 1, 2023

goliaro commented Dec 8, 2023 •

edited

Loading

1193749292 commented Dec 11, 2023

goliaro commented Dec 13, 2023

xinlong-yang commented Jan 30, 2024

lockshaw commented Mar 19, 2024

model weights are always downloaded repeatedly. #1236

model weights are always downloaded repeatedly. #1236

Comments

1193749292 commented Nov 30, 2023

jiazhihao commented Nov 30, 2023

goliaro commented Dec 1, 2023

goliaro commented Dec 8, 2023 • edited Loading

1193749292 commented Dec 11, 2023

goliaro commented Dec 13, 2023

xinlong-yang commented Jan 30, 2024

lockshaw commented Mar 19, 2024

goliaro commented Dec 8, 2023 •

edited

Loading