Create a model file from the generated training_state.dat #22

manojmanivannan · 2024-07-16T09:52:30Z

Apologies if my question is stupid, is it at all possible to create a model so we can run this generated model on , say, ollama ?

keyvank · 2024-07-16T09:57:51Z

Unfortunately, the file format femtoGPT is generating is something special to femtoGPT and not a standardized one, so no, you can't directly load training_state.dat into ollama. Although, maybe in the future, we can add the ability to generate standard model formats to femtoGPT :)

manojmanivannan · 2024-07-16T10:13:31Z

thanks @keyvank for the quick response. So how can i run the model in inference mode ?

keyvank · 2024-07-17T15:16:12Z

@manojmanivannan Just change the main.rs file and keep this:

let inference = gpt.infer(
    &mut rng,
    &tokenizer.tokenize("YOUR INPUT TO THE MODEL"),
    100,
    inference_temperature,
    |_ch| {},
)?;

// Generate 100 character with the currently trained model before
// starting the training loop.
println!("{}", tokenizer.untokenize(&inference));

nitirajrathore · 2024-09-06T20:01:01Z

@keyvank : Thanks for developing such a nice project. Can you also help by writing the full code for this inference in the project itself. I am a coder, but don't know rust. I tried the snippet you gave but it is giving lots of error that I don't understand. Now just to test the model generated, I will have learn rust.
Can you please complete the inference part as well so that newbies can directly run the generated model.

keyvank · 2024-09-06T21:03:07Z

@nitirajrathore Please check my last commit

keyvank closed this as completed in 1af3258 Sep 6, 2024

keyvank reopened this Sep 6, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Create a model file from the generated training_state.dat #22

Create a model file from the generated training_state.dat #22

manojmanivannan commented Jul 16, 2024

keyvank commented Jul 16, 2024

manojmanivannan commented Jul 16, 2024

keyvank commented Jul 17, 2024

nitirajrathore commented Sep 6, 2024

keyvank commented Sep 6, 2024

Create a model file from the generated training_state.dat #22

Create a model file from the generated training_state.dat #22

Comments

manojmanivannan commented Jul 16, 2024

keyvank commented Jul 16, 2024

manojmanivannan commented Jul 16, 2024

keyvank commented Jul 17, 2024

nitirajrathore commented Sep 6, 2024

keyvank commented Sep 6, 2024