Simple Version of testing Code Llama inference in Google Colab environment.

The only difference between original repo (https://github.com/facebookresearch/codellama) is modifying example_completion.py -> example_completion_colab.py

Because torchrun doens't work well in basic colab environment (in my case), I used torch.distributed.launch instead of torchrun.

This is my implementation on colab notebook. (instructions are written in Korean. But almost same as original repo.)

Requirements

Google colab pro plan for High system RAM. (If you do hyperparameter tuning, you might be able to do it in the free version, but I'm not sure.)
at least 12.5GB of free space on your Google Drive. (7b models = 12.5 GB)
7b models can be run with a T4 GPU (Free).

Name		Name	Last commit message	Last commit date
Latest commit History 16 Commits
llama		llama
.gitignore		.gitignore
CODE_OF_CONDUCT.md		CODE_OF_CONDUCT.md
CONTRIBUTING.md		CONTRIBUTING.md
LICENSE		LICENSE
MODEL_CARD.md		MODEL_CARD.md
README.md		README.md
USE_POLICY.md		USE_POLICY.md
dev-requirements.txt		dev-requirements.txt
download.sh		download.sh
example_completion.py		example_completion.py
example_completion_colab.py		example_completion_colab.py
example_infilling.py		example_infilling.py
example_instructions.py		example_instructions.py
requirements.txt		requirements.txt
setup.py		setup.py

Provide feedback