smolGPT 🐾

smol but mighty.

You might have even seen jaymody/picoGPT!

But you have never seen smolGPT!!!

What is smolGPT?

smolGPT is a small-scale version of the GPT architecture, built for research and learning purposes. This is a research project focused on understanding and experimenting with the inner workings of GPT-like architectures.

What makes smolGPT special? It’s built using pycandle-2, a lightweight machine learning library that I’ve created from scratch. By using NumPy as the foundation, the goal was to implement a minimalist GPT model without relying on large, heavyweight frameworks like TensorFlow or PyTorch. This library allows you to run, train, and experiment with machine learning models while keeping things easy to understand.

picoGPT features:

Fast? ❌ smolPGT is supaSLOW 🐌 We say 🚫 to CV-cache, Quantization and Distillation
Training code? ✅ Yes, but it may cause you 💢!
top-p sampling? ❌ top-k? ✅ temperature? ❌ categorical sampling?! ❌ greedy? ✅
Self-made??? ✅✅ YESS!!! I made it completely from scratch in numpy😲😲😲
Scalable? (੭˃ᴗ˂)੭ You may build whatever architecture you want with PyCandle.

GPT2-3-4?, Llama 1-2-3? 😎👌🔥 just provide model weights🤔

📦 Installation

To install smolGPT with pycandle-2, follow these simple steps:

Clone the repository:

git clone https://github.com/TimaGitHub/smolGPT.git
cd smolGPT

Install the required dependencies:
```
pip install -r requirements.txt
```
Run the model:
```
python main.py
```

And that’s it! You’re ready to start generating text with smolGPT. 😅

🚀 Example Usage

With smolGPT, you can quickly generate text. Here’s an example:

python main.py --prompt "Hello! i am a language model," --max_new_tokens 30 --model 124M --device gpu --topk 30

output 😅

Hello! i am a language model, I have no language background and this is not a problem

Anonymous 01/11/15 (Thu) 04:09:19 AM No.

The model’s lighthearted and playful nature makes it a fun tool for experimenting with GPT-like architectures (124M, 345M, 762M, 1542M) 😆

Some fun generations

124M Parameters

prompt = "I will prove this mathematical theorem: "

345M Parameters

prompt = "I will prove this mathematical theorem: "

762M Parameters

prompt = "for i in range(5):"

762M Parameters

prompt = "a = [10, 35, 20, -40]\nsorted(a)\n>>>"

762M Parameters

prompt = "one small step for a man"

💡 Why smolGPT?

Learning-Focused: Built to explore and experiment with GPT architecture, not necessarily to run on embedded or resource-limited systems.
Small Size: Perfect for local experiments and educational purposes.
Built with pycandle-2: Leverages a custom NumPy-based machine learning library for simplicity and efficiency.
Lightweight: While small, it retains the core principles of GPT models.

🤖 Contributing

If you’d like to improve smolGPT or contribute to pycandle-2, feel free to fork the repository, make your changes, and submit a pull request. New ideas and contributions are always welcome!

Name		Name	Last commit message	Last commit date
Latest commit History 12 Commits
pycandle-2 @ 79cd0cd		pycandle-2 @ 79cd0cd
.gitignore		.gitignore
.gitmodules		.gitmodules
LICENSE		LICENSE
README.md		README.md
main.py		main.py
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

smolGPT 🐾

What is smolGPT?

📦 Installation

🚀 Example Usage

output 😅

Some fun generations

124M Parameters

345M Parameters

762M Parameters

762M Parameters

762M Parameters

💡 Why smolGPT?

🤖 Contributing

About

Releases

Packages

Languages

License

TimaGitHub/smolGPT

Folders and files

Latest commit

History

Repository files navigation

smolGPT 🐾

What is smolGPT?

📦 Installation

🚀 Example Usage

output 😅

Some fun generations

124M Parameters

345M Parameters

762M Parameters

762M Parameters

762M Parameters

💡 Why smolGPT?

🤖 Contributing

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages