MIDI Translation Transformers

Overview

The point of this project was to use transformer to predict high-resolution MIDI data from low-resolution, quantized MIDI notes.

Training

Overview: Velocity prediction:

flowchart TD
    A[MIDI Sequence] --> B(quantized piece)
    B --> |transformers| E(encoded_decoded)
    A --> C(velocity)
    E --> |generator| F[generated velocity]
    F --> G(loss)
    C --> G

Dstart prediction:

flowchart TD
    A[MIDI Sequence] --> B(quantized piece)
    B --> |transformers| E(encoded_decoded)
    A --> C(high-resolution dstart bins)
    E --> |generator| F[generated high-resolution dstart bins]
    F --> G(loss)
    C --> G

Training options

You can train the model to predict dstart:

python train.py --config-name=dstart

or velocity:

python train.py

Tokenization method

You can train the model on data quantized into up to 10 bins.

For example, if you want to use 5 bins for dstart, 4 for duration and 1 for velocity, you need to specify dataset.quantization hyperparameters:

python train.py dataset.quantization.dstart=5 dataset.quantization.duration=4 dataset.quantization.velocity=1

Other hyperparameters

train:
  num_epochs: 5
  accum_iter: 10
  base_lr: 1.0
  batch_size: 16
  distributed: False
  label_smoothing: 0.1

dataset_name: 'roszcz/maestro-v1-sustain'

dataset:
  sequence_len: 128
  sequence_step: 42

  quantization:
    duration: 3
    dstart: 3
    velocity: 3

device: "cuda:0"
warmup: 3000
log_frequency: 10
file_prefix: "to-vel"
run_name: midi-transformer-${now:%Y-%m-%d-%H-%M}
project: "midi-transformer"

model:
  n: 6
  d_model: 512
  d_ff: 2048
  h: 8
  dropout: 0.1

Results

The model for velocity prediction, with parameters from above, was trained for ~7.5h on GTX 960M and reached ~2.6 loss on val split of maestro-v1 dataset as well as on giant-midi-sustain.

Dashboard

To start the velocity prediction dashboard you need to run streamlit.

# Streamlit has issues with accepting it's run in the root directory
PYTHONPATH=. streamlit run --server.port 4466 dashboard/streamlit/velocity/main.py

For dstart prediction run:

PYTHONPATH=. streamlit run --server.port 4466 dashboard/streamlit/dstart/main.py

Tokenization review dashboard

You can try out different tokenization methods i.e. different nuber of bins by choosing Tokenization review option from "Display" selectbox.

Model predictions review

When you have trained your model, you can listen to and compare model predictions with original and target pieces. Run the same command and choose "Model predictions" option.

Predict piece

You can choose a model to predict velocities or dstart of any piece from test dataset.

Code Style

This repository uses pre-commit hooks with forced python formatting (black, flake8, and isort):

pip install pre-commit
pre-commit install

Whenever you execute git commit the files altered / added within the commit will be checked and corrected. black and isort can modify files locally - if that happens you have to git add them again. You might also be prompted to introduce some fixes manually.

To run the hooks against all files without running git commit:

pre-commit run --all-files

Name		Name	Last commit message	Last commit date
Latest commit History 248 Commits
artifacts		artifacts
checkpoints		checkpoints
configs		configs
dashboard		dashboard
data		data
evals		evals
modules		modules
pipeline		pipeline
tmp		tmp
.gitignore		.gitignore
.isort.cfg		.isort.cfg
.pre-commit-config.yaml		.pre-commit-config.yaml
README.md		README.md
model.py		model.py
requirements.txt		requirements.txt
train.py		train.py
training_utils.py		training_utils.py
utils.py		utils.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

MIDI Translation Transformers

Overview

Training

Training options

Tokenization method

Other hyperparameters

Results

Dashboard

Tokenization review dashboard

Model predictions review

Predict piece

Code Style

About

Releases

Packages

Contributors 2

Languages

Nospoko/midi-translation

Folders and files

Latest commit

History

Repository files navigation

MIDI Translation Transformers

Overview

Training

Training options

Tokenization method

Other hyperparameters

Results

Dashboard

Tokenization review dashboard

Model predictions review

Predict piece

Code Style

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages