-
Notifications
You must be signed in to change notification settings - Fork 92
Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
Merge pull request #85 from shinning0821/main
Update LoRa
- Loading branch information
Showing
45 changed files
with
1,885 additions
and
382 deletions.
There are no files selected for viewing
Binary file not shown.
Binary file not shown.
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,108 @@ | ||
{ | ||
"cells": [ | ||
{ | ||
"cell_type": "markdown", | ||
"metadata": {}, | ||
"source": [ | ||
"## Use Lora for Adaption" | ||
] | ||
}, | ||
{ | ||
"cell_type": "markdown", | ||
"metadata": {}, | ||
"source": [ | ||
"### A Quick Overview\n", | ||
"\n", | ||
"#### LoRa(ICLR2021)\n", | ||
"<img width=\"380\" height=\"380\" src=\"../figs/lora/lora.png\">\n", | ||
"\n", | ||
"Low-Rank Adaptation, or [LoRA](https://github.com/microsoft/LoRA), freezes the pretrained model weights and injects trainable rank decomposition matrices into each layer of the Transformer architecture, greatly reducing the number of trainable parameters for downstream tasks. Compared to GPT-3 175B fine-tuned with Adam, LoRA can reduce the number of trainable parameters by 10,000 times and the GPU memory requirement by 3 times.\n", | ||
"\n", | ||
"#### AdaLoRa(ICLR2023)\n", | ||
"[AdaLoRA](https://github.com/QingruZhang/AdaLoRA) adaptively allocates the parameter budget among weight matrices according to their importance score. In particular, AdaLoRA parameterizes the incremental updates in the form of singular value decomposition. Such a novel approach allows for the effective pruning of the singular values of unimportant updates, which is essential to reduce the parameter budget but circumvent intensive exact SVD computations.\n", | ||
"\n", | ||
"### Application in our framework\n", | ||
"For each AttentionBlock in ImageEncoder, we replace the two Linear Layers in Attention and Mlp with LoRa Linear Layer and AdaLoRa Layer, i.e., one example [here](../models/ImageEncoder/vit/lora_block.py)" | ||
] | ||
}, | ||
{ | ||
"cell_type": "markdown", | ||
"metadata": {}, | ||
"source": [ | ||
"### Training\n", | ||
"In SAM, EfficientSAM, and MobileSAM, the adjustment of models using Lora is supported. The -mod option can be used to specify the fine-tuning method:\n", | ||
"``python train.py -net mobile_sam -dataset REFUGE -data_path data/REFUGE -sam_ckpt checkpoint/mobile_sam/mobile_sam.pt -image_size 256 -vis 100 -exp_name tiny-mobile-isic-256 -encoder tiny_vit -mod sam_lora``\n", | ||
"\n" | ||
] | ||
}, | ||
{ | ||
"cell_type": "markdown", | ||
"metadata": {}, | ||
"source": [ | ||
"### Performance VS Adapter\n", | ||
"#### REFUGE\n", | ||
"| Baseline | Backbone | mode | DICE | mIou | Memory |\n", | ||
"| ------------ | --------- | ------ | ---- | ------- | ------------ |\n", | ||
"| EfficientSAM | VIT-Small | Adapter | 0.8691 | 0.7915 | 21275 M |\n", | ||
"| EfficientSAM | VIT-Small | Lora | 0.8573 | 0.7703 | 22777 M |\n", | ||
"| EfficientSAM | VIT-Small | AdaLora | 0.8558 | 0.7596 | 22779 M |\n", | ||
"| MobileSAM | Tiny-Vit | Adapter | 0.9330 | 0.8812 | 10255M |\n", | ||
"| MobileSAM | Tiny-Vit | Lora | 0.9107 | 0.8436 | 10401M |\n", | ||
"| MobileSAM | Tiny-Vit | AdaLora | 0.8863 | 0.8031 | 10401M |" | ||
] | ||
}, | ||
{ | ||
"cell_type": "markdown", | ||
"metadata": {}, | ||
"source": [ | ||
"### Curve of loss and performance\n", | ||
"**based on MobileSAM(Tiny-Vit) model and REFUGE dataset**\n", | ||
"\n", | ||
"#### Adapter\n", | ||
"<p float=\"left\">\n", | ||
" <img src=\"../figs/lora/MobileSAM-Ti (Adapter)_loss.png\" width=\"400\" />\n", | ||
" \n", | ||
" <img src=\"../figs/lora/MobileSAM-Ti (Adapter)_performance.png\" width=\"400\" /> \n", | ||
" \n", | ||
"</p>\n", | ||
"\n", | ||
"#### LoRa\n", | ||
"<p float=\"left\">\n", | ||
" <img src=\"../figs/lora/MobileSAM-Ti (Lora)_loss.png\" width=\"400\" />\n", | ||
" \n", | ||
" <img src=\"../figs/lora/MobileSAM-Ti (Lora)_performance.png\" width=\"400\" /> \n", | ||
" \n", | ||
"</p>\n", | ||
"\n", | ||
"#### AdaLoRa\n", | ||
"<p float=\"left\">\n", | ||
" <img src=\"../figs/lora/MobileSAM-Ti (AdaLora)_loss.png\" width=\"400\" />\n", | ||
" \n", | ||
" <img src=\"../figs/lora/MobileSAM-Ti (AdaLora)_performance.png\" width=\"400\" /> \n", | ||
" \n", | ||
"</p>\n", | ||
"\n", | ||
"It can be seen that the training method using Adapter is more stable." | ||
] | ||
} | ||
], | ||
"metadata": { | ||
"kernelspec": { | ||
"display_name": "Python 3.7.16 ('general')", | ||
"language": "python", | ||
"name": "python3" | ||
}, | ||
"language_info": { | ||
"name": "python", | ||
"version": "3.7.16" | ||
}, | ||
"orig_nbformat": 4, | ||
"vscode": { | ||
"interpreter": { | ||
"hash": "e7f99538a81e8449c1b1a4a7141984025c678b5d9c33981aa2a3c129d8e1c90d" | ||
} | ||
} | ||
}, | ||
"nbformat": 4, | ||
"nbformat_minor": 2 | ||
} |
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -1,2 +1,2 @@ | ||
from .tinyvit.tiny_vit import TinyViT | ||
from .vit import AdapterBlock, Block | ||
from .vit import AdaloraBlock, AdapterBlock, Block, LoraBlock |
Binary file not shown.
Binary file not shown.
Binary file modified
BIN
+0 Bytes
(100%)
models/ImageEncoder/tinyvit/__pycache__/adapter_block.cpython-37.pyc
Binary file not shown.
Binary file not shown.
Binary file modified
BIN
+181 Bytes
(100%)
models/ImageEncoder/tinyvit/__pycache__/tiny_vit.cpython-37.pyc
Binary file not shown.
Oops, something went wrong.