This repository contains a project focused on training custom YOLO models for gun action recognition and utilizing EigenCAM for explainable AI evaluation of the results. The project includes data preparation, training, and visualization steps to ensure accurate and interpretable results.
- Introduction
- Installation
- Dataset Preparation
- Training the Model
- Using EigenCAM for Explainable AI
- Results and Evaluation
- Contributing
- License
The goal of this project is to recognize gun actions using custom-trained YOLO models and to evaluate the results using EigenCAM for explainable AI. YOLO (You Only Look Once) is a state-of-the-art, real-time object detection system. EigenCAM is used to provide visual explanations for the model's decisions, making the results more interpretable.
To get started, clone this repository and install the necessary dependencies:
git clone https://github.com/Andreavisi1/Eyes_on_Guns_w_YOLO
cd Eyes_on_Guns_w_YOLO
The dataset consists of videos categorized into Handgun
, Machine_Gun
, and No_Gun
. Each category contains videos from which frames are extracted and labeled. The dataset structure should be as follows:
Gun_Action_Recognition_Dataset/
|
|--- Handgun/
|--- Machine_Gun/
|--- No_Gun/
If you'd like to follow the steps to train a new model, please run the "train.ipynb" file (including the unzipping section!)
Run the script to extract frames from videos and convert labels to YOLO format:
# Extract frames and convert labels
python extract_and_convert.py
Custom YOLO models can be trained using the provided scripts. The models are trained on the prepared dataset with specified configurations.
# Initialize YOLO model
model = YOLO('basic_models/choose_favourite_model.pt')
# Custom training function
def custom_train(model, yaml_filename, epochs=10, batch_size=16, imgsz=480):
# Training logic here...
# Execute custom training
custom_train(model, 'choose_file_yaml.yaml', epochs=10, batch_size=16, imgsz=640)
EigenCAM is used to generate visual explanations for the YOLO model's predictions. This helps in understanding why the model makes certain predictions.
from yolov8_cam.eigen_cam import EigenCAM
from yolov8_cam.utils.image import show_cam_on_image
# Load the trained YOLO model
model = YOLO('choose_favourite_model.pt')
# Use EigenCAM for explanations
cam = EigenCAM(model, target_layers, task='od')
grayscale_cam = cam(rgb_img, eigen_smooth=True, principal_comp=principal_comp)
for i in range(grayscale_cam.shape[3]):
cam_image = show_cam_on_image(img, grayscale_cam[0,:,:,i], use_rgb=True)
plt.imshow(cam_image)
plt.show()
After training, evaluate the model using the test set and visualize the results using EigenCAM. The explanations can be saved for further analysis.
# Verify the dataset structure
if check_dataset_structure(dataset_base_path):
print("Dataset structure is correct.")
else:
print("Dataset structure has issues.")
Contributor Name | GitHub |
---|---|
🍎 Piergallini Enrico | Click here |
🍏 Visi Andrea | Click here |
Contributions are welcome! Please open an issue or submit a pull request for any improvements or bug fixes.
This project is licensed under the MIT License. See the LICENSE file for details.