Computer Science Question Answering (CS QA) Chatbot

The primary goal is to develop a chatbot that answers computer science related questions.

Demo

Telegram deployment on model using LSTM architecture with dot product attention mechanism.

demo.mov

Three different approaches were tried to develop this chatbot: LSTM-LSTM RNN Architecture, DistilRoberta-LSTM Hybrid Architecture and GPT-2 Transformer Architecture.

Datasets: For this project, we sourced Computer-Science related Question-Answer data from 4 origins:

Implementation Overview

Architectures

LSTM-LSTM RNN Architecture:
- A traditional sequence-to-sequence approach with Luong Attention to improve response coherence.
DistilRoberta-LSTM Hybrid Architecture:
- Combines DistilRoberta for contextual embeddings with LSTM for sequential decoding.
- Features a fine-tuned encoder with Luong Attention for dynamic focus on inputs.
GPT-2 Transformer Architecture:
- A fully Transformer-based model with self-attention mechanisms for robust language understanding and generation.

How to Run the Project

Preprocessing

Run preprocessing.ipynb to clean and unify the dataset into a CSV file (combined_data.csv).
- Required Files: final1.txt, intents.json, SoftwareQuestions.csv

Training

Run 'LSTM-LSTM_training.ipynb' to train RNN model
Run 'DistilRoberta-LSTM_training.ipynb' to train Hybrid model
Run 'GPT2_Training.ipynb' to train Transformer model

Note: Instructions on how to run the models are included in the notebooks

Hosting on Telegram

Deploy the chatbot on Telegram:
- Each .ipynb file (LSTM-LSTM_training.ipynb, DistilRoberta-LSTM_training.ipynb, GPT2_Training.ipynb) includes a Telegram hosting block of code at the end.
- To deploy the chatbot, simply run the Telegram block, ensuring you replace YOUR_TOKEN with the API token obtained from BotFather on Telegram.
- To get Token: Search for @BotFather on Telegram and Start a Chart. Type /newbot and follow the instructions to name your bot and set a username (it must end with "bot", e.g., my_test_bot). Thereafter, BotFather would provide the API token
- This block integrates the trained model with a Telegram bot, allowing real-time interaction with users.

Performance Evaluation

Evaluation Metrics

BERTScore: Measures semantic similarity between responses and ground truth.
BLEU: Evaluates precision of n-grams in generated responses.
ROUGE: Assesses overlap of unigrams and sequences between generated and reference answers.

Key Findings

LSTM-LSTM Model

BERTScore: 0.95
BLEU: 0.75
ROUGE-1: 0.84
ROUGE-L: 0.82

DistilRoberta-LSTM Hybrid

BERTScore: 0.97
BLEU: 0.71
ROUGE-1: 0.79
ROUGE-L: 0.77

GPT-2 Transformer

BERTScore: 0.87
BLEU: 0.18
ROUGE-1: 0.38
ROUGE-L: 0.34

Observations

GPT-2 Transformer performed well in generating coherent and semantically meaningful answers.
DistilRoberta-LSTM Hybrid excelled in contextual understanding, and had the highest BERTScore.
LSTM-LSTM RNN provided satisfactory results but sometimes lacked depth/relevance to the question.

Future Improvements

Advanced Inference Techniques:
- Implement Beam Search to enhance response coherence.
Parameter Optimisation:
- Use Grid Search for fine-tuning hyperparameters.
Fallback Mechanisms:
- Integrate BERTScore for real-time similarity checks and provide alternative resources for low-confidence responses.
Dataset Expansion:
- Include diverse interview topics beyond computer science, such as behavioural questions and case studies.

Contributors

Bryan Chua Jiaheng
Leck Yan Qing Elvy
Syahmim Chukhan Bin Shamsudin
Thean Zhi Wei

Academic Context

Course: CS425: Natural Language Communication
Instructor: Dr. Wei Gao
Institution: Singapore Management University, AY 2024-2025 T1

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Computer Science Question Answering (CS QA) Chatbot

Demo

Implementation Overview

Architectures

How to Run the Project

Preprocessing

Training

Hosting on Telegram

Performance Evaluation

Evaluation Metrics

Key Findings

LSTM-LSTM Model

DistilRoberta-LSTM Hybrid

GPT-2 Transformer

Observations

Future Improvements

Contributors

Academic Context

About

Releases

Packages

Contributors 4

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 30 Commits
.gitignore		.gitignore
DistilRoberta-LSTM_training.ipynb		DistilRoberta-LSTM_training.ipynb
GPT2_Training.ipynb		GPT2_Training.ipynb
LSTM_LSTM_training.ipynb		LSTM_LSTM_training.ipynb
README.md		README.md
SoftwareQuestions.csv		SoftwareQuestions.csv
combined_data.csv		combined_data.csv
final1.txt		final1.txt
intents.json		intents.json
preprocessing.ipynb		preprocessing.ipynb

zhiweit/cs-qa-chatbot

Folders and files

Latest commit

History

Repository files navigation

Computer Science Question Answering (CS QA) Chatbot

Demo

Implementation Overview

Architectures

How to Run the Project

Preprocessing

Training

Hosting on Telegram

Performance Evaluation

Evaluation Metrics

Key Findings

LSTM-LSTM Model

DistilRoberta-LSTM Hybrid

GPT-2 Transformer

Observations

Future Improvements

Contributors

Academic Context

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 4

Languages

Packages