Medical QA Fine-Tuning with PEFT and LoRA

This project demonstrates how to fine-tune a GPT-2 model for medical question-pair similarity tasks using Hugging Face's PEFT (Parameter-Efficient Fine-Tuning) library and LoRA (Low-Rank Adaptation). The goal is to achieve high performance while keeping the number of trainable parameters minimal.

Features

Lightweight Fine-Tuning: Uses LoRA to efficiently fine-tune GPT-2 without updating the entire model.
Medical QA Dataset: Processes medical question pairs to train a similarity classifier.
Metrics Logging: Tracks accuracy, F1 score, precision, and recall during training and evaluation.
WandB Integration: Logs training progress and performance metrics to Weights & Biases.
Comparison of Models: Evaluates the performance of the base model and the LoRA-tuned model.

Workflow

1. Dataset Preparation

The medical question-pair dataset is used to train a similarity classifier. This dataset contains pairs of medical questions, where each pair is labeled to indicate whether the questions are semantically similar or not.

2. Model Fine-Tuning

The base GPT-2 model is fine-tuned using LoRA, which updates only a small subset of parameters.
LoRA configuration includes:
- Low-rank matrices (r=8)
- Target modules (c_attn, c_proj)
- LoRA alpha scaling (lora_alpha=32)
- Dropout to prevent overfitting (lora_dropout=0.1)

3. Evaluation

The fine-tuned model is evaluated on the validation and test datasets. The following metrics are computed:

Accuracy
F1 Score
Precision
Recall

4. Performance Comparison

The performance of the base model and the LoRA-tuned model is compared. The results are visualized using a bar chart generated in WandB.

How to Run

Prerequisites

Python 3.10.x

Install required libraries:

pip install torch transformers datasets peft wandb scikit-learn pandas

Steps

Clone the repository:

git clone https://github.com/deepbiolab/gpt2-classification-peft.git
cd gpt2-classification-peft

Run the training script:
```
python main.py
```
View logs and metrics in WandB.

Results

After training, the fine-tuned model achieves improved performance compared to the base model while keeping the number of trainable parameters minimal. The comparison chart illustrates the accuracy improvement.

File Structure

main.py: Main script for training and evaluation.
assets/improved_performance.png: Visualization of model performance comparison.
README.md: Project documentation.

Key Libraries

Hugging Face Transformers: For model loading and training.
PEFT: Implements parameter-efficient fine-tuning methods like LoRA.
WandB: Logs training metrics and visualizations.
Scikit-learn: Computes evaluation metrics.

Future Work

Experiment with different LoRA configurations to optimize performance.
Apply PEFT techniques to other tasks like text summarization or translation.

License

This project is licensed under the MIT License. See the LICENSE file for details.

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
assets		assets
peft_model		peft_model
test_results		test_results
.DS_Store		.DS_Store
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
main.py		main.py
medical_qa_peft.ipynb		medical_qa_peft.ipynb
report.md		report.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Medical QA Fine-Tuning with PEFT and LoRA

Features

Workflow

1. Dataset Preparation

2. Model Fine-Tuning

3. Evaluation

4. Performance Comparison

How to Run

Prerequisites

Steps

Results

File Structure

Key Libraries

Future Work

License

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Medical QA Fine-Tuning with PEFT and LoRA

Features

Workflow

1. Dataset Preparation

2. Model Fine-Tuning

3. Evaluation

4. Performance Comparison

How to Run

Prerequisites

Steps

Results

File Structure

Key Libraries

Future Work

License

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages