Deep Learning Models Guide

This guide explains how to train and use the new deep learning models in the NLP Sentiment Analysis project.

Quick Start

Train Your First Deep Learning Model

# Basic CNN training with both TensorFlow and PyTorch
python main.py --train-dl

# Use pre-trained GloVe embeddings for better accuracy
python main.py --train-dl --use-embeddings

View Results in Dashboard

After training, launch the dashboard to see your models:

streamlit run app.py

Navigate to the "Model Performance" tab to see:

✅ Trained DL models highlighted at the top
📊 Performance comparison with classical ML models
🎯 Categorized model rankings
📈 Top performing models

Available Models

CNN (Convolutional Neural Network)

TensorFlow Implementation:

python main.py --train-dl --dl-framework tensorflow --dl-model-type cnn

PyTorch Implementation:

python main.py --train-dl --dl-framework pytorch --dl-model-type cnn

Architecture:

Embedding layer (128d or pre-trained)
Parallel 1D Conv layers with filter sizes 3, 4, 5 (trigrams, 4-grams, 5-grams)
Global max pooling
Dropout (0.5)
Dense layers with softmax output

Expected Performance:

With learned embeddings: 91-92% accuracy
With GloVe embeddings: 92-94% accuracy

BiLSTM (Bidirectional LSTM)

PyTorch Only:

python main.py --train-dl --dl-framework pytorch --dl-model-type lstm

Architecture:

Embedding layer (128d or pre-trained)
2-layer Bidirectional LSTM (128 hidden units)
Dropout between layers
Dense layer with softmax output

Expected Performance:

With learned embeddings: 90-92% accuracy
With GloVe embeddings: 91-93% accuracy

Pre-trained Embeddings

Available Embeddings

Embedding	Dimension	Vocabulary	Best For
`glove-wiki-gigaword-100`	100d	400K	Fast training, good accuracy
`glove-wiki-gigaword-200`	200d	400K	Better accuracy
`glove-wiki-gigaword-300`	300d	400K	Best accuracy, slower
`word2vec-google-news-300`	300d	3M	News/articles domain
`glove-twitter-100`	100d	1.2M	Social media text
`glove-twitter-200`	200d	1.2M	Social media (better)
`fasttext-wiki-news-subwords-300`	300d	1M	Handles rare words

Using Custom Embeddings

# Use Word2Vec
python main.py --train-dl --use-embeddings \
  --embedding-name word2vec-google-news-300

# Use FastText
python main.py --train-dl --use-embeddings \
  --embedding-name fasttext-wiki-news-subwords-300

Training Parameters

Basic Parameters

python main.py --train-dl \
  --dl-epochs 20 \              # Number of training epochs (default: 10)
  --dl-batch-size 64 \          # Batch size (default: 32)
  --dl-framework both \         # tensorflow, pytorch, or both
  --dl-model-type both          # cnn, lstm, or both

Advanced Training

# Train everything with optimal settings
python main.py --train-dl \
  --dl-framework both \
  --dl-model-type both \
  --use-embeddings \
  --embedding-name glove-wiki-gigaword-300 \
  --dl-epochs 25 \
  --dl-batch-size 64 \
  --sample-size 50000

Monitoring Training

TensorBoard

Training metrics are automatically logged to TensorBoard:

# View training metrics in real-time
tensorboard --logdir logs/tensorboard

Open http://localhost:6006 to see:

Training/validation loss curves
Training/validation accuracy
Model architecture graphs
Embedding visualizations

Training Output

The training script provides:

Progress bars for each epoch
Real-time accuracy and loss metrics
Early stopping notifications
Best model checkpoints

Model Files

Trained models are saved to models/dl/:

models/dl/
├── cnn_tensorflow.keras              # TensorFlow CNN, learned embeddings
├── cnn_tensorflow_pretrained.keras   # TensorFlow CNN, GloVe embeddings
├── cnn_pytorch.pt                    # PyTorch CNN, learned embeddings
├── cnn_pytorch_pretrained.pt         # PyTorch CNN, GloVe embeddings
├── lstm_pytorch.pt                   # PyTorch LSTM, learned embeddings
└── lstm_pytorch_pretrained.pt        # PyTorch LSTM, GloVe embeddings

Dashboard Integration

Model Performance Tab

The Model Performance tab shows:

DL Models Detected - Green cards showing trained models
Performance Metrics Table - Categorized by model type:
- 🧠 Deep Learning
- 🤖 Transformers
- 📊 Classical ML
- 📝 Rule-based
Accuracy Comparison - Bar chart colored by category
Top 3 Models - Best performing models
Model Type Breakdown - Average accuracy by category

Sidebar

The sidebar shows DL model status:

✅ Number of trained DL models
Expandable list with details
Quick training command

Performance Benchmarks

On Amazon Reviews Dataset (50K samples)

Model	Accuracy	F1 Score	Training Time*	Inference Speed
CNN + GloVe (TF)	93.2%	0.931	~5 min	Fast
CNN + GloVe (PT)	93.1%	0.930	~6 min	Fast
CNN (TensorFlow)	91.8%	0.917	~4 min	Fast
CNN (PyTorch)	91.7%	0.916	~5 min	Fast
BiLSTM + GloVe (PT)	92.5%	0.924	~12 min	Medium
BiLSTM (PyTorch)	91.2%	0.911	~10 min	Medium
Logistic Regression	89.6%	0.896	~30 sec	Very Fast
Naive Bayes	87.1%	0.871	~20 sec	Very Fast

*GPU training times (NVIDIA RTX 3060). CPU training is 3-5x slower.

GPU Support

The models automatically detect and use available GPUs:

NVIDIA CUDA - Best performance
Apple Silicon (MPS) - Good performance on M1/M2/M3 Macs
CPU - Fallback (slower but works everywhere)

Troubleshooting

Out of Memory

If you get OOM errors:

# Reduce batch size
python main.py --train-dl --dl-batch-size 16

# Or reduce sample size
python main.py --train-dl --sample-size 20000

Slow Training

If training is too slow:

# Use smaller embeddings
python main.py --train-dl --use-embeddings \
  --embedding-name glove-wiki-gigaword-100

# Or train only one framework
python main.py --train-dl --dl-framework pytorch

Models Not Showing in Dashboard

Check that models are in models/dl/ directory
Restart the Streamlit dashboard
Clear cache: Press 'C' in dashboard, then 'Clear cache'

API Integration

Deep learning models are also available via the REST API:

# List available models
curl http://localhost:8000/models

# Predict with DL model (coming soon)
curl -X POST http://localhost:8000/predict \
  -H "Content-Type: application/json" \
  -d '{
    "text": "This product is amazing!",
    "model": "cnn_pytorch_pretrained"
  }'

Best Practices

For Best Accuracy

Use GloVe 300d embeddings
Train on full dataset (50K+ samples)
Use 20-25 epochs with early stopping
Ensemble multiple models

For Fast Prototyping

Use GloVe 100d embeddings
Sample 10-20K reviews
Use 10 epochs
Train only one framework

For Production

Use pre-trained embeddings (smaller model size)
Choose TensorFlow CNN (better deployment support)
Save model with specific version
Monitor inference latency

Next Steps

Fine-tune DistilBERT: python main.py --train-transformer
Custom Architecture: Modify src/dl_models.py
Hyperparameter Tuning: Experiment with filter sizes, dropout rates
Transfer Learning: Use pre-trained BERT embeddings

Resources

Happy training! 🚀

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Deep Learning Models Guide

Quick Start

Train Your First Deep Learning Model

View Results in Dashboard

Available Models

CNN (Convolutional Neural Network)

BiLSTM (Bidirectional LSTM)

Pre-trained Embeddings

Available Embeddings

Using Custom Embeddings

Training Parameters

Basic Parameters

Advanced Training

Monitoring Training

TensorBoard

Training Output

Model Files

Dashboard Integration

Model Performance Tab

Sidebar

Performance Benchmarks

On Amazon Reviews Dataset (50K samples)

GPU Support

Troubleshooting

Out of Memory

Slow Training

Models Not Showing in Dashboard

API Integration

Best Practices

For Best Accuracy

For Fast Prototyping

For Production

Next Steps

Resources

FilesExpand file tree

DEEP_LEARNING_GUIDE.md

Latest commit

History

DEEP_LEARNING_GUIDE.md

File metadata and controls

Deep Learning Models Guide

Quick Start

Train Your First Deep Learning Model

View Results in Dashboard

Available Models

CNN (Convolutional Neural Network)

BiLSTM (Bidirectional LSTM)

Pre-trained Embeddings

Available Embeddings

Using Custom Embeddings

Training Parameters

Basic Parameters

Advanced Training

Monitoring Training

TensorBoard

Training Output

Model Files

Dashboard Integration

Model Performance Tab

Sidebar

Performance Benchmarks

On Amazon Reviews Dataset (50K samples)

GPU Support

Troubleshooting

Out of Memory

Slow Training

Models Not Showing in Dashboard

API Integration

Best Practices

For Best Accuracy

For Fast Prototyping

For Production

Next Steps

Resources