Undergraduate Researcher · B.Tech CSE (Data Science)
Heritage Institute of Technology, Kolkata
I am an undergraduate researcher interested in multi-agent reinforcement learning, computer vision, AI safety, LLMs, and Applied ML.
Past projects include applied ML for cancer prognosis, thermographic image segmentation using hybrid CNN‑Transformer architectures, and a modified Safe RLHF pipeline for safety benchmarking and safer alignment of large language models. I also investigate failure modes of LLMs in mathematical reasoning.
My ongoing work focuses on RL environments and simulation for autonomous systems — a MARL drone simulator for defence applications. I am currently interning at IIT Kharagpur, working on India’s first genomic language model (IgLM).
🎯 I plan to pursue a PhD after my Bachelor’s degree.
| Degree | Institution | GPA / Score | Period |
|---|---|---|---|
| B.Tech in Data Science | Heritage Institute of Technology, Kolkata | 8.876 / 10 | Aug 2023 – Jun 2027 |
Summer Internship 2026 · Advisor: Dr. Sourangshu Bhattacharya · On‑site
- Selected for the GRISHMA Summer Internship Program at IIT Kharagpur; working on India’s first genomic foundational model (IgLM) — a population‑specific genomic language model built on the StripedHyena2 architecture.
Nov 2025 – Present · Advisor: Prof. Debotosh Bhattacharjee · Remote
- Achieved 98.10% Dice and 96.39% IoU on breast thermography tumor segmentation using a ResNet34 + ASPP + ViT decoder with SE attention, evaluated via 5‑fold cross‑validation.
- Conducted a rigorous 20‑run ablation study across U‑Net, ResNet‑only, and hybrid architectures, revealing generalization trade‑offs in medical imaging.
- Built an automated cross‑validation pipeline with checkpointing, experiment logging, and reproducible training for publication‑ready metrics.
Jun – Nov 2025 · Advisor: Dr. Arnob Ghosh · On‑site / Virtual
- Constructed a cost‑model training dataset for a modified Safe RLHF pipeline, leading to improved cost‑model performance.
- Designed and tested a comparative 800‑datapoint safety benchmark against state‑of‑the‑art LLMs (GPT, DeepSeek, Gemini, Qwen, Mistral).
- Demonstrated 50% higher safety compliance than ChatGPT 5 across benchmark scenarios.
Oct 2024 – Mar 2025 · Advisor: Ms. Arpita Talukdar · On‑site
- Developed a dual‑stage ML framework for breast cancer diagnosis and recurrence prediction using WDBC and WPBC datasets, evaluating RF, SVM, Logistic Regression, MLP, and XGBoost with stratified 10‑fold cross‑validation.
- Implemented RFE and SFS feature selection with SMOTE and GridSearchCV tuning, achieving 93.67% (WPBC) and 97.77% (WDBC) accuracy.
- Conducted comparative performance analysis and interpreted clinically relevant nuclear features with confidence intervals.
Ayushi Bhattacharjee, Arnesh Banerjee, Arpita Talukdar. 2026.
4th Analytics Global Conference (AGC 2026), March 2026.
Kartik Pandit, Sourav Ganguly, Arnesh Banerjee, Shaahin Angizi, Arnob Ghosh. 2025.
arXiv:2510.03520
Arnesh Banerjee, Debotosh Bhattacharjee.
Arnesh Banerjee. With the AI for Defence Lab, ULiège, Belgium.
Arnesh Banerjee, Ayushi Bhattacharjee, Subhajit Datta. Advisor: Prof. Subhajit Datta. B.Tech coursework.
Kartik Pandit, Sourav Ganguly, Arnesh Banerjee, Avirup Chakraborty, Arnob Ghosh. Advisor: Dr. Arnob Ghosh.
- 🥉 Department Third — 4th Semester, SGPA: 9.46, B.Tech CSE(DS), Heritage Institute of Technology.
- 🎓 Summer Internship Offers — Selected for IIT Kharagpur, 3× IIT Patna, IIT Dhanbad, and IIM Ahmedabad AI Venture Summer Internship 2026.
📎 Proofs: IIT KGP · IIT Patna 1 · IIT Patna 2 · IIT Patna 3 · IIM Ahmedabad - 🏅 Institutional Innovation Council (IIC) — One of 10 members representing the CSE(DS) department, HIT Kolkata.
- 🎤 Oral Presentation — Presenting author at AGC 2026.
- 📊 WBJEE 2023 — Top 5.3% in West Bengal.
| Category | Technologies |
|---|---|
| Languages | Python, R, C, Java, SQL, LaTeX |
| Machine Learning | PyTorch, Scikit‑Learn, Keras, H2O, CatBoost, XGBoost, Reinforcement Learning, DQN, Double DQN, PPO, SAC |
| Simulation & RL Environments | Gymnasium, OpenAI Gym, Stable‑Baselines3, JAX |
| Data & Visualization | Pandas, NumPy, Matplotlib, Seaborn, Plotly |
| Frameworks & Tools | Flask, FastAPI, Shiny, Docker, Git |
| Relevant Coursework | Operating Systems, DBMS, Data Structures, Algorithms, Machine Learning, Data Mining, Data Warehousing |
Last updated May 2026 · Built with ❤️ by Arnesh Banerjee (j.k. I used some AI to make this look cooler by sending it my CV)




