Skip to content
View ArneshBanerjee's full-sized avatar

Block or report ArneshBanerjee

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
ArneshBanerjee/README.md

Arnesh Banerjee

Undergraduate Researcher · B.Tech CSE (Data Science)
Heritage Institute of Technology, Kolkata



🔬 About Me

I am an undergraduate researcher interested in multi-agent reinforcement learning, computer vision, AI safety, LLMs, and Applied ML.
Past projects include applied ML for cancer prognosis, thermographic image segmentation using hybrid CNN‑Transformer architectures, and a modified Safe RLHF pipeline for safety benchmarking and safer alignment of large language models. I also investigate failure modes of LLMs in mathematical reasoning.

My ongoing work focuses on RL environments and simulation for autonomous systems — a MARL drone simulator for defence applications. I am currently interning at IIT Kharagpur, working on India’s first genomic language model (IgLM).

🎯 I plan to pursue a PhD after my Bachelor’s degree.


🧠 Research Interests

Multi-Agent RL RL Environments AI Safety Deep Learning Applied ML LLMs Computer Vision


🎓 Education

Degree Institution GPA / Score Period
B.Tech in Data Science Heritage Institute of Technology, Kolkata 8.876 / 10 Aug 2023 – Jun 2027

🔬 Research Experience

IIT Kharagpur
Summer Internship 2026 · Advisor: Dr. Sourangshu Bhattacharya · On‑site
  • Selected for the GRISHMA Summer Internship Program at IIT Kharagpur; working on India’s first genomic foundational model (IgLM) — a population‑specific genomic language model built on the StripedHyena2 architecture.
Jadavpur University
Nov 2025 – Present · Advisor: Prof. Debotosh Bhattacharjee · Remote
  • Achieved 98.10% Dice and 96.39% IoU on breast thermography tumor segmentation using a ResNet34 + ASPP + ViT decoder with SE attention, evaluated via 5‑fold cross‑validation.
  • Conducted a rigorous 20‑run ablation study across U‑Net, ResNet‑only, and hybrid architectures, revealing generalization trade‑offs in medical imaging.
  • Built an automated cross‑validation pipeline with checkpointing, experiment logging, and reproducible training for publication‑ready metrics.
New Jersey Institute of Technology
Jun – Nov 2025 · Advisor: Dr. Arnob Ghosh · On‑site / Virtual
  • Constructed a cost‑model training dataset for a modified Safe RLHF pipeline, leading to improved cost‑model performance.
  • Designed and tested a comparative 800‑datapoint safety benchmark against state‑of‑the‑art LLMs (GPT, DeepSeek, Gemini, Qwen, Mistral).
  • Demonstrated 50% higher safety compliance than ChatGPT 5 across benchmark scenarios.
Heritage Institute of Technology
Oct 2024 – Mar 2025 · Advisor: Ms. Arpita Talukdar · On‑site
  • Developed a dual‑stage ML framework for breast cancer diagnosis and recurrence prediction using WDBC and WPBC datasets, evaluating RF, SVM, Logistic Regression, MLP, and XGBoost with stratified 10‑fold cross‑validation.
  • Implemented RFE and SFS feature selection with SMOTE and GridSearchCV tuning, achieving 93.67% (WPBC) and 97.77% (WDBC) accuracy.
  • Conducted comparative performance analysis and interpreted clinically relevant nuclear features with confidence intervals.

📄 Publications

Recursive and Wrapper‑Based Feature Selection for Breast Cancer Diagnosis and Prognosis
Ayushi Bhattacharjee, Arnesh Banerjee, Arpita Talukdar. 2026.
4th Analytics Global Conference (AGC 2026), March 2026.

📝 Pre‑Prints

Certifiable Safe RLHF: Fixed‑Penalty Constraint Optimization for Safer Language Models
Kartik Pandit, Sourav Ganguly, Arnesh Banerjee, Shaahin Angizi, Arnob Ghosh. 2025.
arXiv:2510.03520
Multiscale Transformer‑Enhanced U‑Net with Level‑Set Regularization for Breast Thermography Segmentation
Arnesh Banerjee, Debotosh Bhattacharjee.

🚀 Ongoing Research

Co‑evolutionary Multi‑Agent RL for Autonomous Drones
Arnesh Banerjee. With the AI for Defence Lab, ULiège, Belgium.
Understanding the Limitations of LLMs in Mathematical Reasoning
Arnesh Banerjee, Ayushi Bhattacharjee, Subhajit Datta. Advisor: Prof. Subhajit Datta. B.Tech coursework.
Analyzing Historical Revisionism in LLMs in the Context of Indian History
Kartik Pandit, Sourav Ganguly, Arnesh Banerjee, Avirup Chakraborty, Arnob Ghosh. Advisor: Dr. Arnob Ghosh.

🏆 Achievements

  • 🥉 Department Third — 4th Semester, SGPA: 9.46, B.Tech CSE(DS), Heritage Institute of Technology.
  • 🎓 Summer Internship Offers — Selected for IIT Kharagpur, 3× IIT Patna, IIT Dhanbad, and IIM Ahmedabad AI Venture Summer Internship 2026.
    📎 Proofs: IIT KGP · IIT Patna 1 · IIT Patna 2 · IIT Patna 3 · IIM Ahmedabad
  • 🏅 Institutional Innovation Council (IIC) — One of 10 members representing the CSE(DS) department, HIT Kolkata.
  • 🎤 Oral Presentation — Presenting author at AGC 2026.
  • 📊 WBJEE 2023 — Top 5.3% in West Bengal.

💻 Technical Skills

Category Technologies
Languages Python, R, C, Java, SQL, LaTeX
Machine Learning PyTorch, Scikit‑Learn, Keras, H2O, CatBoost, XGBoost, Reinforcement Learning, DQN, Double DQN, PPO, SAC
Simulation & RL Environments Gymnasium, OpenAI Gym, Stable‑Baselines3, JAX
Data & Visualization Pandas, NumPy, Matplotlib, Seaborn, Plotly
Frameworks & Tools Flask, FastAPI, Shiny, Docker, Git
Relevant Coursework Operating Systems, DBMS, Data Structures, Algorithms, Machine Learning, Data Mining, Data Warehousing

🤝 Let’s Connect

Email Website

Last updated May 2026 · Built with ❤️ by Arnesh Banerjee (j.k. I used some AI to make this look cooler by sending it my CV)

Pinned Loading

  1. Smart-Competency-Diagnostic-and-Candidate-Profile-Score-Calculator Smart-Competency-Diagnostic-and-Candidate-Profile-Score-Calculator Public

    Jupyter Notebook

  2. ni5arga/deanonymizer ni5arga/deanonymizer Public

    Deanonymize anyone based on their public commenting or posting history & pattern.

    TypeScript 297 84

  3. autoresearch-generalized autoresearch-generalized Public

    A framework for autonomous ML research — configurable autoresearch for any domain

    Shell 1

  4. Moonquake-Classification Moonquake-Classification Public

    Jupyter Notebook