Skip to content
View Ishan-AI-coder's full-sized avatar

Block or report Ishan-AI-coder

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Ishan-AI-coder/README.md

ISHAN SARKAR

Building Production AI Systems


Designing reliable AI systems where deterministic software engineering meets practical machine learning.


About

Building complete AI systems from backend infrastructure to intelligent applications.

My work focuses on production-grade AI engineering, combining scalable backend development with modern LLM architectures including Agentic AI, Hybrid RAG, Multi-Agent Systems, and Healthcare AI.

I enjoy solving engineering problems where deterministic software meets probabilistic models—designing systems that are reliable, observable, and safe for real-world deployment.


Core Focus

  • Production AI Systems
  • Backend Engineering
  • Agentic AI
  • Retrieval-Augmented Generation
  • Healthcare AI
  • AI Infrastructure

Currently Building

🏥 Synapse Clinical AI

Production-ready
clinical AI platform

🤖 Agentic AI

LangGraph
Pydantic AI
Multi-Agent Systems

🧠 AI Infrastructure

Hybrid RAG

Qdrant

FastAPI

PostgreSQL

🔬 Research

Healthcare AI

Model Context Protocol

LLM Engineering




About Me

I build production-grade AI systems that combine reliable software engineering with practical machine learning. My interests lie at the intersection of backend engineering, large language models, and intelligent systems—where scalable infrastructure meets probabilistic AI.

Rather than treating an LLM as the entire application, I design systems in which the model is only one component of a larger architecture. Authentication, data modeling, retrieval, orchestration, observability, and safety remain deterministic, while language models provide reasoning where they add the most value. This separation makes AI applications easier to understand, test, and maintain in production.

My recent work has focused on Agentic AI, Hybrid Retrieval-Augmented Generation (RAG), multi-agent workflows, and healthcare AI. I enjoy building systems that retrieve trusted context, coordinate specialized tools, and keep humans involved in high-impact decisions through structured approval workflows.

From secure FastAPI services and PostgreSQL-backed applications to LangGraph agents and vector search pipelines, I enjoy solving engineering problems that span the full stack of modern AI applications. I care as much about API design, data integrity, and system reliability as I do about model quality.

Ultimately, I'm interested in building AI software that is grounded, observable, and dependable—systems that engineers can reason about, users can trust, and organizations can confidently deploy.

Engineering Philosophy

Build deterministic software around probabilistic models. The intelligence should come from the AI; the reliability should come from the engineering.

Engineering at a Glance

🚀 Production AI

Building intelligent systems designed for real-world deployment.

  • Healthcare AI Platforms
  • Agentic AI Workflows
  • Hybrid RAG Pipelines
  • FastAPI Backend Services
  • Secure API Architectures
  • Human-in-the-Loop AI

🔬 Research Interests

Exploring the next generation of AI systems.

  • Multi-Agent Systems
  • Model Context Protocol (MCP)
  • Agent Memory
  • Long-Context Reasoning
  • LLM Evaluation
  • AI Infrastructure

🏗️ Currently Building

Projects I'm actively developing.

  • 🏥 Synapse Clinical AI
  • ⚙️ AI SDK & MCP Integrations
  • 📚 Production RAG Pipelines
  • 🤖 AI Automation Workflows

🎯 Open To

Looking for opportunities to build impactful AI systems.

  • AI Engineering Internships
  • Backend Engineering
  • Applied Machine Learning
  • Open Source Collaboration
  • Research Opportunities

Featured Engineering Projects


🏥 Synapse Clinical AI

Production-Grade Clinical AI Platform for End-to-End Patient Care

A full-stack clinical AI platform that combines secure backend engineering, agentic AI, hybrid retrieval, and human-in-the-loop validation to help clinicians manage patient care from a single interface.

Repository


Why I Built It

Most healthcare AI applications stop at answering questions.

I wanted to build something closer to a production system—one that combines secure backend services, AI-assisted clinical workflows, structured patient records, retrieval over historical consultations, and deterministic safety checks.

The goal wasn't simply to integrate an LLM, but to demonstrate how modern AI systems can coexist with reliable software engineering practices in high-stakes environments.


Platform Walkthrough

AI Diagnostic Copilot

Diagnostic Copilot

A LangGraph-powered clinical assistant capable of:

  • Patient search
  • Appointment scheduling
  • Medical literature search
  • Medical image analysis
  • Context-aware clinical reasoning

All recommendations pass through human approval before affecting patient records.


AI Audio Scribe

Audio Pipeline

Consultation recordings are automatically transformed into structured clinical information.

Pipeline:

Audio

↓

Gemini Transcription

↓

Structured Extraction

↓

Drug Interaction Checks

↓

Allergy Validation

↓

Patient Database

The entire workflow is deterministic after transcription, ensuring extracted information conforms to predefined schemas before storage.


Hybrid Clinical Retrieval

Patient Search

Every AI response is grounded using hybrid retrieval over previous consultations.

The retrieval pipeline combines

  • Dense embeddings
  • Sparse retrieval
  • Cohere reranking

to minimize hallucinations while maintaining relevant clinical context.


Report Analysis

Report Analysis

Upload laboratory reports or diagnostic documents to generate:

  • Clinical summaries
  • Abnormal value detection
  • Risk assessment
  • Recommended follow-up actions
  • Downloadable PDF reports

Architecture

flowchart LR

Doctor --> Streamlit

Streamlit --> FastAPI

FastAPI --> PostgreSQL

FastAPI --> LangGraph

FastAPI --> Gemini

LangGraph --> Tavily

LangGraph --> MedGemma

LangGraph --> Qdrant

Qdrant --> ClinicalContext

ClinicalContext --> FinalResponse

FinalResponse --> HumanApproval

HumanApproval --> Database
Loading

Engineering Highlights

🔒 Secure Backend

  • JWT Authentication
  • OAuth2
  • FastAPI
  • SQLAlchemy
  • PostgreSQL

Designed with a strict separation between application logic and AI reasoning.


🧠 Agentic Clinical Copilot

Built using LangGraph with specialized tools including

  • Patient Search
  • Scheduling
  • Literature Search
  • Medical Imaging
  • Hybrid Retrieval

Every action remains observable and auditable.


🎙️ AI Audio Scribe

Combines

  • Gemini transcription
  • Structured extraction
  • Drug interaction validation
  • Allergy detection

before committing changes to patient records.


📚 Grounded Clinical AI

Hybrid RAG over patient consultation history using

  • Dense retrieval
  • Sparse retrieval
  • Cohere reranking
  • Qdrant

ensuring responses remain grounded in verified patient context.


Technology Stack

Layer Technology Purpose
Backend FastAPI REST APIs
Authentication JWT / OAuth2 Secure Access
Database PostgreSQL Patient Records
ORM SQLAlchemy Persistence
Agent Framework LangGraph Diagnostic Copilot
LLM Gemini Transcription & Reasoning
Vision Model MedGemma (Ollama) Medical Imaging
Retrieval Qdrant Hybrid RAG
Search Tavily Live Literature Search
Frontend Streamlit Clinical Dashboard

Engineering Decisions

Rather than allowing the language model to directly manipulate patient data, every AI-generated output flows through deterministic validation layers before persistence.

Key design principles include:

  • Schema-constrained extraction
  • Human approval before updates
  • Deterministic backend logic
  • Retrieval-grounded reasoning
  • Explicit safety checks

These boundaries make the system easier to reason about and significantly reduce opportunities for incorrect AI outputs to propagate into patient records.


Lessons Learned

The most valuable engineering decision wasn't selecting the language model—it was defining clear boundaries around it.

Treating AI as a probabilistic component while enforcing reliability through deterministic software produced a system that is significantly easier to audit, maintain, and extend.


Roadmap

  • FHIR Integration
  • MCP Tool Servers
  • Agent Memory
  • Kubernetes Deployment
  • Evaluation Dashboard
  • Multi-Hospital Support

🤖 Health Nexus

Multi-Agent Healthcare Assistant

A healthcare assistant that orchestrates six specialized AI agents to provide personalized medical guidance and comprehensive report analysis.

Highlights

  • Six domain-specific AI agents
  • LLM-based intent routing
  • Parallel report analysis
  • PDF report generation
  • Interactive Streamlit interface

Repository

https://github.com/Ishan-AI-coder/Health-Nexus


📄 OmniRAG

Multimodal Research Assistant

A multimodal retrieval system capable of answering questions across multiple research papers, including figures, charts, and embedded images.

Highlights

  • Hybrid RAG
  • Concurrent image captioning
  • LangGraph tool-calling agent
  • Safe Python execution
  • Perfect RAGAS evaluation scores

Repository

https://github.com/Ishan-AI-coder/OmniRAG


📊 SheetSense

AI-Powered Data Analysis Platform

A multi-agent analytics platform that enables natural-language exploration of structured datasets without writing code.

Highlights

  • Four specialized AI agents
  • Chart generation
  • Context-aware Q&A
  • Sandboxed Pandas execution
  • Interactive dashboards

Repository

https://github.com/Ishan-AI-coder/SheetSense


Tech Stack

Core Languages



Backend Engineering



AI Engineering



Retrieval & Vector Search



Machine Learning



Data & Storage



Frontend & Visualization



Developer Tools


AI Engineering Experience

Area What I've Built Core Technologies
Agentic AI Built a production-style clinical copilot with multi-tool orchestration, deterministic workflows, and human-in-the-loop approval for clinical safety. LangGraph, LangChain, Gemini
Multi-Agent Systems Designed role-specialized AI teams with LLM-based intent routing, parallel agent execution, and consolidated multi-agent reasoning for healthcare assistance. Pydantic AI, Gemini
Retrieval-Augmented Generation (RAG) Engineered hybrid dense + sparse retrieval pipelines over medical documents and patient histories with grounded responses and citation-aware retrieval. Qdrant, FastEmbed, Cohere Rerank
Multimodal AI Built pipelines that process both text and embedded visual content inside PDFs using AI-generated image captions and vision-language models. Gemini Vision, PyMuPDF, MedGemma
LLM Engineering Developed structured-output pipelines, tool-calling agents, long-context workflows, and production-oriented prompt orchestration. Gemini, Ollama, HuggingFace
LLM Fine-Tuning Fine-tuned transformer models using parameter-efficient techniques for domain-specific adaptation. LoRA, QLoRA, HuggingFace
Backend Engineering Built JWT-secured REST APIs, asynchronous AI services, authentication flows, validation layers, and modular backend architectures. FastAPI, SQLAlchemy, PostgreSQL
AI Safety & Reliability Implemented schema-constrained extraction, deterministic validation, guardrails, and human-in-the-loop workflows for high-stakes AI systems. Pydantic, FastAPI, LangGraph
Evaluation & Observability Evaluated retrieval quality and groundedness using automated metrics while designing AI workflows for traceability and auditing. RAGAS, Qdrant
Data & Vector Infrastructure Built embedding pipelines, semantic search systems, relational schemas, and vector databases powering production-grade AI applications. PostgreSQL, Qdrant, FastEmbed
AI Interfaces Developed interactive applications for clinical workflows, document intelligence, and data analytics with visualization and human-in-the-loop interaction. Streamlit, Plotly, Matplotlib



Let’s Build Intelligent Systems That Scale Beyond Demos


Open to Opportunities

AI Engineering • Backend Engineering • Applied ML • Open Source • Research Collaborations




“Good engineering is making complex systems behave predictably — especially when the complexity comes from probabilistic models in production.”


Popular repositories Loading

  1. SheetSense SheetSense Public

    Python

  2. OmniRAG OmniRAG Public

    Python

  3. Synapse-Clinical-AI Synapse-Clinical-AI Public

    An end-to-end AI healthcare platform for patient management, diagnostic support, automated clinical documentation, and intelligent medical retrieval.

    Python

  4. Ishan-AI-coder Ishan-AI-coder Public

  5. website website Public

    Forked from kubeflow/website

    Kubeflow Website

    HTML