Skip to content
View taeyang0505's full-sized avatar
  • Inha University
  • INCHEON, SOUTH KOREA

Block or report taeyang0505

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
taeyang0505/README.md

Hi, I'm Taeyang

I'm an AI Engineering student at Inha University with a strong interest in Computer Vision and Multimodal AI. I've spent the last year digging into Vision-Language Models (VLMs) as an undergraduate research assistant — specifically on the problem of object hallucination, where models confidently describe things that aren't in the image.

Beyond research, I like building things that actually run. Most of my recent projects sit at the intersection of LLMs, RAG pipelines, and practical deployment.


What I'm working on

  • Finishing up RoboGuard, an RLAIF-based RAG agent for industrial robot manuals that uses a self-correction loop (LangGraph + Gemini) to catch and fix hallucinated responses before they reach the user
  • Exploring how inference-time defense strategies (like B-VCD) can make VLMs more reliable without any retraining

Open-source contributions

  • kubeflow/mcp-server #44 — added HuggingFace model-ID suggestions to the MCP server's pre_flight tool, with unit tests (under review)

Tech I use regularly

ML / DL PyTorch Hugging Face Transformers scikit-learn TensorFlow / Keras

LLM & RAG LangChain LangGraph ChromaDB Gemini API LangSmith

Vision OpenCV PIL

General Python Git Jupyter Google Colab macOS (Apple Silicon)


Projects

RoboGuard-RLAIF An enterprise RAG agent for UR10e robot technical support. The core idea: instead of just retrieving and generating, the system runs a judge model on every response and loops back to revise if it detects hallucinated content. Integrates InstructGPT-style reward modeling, Reflexion-style episodic memory, and Self-RAG critique tokens — all wired together in a cyclic LangGraph pipeline with a Streamlit UI and LangSmith tracing.

B-VCD: Mitigating Object Hallucination in VLMs Training-free, inference-time hallucination defense for VLMs. Perturbs the visual input with a physically-modeled degradation (motion blur + illumination attenuation + Poisson–Gaussian noise) and selects the most grounded answer via a degraded-image-grounded LLM-as-a-Judge (Gemini 2.5 Flash) on VizWiz-VQA.

Emotion-Aware Multimodal Chatbot Combines BiLSTM + Attention for text emotion recognition with EfficientNet for image emotion, then fuses both modalities to generate context-aware responses.

Pneumonia Classification CNN-based chest X-ray classifier built with TensorFlow/Keras.

Forest Cover Classification Multi-class tabular classification using tree-based and ensemble methods.


Certifications

IBM SkillsBuild

Agentic AI with LangChain & LangGraph

DeepLearning.ai

Retrieval Augmented Generation

Naver Boostcourse

AI Basic 1 AI Basic 2 Computer Vision Deep Learning 1 NLP Linear Algebra Model Compression

Codecademy

ML / AI Engineer Career Path DL with TensorFlow Intro to LLMs NLP Course Data & Programming Foundations for AI


Pinned Loading

  1. RoboGuard-RLAIF RoboGuard-RLAIF Public

    Building a zero-hallucination RAG agent for industrial robot manuals using RLAIF, LLM-as-a-Judge, and LangGraph.

    Python 1

  2. B-VCD-Mitigating-Object-Hallucination-in-VLMs B-VCD-Mitigating-Object-Hallucination-in-VLMs Public

    B-VCD: Mitigating Object Hallucination in VLMs for Visually Impaired Navigation using Visual Contrastive Decoding

    Jupyter Notebook 1

  3. Emotion_Aware_Multimodal_Chatbot_Project_Text_Image_and_Response_Generation Emotion_Aware_Multimodal_Chatbot_Project_Text_Image_and_Response_Generation Public

    Generating empathetic responses using multimodal emotion recognition from text and images with deep learning techniques.

    Jupyter Notebook 1

  4. Pneumonia-Classification-Project Pneumonia-Classification-Project Public

    Classifying pneumonia from chest X-ray images using python techniques and deep learning techniques.

    Jupyter Notebook 1

  5. us-medical-insurance-analysis-project us-medical-insurance-analysis-project Public

    Investigating a medical insurance costs dataset in a .csv file using the Python skills.

    Jupyter Notebook 1

  6. Forest_Cover_Classification_Project Forest_Cover_Classification_Project Public

    Predicting forest cover type (the most common kind of tree cover) using deep learning with TensorFlow.

    Jupyter Notebook 1