Yu-Cheng Tsai , Ph.D.

App 20240394285 Application

💬

Chatbot

Iterative LLM architectures for handling complex, multi-step queries by dynamically leveraging external APIs and databases. Advances conversational AI beyond single-turn responses into persistent, reasoning-capable agents.

LLMAgentic AIConversational AIMulti-step Reasoning

App 20240394481 Application

✍️

Prompt Generation

Novel methods for automated, context-aware prompt generation for large language models — improving response quality, task alignment, and downstream performance in enterprise AI applications.

Prompt EngineeringLLMGenerative AI

App 20240394512 Application

🛡️

Hallucination Detection

Systematic framework for detecting and mitigating hallucinations in LLM outputs — a critical reliability layer for deploying trustworthy generative AI in high-stakes financial and enterprise environments.

LLM ReliabilityHallucinationGenerative AIFinTech

App 20250209301 Application

🕸️

Generating Graph Model

GNN-based framework for constructing and training graph models over complex entity relationships — powering fraud detection, anomaly surfacing, and relational reasoning at enterprise scale.

Graph Neural NetworksGNNFraud DetectionFinTech

App 20250209385 Application

📊

Covariate Drift Detection

Methods for detecting covariate drift in production ML systems — enabling continuous monitoring of input distribution shifts to maintain model accuracy and reliability in live financial AI deployments.

MLOpsModel MonitoringDistribution ShiftReliability

App 20260064736 Application

📈

Data Resource Identification and Metric Calculation

Automated methods for identifying relevant data resources and computing derived metrics — providing intelligent data discovery and quantitative insight generation for AI-driven financial analytics.

Data IntelligenceAnalyticsAIFinTech

App 20210350391 Application

🖥️

Methods and Systems for Providing a Personalized User Interface

Adaptive UI personalization system that dynamically tailors interface elements to individual user behavior and preferences — improving engagement and usability in enterprise software products.

PersonalizationUXMLAdaptive Systems

Data Science Collective · Medium

Experience

Career Journey

Principal Machine Learning Scientist

Sage · Remote | 2021 – Present

Current

Spearheaded architecture & development of numerous Generative AI applications for finance and accounting, leveraging MCP, LangChain, Streamlit, and vector embedding databases
Led distributed parallel training and fine-tuning of LLM foundation models on AWS utilizing Ray, DeepSpeed, and HuggingFace for high-performance domain adaptation
Engineered and productionized an advanced financial document classification system using TensorFlow, Scikit-Learn, and Docker — resulting in a patented system architecture
Inventor on 8 US Patents and Applications spanning GenAI, LLM architectures, GNNs, and computer vision

MCPLangChainDeepSpeedRayAWSLLMsGNNs

Data Scientist

CaaStle · Mountain View, CA | 2018 – 2021

Built visual and text recommender systems using image segmentation neural networks
Implemented wide-and-deep learning and BERT models for fashion product search and discovery

BERTNLPRecommender SystemsImage Segmentation

Senior Research Project Leader

ASML · Santa Clara, CA | 2016 – 2018

Developed CNNs for semiconductor mask design, directly enabling production of state-of-the-art CPUs, GPUs, and TPUs
Applied deep learning to computational lithography — improving optical proximity correction accuracy at chip-manufacturing scale

CNNsComputer VisionPyTorchSemiconductor AI

Ph.D., Mechanical & Aerospace Engineering

Princeton University · 2013

Specialized in computational physics and numerical simulation. Developed deep expertise in high-performance computing and mathematical modeling — a rigorous foundation that now informs how I approach large-scale distributed ML systems.

Computational PhysicsHPCNumerical Methods

B.S., Mechanical Engineering

National Taiwan University · 2007

Publications

Publications & Insights

10+ highly cited articles on Medium (Sage AI, Data Science Collective) and Towards Data Science, focusing on LLM fine-tuning, agentic workflows, and scaling laws.

Featured · Agentic AI

Building an MCP-Powered LLM Agent

End-to-end implementation of an LLM agent using Model Context Protocol hosts and servers. Integrates open-source and proprietary LLMs via OpenAI-compatible APIs — a practical guide to production-ready agentic architecture.

Featured · Distributed Training

Fine-Tuning LLMs: Distributed Parallel Training with DeepSpeed & Ray

Pioneering guide on instruction-following fine-tuning for LLMs in a distributed cluster framework. Covers DDP, memory optimization, and acceleration for 3B+ parameter models — a first-of-its-kind resource for the community.

Sage AI · Medium

Data Science Collective · Medium

Agentic AI

Building an LLM Agent with N8N and Open-WebUI

Low-code workflow architecture for building production LLM agents with multi-step reasoning, tool use, and open-source model serving.

LLMs

Transforming Next-Token Prediction into Classification with LLMs

Technical bridge between generative and discriminative AI paradigms — adapting LLM next-token prediction for classification tasks.

Towards Data Science

Agentic AI

Building a Local Voice Assistant with LLMs on Your CPU Laptop

Deploying a fully local, privacy-first voice assistant using lightweight LLMs — no GPU required. Part of the lightweight LLMs guide series.

Towards Data Science

LLMs

Are GPTs Good Embedding Models?

Rigorous empirical evaluation of GPT-family models as embedders, with surprising findings about the effectiveness of different embedding strategies.

Towards Data Science

2025 Hackathon Partner: Parenting for Lifelong Health

🌍

Tech for Good — Non-Profit Hackathons

Multi-disciplinary hackathons organized by Sage Foundation, applying agentic AI to real-world challenges for underserved communities.

Rethinking Parenting Lessons for Low-Resource Families

Problem: ParentText — a WhatsApp/SMS parenting course reaching families in South Africa, Mexico & Malaysia — saw a 50% completion drop when delivered without human coaches. Static comic strips caused significant drop-off.

What was built

Dynamic, personalized micro-stories (<200 words) tailored to each caregiver's location, relationship, and child's profile using GPT-4o
LLM role-play simulations with Microsoft AutoGen — characters engage in organic dialogues around stress management and emotional connection
Text-to-speech narration for literacy accessibility
Culturally attuned video content generated with Google Veo3 (replacing Azure Sora for better cultural relevance)
Gamified SMS challenges and story-based cliffhangers for engagement

GPT-4oMicrosoft AutoGenGoogle Veo3Agentic AITTS

2024 Hackathon Partner: STACK Assessment (Moodle)

Advancing Math Education Through AI and Enhanced UX

Problem: STACK, the world's leading open-source math assessment system, lacked personalized learning paths. Educators spent excessive time compiling data; students had no adaptive progression.

What was built

Personalized quiz recommendation engine — ML model trained on student exam attempts predicts scores and adapts learning plans per student
Teacher dashboard with actionable insights, quiz analytics, and one-click Moodle integration
Interactive Figma prototype demonstrating the full educator and student journey
Proto-personas and empathy maps to center vulnerable learners in design decisions

ML RecommendationsMoodle/STACKFigmaAdaptive Learning