Data Scientist · AI Engineer

MINWOO
SOHN.

I TURN MESSY DATA INTO SYSTEMS THAT MATTER.

↗ GitHub↗ LinkedIn
$0M
Capital decisions informed
0×
Faster document retrieval
0%
Manual effort eliminated
Selected WorkView All →
01RAG KNOWLEDGE SYSTEM
RAG · k=5

Offline enterprise RAG with hybrid BM25 + vector retrieval, cross-encoder reranking, and zero external API exposure.

02DEEPAR GAS FORECASTING
nowP10 · P50 · P90

10-year probabilistic demand forecast using DeepAR with ensemble NWP weather covariates and a 4-model comparison dashboard.

03MARKET SIGNAL EXTRACTION
11 sectors · IT

NLP framework extracting thematic + sentiment signals from 68K S&P 500 news articles, validated against real sector returns using Alphalens.

04GAS NOTIFICATION PIPELINE
hourly · alerts

Near-real-time monitoring pipeline that replaced hourly manual SQL checks with automated fault detection and alert dispatch.

05ALPHAPULSE
OHLC · 5d

AI-driven stock advisor aggregating financial news, fundamentals, and economic signals into real-time LLM-generated investment analysis.

06AIRBNB RATING CLASSIFICATION
price?rooms?area?XGBoost · 30+ cities

Classification system predicting rating tiers for unrated Airbnb listings using PySpark on Databricks with XGBoost and Random Forest.

Experience
Jul 2024 – Present
Data Analyst
Knoxville Utilities Board
Production ML systems: RAG, probabilistic forecasting, automated monitoring pipelines, and AI education for 15+ stakeholders.
Jun – Aug 2023
Data Science Intern
Vanderbilt University
Architected and deployed a production LLM-based student assessment system — prompt design, backend logic, evaluation, and full deployment.
Aug 2022 – May 2023
Graduate Data Scientist
Vanderbilt Women's Basketball
Built ETL pipelines for high-frequency player sensor data, implemented ID-level encryption and integrity checks, and delivered time series injury-risk analysis to coaching staff. Improved data refresh efficiency by 80%.
2022 – 2024 · 2018 – 2022
MS Data Science · BS Civil & Environmental Engineering
Vanderbilt University · University of Illinois Urbana-Champaign
Teaching Assistant: Intro to Data Science, Fundamentals of Data Science. Dean's List. Engineering foundation that shapes how I think about ML systems in the real world.