I am a B.S. candidate in AI, Big Data & Management at Kookmin University, with a strong interest in Data Science, Statistical Analysis, and Machine Learning. Through academic research and industry projects, I have developed practical experience in Time Series Analysis, Text Mining, Missing Data Imputation, and Real-time Sensor-based Anomaly Detection, applying data-driven approaches to problems in trade policy, industrial systems, logistics, and consumer analytics.
My research interests include Data Science, Statistical Analysis, Machine Learning, Time Series Analysis, Text Mining, and Deep Learning. I am particularly interested in extracting meaningful insights from complex real-world data and leveraging them to support informed decision-making.
| Period | Institution | Details |
|---|---|---|
| Mar 2022 – Present | Kookmin University · Seoul | AI, Big Data & Management (Major) · Data Science Convergence (Secondary Major) |
| Mar 2019 – Feb 2022 | Daejeon Foreign Language High School · Daejeon | Major in Japanese |
| Activity | Period |
|---|---|
| Director of Planning, Student Council | Mar 2022 – Dec 2023 |
| D&A Data Analytics Society — Basic Session | Jun 2023 |
| D&A Data Analytics Society — ML Session | Nov 2023 |
| Organization | Role | Period | Location | Key Work |
|---|---|---|---|---|
| BITAmin | President & Executive Member | Jan 2024 – Feb 2025 | Seoul | Led Time Series, Recommendation System, Computer Vision, NLP projects; managed club operations |
| J-System Co., Ltd. | Intern, System IT Team | Aug 2024 – Dec 2024 | Anyang | PdM & quality optimization with IoT sensor data; PostgreSQL; FFT denoising & anomaly detection |
| GPANS SmartLo Co., Ltd. | Intern, Data Insight Team | Feb 2024 – Apr 2024 | Seoul | Geofencing-based logistics analysis; MongoDB & PostgreSQL; QA for Expresso app |
| Gallup Korea | Intern | Jan 2023 – Feb 2023 | Seoul | Survey data preprocessing & aggregation for Statistical Analysis |
| UDMTEK Co., Ltd. | Intern | Jun 2022 – Aug 2022 | Suwon | Secondary battery manufacturing data; AI equipment diagnostics proposal (K-water project) |
| # | Title | Status | Repo |
|---|---|---|---|
| [1] | Bae, S., Lee, J. The Impact of Non-Tariff Barriers on Korea's Automobile Exports: Focusing on China's THAAD Retaliation and the U.S. Inflation Reduction Act | Working Paper | NonTariffBarriers_paper |
| [2] | Bae, J., Choi, S., & Bae, S. (2025). An Efficient Method for Imputing Missing Values in Incomplete Process Data from High-Cost Data Acquisition Environments. JKSISE, 48(4), 129–141. | KCI (2025) | MissingDataImputation_paper |
🔗 GitHub links are attached only where a repository exists.
| Project | Type · Period | Keywords | Highlights | Repo |
|---|---|---|---|---|
| Non-Tariff Barriers & Auto Exports | Course · Trade and Big Data Sep 2025 – Dec 2025 |
ITSA, Non-Tariff Barriers | THAAD & IRA impact on auto exports; Extended ITSA with macro controls (KOSPI, base rate, oil, FX) | Link |
| Missing Data Imputation | Industry · CSM Co., Ltd. Jul 2025 – Dec 2025 |
MICE, MissForest, 1D-CNN | Cement process data imputation; MissForest most stable under nonlinearity; NRMSE & PFC evaluation | Link |
| AI vs. Human Text Analysis | Project | Morphological Analysis, Sentence Embeddings | Linguistic comparison of AI-generated vs. human-written texts | Link |
| Drug Safety Assessment (Pregnancy) | Course · Deep Learning Apr 2025 – Jun 2025 |
CNN, OCR, BioBERT | Medicine image + OCR; BioBERT integration; ChatGPT decision-support with prompt engineering | — |
| Beer Brand Review Analysis | Course · Text Data Analysis Apr 2025 – Jun 2025 |
NLP, KoSBERT, KoNLPy | Blog review preprocessing; sentiment & classification for brand insights | Link |
| Electric Vehicle Price Prediction | BITAmin Jan 2025 – Feb 2025 |
XGBoost, Random Forest | EV price prediction; VIF feature selection; ensemble learning | — |
| Retail Demand Forecasting | BITAmin Sep 2024 – Dec 2024 |
Prophet, LSTM | DLC sales forecasting; seasonality & trend modeling | Link |
| IoT Equipment Anomaly Detection | J-System Co., Ltd. Aug 2024 – Dec 2024 |
PostgreSQL, PdM, FFT, Six Sigma | FFT signal processing; real-time anomaly detection with Six Sigma thresholds | Link |
| Tourism Recommendation System | BITAmin Jun 2024 – Aug 2024 |
BERT, Word2Vec, TF-IDF, Streamlit | Review keyword extraction; sentiment analysis; personalized recommendation web app | Link |
| Voice Guidance for Visually Impaired | BITAmin Mar 2024 – May 2024 |
YOLOv8, MediaPipe, TTS | Real-time object detection & voice guidance; IoU tracking & face direction estimation | Link |
| Clothing Recommendation (TPO) | BITAmin Jan 2024 – Feb 2024 |
SegFormer, Semantic Segmentation, Cosine Similarity | Transformer segmentation; style-similarity outfit recommendation via Streamlit | Link |
| Survey Response Prediction | Course · Machine Learning Dec 2023 |
Random Forest, LightGBM, XGBoost, CatBoost | Panel response classification; feature engineering & ensemble modeling | Link |
| COVID-19 · Simpson's Paradox | High School Apr 2021 – Jul 2021 |
Statistical Analysis | Korea–Japan COVID comparison; Simpson's Paradox with testing rates | — |
| Category | Tools |
|---|---|
| Programming | |
| ML / DL | |
| Data & Tools | |
| Languages | 🇰🇷 Korean (Native) · 🇺🇸 English (Professional) · 🇯🇵 Japanese (Intermediate) |
