Skip to content
View baesunny's full-sized avatar
😃
😃

Block or report baesunny

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
baesunny/README.md

SeongYoon Bae

☀️ Sunny's DataLab

📍 Seoul, Korea · 📧 sunabc1023@kookmin.ac.kr · 📝 Tistory


👋 Introduction

I am a B.S. candidate in AI, Big Data & Management at Kookmin University, with a strong interest in Data Science, Statistical Analysis, and Machine Learning. Through academic research and industry projects, I have developed practical experience in Time Series Analysis, Text Mining, Missing Data Imputation, and Real-time Sensor-based Anomaly Detection, applying data-driven approaches to problems in trade policy, industrial systems, logistics, and consumer analytics.

My research interests include Data Science, Statistical Analysis, Machine Learning, Time Series Analysis, Text Mining, and Deep Learning. I am particularly interested in extracting meaningful insights from complex real-world data and leveraging them to support informed decision-making.


🎓 Education

Period Institution Details
Mar 2022 – Present Kookmin University · Seoul AI, Big Data & Management (Major) · Data Science Convergence (Secondary Major)
Mar 2019 – Feb 2022 Daejeon Foreign Language High School · Daejeon Major in Japanese
Activity Period
Director of Planning, Student Council Mar 2022 – Dec 2023
D&A Data Analytics Society — Basic Session Jun 2023
D&A Data Analytics Society — ML Session Nov 2023

💼 Experience

Organization Role Period Location Key Work
BITAmin President & Executive Member Jan 2024 – Feb 2025 Seoul Led Time Series, Recommendation System, Computer Vision, NLP projects; managed club operations
J-System Co., Ltd. Intern, System IT Team Aug 2024 – Dec 2024 Anyang PdM & quality optimization with IoT sensor data; PostgreSQL; FFT denoising & anomaly detection
GPANS SmartLo Co., Ltd. Intern, Data Insight Team Feb 2024 – Apr 2024 Seoul Geofencing-based logistics analysis; MongoDB & PostgreSQL; QA for Expresso app
Gallup Korea Intern Jan 2023 – Feb 2023 Seoul Survey data preprocessing & aggregation for Statistical Analysis
UDMTEK Co., Ltd. Intern Jun 2022 – Aug 2022 Suwon Secondary battery manufacturing data; AI equipment diagnostics proposal (K-water project)

📄 Publications

# Title Status Repo
[1] Bae, S., Lee, J. The Impact of Non-Tariff Barriers on Korea's Automobile Exports: Focusing on China's THAAD Retaliation and the U.S. Inflation Reduction Act Working Paper NonTariffBarriers_paper
[2] Bae, J., Choi, S., & Bae, S. (2025). An Efficient Method for Imputing Missing Values in Incomplete Process Data from High-Cost Data Acquisition Environments. JKSISE, 48(4), 129–141. KCI (2025) MissingDataImputation_paper

🚀 Projects

🔗 GitHub links are attached only where a repository exists.

Project Type · Period Keywords Highlights Repo
Non-Tariff Barriers & Auto Exports Course · Trade and Big Data
Sep 2025 – Dec 2025
ITSA, Non-Tariff Barriers THAAD & IRA impact on auto exports; Extended ITSA with macro controls (KOSPI, base rate, oil, FX) Link
Missing Data Imputation Industry · CSM Co., Ltd.
Jul 2025 – Dec 2025
MICE, MissForest, 1D-CNN Cement process data imputation; MissForest most stable under nonlinearity; NRMSE & PFC evaluation Link
AI vs. Human Text Analysis Project Morphological Analysis, Sentence Embeddings Linguistic comparison of AI-generated vs. human-written texts Link
Drug Safety Assessment (Pregnancy) Course · Deep Learning
Apr 2025 – Jun 2025
CNN, OCR, BioBERT Medicine image + OCR; BioBERT integration; ChatGPT decision-support with prompt engineering
Beer Brand Review Analysis Course · Text Data Analysis
Apr 2025 – Jun 2025
NLP, KoSBERT, KoNLPy Blog review preprocessing; sentiment & classification for brand insights Link
Electric Vehicle Price Prediction BITAmin
Jan 2025 – Feb 2025
XGBoost, Random Forest EV price prediction; VIF feature selection; ensemble learning
Retail Demand Forecasting BITAmin
Sep 2024 – Dec 2024
Prophet, LSTM DLC sales forecasting; seasonality & trend modeling Link
IoT Equipment Anomaly Detection J-System Co., Ltd.
Aug 2024 – Dec 2024
PostgreSQL, PdM, FFT, Six Sigma FFT signal processing; real-time anomaly detection with Six Sigma thresholds Link
Tourism Recommendation System BITAmin
Jun 2024 – Aug 2024
BERT, Word2Vec, TF-IDF, Streamlit Review keyword extraction; sentiment analysis; personalized recommendation web app Link
Voice Guidance for Visually Impaired BITAmin
Mar 2024 – May 2024
YOLOv8, MediaPipe, TTS Real-time object detection & voice guidance; IoU tracking & face direction estimation Link
Clothing Recommendation (TPO) BITAmin
Jan 2024 – Feb 2024
SegFormer, Semantic Segmentation, Cosine Similarity Transformer segmentation; style-similarity outfit recommendation via Streamlit Link
Survey Response Prediction Course · Machine Learning
Dec 2023
Random Forest, LightGBM, XGBoost, CatBoost Panel response classification; feature engineering & ensemble modeling Link
COVID-19 · Simpson's Paradox High School
Apr 2021 – Jul 2021
Statistical Analysis Korea–Japan COVID comparison; Simpson's Paradox with testing rates

💻 Tech Stack

Category Tools
Programming Python R SQL Java
ML / DL PyTorch TensorFlow scikit-learn XGBoost OpenAI
Data & Tools PostgreSQL MySQL MongoDB Docker Jupyter Streamlit
Languages 🇰🇷 Korean (Native) · 🇺🇸 English (Professional) · 🇯🇵 Japanese (Intermediate)

📬 Connect

Gmail Tistory Blog Instagram

Pinned Loading

  1. NonTariffBarriers_paper NonTariffBarriers_paper Public

    The Impact of Non-Tariff Barriers on Korea’s Automobile Exports: Focusing on China’s THAAD Retaliation and the U.S. Inflation Reduction Act

    Jupyter Notebook

  2. MissingDataImputation_paper MissingDataImputation_paper Public

    AI based Data Augmentation and CTQ Prediction | An Efficient Method for Imputing Missing Values in Incomplete Process Data from High- Cost Data Acquisition Environments

    Jupyter Notebook

  3. retailDemandForecasting retailDemandForecasting Public

    유통물류센터 판매 데이터를 활용한 제품군별 수요 예측 모델 구축

    Jupyter Notebook

  4. J-system J-system Public

    IoT 센서를 이용한 설비 예지보전 및 품질 안정화

    Jupyter Notebook