Skip to content
View Tommylee1013's full-sized avatar
🐱
Studying
🐱
Studying

Block or report Tommylee1013

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Tommylee1013/README.md

πŸ‘‹ Hi, I’m Thomas

Undergraduate researcher at Sogang University (Seoul, Korea)
Majors in Financial Economics, Econometrics, Big Data Science, Japanese
Minor in AI & Convergence Software Programming

I focus on quantitative finance, causal inference in asset allocation, and financial machine learning.
My recent work centers on building explainable, causality-aware portfolio construction pipelines and robust backtesting frameworks.


πŸ”¬ Recent Research & Key Projects

Causal Asset Allocation

Causal Inference-Based Portfolio Optimization Pipeline

  • Role: Leader / Researcher / Engineer
  • Summary:
    A portfolio construction framework that explicitly models causal relationships between macro factors and asset clusters using DAGs. The pipeline supports intervention analysis and scenario-based allocation (e.g., shocks to rates, inflation, volatility).
  • Techniques: DAG, Causal Discovery, Double Machine Learning, Intervention Analysis, Portfolio Optimization
  • Keywords: Causal Inference, Asset Allocation, Macro Factors, Explainable Allocation

Posterior-NCO

Improving NCO Portfolio Allocation via Posterior Covariance and Bayesian Updating

  • Role: Leader / Researcher / Engineer
  • Summary:
    An extension of Nested Clustered Optimization (NCO) that replaces sample covariance with posterior covariance estimated via Bayesian updating. The framework integrates manager views and uncertainty to improve out-of-sample stability.
  • Techniques: NCO, Bayesian Updating, Monte Carlo Backtesting, Portfolio Optimization
  • Keywords: Portfolio Construction, Robust Covariance, Bayesian Finance

Causal Factor GAN

The Dawn of Explainable Quantitative Finance

  • Role: Leader / ML Engineer
  • Summary:
    A causal data generating process (DGP) combined with GANs to simulate factor-driven return dynamics under intervention and counterfactual scenarios. Enables stress-testing factor portfolios beyond correlation-based simulations.
  • Techniques: DAG-based Causal Modeling, Conditional GAN, QuantGAN, Time Series Generation
  • Keywords: Causal Inference, GAN, Factor Investing, Synthetic Data

AFML Code Implementation

Reproducible Implementations of Methods from Advances in Financial Machine Learning

  • Role: Contributor / Engineer
  • Summary:
    A clean, modular Python implementation of core methodologies from Marcos LΓ³pez de Prado’s Advances in Financial Machine Learning, including event-driven labeling, purged k-fold CV, meta-labeling, sample weighting, and fractional differentiation.
  • Techniques: Triple Barrier Labeling, Purged K-Fold CV, Meta-Labeling, Sample Weights, Fractional Differentiation
  • Keywords: Financial Machine Learning, Reproducibility, Backtesting Infrastructure

BOBP ETF Listing & Management

Design, Launch, and Ongoing Management of a Rules-Based ETF

  • Role: Product Lead / Quant Researcher
  • Summary:
    End-to-end design of the BOBP ETF, including index construction, portfolio rules, rebalancing logic, and live monitoring. Focused on building a transparent, rules-based ETF with robust backtesting and risk controls.
  • Techniques: Index Construction, Portfolio Rules, Rebalancing, Risk Management, Live Monitoring
  • Keywords: ETF Design, Asset Management, Index Methodology, Portfolio Operations

GDP Nowcasting with Explainable Machine Learning

  • Role: ML Engineer
  • Summary:
    High-frequency macro nowcasting framework using financial and textual indicators. Model explainability is emphasized via SHAP and feature attribution to interpret economic drivers of GDP revisions.
  • Techniques: XGBoost, SHAP, Macro Indicators, Text-based Features
  • Keywords: Nowcasting, Explainable AI, Macroeconomics

Synthetic Data Backtesting Framework

  • Role: ML Engineer
  • Summary:
    A backtesting framework using synthetic return paths generated by statistical and GAN-based models to evaluate robustness under distribution shifts.
  • Techniques: Time Series GAN, Monte Carlo Simulation, Robust Backtesting
  • Keywords: Backtesting, Synthetic Data, Model Robustness

🧠 Research Interests

  • Causal Factor Investing
  • Causal Inference for Asset Allocation
  • LLM-Driven Portfolio Allocation & Decision Support
  • Machine Learning-Based Asset Allocation
  • Alternative Data Feature Engineering
  • Explainable Financial Machine Learning
  • Robust Portfolio Optimization (NCO, Bayesian Covariance)
  • Synthetic Data Generation for Backtesting

πŸ›  Tech Stack

Languages

  • Primary : Python
  • Secondary : SQL, JavaScript, Solidity
  • Others : R, Swift, C/C++

Core Libraries
NumPy, pandas, scikit-learn, PyTorch, statsmodels, econml

Domains
Financial Machine Learning, Quantitative Finance, Time Series Modeling, Causal ML, Portfolio Optimization


πŸ“« Contact

Email: junghun1013@icloud.com
GitHub: https://github.com/tommylee1013

Pinned Loading

  1. I'm a night πŸ¦‰ I'm a night πŸ¦‰
    1
    🌞 Morning    10 commits  β–Žβ–‘β–‘β–‘β–‘β–‘β–‘β–‘β–‘β–‘β–‘β–‘β–‘β–‘β–‘β–‘β–‘β–‘β–‘β–‘β–‘   1.7%
    2
    πŸŒ† Daytime   216 commits  β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–‹β–‘β–‘β–‘β–‘β–‘β–‘β–‘β–‘β–‘β–‘β–‘β–‘β–‘  36.7%
    3
    πŸŒƒ Evening   185 commits  β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–Œβ–‘β–‘β–‘β–‘β–‘β–‘β–‘β–‘β–‘β–‘β–‘β–‘β–‘β–‘  31.4%
    4
    πŸŒ™ Night     178 commits  β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–Žβ–‘β–‘β–‘β–‘β–‘β–‘β–‘β–‘β–‘β–‘β–‘β–‘β–‘β–‘  30.2%
  2. QuantifiSogang/QuantifiLib QuantifiSogang/QuantifiLib Public

    QuantifiLib is a modular Python library for event-driven strategy research, developed by Quantifi Sogang. It supports data loading, signal engineering, backtesting, portfolio optimization, time ser…

    Python 4 5

  3. FinancialMachineLearning FinancialMachineLearning Public

    Financial Machine Learning Repository

    Python 11 7

  4. QuantifiSogang/MLFinance QuantifiSogang/MLFinance Public

    2024학년도 1ν•™κΈ° MLfinLab Project Team repository

    Python 7 8

  5. EconomicCycle EconomicCycle Public

    2023학년도 2ν•™κΈ° 경기변동둠 ν”„λ‘œμ νŠΈ νŽ˜μ΄μ§€

    Python 1

  6. nco-research nco-research Public

    This repository contains research code and experiments for Posterior-NCO, an extension of Nested Clustered Optimization (NCO) that incorporates Managers’ Views via Bayesian posterior covariance/cor…

    Python