Skip to content
View ny1031's full-sized avatar
🐯
🐯

Block or report ny1031

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
ny1031/README.md

Hi, I'm Nayeon πŸ‘‹

AI/ML Researcher at Kakao Corp., working on LLM pre-training. Personally diving deep into alignment (SFT, RLHF/RLVR) and modern architectures.

Google Scholar LinkedIn Hugging Face


πŸŽ“ Background

  • M.S. in Artificial Intelligence, Seoul National University
  • B.S. in Biology and Business Administration, Seoul National University

πŸ’Ό Currently

  • LLM Pre-training @ Kakao Corp.
  • Developing the Kanana model family β€” Kakao's bilingual LLMs
  • Building large-scale pre-training pipelines on JAX/TPU and PyTorch/GPU

πŸ”¬ Research Interests

  • Pre-training at scale β€” Β΅P, hyperparameter transfer, scaling laws
  • Architectures β€” Mixture-of-Experts, Linear attention (DeltaNet, Gated DeltaNet), MLA
  • Alignment β€” SFT, RLHF, RLVR, GRPO

🎀 Talks & Writings

  • Kanana-2 μ–Έμ–΄λͺ¨λΈ 훑어보기
    Instruct.KR 2025 Dec Meetup β€” Agents, Seoul, Dec 2025 Β· [event] β€” co-presented Pre-training of Kakao's MoE LLM
  • Featured in μš”μ¦˜IT
    Speaker interview ahead of Instruct.KR Dec 2025 β€” on Kanana team's MoE development journey
  • λ°μ΄ν„°λŠ” μ—†μ§€λ§Œ LLM은 ν•™μŠ΅ν•˜κ³  μ‹Άμ–΄ β€” Code, Math 데이터 개발기
    if(kakao) 25, Sep 2025 Β· [video] β€” data pipeline development for LLM pre-training (code & math corpora)
  • Kakao's journey with JAX and Cloud TPUs
    Google Cloud Blog, Aug 2025 β€” co-authored technical deep-dive on Kakao's JAX/TPU adoption for Kanana pre-training
  • κ΅­λ‚΄ 졜초 MoE λͺ¨λΈ 'Kanana-MoE' 개발기
    Kakao Tech Blog, Jul 2025 β€” co-authored development story of Korea's first open-source MoE LLM
  • Building and Serving the Next Generation AI Models with JAX
    Google Cloud Next '25, Las Vegas, Apr 2025 Β· [session]

πŸ“š Selected Publications

πŸ›  Stack

Python PyTorch JAX SLURM

πŸ“« Reach me


Pinned Loading

  1. TM-HGNN TM-HGNN Public

    [ACL 2023 Oral] Clinical Note Owns its Hierarchy: Multi-Level Hypergraph Neural Networks for Patient-Level Representation Learning

    Python 9 2

  2. kanana kanana Public

    Forked from kakao/kanana

    Kanana: Compute-efficient Bilingual Language Models