Joon Ha Kim
  • about
  • projects
  • notes (current)
  • cv
  • Disaggregated Serving

    notes on nvidia dynamo (framework for inference)

    4 min read   ·   September 12, 2025

    2025   ·   llm   inference

  • Study on vLLM

    background notes on vllm

    5 min read   ·   September 12, 2025

    2025   ·   llm   inference

  • Study on LLM Inference Communication

    background notes on llm inference communication

    4 min read   ·   August 17, 2025

    2025   ·   llm   communication   inference

  • Study on LLM Inference

    background notes on llm inference

    6 min read   ·   August 16, 2025

    2025   ·   llm

  • NCCL Intra/Inter-Node Communication

    understanding nccl intra/inter-node communication

    3 min read   ·   July 27, 2025

    2025   ·   communication   llm   training   inference

  • LLM Training Optimization: Megatron and Deepspeed

    notes on megatron and deepspeed

    8 min read   ·   July 02, 2025

    2025   ·   llm   training

  • Survey on LLM Training Today

    general notes on sota llm infrastructure

    7 min read   ·   June 19, 2025

    2025   ·   llm

  • Parallelism in Distributed Training

    3 min read   ·   June 02, 2025

    2025   ·   llm   training   inference

© Copyright 2025 Joon Ha Kim. Last updated: September 13, 2025.