Joon Ha Kim
  • about
  • notes (current)
  • collections
  • cv
  • llm
  • •

  • training
  • •

  • communication
  • •

  • inference
  • Study on Batching

    Orca notes

    6 min read   ·   September 21, 2025

    2025   ·   llm   inference

  • Tools

    Tools Used Recently

    3 min read   ·   September 18, 2025

    2025   ·   llm   inference   simulation

  • Disaggregated Serving

    notes on nvidia dynamo (framework for inference)

    6 min read   ·   September 12, 2025

    2025   ·   llm   inference

  • Study on vLLM

    background notes on vllm

    5 min read   ·   September 12, 2025

    2025   ·   llm   inference

  • Inference Simulation Notes

    Quotes from Papers that Hint at Significance of Communication in Inference

    4 min read   ·   September 12, 2025

    2025   ·   llm   inference   simulation

  • Study on LLM Inference Communication

    background notes on llm inference communication

    4 min read   ·   August 17, 2025

    2025   ·   llm   communication   inference

  • Study on LLM Inference

    background notes on llm inference

    6 min read   ·   August 16, 2025

    2025   ·   llm

  • NCCL Intra/Inter-Node Communication

    understanding nccl intra/inter-node communication

    3 min read   ·   July 27, 2025

    2025   ·   communication   llm   training   inference

  • LLM Training Optimization: Megatron and Deepspeed

    notes on megatron and deepspeed

    8 min read   ·   July 02, 2025

    2025   ·   llm   training

  • Survey on LLM Training Today

    general notes on sota llm infrastructure

    7 min read   ·   June 19, 2025

    2025   ·   llm

  • Parallelism in Distributed Training

    3 min read   ·   June 02, 2025

    2025   ·   llm   training   inference

© Copyright 2025 Joon Ha Kim. Last updated: September 22, 2025.