notes | Joon Ha Kim

Study on Batching

Orca notes

6 min read · September 21, 2025

2025 · llm inference
Tools

Tools Used Recently

3 min read · September 18, 2025

2025 · llm inference simulation
Disaggregated Serving

notes on nvidia dynamo (framework for inference)

6 min read · September 12, 2025

2025 · llm inference
Study on vLLM

background notes on vllm

5 min read · September 12, 2025

2025 · llm inference
Inference Simulation Notes

Quotes from Papers that Hint at Significance of Communication in Inference

4 min read · September 12, 2025

2025 · llm inference simulation
Study on LLM Inference Communication

background notes on llm inference communication

4 min read · August 17, 2025

2025 · llm communication inference
Study on LLM Inference

background notes on llm inference

6 min read · August 16, 2025

2025 · llm
NCCL Intra/Inter-Node Communication

understanding nccl intra/inter-node communication

3 min read · July 27, 2025

2025 · communication llm training inference
LLM Training Optimization: Megatron and Deepspeed

notes on megatron and deepspeed

8 min read · July 02, 2025

2025 · llm training
Survey on LLM Training Today

general notes on sota llm infrastructure

7 min read · June 19, 2025

2025 · llm
Parallelism in Distributed Training

3 min read · June 02, 2025

2025 · llm training inference