-
Study on LLM Inference Communication
background notes on llm inference communication
-
Study on LLM Inference
background notes on llm inference
-
NCCL Intra/Inter-Node Communication
understanding nccl intra/inter-node communication
-
NVIDIA Dynamo
notes on nvidia dynamo (framework for inference)
-
LLM Training Optimization: Megatron and Deepspeed
notes on megatron and deepspeed
-
Survey on LLM Training Today
general notes on sota llm infrastructure
-
Parallelism in Distributed Training