-
NVIDIA Dynamo
notes on nvidia dynamo (framework for inference)
-
LLM Training Optimization: Megatron and Deepspeed
notes on megatron and deepspeed
-
Study on LLM Inference
background notes on llm inference
-
Survey on LLM Training Today
general notes on sota llm infrastructure
-
Parallelism in Distributed Training