Performing inference for a Llama 405B-BF16 model on Multi-Node SLURM Clusters with SGLang
Saved my life
Haha
Btw there is an indepth tutorial in the SGL Docs now based on this blog - https://docs.sglang.ai/references/multi_node_deployment/multi_node.html
Saved my life
Haha
Btw there is an indepth tutorial in the SGL Docs now based on this blog - https://docs.sglang.ai/references/multi_node_deployment/multi_node.html