Index of /traces/ai/vllm/Llama_3.1_70B_Instruct_N2_GPU8_TP8_Short_Prompts
Name
Last modified
Size
Description
Parent Directory
-
nsys_reports/
2025-08-28 13:04
-
vllm_inference.py
2025-08-28 13:12
6.3K
vllm_llama_N2_GPU8_PP8.bin
2025-08-28 13:11
99M
vllm_llama_N2_GPU8_PP8.goal
2025-08-28 13:09
147M
Apache/2.4.57 (Debian) Server at storage2.spcl.ethz.ch Port 80