NVIDIA TensorRT-LLM Published on Apr 28, 2025 Updated on Jan 15, 2026 One minuteContents A TensorRT Toolbox for Optimized Large Language Model InferenceReferenceshttps://github.com/NVIDIA/TensorRT-LLMhttps://nvidia.github.io/TensorRT-LLM/latest/