NVIDIA Triton Inference Server: Difference between revisions

NVIDIA Triton Inference Server (view source)

30 bytes added , 29 March 2023

no edit summary

370

edits

@@ Line 1: / Line 1: @@
+==Introduction==
 NVIDIA Triton Inference Server is an open-source software that standardizes model deployment and execution, providing fast and scalable AI in production environments. As part of the [[NVIDIA AI]] platform, Triton enables teams to deploy, run, and scale trained AI models from any framework on GPU- or CPU-based infrastructure, offering high-performance inference across cloud, on-premises, edge, and embedded devices.
+==Features==
 === Support for Multiple Frameworks ===