AI inference server

C25929 concept

An AI inference server is a system that hosts trained machine learning models and processes incoming requests to generate predictions or responses in real time.

Try in SPARQL

Observed surface forms (5)

AI inference platform ×1
cloud-based image analysis service ×1
model serving library ×1
model serving system ×1
single-record prediction API ×1

Instances (6)

NVIDIA Triton Inference Server
Ray Serve via concept surface "model serving library"
TensorFlow Serving via concept surface "model serving system"
Google Vision API via concept surface "cloud-based image analysis service"
PredictionEngine via concept surface "single-record prediction API"
NVIDIA inference platform via concept surface "AI inference platform"