Inference latency

shape