vllm.tracing ¶
BaseSpanAttributes ¶
SpanAttributes ¶vllm/tracing.py
GEN_AI_LATENCY_TIME_IN_MODEL_DECODE class-attribute
instance-attribute
¶
GEN_AI_LATENCY_TIME_IN_MODEL_EXECUTE class-attribute
instance-attribute
¶
GEN_AI_LATENCY_TIME_IN_MODEL_FORWARD class-attribute
instance-attribute
¶
GEN_AI_LATENCY_TIME_IN_MODEL_INFERENCE class-attribute
instance-attribute
¶
GEN_AI_LATENCY_TIME_IN_MODEL_PREFILL class-attribute
instance-attribute
¶
GEN_AI_LATENCY_TIME_IN_QUEUE class-attribute
instance-attribute
¶
GEN_AI_LATENCY_TIME_IN_SCHEDULER class-attribute
instance-attribute
¶
GEN_AI_LATENCY_TIME_TO_FIRST_TOKEN class-attribute
instance-attribute
¶
GEN_AI_REQUEST_MAX_TOKENS class-attribute
instance-attribute
¶
GEN_AI_REQUEST_TEMPERATURE class-attribute
instance-attribute
¶
GEN_AI_REQUEST_TOP_P class-attribute
instance-attribute
¶
GEN_AI_RESPONSE_MODEL class-attribute
instance-attribute
¶
GEN_AI_USAGE_COMPLETION_TOKENS class-attribute
instance-attribute
¶
GEN_AI_USAGE_NUM_SEQUENCES class-attribute
instance-attribute
¶
GEN_AI_USAGE_PROMPT_TOKENS class-attribute
instance-attribute
¶
contains_trace_headers ¶
extract_trace_context ¶
extract_trace_headers ¶
get_span_exporter ¶vllm/tracing.py
init_tracer ¶vllm/tracing.py