vllm.v1.core.kv_cache_metrics ¶
KV cache metrics tracking.
BlockMetricsState ¶
Tracks lifecycle metrics for a single KV cache block.
Source code in vllm/v1/core/kv_cache_metrics.py
KVCacheMetricsCollector ¶
Collects KV cache residency metrics with sampling.