vllm.v1.core.sched.output ¶
CachedRequestData dataclass ¶
Source code in vllm/v1/core/sched/output.py
_req_id_to_num_output_tokens cached property ¶
Cache mapping of req_id to num_output_tokens for O(1) lookup.
This cached property is safe because CachedRequestData instances are created fresh each scheduling iteration and not mutated during computation of iteration details.