Modules:
| Name | Description | 
|---|---|
| benchmark |  | 
| collect_env |  | 
| main | The CLI entrypoints of vLLM | 
| openai |  | 
| run_batch |  | 
| serve |  | 
| types |  | 
 module-attribute  ¶
 __all__: list[str] = [
    "BenchmarkLatencySubcommand",
    "BenchmarkServingSubcommand",
    "BenchmarkThroughputSubcommand",
]
 
  Bases: BenchmarkSubcommandBase
The latency subcommand for vllm bench.
Source code in vllm/entrypoints/cli/benchmark/latency.py
  class-attribute instance-attribute  ¶
   classmethod  ¶
 add_cli_args(parser: ArgumentParser) -> None
 
  Bases: BenchmarkSubcommandBase
The serve subcommand for vllm bench.
Source code in vllm/entrypoints/cli/benchmark/serve.py
  classmethod  ¶
 add_cli_args(parser: ArgumentParser) -> None
 
  Bases: BenchmarkSubcommandBase
The throughput subcommand for vllm bench.
Source code in vllm/entrypoints/cli/benchmark/throughput.py
  classmethod  ¶
 add_cli_args(parser: ArgumentParser) -> None