python3 -m vllm.entrypoints.openai.api_server \ --served-model-name=vllm-a40-gpt-oss-20b \ --dtype=bfloat16 \ --port=5000 \ --model=/model/gpt-oss-20b \ --tensor ...
You can create a release to package software, along with release notes and links to binary files, for other people to use. Learn more about releases in our docs.