Get Started with vLLM

Hands-on notebooks for compressing, serving, and benchmarking LLMs with vLLM — built for the Red Hat OpenShift AI Developer Sandbox.

Notebooks

#	Folder	Notebook	What you'll do
1	`llm-compression/`	Optimizing a Model with LLM Compressor	Apply GPTQ W4A16 quantization to Qwen3-0.6B and compare it against the original
2	`vllm-inference/`	Serving LLMs Efficiently with vLLM	Query a vLLM server with the OpenAI-compatible API, explore logprobs, continuous batching, KV cache metrics, and prefix caching
3	`benchmarks-evals/`	Measuring What Matters: Benchmarking and Evaluation	Run GuideLLM serving benchmarks and lm_eval quality checks, then decide if a quantization tradeoff is worth deploying

The recommended flow is 1 → 2 → 3, but each notebook is self-contained — pick whichever topic interests you.

Note: Pre-quantized models are already included in llm-compression/ so you can skip straight to inference or benchmarking if you prefer.

Prerequisites

A free Developer Sandbox account
An OpenShift API token (used as your LLM API key) — follow the sandbox LLM guide to retrieve it

Getting started

Open OpenShift AI from the Developer Sandbox and create a workbench using the Jupyter Minimal — CPU — Python 3.12 image.

Clone this repo inside the workbench:

git clone https://github.com/cedricclyburn/get-started-with-vllm.git

Open the notebook you want, install its dependencies (pip install -r requirements.txt in the matching folder), and run the cells.

License

Apache 2.0

Name		Name	Last commit message	Last commit date
Latest commit History 13 Commits
benchmarks-evals		benchmarks-evals
llm-compression		llm-compression
vllm-inference		vllm-inference
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Get Started with vLLM

Notebooks

Prerequisites

Getting started

License

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Uh oh!

Folders and files

Latest commit

History

Repository files navigation

Get Started with vLLM

Notebooks

Prerequisites

Getting started

License

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages