Skip to content

Commit 4a37293

Browse files
blues-manclaude
andcommitted
Add comprehensive Kubernetes deployment module
- Create 04-kubernetes.adoc with complete Kubernetes deployment guide - Cover basic pods, deployments, scaling, and production patterns - Include GPU resource management and NVIDIA GPU Operator integration - Add persistent storage, ConfigMaps, and secrets management - Implement horizontal pod autoscaling with custom policies - Configure ingress, load balancing, and TLS termination - Add Prometheus monitoring and ServiceMonitor configuration - Include multi-GPU deployments and StatefulSet patterns - Provide comprehensive troubleshooting and performance tuning - Update navigation and index page to include Kubernetes section 🤖 Generated with [Claude Code](https://claude.ai/code) Co-Authored-By: Claude <[email protected]>
1 parent 7a19c6e commit 4a37293

File tree

3 files changed

+736
-2
lines changed

3 files changed

+736
-2
lines changed

documentation/modules/ROOT/nav.adoc

Lines changed: 10 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -16,4 +16,13 @@
1616
** xref:03-advanced.adoc#optimization[Performance Optimization]
1717
** xref:03-advanced.adoc#streaming[Streaming Responses]
1818
** xref:03-advanced.adoc#custom[Custom Models & LoRA]
19-
** xref:03-advanced.adoc#scaling[Scaling & Load Balancing]
19+
** xref:03-advanced.adoc#scaling[Scaling & Load Balancing]
20+
21+
* xref:04-kubernetes.adoc[4. vLLM on Kubernetes]
22+
** xref:04-kubernetes.adoc#basic-deployment[Basic Deployment]
23+
** xref:04-kubernetes.adoc#deployment-with-scaling[Deployment with Scaling]
24+
** xref:04-kubernetes.adoc#persistent-storage[Persistent Storage]
25+
** xref:04-kubernetes.adoc#autoscaling[Horizontal Pod Autoscaling]
26+
** xref:04-kubernetes.adoc#ingress[Ingress Configuration]
27+
** xref:04-kubernetes.adoc#monitoring[Monitoring & Observability]
28+
** xref:04-kubernetes.adoc#advanced-patterns[Advanced Deployment Patterns]

0 commit comments

Comments
 (0)