From dedb38f5741d230254f940fc2b7862a759862451 Mon Sep 17 00:00:00 2001
From: kpwrennbu <kpwrenn@bu.edu>
Date: Thu, 26 Feb 2026 17:10:11 -0500
Subject: [PATCH 1/6] add Apache Kafka on OpenShift documentation under Other
 Tools On NERC

Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>
---
 docs/other-tools/kafka/kafka-on-openshift.md | 353 +++++++++++++++++++
 mkdocs.yaml                                  |   1 +
 2 files changed, 354 insertions(+)
 create mode 100644 docs/other-tools/kafka/kafka-on-openshift.md
diff --git a/docs/other-tools/kafka/kafka-on-openshift.md b/docs/other-tools/kafka/kafka-on-openshift.md
new file mode 100644
index 000000000..bfd195a7e
--- /dev/null
+++ b/docs/other-tools/kafka/kafka-on-openshift.md
@@ -0,0 +1,353 @@
+# Apache Kafka on NERC OpenShift
+
+## Apache Kafka Overview
+
+[Apache Kafka](https://kafka.apache.org/) is a distributed event streaming platform
+capable of handling trillions of events per day. Originally developed at LinkedIn and
+open-sourced in 2011, Kafka is designed for high-throughput, fault-tolerant, and
+scalable real-time data pipelines and streaming applications.
+
+Kafka uses a **publish-subscribe** model organized around the following core concepts:
+
+- **Broker**: A Kafka server that stores and serves messages.
+- **Topic**: A named stream to which producers publish records and from which consumers
+  read records.
+- **Partition**: Topics are split into partitions for parallelism and fault tolerance.
+- **Producer**: A client application that publishes records to one or more topics.
+- **Consumer**: A client application that subscribes to topics and processes records.
+- **Consumer Group**: A group of consumers that collectively consume a topic, with each
+  partition assigned to exactly one member.
+
+Running Kafka on [NERC OpenShift](https://nerc-project.github.io/nerc-docs/openshift/)
+is the recommended approach for course workloads requiring persistent, scalable message
+streaming. This guide uses the **[Strimzi Operator](https://strimzi.io/)**, which is
+the standard Kubernetes-native method for deploying Kafka on OpenShift.
+
+## Prerequisites
+
+Before proceeding, ensure you have:
+
+- Access to a [NERC OpenShift project](https://nerc-project.github.io/nerc-docs/openshift/logging-in/access-the-openshift-web-console/)
+- The `oc` CLI installed and authenticated to the NERC OpenShift cluster
+- Sufficient quota in your project (at least 3 vCPUs and 6 GiB memory recommended
+  for a minimal Kafka cluster)
+
+!!! note "Checking Your Quota"
+
+    You can view your project's resource quota by running:
+
+    ```sh
+    oc describe quota -n <your-project>
+    ```
+
+    If you need additional resources, contact your project PI or NERC support.
+
+## Deploy Kafka Using the Strimzi Operator
+
+Strimzi provides a Kubernetes Operator that manages the full lifecycle of Kafka
+clusters on OpenShift. On NERC OpenShift, you will install Strimzi into your own
+namespace.
+
+### Install the Strimzi Operator
+
+-   Log in to the NERC OpenShift cluster and switch to your project namespace:
+
+    ```sh
+    oc login <your-openshift-api-url>
+    oc project <your-project>
+    ```
+
+    For example:
+
+    ```sh
+    oc login https://api.shift.nerc.mghpcc.org:6443
+    oc project ds551-kafka
+    ```
+
+-   Download the Strimzi installation YAML files. Always check the
+    [Strimzi releases page](https://github.com/strimzi/strimzi-kafka-operator/releases)
+    for the latest version:
+
+    ```sh
+    STRIMZI_VERSION="0.45.0"
+    wget https://github.com/strimzi/strimzi-kafka-operator/releases/download/${STRIMZI_VERSION}/strimzi-${STRIMZI_VERSION}.tar.gz
+    tar -xzf strimzi-${STRIMZI_VERSION}.tar.gz
+    cd strimzi-${STRIMZI_VERSION}
+    ```
+
+    !!! warning "Very Important Note"
+
+        Check the [Strimzi compatibility matrix](https://strimzi.io/downloads/) to
+        confirm the Strimzi version supports the Kafka version and Kubernetes/OpenShift
+        version running on NERC. Mismatched versions can prevent the operator from
+        starting.
+
+-   Update the installation files to use your project namespace. Replace all
+    occurrences of `myproject` with your actual namespace:
+
+    ```sh
+    sed -i 's/namespace: .*/namespace: <your-project>/' install/cluster-operator/*RoleBinding*.yaml
+    ```
+
+    For example:
+
+    ```sh
+    sed -i 's/namespace: .*/namespace: ds551-kafka/' install/cluster-operator/*RoleBinding*.yaml
+    ```
+
+-   Apply the Strimzi Cluster Operator installation files:
+
+    ```sh
+    oc apply -f install/cluster-operator/ -n <your-project>
+    ```
+
+-   Verify the operator pod is running:
+
+    ```sh
+    oc get pods -n <your-project> -l name=strimzi-cluster-operator
+    ```
+
+    The output should look similar to:
+
+    ```
+    NAME                                        READY   STATUS    RESTARTS   AGE
+    strimzi-cluster-operator-7d96bf8c59-kfzwp   1/1     Running   0          45s
+    ```
+
+    !!! note "Note"
+
+        It may take 1–2 minutes for the operator pod to reach `Running` status.
+
+### Create a Kafka Cluster
+
+Once the Strimzi Operator is running, you can deploy a Kafka cluster by creating
+a `Kafka` custom resource.
+
+-   Create a file named `kafka-cluster.yaml` with the following minimal cluster
+    definition. This deploys a single-broker Kafka cluster with 3 ZooKeeper replicas
+    and ephemeral (non-persistent) storage suitable for development and testing:
+
+    ```yaml
+    apiVersion: kafka.strimzi.io/v1beta2
+    kind: Kafka
+    metadata:
+      name: my-cluster
+      namespace: <your-project>
+    spec:
+      kafka:
+        version: 3.8.0
+        replicas: 1
+        listeners:
+          - name: plain
+            port: 9092
+            type: internal
+            tls: false
+          - name: tls
+            port: 9093
+            type: internal
+            tls: true
+        config:
+          offsets.topic.replication.factor: 1
+          transaction.state.log.replication.factor: 1
+          transaction.state.log.min.isr: 1
+          default.replication.factor: 1
+          min.insync.replicas: 1
+        storage:
+          type: ephemeral
+      zookeeper:
+        replicas: 3
+        storage:
+          type: ephemeral
+      entityOperator:
+        topicOperator: {}
+        userOperator: {}
+    ```
+
+    !!! warning "Very Important Note"
+
+        This configuration uses `ephemeral` storage, meaning all Kafka data will
+        be lost if pods restart. For persistent storage across pod restarts, change
+        the `storage` type to `persistent-claim` and specify a `size` and
+        `storageClass`. See the
+        [Strimzi storage documentation](https://strimzi.io/docs/operators/latest/full/deploying.html#type-EphemeralStorage-reference)
+        for details.
+
+-   Apply the Kafka cluster definition:
+
+    ```sh
+    oc apply -f kafka-cluster.yaml -n <your-project>
+    ```
+
+-   Watch the cluster come up. It may take 3–5 minutes for all pods to reach
+    `Running` status:
+
+    ```sh
+    oc get pods -n <your-project> -l strimzi.io/cluster=my-cluster -w
+    ```
+
+    A healthy cluster will show output similar to:
+
+    ```
+    NAME                                          READY   STATUS    RESTARTS   AGE
+    my-cluster-entity-operator-6d7f9c7d4b-xqtlp   2/2     Running   0          2m
+    my-cluster-kafka-0                             1/1     Running   0          3m
+    my-cluster-zookeeper-0                         1/1     Running   0          4m
+    my-cluster-zookeeper-1                         1/1     Running   0          4m
+    my-cluster-zookeeper-2                         1/1     Running   0          4m
+    ```
+
+### Create a Kafka Topic
+
+-   Create a file named `kafka-topic.yaml`:
+
+    ```yaml
+    apiVersion: kafka.strimzi.io/v1beta2
+    kind: KafkaTopic
+    metadata:
+      name: my-topic
+      namespace: <your-project>
+      labels:
+        strimzi.io/cluster: my-cluster
+    spec:
+      partitions: 3
+      replicas: 1
+      config:
+        retention.ms: 7200000
+        segment.bytes: 1073741824
+    ```
+
+-   Apply the topic:
+
+    ```sh
+    oc apply -f kafka-topic.yaml -n <your-project>
+    ```
+
+-   Verify the topic was created:
+
+    ```sh
+    oc get kafkatopic my-topic -n <your-project>
+    ```
+
+    Expected output:
+
+    ```
+    NAME       CLUSTER      PARTITIONS   REPLICATION FACTOR   READY
+    my-topic   my-cluster   3            1                    True
+    ```
+
+## Test the Kafka Cluster
+
+Strimzi ships with example producer and consumer jobs you can use to verify your
+cluster is working correctly.
+
+### Run a Producer
+
+-   Start a producer pod that sends 100 messages to `my-topic`:
+
+    ```sh
+    oc run kafka-producer -ti \
+      --image=quay.io/strimzi/kafka:0.45.0-kafka-3.8.0 \
+      --rm=true --restart=Never \
+      -- bin/kafka-console-producer.sh \
+      --bootstrap-server my-cluster-kafka-bootstrap:9092 \
+      --topic my-topic
+    ```
+
+-   Type a few test messages at the prompt and press `Enter` after each:
+
+    ```
+    > Hello from NERC OpenShift!
+    > This is a Kafka test message.
+    ```
+
+    Press `Ctrl+C` to stop the producer.
+
+### Run a Consumer
+
+-   In a separate terminal, start a consumer pod to read messages from the beginning:
+
+    ```sh
+    oc run kafka-consumer -ti \
+      --image=quay.io/strimzi/kafka:0.45.0-kafka-3.8.0 \
+      --rm=true --restart=Never \
+      -- bin/kafka-console-consumer.sh \
+      --bootstrap-server my-cluster-kafka-bootstrap:9092 \
+      --topic my-topic \
+      --from-beginning
+    ```
+
+    You should see the messages published by the producer:
+
+    ```
+    Hello from NERC OpenShift!
+    This is a Kafka test message.
+    ```
+
+    Press `Ctrl+C` to stop the consumer.
+
+    !!! tip "Consumer Groups"
+
+        To simulate multiple consumers sharing a topic workload, add the flag
+        `--group <group-name>` to the consumer command. Each consumer in the same
+        group will receive messages from a distinct subset of partitions.
+
+## Connecting Applications to Kafka
+
+Applications running inside the same OpenShift project can reach the Kafka broker
+using the internal bootstrap address:
+
+```
+my-cluster-kafka-bootstrap:9092   # plaintext (no TLS)
+my-cluster-kafka-bootstrap:9093   # TLS
+```
+
+For Python applications, use the [kafka-python](https://kafka-python.readthedocs.io/)
+or [confluent-kafka](https://docs.confluent.io/kafka-clients/python/current/overview.html)
+client libraries:
+
+```python
+from kafka import KafkaProducer, KafkaConsumer
+
+# Producer example
+producer = KafkaProducer(bootstrap_servers='my-cluster-kafka-bootstrap:9092')
+producer.send('my-topic', b'Hello from Python!')
+producer.flush()
+
+# Consumer example
+consumer = KafkaConsumer(
+    'my-topic',
+    bootstrap_servers='my-cluster-kafka-bootstrap:9092',
+    auto_offset_reset='earliest',
+    group_id='my-group'
+)
+for msg in consumer:
+    print(f"Received: {msg.value.decode()}")
+```
+
+!!! note "Note"
+
+    The bootstrap address `my-cluster-kafka-bootstrap` is an OpenShift Service
+    created automatically by Strimzi. It is only reachable from within the same
+    project namespace. If you need external access, configure a `route` or
+    `loadbalancer` type listener in the Kafka CR.
+
+## Clean Up Resources
+
+When you are finished, remove all Kafka resources to free up project quota:
+
+```sh
+# Delete the Kafka topic
+oc delete kafkatopic my-topic -n <your-project>
+
+# Delete the Kafka cluster (also removes ZooKeeper and Entity Operator pods)
+oc delete kafka my-cluster -n <your-project>
+
+# Remove the Strimzi Operator
+oc delete -f install/cluster-operator/ -n <your-project>
+```
+
+!!! danger "Very Important Note"
+
+    Deleting the Kafka cluster with ephemeral storage permanently destroys all
+    messages stored in that cluster. Make sure you have consumed or exported any
+    data you need before running these commands.
\ No newline at end of file
diff --git a/mkdocs.yaml b/mkdocs.yaml
index 48ad2b4dd..da737a170 100644
--- a/mkdocs.yaml
+++ b/mkdocs.yaml
@@ -181,6 +181,7 @@ nav:
                 - RStudio: other-tools/R/rstudio-server/running-rstudio-on-NERC.md
                 - R Shiny: other-tools/R/r-shiny-server/running-rshiny-on-NERC.md
           - Apache Spark: other-tools/apache-spark/spark.md
+          - Apache Kafka on OpenShift: other-tools/kafka/kafka-on-openshift.md
           - MLflow:
                 - MLflow Overview: other-tools/mlflow/mlflow-overview.md
                 - MLflow Server Setup: other-tools/mlflow/mlflow-server-setup.md

From 3f819c7ab565bcd3fdd519fc19489d3f129e97a2 Mon Sep 17 00:00:00 2001
From: kpwrennbu <kpwrenn@bu.edu>
Date: Tue, 3 Mar 2026 12:45:16 -0500
Subject: [PATCH 2/6] fixes kafka doc with thorough testing

---
 kafka-on-openshift.md | 414 ++++++++++++++++++++++++++++++++++++++++++
 1 file changed, 414 insertions(+)
 create mode 100644 kafka-on-openshift.md

diff --git a/kafka-on-openshift.md b/kafka-on-openshift.md
new file mode 100644
index 000000000..9270a6d27
--- /dev/null
+++ b/kafka-on-openshift.md
@@ -0,0 +1,414 @@
+# Apache Kafka on NERC OpenShift
+
+## Apache Kafka Overview
+
+[Apache Kafka](https://kafka.apache.org/) is a distributed event streaming platform
+capable of handling trillions of events per day. Originally developed at LinkedIn and
+open-sourced in 2011, Kafka is designed for high-throughput, fault-tolerant, and
+scalable real-time data pipelines and streaming applications.
+
+Kafka uses a **publish-subscribe** model organized around the following core concepts:
+
+- **Broker**: A Kafka server that stores and serves messages.
+- **Topic**: A named stream to which producers publish records and from which consumers
+  read records.
+- **Partition**: Topics are split into partitions for parallelism and fault tolerance.
+- **Producer**: A client application that publishes records to one or more topics.
+- **Consumer**: A client application that subscribes to topics and processes records.
+- **Consumer Group**: A group of consumers that collectively consume a topic, with each
+  partition assigned to exactly one member.
+
+Running Kafka on [NERC OpenShift](https://nerc-project.github.io/nerc-docs/openshift/)
+is the recommended approach for course workloads requiring persistent, scalable message
+streaming. This guide uses the **[Strimzi Operator](https://strimzi.io/)**, which is
+the standard Kubernetes-native method for deploying Kafka on OpenShift.
+
+## Prerequisites
+
+Before proceeding, ensure you have:
+
+- Access to a [NERC OpenShift project](https://nerc-project.github.io/nerc-docs/openshift/logging-in/access-the-openshift-web-console/)
+- The `oc` CLI installed and authenticated to the NERC OpenShift cluster
+- Sufficient quota in your project (at least 3 vCPUs and 6 GiB memory recommended
+  for a minimal Kafka cluster)
+
+!!! note "Checking Your Quota"
+
+    You can view your project's resource quota by running:
+
+    ```sh
+    oc describe quota -n <your-project>
+    ```
+
+    If you need additional resources, contact your project PI or NERC support.
+
+## Deploy Kafka Using the Strimzi Operator
+
+Strimzi provides a Kubernetes Operator that manages the full lifecycle of Kafka
+clusters on OpenShift. On NERC OpenShift, you will install Strimzi into your own
+namespace.
+
+### Install the Strimzi Operator
+
+-   Log in to the NERC OpenShift cluster and switch to your project namespace:
+
+    ```sh
+    oc login <your-openshift-api-url>
+    oc project <your-project>
+    ```
+
+    For example:
+
+    ```sh
+    oc login https://api.edu.nerc.mghpcc.org:6443
+    oc project ds551-2026-spring-9ab13b
+    ```
+
+-   Download the Strimzi installation YAML files. Always check the
+    [Strimzi releases page](https://github.com/strimzi/strimzi-kafka-operator/releases)
+    for the latest version:
+
+    ```sh
+    STRIMZI_VERSION="0.50.1"
+    wget https://github.com/strimzi/strimzi-kafka-operator/releases/download/${STRIMZI_VERSION}/strimzi-${STRIMZI_VERSION}.tar.gz
+    tar -xzf strimzi-${STRIMZI_VERSION}.tar.gz
+    cd strimzi-${STRIMZI_VERSION}
+    ```
+
+    !!! warning "Very Important Note"
+
+        Check the [Strimzi compatibility matrix](https://strimzi.io/downloads/) to
+        confirm the Strimzi version supports the Kafka version and Kubernetes/OpenShift
+        version running on NERC. Mismatched versions can prevent the operator from
+        starting. For Kafka 4.0+, use Strimzi 0.50.0 or later.
+
+-   Update the installation files to use your project namespace. Replace all
+    occurrences of `myproject` with your actual namespace:
+
+    ```sh
+    sed -i '' 's/namespace: .*/namespace: <your-project>/' install/cluster-operator/*RoleBinding*.yaml
+    ```
+
+    For example:
+
+    ```sh
+    sed -i '' 's/namespace: .*/namespace: ds551-2026-spring-9ab13b/' install/cluster-operator/*RoleBinding*.yaml
+    ```
+
+    !!! important "Make sure to update the namespace"
+
+        The `-n <your-project>` flag explicitly specifies the namespace for all
+        subsequent `oc` commands. Always include this flag when working with multiple
+        projects to avoid accidentally operating on the wrong namespace.
+
+-   Apply the Strimzi Cluster Operator installation files:
+
+    ```sh
+    oc apply -f install/cluster-operator/ -n <your-project>
+    ```
+
+-   Verify the operator pod is running:
+
+    ```sh
+    oc get pods -n <your-project> -l name=strimzi-cluster-operator
+    ```
+
+    The output should look similar to:
+
+    ```
+    NAME                                        READY   STATUS    RESTARTS   AGE
+    strimzi-cluster-operator-7d96bf8c59-kfzwp   1/1     Running   0          45s
+    ```
+
+    !!! note "Note"
+
+        It may take 1–2 minutes for the operator pod to reach `Running` status.
+
+### Create a Kafka Cluster
+
+Once the Strimzi Operator is running, you can deploy a Kafka cluster by creating
+a `Kafka` custom resource and a `KafkaNodePool` resource.
+
+!!! warning "Important: KafkaNodePool is Required"
+
+    As of Kafka 4.0+, Strimzi uses `KafkaNodePool` to define broker and controller nodes.
+    Both resources must be created together. The `KafkaNodePool` should define at least
+    one node pool with both `broker` and `controller` roles for KRaft mode operation.
+    Without a KafkaNodePool, the Kafka cluster will not deploy.
+
+-   Create a file named `kafka-cluster.yaml` with the Kafka cluster definition:
+
+    ```yaml
+    apiVersion: kafka.strimzi.io/v1
+    kind: KafkaNodePool
+    metadata:
+      name: dual-role
+      namespace: <your-project>
+      labels:
+        strimzi.io/cluster: my-cluster
+    spec:
+      replicas: 1
+      roles:
+        - broker
+        - controller
+      storage:
+        type: ephemeral
+    ---
+    apiVersion: kafka.strimzi.io/v1beta2
+    kind: Kafka
+    metadata:
+      name: my-cluster
+      namespace: <your-project>
+    spec:
+      kafka:
+        version: 4.1.1
+        listeners:
+          - name: plain
+            port: 9092
+            type: internal
+            tls: false
+          - name: tls
+            port: 9093
+            type: internal
+            tls: true
+        config:
+          offsets.topic.replication.factor: 1
+          transaction.state.log.replication.factor: 1
+          transaction.state.log.min.isr: 1
+          default.replication.factor: 1
+          min.insync.replicas: 1
+      entityOperator:
+        topicOperator: {}
+        userOperator: {}
+    ```
+
+    !!! warning "Very Important Note"
+
+        - Kafka 4.0+ requires `KafkaNodePool` with both `broker` and `controller` roles
+          for KRaft (Kraft Raft) consensus mode operation.
+        - This configuration uses `ephemeral` storage, meaning all Kafka data will
+          be lost if pods restart. For persistent storage across pod restarts, change
+          the `storage` type to `persistent-claim` and specify a `size` and
+          `storageClass`. See the
+          [Strimzi storage documentation](https://strimzi.io/docs/operators/latest/full/deploying.html#type-EphemeralStorage-reference)
+          for details.
+        - Make sure the `KafkaNodePool` metadata includes the label `strimzi.io/cluster: my-cluster`
+          to link it to the Kafka resource.
+
+-   Apply the Kafka cluster definition:
+
+    ```sh
+    oc apply -f kafka-cluster.yaml -n <your-project>
+    ```
+
+-   Watch the cluster come up. It may take 3–5 minutes for all pods to reach
+    `Running` status:
+
+    ```sh
+    oc get pods -n <your-project> -l strimzi.io/cluster=my-cluster -w
+    ```
+
+    A healthy cluster will show output similar to:
+
+    ```
+    NAME                                          READY   STATUS    RESTARTS   AGE
+    my-cluster-dual-role-0                        1/1     Running   0          3m
+    my-cluster-entity-operator-6d7f9c7d4b-xqtlp   2/2     Running   0          2m
+    ```
+
+    !!! note "Note about Kafka 4.0+ Differences"
+
+        In Kafka 4.0+:
+        - There are **no ZooKeeper pods**. The broker manages its own metadata using KRaft.
+        - Pod names follow the pattern `<cluster-name>-<nodepool-name>-<id>`.
+        - With this single-node setup using `dual-role`, you'll see pods named `my-cluster-dual-role-0`.
+
+### Create a Kafka Topic
+
+-   Create a file named `kafka-topic.yaml`:
+
+    ```yaml
+    apiVersion: kafka.strimzi.io/v1beta2
+    kind: KafkaTopic
+    metadata:
+      name: my-topic
+      namespace: <your-project>
+      labels:
+        strimzi.io/cluster: my-cluster
+    spec:
+      partitions: 3
+      replicas: 1
+      config:
+        retention.ms: 7200000
+        segment.bytes: 1073741824
+    ```
+
+-   Apply the topic:
+
+    ```sh
+    oc apply -f kafka-topic.yaml -n <your-project>
+    ```
+
+-   Verify the topic was created:
+
+    ```sh
+    oc get kafkatopic my-topic -n <your-project>
+    ```
+
+    Expected output:
+
+    ```
+    NAME       CLUSTER      PARTITIONS   REPLICATION FACTOR   READY
+    my-topic   my-cluster   3            1                    True
+    ```
+
+## Test the Kafka Cluster
+
+Strimzi ships with pre-built container images with Kafka command-line tools that
+you can use to verify your cluster is working correctly.
+
+!!! note "API Deprecation Warnings"
+
+    You may see deprecation warnings about Kafka API versions during deployment and testing.
+    These are safe to ignore. The deprecation warnings occur because the v1beta2 API version
+    is being phased out in favor of v1. Your cluster will still function correctly.
+
+### Run a Producer
+
+The producer tool lets you send messages to a Kafka topic. In interactive mode, you can
+type messages directly:
+
+-   Start a producer pod in interactive mode:
+
+    ```sh
+    oc run kafka-producer -ti \
+      --image=quay.io/strimzi/kafka:0.50.1-kafka-4.1.1 \
+      --rm=true --restart=Never \
+      -n <your-project> \
+      -- bash -c 'bin/kafka-console-producer.sh \
+        --bootstrap-server my-cluster-kafka-bootstrap:9092 \
+        --topic my-topic'
+    ```
+
+    The `-ti` flags enable **interactive terminal mode**, which allows you to type messages
+    at a prompt. The `--rm=true` flag automatically removes the pod after it exits.
+
+-   At the prompt, type test messages and press `Enter` after each one:
+
+    ```
+    > Hello from NERC OpenShift!
+    > This is a Kafka test message.
+    ```
+
+    Press `Ctrl+C` to stop the producer and exit.
+
+    !!! warning "Important: Interactive Mode (`-ti --rm`)"
+
+        The `-ti --rm` flags work together to create an interactive session that automatically
+        cleans up the pod. Do not use these flags in scripts or CI/CD pipelines—instead,
+        pipe your messages to stdin or use a heredoc. For example:
+
+        ```sh
+        echo -e "message1\nmessage2" | oc run kafka-producer \
+          --image=quay.io/strimzi/kafka:0.50.1-kafka-4.1.1 \
+          --restart=Never \
+          -n <your-project> \
+          -i \
+          -- bin/kafka-console-producer.sh \
+            --bootstrap-server my-cluster-kafka-bootstrap:9092 \
+            --topic my-topic
+        ```
+
+### Run a Consumer
+
+-   In a separate terminal, start a consumer pod to read messages from the beginning:
+
+    ```sh
+    oc run kafka-consumer -ti \
+      --image=quay.io/strimzi/kafka:0.50.1-kafka-4.1.1 \
+      --rm=true --restart=Never \
+      -n <your-project> \
+      -- bash -c 'bin/kafka-console-consumer.sh \
+        --bootstrap-server my-cluster-kafka-bootstrap:9092 \
+        --topic my-topic \
+        --from-beginning'
+    ```
+
+    You should see the messages published by the producer:
+
+    ```
+    Hello from NERC OpenShift!
+    This is a Kafka test message.
+    ```
+
+    Press `Ctrl+C` to stop the consumer.
+
+    !!! tip "Consumer Groups"
+
+        To test multiple consumers sharing a topic workload, add the flag
+        `--group <group-name>` to the consumer command. Each consumer in the same
+        group will receive messages from a distinct subset of partitions.
+
+## Connecting Applications to Kafka
+
+Applications running inside the same OpenShift project can reach the Kafka broker
+using the internal bootstrap address:
+
+```
+my-cluster-kafka-bootstrap:9092   # plaintext (no TLS)
+my-cluster-kafka-bootstrap:9093   # TLS
+```
+
+For Python applications, use the [kafka-python](https://kafka-python.readthedocs.io/)
+or [confluent-kafka](https://docs.confluent.io/kafka-clients/python/current/overview.html)
+client libraries:
+
+```python
+from kafka import KafkaProducer, KafkaConsumer
+
+# Producer example
+producer = KafkaProducer(bootstrap_servers='my-cluster-kafka-bootstrap:9092')
+producer.send('my-topic', b'Hello from Python!')
+producer.flush()
+
+# Consumer example
+consumer = KafkaConsumer(
+    'my-topic',
+    bootstrap_servers='my-cluster-kafka-bootstrap:9092',
+    auto_offset_reset='earliest',
+    group_id='my-group'
+)
+for msg in consumer:
+    print(f"Received: {msg.value.decode()}")
+```
+
+!!! note "Note"
+
+    The bootstrap address `my-cluster-kafka-bootstrap` is an OpenShift Service
+    created automatically by Strimzi. It is only reachable from within the same
+    project namespace. If you need external access, configure a `route` or
+    `loadbalancer` type listener in the Kafka CR.
+
+## Clean Up Resources
+
+When you are finished, remove all Kafka resources to free up project quota:
+
+```sh
+# Delete the Kafka topic
+oc delete kafkatopic my-topic -n <your-project>
+
+# Delete the Kafka cluster (also removes Entity Operator pods)
+oc delete kafka my-cluster -n <your-project>
+
+# If using KafkaNodePool (in some configurations), delete it as well
+oc delete kafkanodepool dual-role -n <your-project> 2>/dev/null || true
+
+# Remove the Strimzi Operator
+oc delete -f install/cluster-operator/ -n <your-project>
+```
+
+!!! danger "Very Important Note"
+
+    Deleting the Kafka cluster with ephemeral storage permanently destroys all
+    messages stored in that cluster. Make sure you have consumed or exported any
+    data you need before running these commands.
\ No newline at end of file

From 929e0b3255f7ed3c990bc53a6a7505ab02e771a5 Mon Sep 17 00:00:00 2001
From: kpwrennbu <kpwrenn@bu.edu>
Date: Tue, 3 Mar 2026 12:47:08 -0500
Subject: [PATCH 3/6] update Kafka doc with Strimzi 0.50.1, Kafka 4.0+
 KafkaNodePool support, and corrected namespace/endpoint

Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>
---
 docs/other-tools/kafka/kafka-on-openshift.md | 149 +++++++++++++------
 1 file changed, 105 insertions(+), 44 deletions(-)

diff --git a/docs/other-tools/kafka/kafka-on-openshift.md b/docs/other-tools/kafka/kafka-on-openshift.md
index bfd195a7e..9270a6d27 100644
--- a/docs/other-tools/kafka/kafka-on-openshift.md
+++ b/docs/other-tools/kafka/kafka-on-openshift.md
@@ -60,8 +60,8 @@ namespace.
     For example:
 
     ```sh
-    oc login https://api.shift.nerc.mghpcc.org:6443
-    oc project ds551-kafka
+    oc login https://api.edu.nerc.mghpcc.org:6443
+    oc project ds551-2026-spring-9ab13b
     ```
 
 -   Download the Strimzi installation YAML files. Always check the
@@ -69,7 +69,7 @@ namespace.
     for the latest version:
 
     ```sh
-    STRIMZI_VERSION="0.45.0"
+    STRIMZI_VERSION="0.50.1"
     wget https://github.com/strimzi/strimzi-kafka-operator/releases/download/${STRIMZI_VERSION}/strimzi-${STRIMZI_VERSION}.tar.gz
     tar -xzf strimzi-${STRIMZI_VERSION}.tar.gz
     cd strimzi-${STRIMZI_VERSION}
@@ -80,21 +80,27 @@ namespace.
         Check the [Strimzi compatibility matrix](https://strimzi.io/downloads/) to
         confirm the Strimzi version supports the Kafka version and Kubernetes/OpenShift
         version running on NERC. Mismatched versions can prevent the operator from
-        starting.
+        starting. For Kafka 4.0+, use Strimzi 0.50.0 or later.
 
 -   Update the installation files to use your project namespace. Replace all
     occurrences of `myproject` with your actual namespace:
 
     ```sh
-    sed -i 's/namespace: .*/namespace: <your-project>/' install/cluster-operator/*RoleBinding*.yaml
+    sed -i '' 's/namespace: .*/namespace: <your-project>/' install/cluster-operator/*RoleBinding*.yaml
     ```
 
     For example:
 
     ```sh
-    sed -i 's/namespace: .*/namespace: ds551-kafka/' install/cluster-operator/*RoleBinding*.yaml
+    sed -i '' 's/namespace: .*/namespace: ds551-2026-spring-9ab13b/' install/cluster-operator/*RoleBinding*.yaml
     ```
 
+    !!! important "Make sure to update the namespace"
+
+        The `-n <your-project>` flag explicitly specifies the namespace for all
+        subsequent `oc` commands. Always include this flag when working with multiple
+        projects to avoid accidentally operating on the wrong namespace.
+
 -   Apply the Strimzi Cluster Operator installation files:
 
     ```sh
@@ -121,13 +127,33 @@ namespace.
 ### Create a Kafka Cluster
 
 Once the Strimzi Operator is running, you can deploy a Kafka cluster by creating
-a `Kafka` custom resource.
+a `Kafka` custom resource and a `KafkaNodePool` resource.
+
+!!! warning "Important: KafkaNodePool is Required"
 
--   Create a file named `kafka-cluster.yaml` with the following minimal cluster
-    definition. This deploys a single-broker Kafka cluster with 3 ZooKeeper replicas
-    and ephemeral (non-persistent) storage suitable for development and testing:
+    As of Kafka 4.0+, Strimzi uses `KafkaNodePool` to define broker and controller nodes.
+    Both resources must be created together. The `KafkaNodePool` should define at least
+    one node pool with both `broker` and `controller` roles for KRaft mode operation.
+    Without a KafkaNodePool, the Kafka cluster will not deploy.
+
+-   Create a file named `kafka-cluster.yaml` with the Kafka cluster definition:
 
     ```yaml
+    apiVersion: kafka.strimzi.io/v1
+    kind: KafkaNodePool
+    metadata:
+      name: dual-role
+      namespace: <your-project>
+      labels:
+        strimzi.io/cluster: my-cluster
+    spec:
+      replicas: 1
+      roles:
+        - broker
+        - controller
+      storage:
+        type: ephemeral
+    ---
     apiVersion: kafka.strimzi.io/v1beta2
     kind: Kafka
     metadata:
@@ -135,8 +161,7 @@ a `Kafka` custom resource.
       namespace: <your-project>
     spec:
       kafka:
-        version: 3.8.0
-        replicas: 1
+        version: 4.1.1
         listeners:
           - name: plain
             port: 9092
@@ -152,12 +177,6 @@ a `Kafka` custom resource.
           transaction.state.log.min.isr: 1
           default.replication.factor: 1
           min.insync.replicas: 1
-        storage:
-          type: ephemeral
-      zookeeper:
-        replicas: 3
-        storage:
-          type: ephemeral
       entityOperator:
         topicOperator: {}
         userOperator: {}
@@ -165,12 +184,16 @@ a `Kafka` custom resource.
 
     !!! warning "Very Important Note"
 
-        This configuration uses `ephemeral` storage, meaning all Kafka data will
-        be lost if pods restart. For persistent storage across pod restarts, change
-        the `storage` type to `persistent-claim` and specify a `size` and
-        `storageClass`. See the
-        [Strimzi storage documentation](https://strimzi.io/docs/operators/latest/full/deploying.html#type-EphemeralStorage-reference)
-        for details.
+        - Kafka 4.0+ requires `KafkaNodePool` with both `broker` and `controller` roles
+          for KRaft (Kraft Raft) consensus mode operation.
+        - This configuration uses `ephemeral` storage, meaning all Kafka data will
+          be lost if pods restart. For persistent storage across pod restarts, change
+          the `storage` type to `persistent-claim` and specify a `size` and
+          `storageClass`. See the
+          [Strimzi storage documentation](https://strimzi.io/docs/operators/latest/full/deploying.html#type-EphemeralStorage-reference)
+          for details.
+        - Make sure the `KafkaNodePool` metadata includes the label `strimzi.io/cluster: my-cluster`
+          to link it to the Kafka resource.
 
 -   Apply the Kafka cluster definition:
 
@@ -189,13 +212,17 @@ a `Kafka` custom resource.
 
     ```
     NAME                                          READY   STATUS    RESTARTS   AGE
+    my-cluster-dual-role-0                        1/1     Running   0          3m
     my-cluster-entity-operator-6d7f9c7d4b-xqtlp   2/2     Running   0          2m
-    my-cluster-kafka-0                             1/1     Running   0          3m
-    my-cluster-zookeeper-0                         1/1     Running   0          4m
-    my-cluster-zookeeper-1                         1/1     Running   0          4m
-    my-cluster-zookeeper-2                         1/1     Running   0          4m
     ```
 
+    !!! note "Note about Kafka 4.0+ Differences"
+
+        In Kafka 4.0+:
+        - There are **no ZooKeeper pods**. The broker manages its own metadata using KRaft.
+        - Pod names follow the pattern `<cluster-name>-<nodepool-name>-<id>`.
+        - With this single-node setup using `dual-role`, you'll see pods named `my-cluster-dual-role-0`.
+
 ### Create a Kafka Topic
 
 -   Create a file named `kafka-topic.yaml`:
@@ -237,30 +264,60 @@ a `Kafka` custom resource.
 
 ## Test the Kafka Cluster
 
-Strimzi ships with example producer and consumer jobs you can use to verify your
-cluster is working correctly.
+Strimzi ships with pre-built container images with Kafka command-line tools that
+you can use to verify your cluster is working correctly.
+
+!!! note "API Deprecation Warnings"
+
+    You may see deprecation warnings about Kafka API versions during deployment and testing.
+    These are safe to ignore. The deprecation warnings occur because the v1beta2 API version
+    is being phased out in favor of v1. Your cluster will still function correctly.
 
 ### Run a Producer
 
--   Start a producer pod that sends 100 messages to `my-topic`:
+The producer tool lets you send messages to a Kafka topic. In interactive mode, you can
+type messages directly:
+
+-   Start a producer pod in interactive mode:
 
     ```sh
     oc run kafka-producer -ti \
-      --image=quay.io/strimzi/kafka:0.45.0-kafka-3.8.0 \
+      --image=quay.io/strimzi/kafka:0.50.1-kafka-4.1.1 \
       --rm=true --restart=Never \
-      -- bin/kafka-console-producer.sh \
-      --bootstrap-server my-cluster-kafka-bootstrap:9092 \
-      --topic my-topic
+      -n <your-project> \
+      -- bash -c 'bin/kafka-console-producer.sh \
+        --bootstrap-server my-cluster-kafka-bootstrap:9092 \
+        --topic my-topic'
     ```
 
--   Type a few test messages at the prompt and press `Enter` after each:
+    The `-ti` flags enable **interactive terminal mode**, which allows you to type messages
+    at a prompt. The `--rm=true` flag automatically removes the pod after it exits.
+
+-   At the prompt, type test messages and press `Enter` after each one:
 
     ```
     > Hello from NERC OpenShift!
     > This is a Kafka test message.
     ```
 
-    Press `Ctrl+C` to stop the producer.
+    Press `Ctrl+C` to stop the producer and exit.
+
+    !!! warning "Important: Interactive Mode (`-ti --rm`)"
+
+        The `-ti --rm` flags work together to create an interactive session that automatically
+        cleans up the pod. Do not use these flags in scripts or CI/CD pipelines—instead,
+        pipe your messages to stdin or use a heredoc. For example:
+
+        ```sh
+        echo -e "message1\nmessage2" | oc run kafka-producer \
+          --image=quay.io/strimzi/kafka:0.50.1-kafka-4.1.1 \
+          --restart=Never \
+          -n <your-project> \
+          -i \
+          -- bin/kafka-console-producer.sh \
+            --bootstrap-server my-cluster-kafka-bootstrap:9092 \
+            --topic my-topic
+        ```
 
 ### Run a Consumer
 
@@ -268,12 +325,13 @@ cluster is working correctly.
 
     ```sh
     oc run kafka-consumer -ti \
-      --image=quay.io/strimzi/kafka:0.45.0-kafka-3.8.0 \
+      --image=quay.io/strimzi/kafka:0.50.1-kafka-4.1.1 \
       --rm=true --restart=Never \
-      -- bin/kafka-console-consumer.sh \
-      --bootstrap-server my-cluster-kafka-bootstrap:9092 \
-      --topic my-topic \
-      --from-beginning
+      -n <your-project> \
+      -- bash -c 'bin/kafka-console-consumer.sh \
+        --bootstrap-server my-cluster-kafka-bootstrap:9092 \
+        --topic my-topic \
+        --from-beginning'
     ```
 
     You should see the messages published by the producer:
@@ -287,7 +345,7 @@ cluster is working correctly.
 
     !!! tip "Consumer Groups"
 
-        To simulate multiple consumers sharing a topic workload, add the flag
+        To test multiple consumers sharing a topic workload, add the flag
         `--group <group-name>` to the consumer command. Each consumer in the same
         group will receive messages from a distinct subset of partitions.
 
@@ -339,9 +397,12 @@ When you are finished, remove all Kafka resources to free up project quota:
 # Delete the Kafka topic
 oc delete kafkatopic my-topic -n <your-project>
 
-# Delete the Kafka cluster (also removes ZooKeeper and Entity Operator pods)
+# Delete the Kafka cluster (also removes Entity Operator pods)
 oc delete kafka my-cluster -n <your-project>
 
+# If using KafkaNodePool (in some configurations), delete it as well
+oc delete kafkanodepool dual-role -n <your-project> 2>/dev/null || true
+
 # Remove the Strimzi Operator
 oc delete -f install/cluster-operator/ -n <your-project>
 ```

From 0b17c9040d519de39d4932ec1392c5c5e0594e50 Mon Sep 17 00:00:00 2001
From: kpwrennbu <kpwrenn@bu.edu>
Date: Tue, 3 Mar 2026 12:49:46 -0500
Subject: [PATCH 4/6] switch storage to persistent-claim 1Gi and update storage
 docs

Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>
---
 docs/other-tools/kafka/kafka-on-openshift.md | 12 ++++++------
 kafka-on-openshift.md                        | 12 ++++++------
 2 files changed, 12 insertions(+), 12 deletions(-)

diff --git a/docs/other-tools/kafka/kafka-on-openshift.md b/docs/other-tools/kafka/kafka-on-openshift.md
index 9270a6d27..4730af798 100644
--- a/docs/other-tools/kafka/kafka-on-openshift.md
+++ b/docs/other-tools/kafka/kafka-on-openshift.md
@@ -152,7 +152,8 @@ a `Kafka` custom resource and a `KafkaNodePool` resource.
         - broker
         - controller
       storage:
-        type: ephemeral
+        type: persistent-claim
+        size: 1Gi
     ---
     apiVersion: kafka.strimzi.io/v1beta2
     kind: Kafka
@@ -186,11 +187,10 @@ a `Kafka` custom resource and a `KafkaNodePool` resource.
 
         - Kafka 4.0+ requires `KafkaNodePool` with both `broker` and `controller` roles
           for KRaft (Kraft Raft) consensus mode operation.
-        - This configuration uses `ephemeral` storage, meaning all Kafka data will
-          be lost if pods restart. For persistent storage across pod restarts, change
-          the `storage` type to `persistent-claim` and specify a `size` and
-          `storageClass`. See the
-          [Strimzi storage documentation](https://strimzi.io/docs/operators/latest/full/deploying.html#type-EphemeralStorage-reference)
+        - This configuration uses persistent storage (1Gi) suitable for testing and demo purposes.
+          For production or larger workloads, increase the `size` value or use a specific `storageClass`.
+          See the
+          [Strimzi storage documentation](https://strimzi.io/docs/operators/latest/full/deploying.html#type-PersistentClaimStorage-reference)
           for details.
         - Make sure the `KafkaNodePool` metadata includes the label `strimzi.io/cluster: my-cluster`
           to link it to the Kafka resource.
diff --git a/kafka-on-openshift.md b/kafka-on-openshift.md
index 9270a6d27..4730af798 100644
--- a/kafka-on-openshift.md
+++ b/kafka-on-openshift.md
@@ -152,7 +152,8 @@ a `Kafka` custom resource and a `KafkaNodePool` resource.
         - broker
         - controller
       storage:
-        type: ephemeral
+        type: persistent-claim
+        size: 1Gi
     ---
     apiVersion: kafka.strimzi.io/v1beta2
     kind: Kafka
@@ -186,11 +187,10 @@ a `Kafka` custom resource and a `KafkaNodePool` resource.
 
         - Kafka 4.0+ requires `KafkaNodePool` with both `broker` and `controller` roles
           for KRaft (Kraft Raft) consensus mode operation.
-        - This configuration uses `ephemeral` storage, meaning all Kafka data will
-          be lost if pods restart. For persistent storage across pod restarts, change
-          the `storage` type to `persistent-claim` and specify a `size` and
-          `storageClass`. See the
-          [Strimzi storage documentation](https://strimzi.io/docs/operators/latest/full/deploying.html#type-EphemeralStorage-reference)
+        - This configuration uses persistent storage (1Gi) suitable for testing and demo purposes.
+          For production or larger workloads, increase the `size` value or use a specific `storageClass`.
+          See the
+          [Strimzi storage documentation](https://strimzi.io/docs/operators/latest/full/deploying.html#type-PersistentClaimStorage-reference)
           for details.
         - Make sure the `KafkaNodePool` metadata includes the label `strimzi.io/cluster: my-cluster`
           to link it to the Kafka resource.

From 223935492d8b11b661b83f8970a918a60348024a Mon Sep 17 00:00:00 2001
From: kpwrennbu <kpwrenn@bu.edu>
Date: Tue, 3 Mar 2026 13:23:33 -0500
Subject: [PATCH 5/6] add trailing horizontal rule to kafka doc

Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>
---
 docs/other-tools/kafka/kafka-on-openshift.md | 4 +++-
 kafka-on-openshift.md                        | 4 +++-
 2 files changed, 6 insertions(+), 2 deletions(-)

diff --git a/docs/other-tools/kafka/kafka-on-openshift.md b/docs/other-tools/kafka/kafka-on-openshift.md
index 4730af798..8426da8d9 100644
--- a/docs/other-tools/kafka/kafka-on-openshift.md
+++ b/docs/other-tools/kafka/kafka-on-openshift.md
@@ -411,4 +411,6 @@ oc delete -f install/cluster-operator/ -n <your-project>
 
     Deleting the Kafka cluster with ephemeral storage permanently destroys all
     messages stored in that cluster. Make sure you have consumed or exported any
-    data you need before running these commands.
\ No newline at end of file
+    data you need before running these commands.
+
+---
\ No newline at end of file
diff --git a/kafka-on-openshift.md b/kafka-on-openshift.md
index 4730af798..8426da8d9 100644
--- a/kafka-on-openshift.md
+++ b/kafka-on-openshift.md
@@ -411,4 +411,6 @@ oc delete -f install/cluster-operator/ -n <your-project>
 
     Deleting the Kafka cluster with ephemeral storage permanently destroys all
     messages stored in that cluster. Make sure you have consumed or exported any
-    data you need before running these commands.
\ No newline at end of file
+    data you need before running these commands.
+
+---
\ No newline at end of file

From 2a3689c57eb546035b534d3f3b647090097db719 Mon Sep 17 00:00:00 2001
From: kpwrennbu <kpwrenn@bu.edu>
Date: Tue, 3 Mar 2026 14:53:12 -0500
Subject: [PATCH 6/6] Fix markdown linting issues: add language specs to code
 blocks and adjust line lengths

---
 docs/other-tools/kafka/kafka-on-openshift.md | 47 +++++++++++---------
 kafka-on-openshift.md                        | 47 +++++++++++---------
 2 files changed, 50 insertions(+), 44 deletions(-)

diff --git a/docs/other-tools/kafka/kafka-on-openshift.md b/docs/other-tools/kafka/kafka-on-openshift.md
index 8426da8d9..68948f2bb 100644
--- a/docs/other-tools/kafka/kafka-on-openshift.md
+++ b/docs/other-tools/kafka/kafka-on-openshift.md
@@ -3,25 +3,26 @@
 ## Apache Kafka Overview
 
 [Apache Kafka](https://kafka.apache.org/) is a distributed event streaming platform
-capable of handling trillions of events per day. Originally developed at LinkedIn and
-open-sourced in 2011, Kafka is designed for high-throughput, fault-tolerant, and
-scalable real-time data pipelines and streaming applications.
+capable of handling trillions of events per day. Originally developed at LinkedIn
+and open-sourced in 2011, Kafka is designed for high-throughput, fault-tolerant,
+and scalable real-time data pipelines and streaming applications.
 
 Kafka uses a **publish-subscribe** model organized around the following core concepts:
 
 - **Broker**: A Kafka server that stores and serves messages.
-- **Topic**: A named stream to which producers publish records and from which consumers
-  read records.
+- **Topic**: A named stream to which producers publish records and from which
+  consumers read records.
 - **Partition**: Topics are split into partitions for parallelism and fault tolerance.
 - **Producer**: A client application that publishes records to one or more topics.
 - **Consumer**: A client application that subscribes to topics and processes records.
-- **Consumer Group**: A group of consumers that collectively consume a topic, with each
-  partition assigned to exactly one member.
+- **Consumer Group**: A group of consumers that collectively consume a topic, with
+  each partition assigned to exactly one member.
 
 Running Kafka on [NERC OpenShift](https://nerc-project.github.io/nerc-docs/openshift/)
 is the recommended approach for course workloads requiring persistent, scalable message
-streaming. This guide uses the **[Strimzi Operator](https://strimzi.io/)**, which is
-the standard Kubernetes-native method for deploying Kafka on OpenShift.
+streaming. This guide uses the **[Strimzi Operator](https://strimzi.io/)**,
+which is the standard Kubernetes-native method for deploying Kafka on OpenShift
+on NERC.
 
 ## Prerequisites
 
@@ -115,7 +116,7 @@ namespace.
 
     The output should look similar to:
 
-    ```
+    ```text
     NAME                                        READY   STATUS    RESTARTS   AGE
     strimzi-cluster-operator-7d96bf8c59-kfzwp   1/1     Running   0          45s
     ```
@@ -210,7 +211,7 @@ a `Kafka` custom resource and a `KafkaNodePool` resource.
 
     A healthy cluster will show output similar to:
 
-    ```
+    ```text
     NAME                                          READY   STATUS    RESTARTS   AGE
     my-cluster-dual-role-0                        1/1     Running   0          3m
     my-cluster-entity-operator-6d7f9c7d4b-xqtlp   2/2     Running   0          2m
@@ -257,7 +258,7 @@ a `Kafka` custom resource and a `KafkaNodePool` resource.
 
     Expected output:
 
-    ```
+    ```text
     NAME       CLUSTER      PARTITIONS   REPLICATION FACTOR   READY
     my-topic   my-cluster   3            1                    True
     ```
@@ -275,8 +276,8 @@ you can use to verify your cluster is working correctly.
 
 ### Run a Producer
 
-The producer tool lets you send messages to a Kafka topic. In interactive mode, you can
-type messages directly:
+The producer tool lets you send messages to a Kafka topic. In interactive mode,
+you can type messages directly:
 
 -   Start a producer pod in interactive mode:
 
@@ -290,12 +291,13 @@ type messages directly:
         --topic my-topic'
     ```
 
-    The `-ti` flags enable **interactive terminal mode**, which allows you to type messages
-    at a prompt. The `--rm=true` flag automatically removes the pod after it exits.
+    The `-ti` flags enable **interactive terminal mode**, which allows you to type
+    messages at a prompt. The `--rm=true` flag automatically removes the pod after
+    it exits.
 
 -   At the prompt, type test messages and press `Enter` after each one:
 
-    ```
+    ```text
     > Hello from NERC OpenShift!
     > This is a Kafka test message.
     ```
@@ -304,8 +306,9 @@ type messages directly:
 
     !!! warning "Important: Interactive Mode (`-ti --rm`)"
 
-        The `-ti --rm` flags work together to create an interactive session that automatically
-        cleans up the pod. Do not use these flags in scripts or CI/CD pipelines—instead,
+        The `-ti --rm` flags work together to create an interactive session that
+        automatically cleans up the pod. Do not use these flags in scripts or
+        CI/CD pipelines—instead,
         pipe your messages to stdin or use a heredoc. For example:
 
         ```sh
@@ -336,7 +339,7 @@ type messages directly:
 
     You should see the messages published by the producer:
 
-    ```
+    ```text
     Hello from NERC OpenShift!
     This is a Kafka test message.
     ```
@@ -354,7 +357,7 @@ type messages directly:
 Applications running inside the same OpenShift project can reach the Kafka broker
 using the internal bootstrap address:
 
-```
+```text
 my-cluster-kafka-bootstrap:9092   # plaintext (no TLS)
 my-cluster-kafka-bootstrap:9093   # TLS
 ```
@@ -413,4 +416,4 @@ oc delete -f install/cluster-operator/ -n <your-project>
     messages stored in that cluster. Make sure you have consumed or exported any
     data you need before running these commands.
 
----
\ No newline at end of file
+---
diff --git a/kafka-on-openshift.md b/kafka-on-openshift.md
index 8426da8d9..68948f2bb 100644
--- a/kafka-on-openshift.md
+++ b/kafka-on-openshift.md
@@ -3,25 +3,26 @@
 ## Apache Kafka Overview
 
 [Apache Kafka](https://kafka.apache.org/) is a distributed event streaming platform
-capable of handling trillions of events per day. Originally developed at LinkedIn and
-open-sourced in 2011, Kafka is designed for high-throughput, fault-tolerant, and
-scalable real-time data pipelines and streaming applications.
+capable of handling trillions of events per day. Originally developed at LinkedIn
+and open-sourced in 2011, Kafka is designed for high-throughput, fault-tolerant,
+and scalable real-time data pipelines and streaming applications.
 
 Kafka uses a **publish-subscribe** model organized around the following core concepts:
 
 - **Broker**: A Kafka server that stores and serves messages.
-- **Topic**: A named stream to which producers publish records and from which consumers
-  read records.
+- **Topic**: A named stream to which producers publish records and from which
+  consumers read records.
 - **Partition**: Topics are split into partitions for parallelism and fault tolerance.
 - **Producer**: A client application that publishes records to one or more topics.
 - **Consumer**: A client application that subscribes to topics and processes records.
-- **Consumer Group**: A group of consumers that collectively consume a topic, with each
-  partition assigned to exactly one member.
+- **Consumer Group**: A group of consumers that collectively consume a topic, with
+  each partition assigned to exactly one member.
 
 Running Kafka on [NERC OpenShift](https://nerc-project.github.io/nerc-docs/openshift/)
 is the recommended approach for course workloads requiring persistent, scalable message
-streaming. This guide uses the **[Strimzi Operator](https://strimzi.io/)**, which is
-the standard Kubernetes-native method for deploying Kafka on OpenShift.
+streaming. This guide uses the **[Strimzi Operator](https://strimzi.io/)**,
+which is the standard Kubernetes-native method for deploying Kafka on OpenShift
+on NERC.
 
 ## Prerequisites
 
@@ -115,7 +116,7 @@ namespace.
 
     The output should look similar to:
 
-    ```
+    ```text
     NAME                                        READY   STATUS    RESTARTS   AGE
     strimzi-cluster-operator-7d96bf8c59-kfzwp   1/1     Running   0          45s
     ```
@@ -210,7 +211,7 @@ a `Kafka` custom resource and a `KafkaNodePool` resource.
 
     A healthy cluster will show output similar to:
 
-    ```
+    ```text
     NAME                                          READY   STATUS    RESTARTS   AGE
     my-cluster-dual-role-0                        1/1     Running   0          3m
     my-cluster-entity-operator-6d7f9c7d4b-xqtlp   2/2     Running   0          2m
@@ -257,7 +258,7 @@ a `Kafka` custom resource and a `KafkaNodePool` resource.
 
     Expected output:
 
-    ```
+    ```text
     NAME       CLUSTER      PARTITIONS   REPLICATION FACTOR   READY
     my-topic   my-cluster   3            1                    True
     ```
@@ -275,8 +276,8 @@ you can use to verify your cluster is working correctly.
 
 ### Run a Producer
 
-The producer tool lets you send messages to a Kafka topic. In interactive mode, you can
-type messages directly:
+The producer tool lets you send messages to a Kafka topic. In interactive mode,
+you can type messages directly:
 
 -   Start a producer pod in interactive mode:
 
@@ -290,12 +291,13 @@ type messages directly:
         --topic my-topic'
     ```
 
-    The `-ti` flags enable **interactive terminal mode**, which allows you to type messages
-    at a prompt. The `--rm=true` flag automatically removes the pod after it exits.
+    The `-ti` flags enable **interactive terminal mode**, which allows you to type
+    messages at a prompt. The `--rm=true` flag automatically removes the pod after
+    it exits.
 
 -   At the prompt, type test messages and press `Enter` after each one:
 
-    ```
+    ```text
     > Hello from NERC OpenShift!
     > This is a Kafka test message.
     ```
@@ -304,8 +306,9 @@ type messages directly:
 
     !!! warning "Important: Interactive Mode (`-ti --rm`)"
 
-        The `-ti --rm` flags work together to create an interactive session that automatically
-        cleans up the pod. Do not use these flags in scripts or CI/CD pipelines—instead,
+        The `-ti --rm` flags work together to create an interactive session that
+        automatically cleans up the pod. Do not use these flags in scripts or
+        CI/CD pipelines—instead,
         pipe your messages to stdin or use a heredoc. For example:
 
         ```sh
@@ -336,7 +339,7 @@ type messages directly:
 
     You should see the messages published by the producer:
 
-    ```
+    ```text
     Hello from NERC OpenShift!
     This is a Kafka test message.
     ```
@@ -354,7 +357,7 @@ type messages directly:
 Applications running inside the same OpenShift project can reach the Kafka broker
 using the internal bootstrap address:
 
-```
+```text
 my-cluster-kafka-bootstrap:9092   # plaintext (no TLS)
 my-cluster-kafka-bootstrap:9093   # TLS
 ```
@@ -413,4 +416,4 @@ oc delete -f install/cluster-operator/ -n <your-project>
     messages stored in that cluster. Make sure you have consumed or exported any
     data you need before running these commands.
 
----
\ No newline at end of file
+---