Skip to content

Conversation

@ben-z
Copy link
Member

@ben-z ben-z commented Aug 6, 2025

No description provided.

@ben-z ben-z requested review from alexboden and Copilot August 6, 2025 01:10
Copy link
Contributor

Copilot AI left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Pull Request Overview

This PR creates a new blog post about WATcloud's networking architecture, detailing their dual-network design that separates campus connectivity from high-performance cluster communication. The post covers the evolution from complex software overlay networks to simplified physical infrastructure.

Key changes:

  • Added asset configuration for blog post images (square and wide formats)
  • Created comprehensive blog post covering network architecture, performance optimizations, and engineering decisions
  • Documented the transition from Yggdrasil overlay networks to reliable physical infrastructure

Reviewed Changes

Copilot reviewed 2 out of 2 changed files in this pull request and generated 2 comments.

File Description
scripts/asset-config.json Added image asset configurations for the networking blog post
pages/blog/networking.mdx New blog post covering WATcloud's dual-network architecture and design evolution

- **Suitability**: Good for low-traffic environments; may need to switch to external load balancer for high-traffic scenarios

#### Cluster Traffic Strategy
For cluster communication, we use DNS round-robin with health-aware routing. Our [custom-built Automatic DNS Failover Agent](https://github.com/WATonomous/automatic-dns-failover) continuously monitors node health and dynamically updates DNS records, adding or removing entries as nodes become available or go offline.
Copy link

Copilot AI Aug 6, 2025

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

[nitpick] This paragraph appears to be disconnected from the surrounding context about external traffic strategy. Consider restructuring this content or adding transitional text to improve flow.

Suggested change
For cluster communication, we use DNS round-robin with health-aware routing. Our [custom-built Automatic DNS Failover Agent](https://github.com/WATonomous/automatic-dns-failover) continuously monitors node health and dynamically updates DNS records, adding or removing entries as nodes become available or go offline.
While external traffic is funneled through a single highly available entry point, internal cluster communication requires a different approach to ensure resilience and scalability. For cluster communication, we use DNS round-robin with health-aware routing. Our [custom-built Automatic DNS Failover Agent](https://github.com/WATonomous/automatic-dns-failover) continuously monitors node health and dynamically updates DNS records, adding or removing entries as nodes become available or go offline.

Copilot uses AI. Check for mistakes.
@ben-z ben-z removed the request for review from alexboden August 6, 2025 01:26
@ben-z
Copy link
Member Author

ben-z commented Aug 6, 2025

Moved to internal repo: https://github.com/WATonomous/infra-config/pull/4487

@ben-z ben-z closed this Aug 6, 2025
@ben-z ben-z deleted the benz/blog-networking branch August 6, 2025 01:27
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants