Network Management
The Network Management module in eEKAS provides a centralized,centralized and intuitive interface for configuring and monitoring all network interfaces and connections across the cluster. In a Ceph-based, high-availability environment, reliable and well-structured networking is a fundamentalcritical requirementfoundation for both storage performanceperformance, cluster stability, and uninterrupted service continuity.availability.
This module enablesallows administrators to define, adjust,modify, and troubleshoot network configurations without disrupting runningactive services.workloads, ensuring consistent operation during both normal conditions and failure scenarios.
Network Architecture and Requirements
To ensureguarantee stable cluster operation, eEKAS requires a minimum of two logically separated networks., Theseeach networksserving servea distinct purposes and areessential criticalrole towithin maintainingthe availability, consistency, and predictable performance under both normal operation and failure conditions.cluster.
Cluster / Internal Communication Network: This network is used for node-to-node communication, cluster coordination, and heartbeat signaling. It allowsenables therapid clusterfailure to continuously monitor node health, detect failures,detection and trigger automated failover actions. A dedicated anddedicated, low-latency internal network is essentialmandatory to ensure fast failure detection and stablereliable high-availability behavior.
Ceph Storage Network: The Ceph network ishandles used exclusively forall storage-related traffictraffic, such asincluding data replication, recovery, and rebalancing between nodes. Isolating this traffic prevents storage operations from interfering withimpacting client access and ensures consistentpredictable performance, especially during rebuild or recovery scenarios.events.
In addition to these mandatory networks, one or more client-facingclient access networks are typically configured to provideserve access to storage servicesprotocols such as SMB, NFS, iSCSI, NVMe-oF, and S3.
IP client traffic from internalGroups and storageClient replication traffic significantly improves scalability, security, and overall system stability.Access
For block and file services, eEKAS networking is built arounduses IP Groups—logical collections of one or more IP addresses that can be assigned to specific services. IP Groups can be moved transparently between cluster nodes during plannedmaintenance maintenanceoperations or automatically during failover events, ensuring uninterrupted client connectivity.
Key Capabilities
Centralized Interface Management– Configure physical interfaces, bonded interfaces for redundancy and performance, andVLAN-taggedinterfaces across all nodes from a single interface.VLAN Support– Logically separateClient-facing traffictypesissuchstrictlyasseparatedmanagement, cluster communication, storage replication, and client access without requiring additional physical NICs.Multi-Network Architecture– Enforce strict separation betweenfrom internal clustertraffic,communicationCephand storage replication traffic, improving scalability, security, andclient-facingoperationalservicesstability.toNetwork
improveBandwidth RequirementsNetwork Bandwidth Guidelines: To ensure predictable performance and
resilience.efficient Ceph replication, high-bandwidth and low-latency networking is required for all internal and storage-related networks.- Minimum: 10 GbE per network – suitable for small clusters, test environments, or moderate workloads.
HighRecommended:Availability25IntegrationGbE or 40 GbE –Automaticidealfailoverformechanismsproductionensureworkloadsserviceswithremainconsistentavailable evenperformance duringnoderebuilds.- Best Practice: 100 GbE or
hardware failures. Real-Time Monitoringhigher –Monitor link status, bandwidth utilization, and traffic patternsoptimal forproactivelarge-scalediagnosticsorandperformance-criticalcapacity planning.deployments.
ByNetworkcombiningBandwidthaComparisonmulti-networkarchitecturewithcentralizedTier managementBandwidth Typical Use Case Operational Impact Minimum 10 GbE Small clusters, testing, low I/O Longer rebuild and automatedrecoveryfailovertimeslogic,eEKASensuresRecommended stable,25 / 40 GbE Production, virtualization, mixed workloads Stable performance during rebuilds Best Practice 100 GbE+ Large clusters, high- performancedensityconnectivitysystemsforMinimal bothimpacttheduringstoragefailuresbackendorandmaintenancetheservicesthatdepend on it.
LayoutExample VLAN
Scenario- VLAN 10 – Management & Cluster Communication:
Node management,Administration, monitoring,andheartbeattraffic. - VLAN 20 – Ceph Storage Network: Replication, recovery,
and internal Ceph communication.rebalancing - VLAN 30 – Client Access: SMB, NFS, iSCSI, NVMe-oF,
and optionalS3access.
This separation
ensuresminimizesthatcontention,criticalimprovesclustersecurity, andstorageensuresoperationspredictableremainperformanceisolatedunderfromallclientoperatingworkloads, reducing contention while improving performance, predictability, and security.conditions.The model can be extended with additional networks or VLANs for backup traffic, dedicated S3 access, or other specialized workloads as required.