Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.
Table of Contents
minLevel1
maxLevel2
outlinefalse
typelist
printablefalse

This section describes how to set up the network services for a storage cluster.

Info

Platform Server

Skip this section if using Legacy Swarm Platform: network services are set up.

Setting Up NTP for Time Synchronization

The Network Time Protocol (NTP) server provides time synchronization between the cluster nodes, which is critical for many Swarm components. For best results, configure multiple NTP servers in close proximity to a cluster. For example, use the NTP Pool Project's continental zones, which are pools of NTP servers.

...

See Configuring an External Time Server

Info

NTP 3.0

NTP 3.0 included a design limitation causing the time value to wrap in the year 2036. NTP cannot correct the time if the BIOS clock in a cluster node is set beyond this wrap point. Verify the BIOS clocks in all nodes are set to a year prior to 2036 before booting Swarm in a cluster. This issue was resolved in NTP 4.0.

The node does not boot if the configured NTP server(s) cannot be reached. See Configuring a Node without NTP if the cluster nodes cannot access an external or internal NTP server.

Setting Up DHCP for IP Address Administration

The Dynamic Host Configuration Protocol (DHCP) server provides IP addresses to the cluster nodes and other devices enabled as DHCP clients. While Swarm nodes are not required to have static IP addresses to discover and communicate with each other, administrators may find it easier to manage and monitor a cluster where each node receives a predetermined IP address.

...

  1. Map the Ethernet media access control (MAC) address of each node to a static IP address.

  2. Configure the DHCP server to provide each node with an IP address for each of these:

    • network mask

    • default gateway

    • DNS server

Setting Up DNS for Name Resolution

The Domain Name Service (DNS) is used to resolve host names into IP addresses. While DNS is not required for Swarm nodes to communicate with each other, DNS can be very useful for client applications to reach the cluster. DNS is one method to use to enable access to objects over the Internet if using named objects.

Tip

Best Practice

Although client applications can initiate first contact with any node in the storage cluster – even choosing to access the same node every time – best practice is for the node of first contact to be distributed evenly around the cluster.

...

In this example, the Time To Live (TTL) value for each of the records in the round-robin group is very small (0-2 seconds). This configuration is necessary so clients who cache the resolution results quickly flush them. This process allows the first contact node to be distributed and allows a client to move quickly to another node if the first contact node is unavailable.

Tip

Best Practice

Applications are recommended to implement robust mechanisms such as Zero Configuration Networking for distributing the node of first contact and skipping failed nodes. An administrator can use DNS to assist with less complex applications.

Preparing for Domains

To allow clients to access named objects over the Internet, enable incoming HTTP requests to resolve to the correct domain. (A cluster can contain many domains, each of which can contain many buckets, each of which can contain many named objects.) Cluster and domain names should both be Internet Assigned Numbers Authority (IANA) compatible host names, such as cluster.example.com.

...

  1. Set up host files to map domain names to IP address(es) of the first contact node.

    • For a Linux system, configure the /etc/hosts file.

    • For a Windows system, configure the %SystemRoot%\system32\drivers\etc\hosts file.
      Example of a configured hosts file: 

      Code Block
      languagetext
      192.168.1.111 cluster.example.com 
      192.168.1.112 vault.example.com
  2. Define multiple DNS entries ("A" or "CNAME" records) that identify the IP address(es) of the first contact node in the storage cluster. This process creates a DNS round-robin that provides client request load balancing.

    • For help setting up DNS for Swarm, see Setting up DNS for name resolution, above.

    • For information about setting up a DNS server, see the DNS software documentation.

Setting Up a Syslog Server for Critical Alerts

A syslog server must be set up to capture critical operational alerts from the nodes in a storage cluster. The server captures messages sent by the Swarm nodes on UDP port 514.

See Configuring External Logging on configuring an rsyslog server and the log.host and log.level parameters used to send Swarm messages to a syslog server.

Setting Up SNMP for Monitoring

Swarm provides monitoring information and administrative controls using the Simple Network Management Protocol (SNMP). Using an SNMP console, an administrator can monitor a storage cluster from a central location.

Info

Disabling SNMP

Disable the Swarm Storage setting snmp.enabled (v12.0) if SNMP needs to be disabled cluster-wide, such as for a security need or using Swarm in containers.

Swarm uses an SNMP management information base (MIB) definition file to map SNMP object identifiers (OIDs) to logical names. The MIB can be located in one of two locations, depending on the configuration:

  • The aggregate MIB for the entire cluster is located at /usr/share/snmp/mibs if cluster nodes boot from a Platform Server.

  • The MIB is located in the root directory of the Swarm software distribution if cluster nodes do not boot from a Platform Server.

See Using SNMP with Swarm

Setting Up Network Load Balancing

Although the Swarm Storage Cluster nodes interact with client applications using the HTTP communication protocol, the nodes operate differently from traditional web servers. Placing storage nodes behind an HTTP load balancer is usually an unnecessary configuration. A properly configured load balancer can add value-added services like SSL off-load and centralized certificate management.

During normal operations, a storage node routinely redirects a client to another node within the cluster. The client must initiate another HTTP request to the redirected node when this process occurs. Any process that virtualizes the storage node IP addresses or attempts to control the nodes connected to the client generates communication errors.

Setting Up the Network Interfaces

Gigabit Ethernet or faster NICs provide the recommended 1000 Mbps data communications speed between storage cluster nodes. Swarm automatically uses multiple NICs to provide a redundant network connection.

...