What is the Difference Between Elasticsearch Master Nodes and Data Nodes? - 面试题

Elasticsearch clusters achieve functional separation through node roles, preventing single points of failure and optimizing resource utilization. In earlier versions, master nodes handled cluster management while data nodes managed data storage; however, from Elasticsearch 7.0 onwards, roles can be flexibly configured (e.g., nodes can simultaneously hold both master and data roles), but mixed role configurations introduce performance risks. Understanding the distinction between the two is fundamental for building highly available clusters, especially in scenarios like log analysis and full-text search. This article focuses on core differences and avoids common misconceptions.

Master Nodes and Data Nodes Overview

In Elasticsearch clusters, node roles are defined by the node.roles parameter. Key role divisions include:

Core Responsibilities of Master Nodes

Master nodes serve as the cluster's 'brain,' responsible for metadata management and cluster coordination, specifically including:

Cluster state maintenance: Handling index creation/deletion, shard allocation, and replica strategies.
Election mechanism: Electing a new master node when the cluster is unavailable (configured via cluster.initial_master_nodes).
Metadata storage: Managing index mappings (mapping) and settings, but does not store user data.

Typical use case: In distributed logging systems, master nodes ensure metadata consistency for log indices, preventing shard allocation chaos.

Core Responsibilities of Data Nodes

Data nodes serve as the cluster's 'workhorse,' focused on data storage and query processing, specifically including:

Data storage: Storing index data in shard form (e.g., _doc type documents).
Search and indexing operations: Executing query requests and writing data to shards.
Replica management: Maintaining replicas of primary shards to enhance query throughput and disaster recovery.

Typical use case: In search applications, data nodes directly respond to user queries, requiring vertical scaling under high loads to avoid latency.

Key Differences Analysis

Through technical parameters, the distinctions between the two are clear:

Dimension	Master Node	Data Node
Core Responsibilities	Cluster management (metadata, state maintenance)	Data processing (storage, queries)
Data Handling	None (only metadata)	Yes (index data)
Resource Consumption	Low CPU (lightweight management tasks)	High CPU (query/index intensive)
Configuration Example	`node.roles: [master]`	`node.roles: [data]`
Mixed Role Risks	Master role should not be enabled on data nodes	Data processing should not be performed on master nodes

Technical details: From Elasticsearch 7.0 onwards, role configurations are implemented via elasticsearch.yml, but mixed role configurations can lead to performance bottlenecks. For example, when master nodes handle data requests, cluster coordination tasks consume significant CPU, reducing data processing throughput (refer to official performance guidelines).

Practical Configuration: Avoiding Common Errors

In production environments, misconfiguration is the primary cause of cluster failures. Recommended practices include:

Dedicated Node Principle:

yaml
# Recommended configuration: master nodes handle only management tasks
node.roles: [master]
cluster.initial_master_nodes: ['node1', 'node2', 'node3']

# Recommended configuration: data nodes handle only data processing
node.roles: [data]

Key point: cluster.initial_master_nodes must be configured with all initial cluster nodes (to avoid election failures).
Monitoring: Track CPU and memory usage in real-time using monitoring tools to detect anomalies early.
Scaling: For large clusters, optimize shard allocation using the cluster.routing.allocation.enable parameter to balance load.

Mixed Role Risks

Although Elasticsearch allows nodes to simultaneously hold both master and data roles, it is not recommended for production environments. Performance impact occurs when master nodes handle data requests, blocking cluster coordination tasks. Testing shows mixed role clusters experience a 300% increase in query latency. Failure risk arises when a master node fails, as data nodes cannot handle cluster recovery, leading to data loss. Best practices include: for small clusters (<3 nodes), a single node can be used with node.roles: [master,data]; in production environments, roles must be separated with at least 3 master nodes (for redundancy) and 5+ data nodes (for scalability).