在这里插入图片描述

1.概述

转载：https://blog.csdn.net/UbuntuTouch/article/details/108973356

当es集群我们可以使用如下的命令来对集群进行查看：

GET _cluster/health?level=indices

上面的命令可以让我们定位到到底是哪一个或者哪一些索引有问题。

上面的命令显示的结果为：

在这里插入图片描述
从上面我们可以看出来 restored_logs_4 这个索引是有问题的。它显示的状态为 red，也即是红色。

我们也可以对 shard 进行查询：

GET _cluster/health?level=shards

上面的命令显示的结果为：

在这里插入图片描述
上面的命令显示 restored_logs_4 这个索引的 shard 0 的状态是0。这种情况发生在这个 shard 从未被分配过，或者曾经被分配过，但是整个 node 可能由于某种原因而造成这个 shard 的丢失。

我们甚至直接使用如下的方法来得到这个索引的所有情况：

GET _cluster/health/restored_logs_4?level=shards

上面显示的结果为：

{
    
    
  "cluster_name" : "my_cluster",
  "status" : "red",
  "timed_out" : false,
  "number_of_nodes" : 1,
  "number_of_data_nodes" : 1,
  "active_primary_shards" : 0,
  "active_shards" : 0,
  "relocating_shards" : 0,
  "initializing_shards" : 0,
  "unassigned_shards" : 2,
  "delayed_unassigned_shards" : 0,
  "number_of_pending_tasks" : 0,
  "number_of_in_flight_fetch" : 0,
  "task_max_waiting_in_queue_millis" : 0,
  "active_shards_percent_as_number" : 63.41463414634146,
  "indices" : {
    
    
    "restored_logs_4" : {
    
    
      "status" : "red",
      "number_of_shards" : 1,
      "number_of_replicas" : 1,
      "active_primary_shards" : 0,
      "active_shards" : 0,
      "relocating_shards" : 0,
      "initializing_shards" : 0,
      "unassigned_shards" : 2,
      "shards" : {
    
    
        "0" : {
    
    
          "status" : "red",
          "primary_active" : false,
          "active_shards" : 0,
          "relocating_shards" : 0,
          "initializing_shards" : 0,
          "unassigned_shards" : 2
        }
      }
    }
  }
}

为了能够更进一步查出来到底是什么原因造成的，我们可以如下的命令来进行查询：

GET _cluster/allocation/explain

在实际的使用中，我们需要配置一些参数来得到某个具体索引的分配情况，比如：

GET _cluster/allocation/explain
{
    
    
  "index": "restored_logs_4",
  "shard": 0,
  "primary": true
}

上面的命令显示的结果为：

{
    
    
  "index" : "restored_logs_4",
  "shard" : 0,
  "primary" : true,
  "current_state" : "unassigned",
  "unassigned_info" : {
    
    
    "reason" : "CLUSTER_RECOVERED",
    "at" : "2020-10-05T08:08:54.241Z",
    "last_allocation_status" : "no_valid_shard_copy"
  },
  "can_allocate" : "no_valid_shard_copy",
  "allocate_explanation" : "cannot allocate because a previous copy of the primary shard existed but can no longer be found on the nodes in the cluster",
  "node_allocation_decisions" : [
    {
    
    
      "node_id" : "Ohi9yhffThGZ5X8gq4AXLw",
      "node_name" : "node1",
      "transport_address" : "127.0.0.1:9300",
      "node_attributes" : {
    
    
        "ml.machine_memory" : "34359738368",
        "xpack.installed" : "true",
        "transform.node" : "true",
        "ml.max_open_jobs" : "20",
        "my_rack" : "rack1"
      },
      "node_decision" : "no",
      "store" : {
    
    
        "found" : false
      }
    }
  ]
}

从上面的描述中，我们可以看到为啥我们的 shard 是分配不成功的。

在实际的使用中，我们也可以通过如下的方式来得到一个集群变为另外一种状态，比如：

GET _cluster/health?wait_for_status=yellow

上面的调用表示当集群的状态变为黄色时才会返回结果，否则一直处于 block 状态。

【Elasticsearch】es查看有问题的索引或者分片

1.概述

猜你喜欢