Elasticsearch

Indices

Data in a index is partitioned acrossed shards to make storage more managable. An index may be too large to fit on a single disk, but shards are smaller and can be allocated across different nodes as needed

  • Searches can be run across different shards in parallel speeding up query processing
  • Each shard may have a number of replicas
  • The replica shards process queries but do not index documents
  • They are allocated at to a different node from the primary shard

Frozen indices

For very old indices that are rarely accessed, we should free up the memory that they use

Frozen indices | Elasticsearch Reference [7.9] | Elastic

  • When an index is frozen, it becomes read-only, and its resources are no longer kept active.
  • The tradeoff is that frozen indices are slower to search

Index lifecycle management

  • Hot indices: Actively receiving data index and are frequently serving queries. Typical actions for this phase includes
    • Setting high priority for recovery
    • Specifying rollover policy to create a new index when the current one becomes too large, too old, or has too many documents.
  • Warm indices are no longer having data indexed in them, but they still serve read request.
    • Setting medium priority for recovery.
    • Optimizing the indices by shrinking them, force-merging them, or setting them to read-only.
    • Allocating the indices to less performant hardware.
  • Cold indices are rarely queried at all.
Last updated on