Memory leak on Alias

**Elasticsearch version**:
5.0.0
**Plugins installed**: []
None
**JVM version**:
1.8.0_77-b03
**OS version**:
CentOS release 6.4 (Final)
**Description of the problem including expected versus actual behavior**:

One of our data node's is suffering from high heap usage last night and old GC was not able to reclaim any heap space.  At the time, either bulk or queries were light and all thread pools were pretty idle. The node is one of the 120 data node cluster for logs analysis. Every night we have maintenance job deleting/force merging cold data and creating indices/aliases for the new day.

The node is configured with 31GB of heap and holds about 450 shards,  2k-2.5k segments.  For the past week, the segment count/memory remained constant and even dropping. Heap usage  had been crawling up until I restarted the node last night.  I looked at all memory related stats from our monitoring systems and could not find the culprit for increasing heap usage.
![image](https://cloud.githubusercontent.com/assets/10510416/20952656/f75f9b82-bc68-11e6-893d-1af726e02378.png)
![image](https://cloud.githubusercontent.com/assets/10510416/20952683/226484dc-bc69-11e6-83bb-a80861f2b339.png)
![image](https://cloud.githubusercontent.com/assets/10510416/20952949/0b243fa4-bc6b-11e6-9c73-2e563d9b3554.png)
![image](https://cloud.githubusercontent.com/assets/10510416/20952698/3c2655f8-bc69-11e6-814f-81c1aa853543.png)

Before restarting the node, I took a heap dump and analyzed with MAT.  The huge number of `org.elasticsearch.cluster.metadata.AliasOrIndex$Alias` objects looks suspicious. They retained nearly 7GB of memory. 
![2016-12-06 17 58 44](https://cloud.githubusercontent.com/assets/10510416/20952752/8c5ff1fa-bc69-11e6-8679-22625d06eb32.png)
![2016-12-06 17 58 10](https://cloud.githubusercontent.com/assets/10510416/20952753/8e30eab6-bc69-11e6-8265-37b6f2ea689c.png)

We do use alias intensively and there are 40k aliases in total across the whole cluster.  After the node was recovered, another heap dump was taken. This time the number of `org.elasticsearch.cluster.metadata.AliasOrIndex$Alias` objects dropped to 673,427 instances and retained only 16MB of memory.

Does this suggest memory leak on Alias metadata?

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Memory leak on Alias #22013

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Participants

Memory leak on Alias #22013

Description

Activity

ywelsch commented on Dec 7, 2016

xgwu commented on Dec 7, 2016

ywelsch commented on Dec 7, 2016

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Participants

Issue actions