-
Notifications
You must be signed in to change notification settings - Fork 25.1k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
search.default_search_timeout NOT working #21595
Comments
Thanks for reporting, I think I see the issue. |
@jasontedor thanks for quick responding! Besides global search timeout not working in v5.0.0, the query body timeout seems not functioning as good as in v2.4.0 under heavy search load. We have a large cluster, with 120 data nodes running on 70 physical servers for Trillions of logs search& analysis. Search timeout in v2.4.0 protected the cluster pretty well by terminating expensive queries from inept users. This been not the case since upgraded to v5.0.0 last weekend, some data nodes became wearing out its disk IO during peak usage time on a daily basis and we have to restart some nodes quite frequently. When a node is wearing out , iostats shows consistent peak disk read and I observed from tasks management api that some queries seems running endless which however should have been timed out by the node. Even after we stopped sending queries to the node, it seemsed still trying very hard in executing its active search thread and search completed counter had no sign to increase. We also tried other measures such as tuning down search thread pool and queue length in hoping to reduce search concurrency and reject queries early during peak time but without any luck. Below is one example output from tasks API when the node "stuck" in searching. { "nodes": { |
Okay, I've confirmed what I see; I'll open a PR to fix soon. |
I opened #21599. |
This will be fixed in 5.0.2. |
Elasticsearch version:
5.0.0
Plugins installed: []
NONE
JVM version:
1.8.0_77
OS version:
CentOS release 6.4 (Final)
Linux 2.6.32-431.29.2.el6.x86_64 #1 SMP Tue Sep 9 21:36:05 UTC 2014 x86_64 x86_64 x86_64 GNU/Linux
Description of the problem including expected versus actual behavior:
Since upgrading from v2.4.0 to v5.0.0, the global search timeout has no effect any more.
Steps to reproduce:
PUT /_cluster/settings
{
"transient": {
"search.default_search_timeout": "1s"
}
}
Provide logs (if relevant):
Describe the feature:
The text was updated successfully, but these errors were encountered: