-
Notifications
You must be signed in to change notification settings - Fork 25.1k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Add HierarchyCircuitBreakerService #6739
Add HierarchyCircuitBreakerService #6739
Conversation
Note that this is currently marked as a breaking change because I've changed the way that |
The settings start with |
/** | ||
* @return the maximum size this circuit breaker can grow to | ||
*/ | ||
public long getMaximum(); |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Can you document how getMaximum differs from getLimit?
@jpountz I agree it would fit better there, I've moved the classes and incorporated the other feedback you asked for as well. |
moving out to 1.4 for now |
@@ -364,38 +366,62 @@ public T set(long index, T value) { | |||
|
|||
final PageCacheRecycler recycler; | |||
final AtomicLong ramBytesUsed; |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
The purpose of this thing was to do circuit breaking, I think we can remove it now
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
ramBytesUsed
is still required to keep track of the used memory so it can be decremented when released.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I think the used memory is only used for testing so that would be ok to remove it?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Ahh okay, yes, since we test the breaker is reset we no longer need this in the tests also, I'll remove it.
This looks better but I'm not very happy with this boolean on every BigArrays method. I would rather have one instance that does accounting+tripping and another one that would only do accounting. API-wise maybe what we need is for the singleton BigArrays (the one created by Guice) to only do accounting (no breaking), and then have a Regarding circuit breaking, I'd like to see if we can make it simpler. For instance, do we actually need overheads, limits or the ability to disable distribution? I think we also need to have logic in |
I think this would work and make it a bit simpler, I'll try to do this.
I think we can get rid of maximum sizes and just use the limit as the max, but keep minimum sizes for breakers (so you can force space to always be available for some field data or aggregations). For overhead, it's a nob in the event that the field data estimations don't estimate correctly, so I think it's important to keep. It's not used for BigArrays though, so it's set to 1.0. I'm not sure it (overheads) should be removed either.
I added the logic to do this already, see: https://github.com/elasticsearch/elasticsearch/pull/6739/files#diff-e6761c31a9d75e74580a93c5aeb3aaa3R198 |
Agreed on minimum sizes, they are important.
But does it really help to overestimate memory usage reports by 3%?
I had missed it, great! |
The additional 3% is to err on the side of estimating too high rather than too low. I can change the estimation function to include the extra 3%, but having a configurable value in case my estimation isn't accurate for some people is important in my opinion, since it's dynamically changeable and the estimation formula is not. |
Is it something that could be done outside of the circuit-breaking API only for field data? In the |
@jpountz I added the change to add Still working on the other changes. |
public int compare(CircuitBreaker b1, CircuitBreaker b2) { | ||
long b1Free = b1.getMaximum() - b1.getUsed(); | ||
long b2Free = b2.getMaximum() - b2.getUsed(); | ||
if (b1Free > b2Free) { |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
can't you use Long.compare(b1Free, b2Free)
?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Yes, good idea, I will use that.
@jpountz pushed a couple of new commits that remove the concept of a I also removed configuring the Parent breaker entirely (now its only purpose is to track the total memory usage across all breakers and the number of times the breaker tripped and wasn't able to redistribute). I think this makes it a lot simpler. |
So the issue with this is that the |
public void circuitBreak(String fieldName, long bytesNeeded); | ||
|
||
/** | ||
* add bytes to the breaker and maybe trip |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I think we also need to document if there is a circuit break then bytes will not be accounted? (that's what the implementation seems to do?)
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Not sure I follow, the bytes are still accounted, they just aren't allowed to increase. In the case of fielddata an exception is thrown and the generated data never makes it into the cache, for bigarrays the exception is thrown and memory is released in the .close()
method of AbstractArray.java
this.bigarrays = bigarrays.withCircuitBreaking(); | ||
} else { | ||
this.bigarrays = bigarrays; | ||
} |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Should we leave that responsibility to the callers of this constructor?
@dakrone Agreed that it is much simpler and on the need to keep the overhead! Thanks! I left some minor comments about code cosmetics. The last thing I'd like to discuss is the redistribution logic, but other than that, this looks good to me! |
@jpountz I think I've addressed all of the comments. I also changed this entirely to use the parent-check method we talked about. Instead of redistributing the children's limits, each child has a set limit while still checking the global ("parent") breaker hasn't been tripped. Changed the default limits to 60% for field data and 40% for bigarrays/requests, overall limit is 70%. |
this.ordinal = ordinal; | ||
} | ||
|
||
public int getOrdinal() { |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Can you rename it to something else? This is close to ordinal
which also exists on enums but should not be used for serialization as it would change if we added/removed entries in that enum.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Sure, I'll rename this.
@dakrone Left minor comments. I think it's close now. |
LGTM |
Squashed this, rebased on master and resolved all the conflicts (there were quite a few since this PR was opened 21 days ago and we released a new version of ES between now and then). I also added documentation and changed the version check to be 1.4.0 instead of 2.0.0. I'm planning on merging this Monday since it received Adrien's +1 if there are no more comments by then. |
Adds a breaker for request BigArrays, which are used for parent/child queries as well as some aggregations. Certain operations like Netty HTTP responses and transport responses increment the breaker, but will not trip. This also changes the output of the nodes' stats endpoint to show the parent breaker as well as the fielddata and request breakers. There are a number of new settings for breakers now: `indices.breaker.total.limit`: starting limit for all memory-use breaker, defaults to 70% `indices.breaker.fielddata.limit`: starting limit for fielddata breaker, defaults to 60% `indices.breaker.fielddata.overhead`: overhead for fielddata breaker estimations, defaults to 1.03 (the fielddata breaker settings also use the backwards-compatible setting `indices.fielddata.breaker.limit` and `indices.fielddata.breaker.overhead`) `indices.breaker.request.limit`: starting limit for request breaker, defaults to 40% `indices.breaker.request.overhead`: request breaker estimation overhead, defaults to 1.0 The breaker service infrastructure is now generic and opens the path to adding additional circuit breakers in the future. Fixes elastic#6129 Conflicts: src/main/java/org/elasticsearch/index/fielddata/IndexFieldData.java src/main/java/org/elasticsearch/index/fielddata/IndexFieldDataService.java src/main/java/org/elasticsearch/index/fielddata/RamAccountingTermsEnum.java src/main/java/org/elasticsearch/index/fielddata/ordinals/GlobalOrdinalsBuilder.java src/main/java/org/elasticsearch/index/fielddata/ordinals/InternalGlobalOrdinalsBuilder.java src/main/java/org/elasticsearch/index/fielddata/plain/AbstractIndexOrdinalsFieldData.java src/main/java/org/elasticsearch/index/fielddata/plain/DisabledIndexFieldData.java src/main/java/org/elasticsearch/index/fielddata/plain/IndexIndexFieldData.java src/main/java/org/elasticsearch/index/fielddata/plain/NonEstimatingEstimator.java src/main/java/org/elasticsearch/index/fielddata/plain/PackedArrayIndexFieldData.java src/main/java/org/elasticsearch/index/fielddata/plain/ParentChildIndexFieldData.java src/main/java/org/elasticsearch/index/fielddata/plain/SortedSetDVOrdinalsIndexFieldData.java src/main/java/org/elasticsearch/node/internal/InternalNode.java src/test/java/org/elasticsearch/index/aliases/IndexAliasesServiceTests.java src/test/java/org/elasticsearch/index/codec/CodecTests.java src/test/java/org/elasticsearch/index/fielddata/AbstractFieldDataTests.java src/test/java/org/elasticsearch/index/fielddata/IndexFieldDataServiceTests.java src/test/java/org/elasticsearch/index/mapper/MapperTestUtils.java src/test/java/org/elasticsearch/index/query/IndexQueryParserFilterCachingTests.java src/test/java/org/elasticsearch/index/query/SimpleIndexQueryParserTests.java src/test/java/org/elasticsearch/index/query/guice/IndexQueryParserModuleTests.java src/test/java/org/elasticsearch/index/search/FieldDataTermsFilterTests.java src/test/java/org/elasticsearch/index/search/child/ChildrenConstantScoreQueryTests.java src/test/java/org/elasticsearch/index/similarity/SimilarityTests.java
Adds a breaker for request BigArrays, which are used for parent/child
queries as well as some aggregations. Certain operations like Netty HTTP
responses and transport responses increment the breaker, but will not
trip.
This also changes the output of the nodes' stats endpoint to show the
parent breaker as well as the fielddata and request breakers.
There are a number of new settings for breakers now:
indices.breaker.total.limit
: starting limit for all memory-use breaker,defaults to 70%
indices.breaker.fielddata.limit
: starting limit for fielddata breaker,defaults to 60%
indices.breaker.fielddata.overhead
: overhead for fielddata breakerestimations, defaults to 1.03
(the fielddata breaker settings also use the backwards-compatible
setting
indices.fielddata.breaker.limit
andindices.fielddata.breaker.overhead
)indices.breaker.request.limit
: starting limit for request breaker,defaults to 40%
indices.breaker.request.overhead
: request breaker estimation overhead,defaults to 1.0
The breaker service infrastructure is now generic and opens the path to
adding additional circuit breakers in the future.
Fixes #6129