-
Notifications
You must be signed in to change notification settings - Fork 10.4k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[benchmark] Add benchmark for UnsafeMutableRawBufferPointer firstIndex(of:) and lastIndex(of:) #63106
[benchmark] Add benchmark for UnsafeMutableRawBufferPointer firstIndex(of:) and lastIndex(of:) #63106
Conversation
@swift-ci please benchmark |
Benchmark looks reasonable. Do we really need to run it with 4 different orders of magnitude? |
Benchmarking issue:
Removing the versions that take fewer than 10k elements should fix this. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
The benchmark harness needs each iteration to run for longer than some minimum amount of time. It appears that only the largest of these benchmark invocations meets the bar.
Search in 100K byte buffer looks to happen quite rare. Contrary, search in buffer of few hundred bytes (or shorter) looks more realistic. E.g. search for nil terminator in string buffers. Are we interested to keep an eye on how future code change might slow down search in short buffer? If yes, then we are interested to keep benchmarks of search in short buffers. Should we just increase multiplicator from 10 to 100 or 1000 and keep all benchmarks? |
If you can increase the multiplier, then that would take care of the sanitizer's complaint. I'm not sure we need all those lengths, though. It matters to have a short length (less than 1 stride), a short length that vectorizes (more than 1 stride and less than 2 strides), and a large length (many strides.) Probably 7, 15 and 1000 are fine. We don't want to have an unnecessarily large number of benchmarks because of the load on the resources. |
Created new issue to describe this work: #63200 |
Done. Let's see how it works. |
I don't know how review request from @lorentey got removed. It was unintentional. Sorry. |
@swift-ci please benchmark |
Looks like we can make the multipliers much larger:
Since we expect to speed up the search a lot, we can probably shoot for the millisecond range, so that the benchmark still executes for long enough after the vectorized implementations are in place. |
4ed9f68
to
b8e0e38
Compare
Increased multiplier to 1000_000 |
@swift-ci please benchmark |
1 similar comment
@swift-ci please benchmark |
…multiplier from 10 to 10000
b8e0e38
to
93c9a64
Compare
@swift-ci please benchmark |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Thanks for this benchmark! We can tune the remaining details once we have an implementation.
The benchmark checker still warns that the 1000-length iterations are slightly too long, but if the improvements are reasonable, then the result will be under the warning threshold.
@swift-ci please smoke test and merge |
No description provided.