Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[benchmark] Add benchmark for UnsafeMutableRawBufferPointer firstIndex(of:) and lastIndex(of:) #63106

Merged

Conversation

valeriyvan
Copy link
Contributor

No description provided.

@valeriyvan valeriyvan changed the title Add benchmark for UnsafeMutableRawBufferPointer firstIndex(of:) and lastIndex(of:) [benchmark] Add benchmark for UnsafeMutableRawBufferPointer firstIndex(of:) and lastIndex(of:) Jan 19, 2023
@glessard
Copy link
Contributor

@swift-ci please benchmark

@glessard
Copy link
Contributor

Benchmark looks reasonable. Do we really need to run it with 4 different orders of magnitude?

@glessard glessard requested review from glessard and lorentey January 20, 2023 18:04
@glessard
Copy link
Contributor

Benchmarking issue:

Benchmark Check Report
⚠️ RawBuffer.1000.findFirst execution took 3 μs.
Increase the workload of RawBuffer.1000.findFirst to be more than 20 μs.

Removing the versions that take fewer than 10k elements should fix this.

Copy link
Contributor

@glessard glessard left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

The benchmark harness needs each iteration to run for longer than some minimum amount of time. It appears that only the largest of these benchmark invocations meets the bar.

@valeriyvan
Copy link
Contributor Author

The benchmark harness needs each iteration to run for longer than some minimum amount of time. It appears that only the largest of these benchmark invocations meets the bar.

Search in 100K byte buffer looks to happen quite rare. Contrary, search in buffer of few hundred bytes (or shorter) looks more realistic. E.g. search for nil terminator in string buffers. Are we interested to keep an eye on how future code change might slow down search in short buffer? If yes, then we are interested to keep benchmarks of search in short buffers. Should we just increase multiplicator from 10 to 100 or 1000 and keep all benchmarks?

@glessard
Copy link
Contributor

If you can increase the multiplier, then that would take care of the sanitizer's complaint. I'm not sure we need all those lengths, though. It matters to have a short length (less than 1 stride), a short length that vectorizes (more than 1 stride and less than 2 strides), and a large length (many strides.) Probably 7, 15 and 1000 are fine. We don't want to have an unnecessarily large number of benchmarks because of the load on the resources.

@glessard
Copy link
Contributor

Created new issue to describe this work: #63200

@valeriyvan
Copy link
Contributor Author

If you can increase the multiplier, then that would take care of the sanitizer's complaint. I'm not sure we need all those lengths, though. It matters to have a short length (less than 1 stride), a short length that vectorizes (more than 1 stride and less than 2 strides), and a large length (many strides.) Probably 7, 15 and 1000 are fine. We don't want to have an unnecessarily large number of benchmarks because of the load on the resources.

Done. Let's see how it works.

@valeriyvan valeriyvan requested review from glessard and removed request for lorentey January 25, 2023 09:34
@valeriyvan
Copy link
Contributor Author

I don't know how review request from @lorentey got removed. It was unintentional. Sorry.

@glessard glessard requested a review from lorentey January 25, 2023 18:17
@glessard
Copy link
Contributor

@swift-ci please benchmark

@glessard
Copy link
Contributor

glessard commented Jan 25, 2023

Looks like we can make the multipliers much larger:

Benchmark Check Report
⚠️ RawBuffer.7.findLast execution took 3 μs.
Increase the workload of RawBuffer.7.findLast to be more than 20 μs.
unable to compute memory footprint of RawBuffer.7.findLast
⚠️ RawBuffer.7.findFirst execution took 4 μs.
Increase the workload of RawBuffer.7.findFirst to be more than 20 μs.
unable to compute memory footprint of RawBuffer.7.findFirst
unable to compute memory footprint of RawBuffer.1000.findFirst
unable to compute memory footprint of RawBuffer.1000.findLast
⚠️ RawBuffer.15.findLast execution took 5 μs.
Increase the workload of RawBuffer.15.findLast to be more than 20 μs.
unable to compute memory footprint of RawBuffer.15.findLast
⚠️ RawBuffer.15.findFirst execution took 6 μs.
Increase the workload of RawBuffer.15.findFirst to be more than 20 μs.
unable to compute memory footprint of RawBuffer.15.findFirst

Since we expect to speed up the search a lot, we can probably shoot for the millisecond range, so that the benchmark still executes for long enough after the vectorized implementations are in place.

@valeriyvan valeriyvan force-pushed the Benchmark-UnsafeRawBufferPointer-first branch from 4ed9f68 to b8e0e38 Compare January 26, 2023 08:25
@valeriyvan
Copy link
Contributor Author

Since we expect to speed up the search a lot, we can probably shoot for the millisecond range, so that the benchmark still executes for long enough after the vectorized implementations are in place.

Increased multiplier to 1000_000

@glessard
Copy link
Contributor

@swift-ci please benchmark

1 similar comment
@glessard
Copy link
Contributor

@swift-ci please benchmark

@valeriyvan valeriyvan force-pushed the Benchmark-UnsafeRawBufferPointer-first branch from b8e0e38 to 93c9a64 Compare January 29, 2023 12:59
@glessard
Copy link
Contributor

@swift-ci please benchmark

Copy link
Contributor

@glessard glessard left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Thanks for this benchmark! We can tune the remaining details once we have an implementation.

The benchmark checker still warns that the 1000-length iterations are slightly too long, but if the improvements are reasonable, then the result will be under the warning threshold.

@glessard
Copy link
Contributor

glessard commented Feb 1, 2023

@swift-ci please smoke test and merge

@swift-ci swift-ci merged commit afe43a4 into swiftlang:main Feb 1, 2023
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

Implement _customIndexOfEquatableElement for UnsafeRawBufferPointer
3 participants