-
-
Notifications
You must be signed in to change notification settings - Fork 18.4k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
CLN: Remove special handling of nans in the float64-case of isin #22117
CLN: Remove special handling of nans in the float64-case of isin #22117
Conversation
Codecov Report
@@ Coverage Diff @@
## master #22117 +/- ##
==========================================
- Coverage 92.08% 92.08% -0.01%
==========================================
Files 169 169
Lines 50704 50703 -1
==========================================
- Hits 46691 46690 -1
Misses 4013 4013
Continue to review full report at Codecov.
|
can you run the asv suite for value_counts, isin, factorize and report any changes. |
pandas/tests/test_algos.py
Outdated
@@ -623,6 +623,27 @@ def test_empty(self, empty): | |||
result = algos.isin(vals, empty) | |||
tm.assert_numpy_array_equal(expected, result) | |||
|
|||
def test_different_nans_as_float64(self): | |||
# create different nans from bit-patterns, |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
can you add the issue number as a comment here
I had to add some performance tests for For
|
Hello @realead! Thanks for updating the PR. Cheers ! There are no PEP8 issues in this Pull Request. 🍻 Comment last updated on August 09, 2018 at 21:45 Hours UTC |
@jreback to bump it and to show, that there is really a performance benefit for some scenarios:
wiih The strange thing: when I put all tests into |
can you rebase |
It is no longer needed because the hash-table handles the nans correctly out of the box (see GH21866) Not having to scan the values via isna(...).any() will improve the perfomance.
9432f86
to
567e7bb
Compare
Performance test:
|
looks fine. ping on green. |
@jreback Done |
thanks @realead nice patch, keep em coming! |
It is no longer needed because the hash-table handles the nans correctly out of the box (see #21866)
Not having to scan the values via isna(...).any() will improve the performance.
git diff upstream/master -u -- "*.py" | flake8 --diff
There are no changes visible for the user (others than slightly better performance).