added option keep=False to nlargests/nsmallest #18656

tdpetrou · 2017-12-06T03:19:06Z

closes Make parameter keep=False keep duplicates for nlargest/nsmallest #16818
tests added / passed
passes git diff upstream/master -u -- "*.py" | flake8 --diff
whatsnew entry

I made a minor change to nlargest/nsmallest to keep all the values of the last n when keep=False

gfyoung · 2017-12-06T04:42:50Z

Closed due to decision in #18559.

jreback · 2017-12-06T17:21:17Z

actually this is prob reasonable.

codecov · 2017-12-06T17:21:49Z

Codecov Report

Merging #18656 into master will decrease coverage by 0.01%.
The diff coverage is 100%.

@@            Coverage Diff             @@
##           master   #18656      +/-   ##
==========================================
- Coverage   91.59%   91.57%   -0.02%     
==========================================
  Files         153      153              
  Lines       51221    51223       +2     
==========================================
- Hits        46917    46910       -7     
- Misses       4304     4313       +9

Flag	Coverage Δ
#multiple	`89.44% <100%> (ø)`	⬆️
#single	`40.67% <0%> (-0.11%)`	⬇️

Impacted Files	Coverage Δ
pandas/core/algorithms.py	`94.16% <100%> (+0.01%)`	⬆️
pandas/io/gbq.py	`25% <0%> (-58.34%)`	⬇️
pandas/core/frame.py	`97.81% <0%> (-0.1%)`	⬇️

Continue to review full report at Codecov.

Legend - Click here to learn more
Δ = absolute <relative> (impact), ø = not affected, ? = missing data
Powered by Codecov. Last update 13f6267...fcfc563. Read the comment docs.

codecov · 2017-12-06T17:21:53Z

Codecov Report

Merging #18656 into master will decrease coverage by 0.01%.
The diff coverage is 100%.

@@            Coverage Diff             @@
##           master   #18656      +/-   ##
==========================================
- Coverage   91.61%    91.6%   -0.02%     
==========================================
  Files         153      153              
  Lines       51367    51369       +2     
==========================================
- Hits        47061    47056       -5     
- Misses       4306     4313       +7

Flag	Coverage Δ
#multiple	`89.46% <100%> (ø)`	⬆️
#single	`40.76% <0%> (-0.12%)`	⬇️

Impacted Files	Coverage Δ
pandas/core/frame.py	`97.81% <ø> (-0.1%)`	⬇️
pandas/core/algorithms.py	`94.16% <100%> (+0.01%)`	⬆️
pandas/io/gbq.py	`25% <0%> (-58.34%)`	⬇️
pandas/util/testing.py	`82.52% <0%> (+0.19%)`	⬆️

Continue to review full report at Codecov.

Legend - Click here to learn more
Δ = absolute <relative> (impact), ø = not affected, ? = missing data
Powered by Codecov. Last update 265e327...56954b4. Read the comment docs.

jreback

change the whatsnew note which remove the keep=False (but keep the actual issue number attached there)

jreback · 2017-12-06T17:21:44Z

pandas/tests/frame/test_analytics.py

@@ -2162,6 +2162,21 @@ def test_n_duplicate_index(self, df_duplicates, n, order):
        expected = df.sort_values(order, ascending=False).head(n)
        tm.assert_frame_equal(result, expected)

+    def test_keep_false(self):
+        df = pd.DataFrame({'a': [5, 4, 4, 2, 3, 3, 3, 3],


can you add the issue number here

make a more informative name for the test

jreback · 2017-12-06T17:22:24Z

pandas/tests/series/test_analytics.py

@@ -1865,3 +1865,14 @@ def test_n(self, n):
        result = s.nsmallest(n)
        expected = s.sort_values().head(n)
        assert_series_equal(result, expected)
+
+    def test_keep_false(self):
+        s = Series([10, 9, 8, 7, 7, 7, 7, 6])


same as above

jreback · 2017-12-06T17:22:52Z

pandas/core/algorithms.py

@@ -910,8 +910,8 @@ def __init__(self, obj, n, keep):
        self.n = n
        self.keep = keep

-        if self.keep not in ('first', 'last'):


need to add back to the docs-string (don't need a version added as it was tehre before)

gfyoung · 2017-12-06T17:27:36Z

@jreback : What do you think of making keep=False be the option? Besides API consistency, semantically, it doesn't make as much sense to me (or @tdpetrou ) to use because it would imply that we drop ties.

tdpetrou · 2017-12-11T14:23:16Z

@jreback As @gfyoung mentioned, can we choose the parameter value first before making more changes. Should we do 'all', False, or something completely different?

edit: I am highly in favor of 'all'

jreback · 2017-12-13T14:41:07Z

yeah False doesn't seem intuitive, all I think would be ok. Not sure if we have this option anywhere else (e.g. it doesn't exist in .drop_duplicates nor should it as that would be silly:>)

tdpetrou · 2017-12-13T22:00:33Z

@jreback Changes complete. Also, I didn't see anything in whatsnew on the other issues.

jreback · 2018-02-10T18:43:46Z

pls rebase

jreback · 2018-06-26T10:36:50Z

closing as stale

Closes pandas-devgh-16818. Closes pandas-devgh-18656.

gfyoung · 2018-06-27T07:14:55Z

@jreback : Revived in #21650.

Closes pandas-devgh-16818. Closes pandas-devgh-18656.

gfyoung added Algos Non-arithmetic algos: value_counts, factorize, sorting, isin, clip, shift, diff API Design labels Dec 6, 2017

gfyoung added this to the No action milestone Dec 6, 2017

gfyoung closed this Dec 6, 2017

tdpetrou mentioned this pull request Dec 6, 2017

Make parameter keep=False keep duplicates for nlargest/nsmallest #16818

Closed

jreback reopened this Dec 6, 2017

jreback removed this from the No action milestone Dec 6, 2017

jreback reviewed Dec 6, 2017

View reviewed changes

tdpetrou added 4 commits December 13, 2017 12:30

added option keep=False to nlargests/nsmallest

e8395b9

add "all" argument for nlargest/nsmallest

5b1e7b0

added whatsnew and cleaned up docstrings

5cd3a8d

cleaned up docstrings

56954b4

jreback closed this Jun 26, 2018

gfyoung added a commit to forking-repos/pandas that referenced this pull request Jun 27, 2018

ENH: Allow keep='all' for nlargest/nsmallest

8d3b883

Closes pandas-devgh-16818. Closes pandas-devgh-18656.

gfyoung mentioned this pull request Jun 27, 2018

ENH: Allow keep='all' for nlargest/nsmallest #21650

Merged

gfyoung added a commit to forking-repos/pandas that referenced this pull request Jun 27, 2018

ENH: Allow keep='all' for nlargest/nsmallest

4b7f199

Closes pandas-devgh-16818. Closes pandas-devgh-18656.

gfyoung added a commit to forking-repos/pandas that referenced this pull request Jun 28, 2018

ENH: Allow keep='all' for nlargest/nsmallest

a07c686

Closes pandas-devgh-16818. Closes pandas-devgh-18656.

gfyoung added a commit to forking-repos/pandas that referenced this pull request Jun 28, 2018

ENH: Allow keep='all' for nlargest/nsmallest

4998050

Closes pandas-devgh-16818. Closes pandas-devgh-18656.

gfyoung added a commit to forking-repos/pandas that referenced this pull request Jun 28, 2018

ENH: Allow keep='all' for nlargest/nsmallest (pandas-dev#21650)

0801b8c

Closes pandas-devgh-16818. Closes pandas-devgh-18656.

Sup3rGeo pushed a commit to Sup3rGeo/pandas that referenced this pull request Oct 1, 2018

ENH: Allow keep='all' for nlargest/nsmallest (pandas-dev#21650)

db18fd6

Closes pandas-devgh-16818. Closes pandas-devgh-18656.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

added option keep=False to nlargests/nsmallest #18656

added option keep=False to nlargests/nsmallest #18656

tdpetrou commented Dec 6, 2017 •

edited

Loading

gfyoung commented Dec 6, 2017

jreback commented Dec 6, 2017

codecov bot commented Dec 6, 2017

codecov bot commented Dec 6, 2017 •

edited

Loading

jreback left a comment

jreback Dec 6, 2017

jreback Dec 6, 2017

jreback Dec 6, 2017

jreback Dec 6, 2017

gfyoung commented Dec 6, 2017

tdpetrou commented Dec 11, 2017 •

edited

Loading

jreback commented Dec 13, 2017

tdpetrou commented Dec 13, 2017

jreback commented Feb 10, 2018

jreback commented Jun 26, 2018

gfyoung commented Jun 27, 2018

added option keep=False to nlargests/nsmallest #18656

added option keep=False to nlargests/nsmallest #18656

Conversation

tdpetrou commented Dec 6, 2017 • edited Loading

gfyoung commented Dec 6, 2017

jreback commented Dec 6, 2017

codecov bot commented Dec 6, 2017

Codecov Report

codecov bot commented Dec 6, 2017 • edited Loading

Codecov Report

jreback left a comment

Choose a reason for hiding this comment

jreback Dec 6, 2017

Choose a reason for hiding this comment

jreback Dec 6, 2017

Choose a reason for hiding this comment

jreback Dec 6, 2017

Choose a reason for hiding this comment

jreback Dec 6, 2017

Choose a reason for hiding this comment

gfyoung commented Dec 6, 2017

tdpetrou commented Dec 11, 2017 • edited Loading

jreback commented Dec 13, 2017

tdpetrou commented Dec 13, 2017

jreback commented Feb 10, 2018

jreback commented Jun 26, 2018

gfyoung commented Jun 27, 2018

tdpetrou commented Dec 6, 2017 •

edited

Loading

codecov bot commented Dec 6, 2017 •

edited

Loading

tdpetrou commented Dec 11, 2017 •

edited

Loading