Select Pandas rows based on list index

Question

I have a dataframe df:

20060930  10.103       NaN     10.103   7.981
20061231  15.915       NaN     15.915  12.686
20070331   3.196       NaN      3.196   2.710
20070630   7.907       NaN      7.907   6.459

Then I want to select rows with certain sequence numbers which indicated in a list, suppose here is [1,3], then left:

20061231  15.915       NaN     15.915  12.686
20070630   7.907       NaN      7.907   6.459

How or what function can do that?

legel · Accepted Answer · 2022-08-02 00:02:46Z

252

Use .iloc for integer based indexing and .loc for label based indexing. See below example:

ind_list = [1, 3]
df.iloc[ind_list]

edited Aug 2, 2022 at 0:02

legel

2,7133 gold badges25 silver badges24 bronze badges

answered Oct 3, 2013 at 9:43

Woody Pride

14k10 gold badges51 silver badges64 bronze badges

Sign up to request clarification or add additional context in comments.

4 Comments

t_warsop Over a year ago

This is now deprecated, .iloc should be used for positional indexing

MogaGennis Over a year ago

This didn't work for me, I had to use df.iloc[[1,3],:]

user42 Over a year ago

The solution in the update doesn't work either, I have the latest working solution as of March 2021 here

ba_ul Over a year ago

This will work as long as you can guarantee ind_list is a subset of df.index. If ind_list contains even a single element that doesn't exist in df.index, Pandas will raise a keyerror. If you can't guarantee that, use isin as suggested in other answers.

Community · Accepted Answer · 2020-04-17 01:22:58Z

154

you can also use iloc:

df.iloc[[1,3],:]

This will not work if the indexes in your dataframe do not correspond to the order of the rows due to prior computations. In that case use:

df.index.isin([1,3])

... as suggested in other responses.

edited Apr 17, 2020 at 1:22

CommunityBot

11 silver badge

answered Oct 10, 2013 at 12:17

yemu

28.8k10 gold badges34 silver badges30 bronze badges

Comments

Community · Accepted Answer · 2020-06-20 09:12:55Z

120

Another way (although it is a longer code) but it is faster than the above codes. Check it using %timeit function:

df[df.index.isin([1,3])]

PS: You figure out the reason

edited Jun 20, 2020 at 9:12

CommunityBot

11 silver badge

answered Jan 8, 2019 at 11:14

Amruth Lakkavaram

1,5171 gold badge11 silver badges13 bronze badges

2 Comments

CiaranWelsh Over a year ago

use df.index.get_level_values(0).isin for multiindex

Tom Price Over a year ago

Your comparison is invalid, because it should compare with df.iloc instead of df.loc.

user42 · Accepted Answer · 2021-03-11 09:13:10Z

27

If index_list contains your desired indices, you can get the dataframe with the desired rows by doing

index_list = [1,2,3,4,5,6]
df.loc[df.index[index_list]]

This is based on the latest documentation as of March 2021.

answered Mar 11, 2021 at 9:13

user42

9691 gold badge13 silver badges36 bronze badges

1 Comment

Gabriel Over a year ago

This is a great answer. The advantage of this method is that you can use the full power of df.loc. For example you can select the column you want with df.loc[df.index[index_list], "my_column"] and even set values with df.loc[df.index[index_list], "my_column"] = "my_value"

Community · Accepted Answer · 2020-06-20 09:12:55Z

6

For large datasets, it is memory efficient to read only selected rows via the skiprows parameter.

Example

pred = lambda x: x not in [1, 3]
pd.read_csv("data.csv", skiprows=pred, index_col=0, names=...)

This will now return a DataFrame from a file that skips all rows except 1 and 3.

Details

From the docs:

skiprows : list-like or integer or callable, default None

...

If callable, the callable function will be evaluated against the row indices, returning True if the row should be skipped and False otherwise. An example of a valid callable argument would be lambda x: x in [0, 2]

This feature works in version pandas 0.20.0+. See also the corresponding issue and a related post.

edited Jun 20, 2020 at 9:12

CommunityBot

11 silver badge

answered Jun 20, 2018 at 18:13

pylang

45.4k16 gold badges137 silver badges133 bronze badges

Comments

Julio · Accepted Answer · 2022-05-27 10:29:34Z

5

What you are trying to do is to filter your dataframe by index. The best way to do that in pandas at the moment is the following:

Single Index

desired_index_list = [1,3]
df[df.index.isin(desired_index_list)]

Multiindex

desired_index_list = [1,3]
index_level_to_filter = 0
df[df.index.get_level_values(index_level_to_filter).isin(desired_index_list)]

answered May 27, 2022 at 10:29

Julio

8892 gold badges11 silver badges18 bronze badges

Comments

Loochie · Accepted Answer · 2020-11-05 03:05:56Z

4

There are many ways of solving this problem, and the ones listed above are the most commonly used ways of achieving the solution. I want to add two more ways, just in case someone is looking for an alternative.

index_list = [1,3]

df.take(pos)

#or

df.query('index in @index_list')

answered Nov 5, 2020 at 3:05

Loochie

2,47215 silver badges20 bronze badges

2 Comments

user27221 Over a year ago

this is the correct answer if you have say a named index like:

pd.DataFrame({'num_legs': [2, 4, 8, 0, 6, 10], 'num_wings': [2, 0, 0, 0, 4, 0], 'num_specimen_seen': [10, 2, 1, 8, 3, 0], 'do_I_like_it': [0, 1, 1, 1, 0, 0]}, index=['falcon', 'dog', 'spider', 'fish', 'dragonfly', 'limulus'])

vault Over a year ago

@user27221 could you please take your DataFrame, transpose it, and then explain me how to select num_legs based on num_wings == 0 and do_I_like_it == 1?

user3503711 · Accepted Answer · 2022-11-22 17:17:17Z

4

To get a new DataFrame from filtered indexes:

For my problem, I needed a new dataframe from the indexes. I found a straight-forward way to do this:

iloc_list=[1,2,4,8]
df_new = df.filter(items = iloc_list , axis=0)

You can also filter columns using this. Please see the documentation for details.

answered Nov 22, 2022 at 17:17

user3503711

2,1961 gold badge31 silver badges38 bronze badges

Collectives™ on Stack Overflow

Select Pandas rows based on list index

8 Answers 8

4 Comments

Comments

2 Comments

1 Comment

Comments

Comments

2 Comments

Comments

Linked

Hot Network Questions

Collectives™ on Stack Overflow

8 Answers 8

4 Comments

Comments

2 Comments

1 Comment

Comments

Comments

2 Comments

Comments

Linked

Related