How to get the first column of a pandas DataFrame as a Series?

Question

I tried:

x=pandas.DataFrame(...)
s = x.take([0], axis=1)

And s gets a DataFrame, not a Series.

cs95 · Accepted Answer · 2019-01-22 09:14:33Z

162

From v0.11+, ... use df.iloc.

In [7]: df.iloc[:,0]
Out[7]: 
0    1
1    2
2    3
3    4
Name: x, dtype: int64

edited Jan 22, 2019 at 9:14

cs95

406k106 gold badges744 silver badges797 bronze badges

answered Mar 12, 2013 at 14:49

Jeff

130k21 gold badges223 silver badges189 bronze badges

Sign up to request clarification or add additional context in comments.

1 Comment

gaborous Over a year ago

This is the most compatible version with the new releases and also with the old ones. And probably the most efficient since the dev team is officially promoting this approach.

herrfz · Accepted Answer · 2018-10-23 08:32:05Z

155

>>> import pandas as pd
>>> df = pd.DataFrame({'x' : [1, 2, 3, 4], 'y' : [4, 5, 6, 7]})
>>> df
   x  y
0  1  4
1  2  5
2  3  6
3  4  7
>>> s = df.ix[:,0]
>>> type(s)
<class 'pandas.core.series.Series'>
>>>

===========================================================================

UPDATE

If you're reading this after June 2017, ix has been deprecated in pandas 0.20.2, so don't use it. Use loc or iloc instead. See comments and other answers to this question.

edited Oct 23, 2018 at 8:32

answered Mar 12, 2013 at 13:33

herrfz

4,9044 gold badges29 silver badges37 bronze badges

7 Comments

herrfz Over a year ago

df.set_index('x').y

sapo_cosmico Over a year ago

Would be worth adding the .iloc alternative (as proposed by Jeff further down on this page), as it is not ambiguous in the presence of columns with numbers for names.

herrfz Over a year ago

The answer was given in 2013; as far as I remember, .iloc wasn't there yet back then. In 2016, the correct answer is Jeff's (after all he's pandas God, mind you ;-)). I'm not sure what's SO's policy regarding update of answers due to API change; I'm honestly surprised by the number of votes for this answer, didn't think it was that useful to people...

user2285236 Over a year ago

Another note: ix was deprecated in version 0.20.

normanius Over a year ago

ix should not be used anymore, use iloc instead: s = df.ix[:,0]. See this post for a comparison of iloc and ix.

|

HYRY · Accepted Answer · 2013-03-12 12:42:57Z

125

You can get the first column as a Series by following code:

x[x.columns[0]]

answered Mar 12, 2013 at 12:42

HYRY

97.8k28 gold badges197 silver badges192 bronze badges

4 Comments

Polly Over a year ago

how can i get the last column like that?

elPastor Over a year ago

The others work fine as well, but this one seems more intuitive.

Vishal Over a year ago

This is no good if you have multiple columns with the same name. Whether column names should be unique or not is a separate discussion.

fujianjin6471 Over a year ago

@Polly x[x.columns[x.columns.size-1]]

ImportanceOfBeingErnest · Accepted Answer · 2017-06-17 17:40:54Z

13

Isn't this the simplest way?

By column name:

In [20]: df = pd.DataFrame({'x' : [1, 2, 3, 4], 'y' : [4, 5, 6, 7]})
In [21]: df
Out[21]:
    x   y
0   1   4
1   2   5
2   3   6
3   4   7

In [23]: df.x
Out[23]:
0    1
1    2
2    3
3    4
Name: x, dtype: int64

In [24]: type(df.x)
Out[24]:
pandas.core.series.Series

edited Jun 17, 2017 at 17:40

ImportanceOfBeingErnest

342k61 gold badges737 silver badges771 bronze badges

answered Dec 23, 2016 at 5:30

SamJ

1471 silver badge2 bronze badges

2 Comments

ponadto Over a year ago

In this particular case you know the name of the first column ("x"), but what the question meant was: "How can I access the first column, REGARDLESS of it's name". Also, accessing columns like this (df.x) is not generic -- what if the column name contains spaces? What if the name of the column coincides with DataFrame-s attribute name? It's more general to access columns using __getitem__ (i.e. like so: df["x"]).

Jean-François Corbett Over a year ago

Also doesn't work if the column's header has e.g. spaces in it.

Christopher Pfeifer · Accepted Answer · 2018-04-07 23:06:28Z

4

This works great when you want to load a series from a csv file

x = pd.read_csv('x.csv', index_col=False, names=['x'],header=None).iloc[:,0]
print(type(x))
print(x.head(10))


<class 'pandas.core.series.Series'>
0    110.96
1    119.40
2    135.89
3    152.32
4    192.91
5    177.20
6    181.16
7    177.30
8    200.13
9    235.41
Name: x, dtype: float64

answered Apr 7, 2018 at 23:06

Christopher Pfeifer

996 bronze badges

Comments

BlackList96 · Accepted Answer · 2020-07-07 17:51:49Z

4

df[df.columns[i]]

where i is the position/number of the column(starting from 0).

So, i = 0 is for the first column.

You can also get the last column using i = -1

answered Jul 7, 2020 at 17:51

BlackList96

4895 silver badges14 bronze badges

Collectives™ on Stack Overflow

How to get the first column of a pandas DataFrame as a Series?

6 Answers 6

1 Comment

7 Comments

4 Comments

2 Comments

Comments

Comments

Your Answer

Linked

Hot Network Questions

Collectives™ on Stack Overflow

6 Answers 6

1 Comment

7 Comments

4 Comments

2 Comments

Comments

Comments

Your Answer

Sign up or log in

Post as a guest

Linked

Related