How to add a column with values 1 to len(df) to a dataframe

Question

The index that I have in the dataframe (with 30 rows) is of the form:

Int64Index([171, 174, 173, 172, 199, …, 175, 200])

The index is not strictly increasing because the data frame is the output of a sort().

I want to add a column which is the series:

[1, 2, 3, 4, 5, …, 30]

How should I go about doing that?

Chang She · Accepted Answer · 2012-08-29 03:13:24Z

178

How about:

df['new_col'] = range(1, len(df) + 1)

Alternatively if you want the index to be the ranks and store the original index as a column:

df = df.reset_index()

answered Aug 29, 2012 at 3:13

Chang She

17k8 gold badges43 silver badges26 bronze badges

Sign up to request clarification or add additional context in comments.

2 Comments

javabeangrinder Over a year ago

This answer got me halfway to where I wanted since I already had an index that I wanted replaced. In such a case you can complement with: df = df.reset_index(drop=True)

panzerpower Over a year ago

Using np.arange instead of native range, like df['new_col'] = np.arange(1, df.shape[0] + 1) should speed up the runtime, especially when dealing with large datasets.

user1225054 · Accepted Answer · 2013-10-13 18:57:05Z

113

I stumbled on this question while trying to do the same thing (I think). Here is how I did it:

df['index_col'] = df.index

You can then sort on the new index column, if you like.

answered Oct 13, 2013 at 18:57

user1225054

2 Comments

pacholik Over a year ago

No, that would be unsorted.

citynorman Over a year ago

more dynamic df[df.index.name] = df.index

Rudy Matela · Accepted Answer · 2019-07-17 19:47:09Z

23

How about this:

from pandas import *

idx = Int64Index([171, 174, 173])
df = DataFrame(index = idx, data =([1,2,3]))
print df

It gives me:

Is this what you are looking for?

edited Jul 17, 2019 at 19:47

Rudy Matela

6,5102 gold badges35 silver badges37 bronze badges

answered Aug 28, 2012 at 23:13

nitin

7,42811 gold badges44 silver badges53 bronze badges

2 Comments

Navneet Over a year ago

Almost. So, in sum, I need to create another data frame which contains the rank/position of the row. And then, I need to join these.

nitin Over a year ago

Yes you combine add this df to your existing dataframe by using df.combine_first(df2)

XiB · Accepted Answer · 2021-10-20 19:21:22Z

9

The way to do that would be this:

Resetting the index:

df.reset_index(drop=True, inplace=True)

Sorting an index:

df.sort_index(inplace=True)

Setting a new index from a column:

df.set_index('column_name', inplace=True)

Setting a new index from a range:

df.index = range(1, 31, 1) #a range starting at one ending at 30 with a stepsize of 1.

Sorting a dataframe based on column value:

df.sort_values(by='column_name', inplace=True)

Reassigning variables works as-well:

df=df.reset_index(drop=True)
df=df.sort_index()
df=df.set_index('column_name')
df.index = range(1, 31, 1) #a range starting at one ending at 30 with a stepsize of 1.
df=df.sort_values(by='column_name')

answered Oct 20, 2021 at 19:21

XiB

77011 silver badges24 bronze badges

1 Comment

flywire Over a year ago

I don't think you answered: I want to add a column which is the [sort order] series ie set a column to the index.

Collectives™ on Stack Overflow

How to add a column with values 1 to len(df) to a dataframe

4 Answers 4

2 Comments

2 Comments

2 Comments

1 Comment

Your Answer

Linked

Hot Network Questions

Collectives™ on Stack Overflow

4 Answers 4

2 Comments

2 Comments

2 Comments

1 Comment

Your Answer

Sign up or log in

Post as a guest

Linked

Related