Pandas make new column from string slice of another column

Question

I want to create a new column in Pandas using a string sliced for another column in the dataframe.

For example.

Sample  Value  New_sample
AAB     23     A
BAB     25     B

Where New_sample is a new column formed from a simple [:1] slice of Sample

I've tried a number of things to no avail - I feel I'm missing something simple.

What's the most efficient way of doing this?

EdChum · Accepted Answer · 2014-09-11 14:21:13Z

149

You can call the str method and apply a slice, this will be much quicker than the other method as this is vectorised (thanks @unutbu):

df['New_Sample'] = df.Sample.str[:1]

You can also call a lambda function on the df but this will be slower on larger dataframes:

In [187]:

df['New_Sample'] = df.Sample.apply(lambda x: x[:1])
df
Out[187]:
  Sample  Value New_Sample
0    AAB     23          A
1    BAB     25          B

edited Sep 11, 2014 at 14:21

answered Sep 11, 2014 at 14:02

EdChum

397k204 gold badges836 silver badges583 bronze badges

Sign up to request clarification or add additional context in comments.

Comments

Gaurav Singh · Accepted Answer · 2020-07-03 08:03:32Z

24

Adding solution to a common variation when the slice width varies across DataFrame Rows:

#--Here i am extracting the ID part from the Email (i.e. the part before @)

#--First finding the position of @ in Email
d['pos'] = d['Email'].str.find('@')

#--Using position to slice Email using a lambda function
d['new_var'] = d.apply(lambda x: x['Email'][0:x['pos']],axis=1)

#--Imagine x['Email'] as a string on which, slicing is applied

Hope this Helps !

answered Jul 3, 2020 at 8:03

Gaurav Singh

8797 silver badges8 bronze badges

2 Comments

fortuneRice Over a year ago

Thanks for adding this common variation solution, just what I was looking for! And to combine into a single line: d['new_var'] = d.apply(lambda x: x['Email'][0:x['Email'].find('@')],axis=1)

Lucas Meier Over a year ago

You could also do d["new_var"] = np.vectorize(lambda x : x.split("@")[0])(np.array(d["Email"],dtype=str)), which would spare you an extra column.

niraj · Accepted Answer · 2018-07-29 16:33:03Z

18

You can also use slice() to slice string of Series as following:

df['New_sample'] = df['Sample'].str.slice(0,1)

From pandas documentation:

Series.str.slice(start=None, stop=None, step=None)

Slice substrings from each element in the Series/Index

For slicing index (if index is of type string), you can try:

df.index = df.index.str.slice(0,1)

answered Jul 29, 2018 at 16:33

niraj

18.2k4 gold badges36 silver badges50 bronze badges

1 Comment

Kristen G. Over a year ago

is there any preference between df.somecolumn.str[0:1] and df.somecolumn.str.slice(0,1)?

Sarah · Accepted Answer · 2023-12-01 21:30:52Z

0

Adding a solution for when you want to take the second element from your pandas dataframe index, which is a tuple, and move it into its own column. Not sure if there is a shorter way to do this but this way works:

df["newcol"]=df.index
df["newcol"]=df["newcol"].apply(lambda x: x[1])

answered Dec 1, 2023 at 21:30

Sarah

1034 bronze badges

Collectives™ on Stack Overflow

Pandas make new column from string slice of another column

4 Answers 4

Comments

2 Comments

1 Comment

Comments

Your Answer

Linked

Hot Network Questions

Collectives™ on Stack Overflow

4 Answers 4

Comments

2 Comments

1 Comment

Comments

Your Answer

Sign up or log in

Post as a guest

Linked

Related