Add column to dataframe with constant value

Question

I have an existing dataframe which I need to add an additional column to which will contain the same value for every row.

Existing df:

Date, Open, High, Low, Close
01-01-2015, 565, 600, 400, 450

New df:

Name, Date, Open, High, Low, Close
abc, 01-01-2015, 565, 600, 400, 450

I know how to append an existing series / dataframe column. But this is a different situation, because all I need is to add the 'Name' column and set every row to the same value, in this case 'abc'.

EdChum · Accepted Answer · 2015-04-08 14:09:52Z

569

df['Name']='abc' will add the new column and set all rows to that value:

In [79]:

df
Out[79]:
         Date, Open, High,  Low,  Close
0  01-01-2015,  565,  600,  400,    450
In [80]:

df['Name'] = 'abc'
df
Out[80]:
         Date, Open, High,  Low,  Close Name
0  01-01-2015,  565,  600,  400,    450  abc

answered Apr 8, 2015 at 14:09

EdChum

397k204 gold badges836 silver badges583 bronze badges

Sign up to request clarification or add additional context in comments.

1 Comment

s.paszko Over a year ago

What when i would like to insert new column with each constant value equal to some list? For example i would like to insert Trajectory column with each row value equal to [0,0,0]? It doesn't work your way df['Trajectory'] = [0,0,0] because pandas understands list as whole column.

piRSquared · Accepted Answer · 2017-06-20 15:42:00Z

124

You can use insert to specify where you want to new column to be. In this case, I use 0 to place the new column at the left.

df.insert(0, 'Name', 'abc')

  Name        Date  Open  High  Low  Close
0  abc  01-01-2015   565   600  400    450

answered Jun 20, 2017 at 15:42

piRSquared

296k68 gold badges509 silver badges654 bronze badges

1 Comment

Kallol Bosu Roy Choudhuri Over a year ago

This gives more control on the position of the new column.

Jolta · Accepted Answer · 2021-01-28 09:00:38Z

100

Summing up what the others have suggested, and adding a third way

You can:

assign(**kwargs):
```
 df.assign(Name='abc')
```
access the new column series (it will be created) and set it:
```
 df['Name'] = 'abc'
```
insert(loc, column, value, allow_duplicates=False)
```
 df.insert(0, 'Name', 'abc')
```

where the argument loc ( 0 <= loc <= len(columns) ) allows you to insert the column where you want.

'loc' gives you the index that your column will be at after the insertion. For example, the code above inserts the column Name as the 0-th column, i.e. it will be inserted before the first column, becoming the new first column. (Indexing starts from 0).

All these methods allow you to add a new column from a Series as well (just substitute the 'abc' default argument above with the series).

edited Jan 28, 2021 at 9:00

Jolta

2,7362 gold badges31 silver badges45 bronze badges

answered May 14, 2018 at 9:20

Michele Piccolini

2,9831 gold badge22 silver badges31 bronze badges

1 Comment

Att Righ Over a year ago

Plus one for assign because it avoids mutating the underlying data frame.

Zero · Accepted Answer · 2015-04-08 14:10:27Z

67

Single liner works

df['Name'] = 'abc'

Creates a Name column and sets all rows to abc value

answered Apr 8, 2015 at 14:10

Zero

77.4k22 gold badges153 silver badges153 bronze badges

2 Comments

KansaiRobot Over a year ago

That gives me a warning A value is trying to be set on a copy of a slice from a DataFrame.

MorLavender Over a year ago

df['New Col'] = pd.Series(["abc" for x in range(len(df.index))]) example below show

törzsmókus · Accepted Answer · 2024-11-22 17:59:07Z

I want to draw more attention to a portion of @michele-piccolini's answer.

I strongly believe that .assign is the best solution here. In the real world, these operations are not in isolation, but in a chain of operations. And if you want to support a chain of operations, you should probably use the .assign method.

Here is an example using snowfall data at a ski resort (but the same principles would apply to say ... financial data).

This code reads like a recipe of steps. Both assignment (with =) and .insert make this much harder:

raw = pd.read_csv('https://github.com/mattharrison/datasets/raw/master/data/alta-noaa-1980-2019.csv',
                  parse_dates=['DATE'])
def clean_alta(df):
    return (df
            .loc[:, ['STATION', 'NAME', 'LATITUDE', 'LONGITUDE', 'ELEVATION', 'DATE', 
                     'PRCP', 'SNOW', 'SNWD', 'TMAX', 'TMIN', 'TOBS']]
            .groupby(pd.Grouper(key='DATE', freq='W'))
            .agg({'PRCP': 'sum', 'TMAX': 'max', 'TMIN': 'min', 'SNOW': 'sum', 'SNWD': 'mean'})
            .assign(LOCATION='Alta', 
                    T_RANGE=lambda w_df: w_df.TMAX-w_df.TMIN)
    )

clean_alta(raw)

Notice the line .assign(LOCATION='Alta', that creates a column with a single value in the middle of the rest of the operations.

Zachary · Accepted Answer · 2021-07-15 11:49:53Z

17

One Line did the job for me.

df['New Column'] = 'Constant Value'
df['New Column'] = 123

answered Jul 15, 2021 at 11:49

Zachary

3073 silver badges7 bronze badges

1 Comment

Ben Farmer Over a year ago

For me this adds a new row, not a column.

Imam_AI · Accepted Answer · 2021-09-08 14:16:46Z

11

You can Simply do the following:

df['New Col'] = pd.Series(["abc" for x in range(len(df.index))])

answered Sep 8, 2021 at 14:16

Imam_AI

1511 silver badge7 bronze badges

2 Comments

Roni Antonio Over a year ago

Thanks, that was specially good to avoid the damn chained indexing warning.

BeRT2me Over a year ago

Thank you, this worked perfectly for assigning a dataframe to a column, aka df['date'] = pd.Series([pd.date_range('2020-01-01', '2023-12-31') for x in range(len(df.index))])

Arvind Singh Kandasi · Accepted Answer · 2022-09-21 05:24:26Z

7

This single line will work.

df['name'] = 'abc'

answered Sep 21, 2022 at 5:24

Arvind Singh Kandasi

711 silver badge1 bronze badge

1 Comment

KansaiRobot Over a year ago

gives a warning

Anurag Dhadse · Accepted Answer · 2023-02-02 09:46:52Z

The `append` method has been deprecated since Pandas 1.4.0

So instead use the above method only if using actual pandas DataFrame object:

df["column"] = "value"

Or, if setting value on a view of a copy of a DataFrame, use concat() or assign():

This way the new Series created has the same index as original DataFrame, and so will match on exact rows

# adds a new column in view `where_there_is_one` named 
# `client` with value `display_name`
# `df` remains unchanged
df = pd.DataFrame({"number": ([1]*5 + [0]*5 )})

where_there_is_one = df[ df["number"] == 1]
where_there_is_one = pd.concat([
    where_there_is_one,
    pd.Series(["display_name"]*df.shape[0],
              index=df.index, 
              name="client")
   ], 
join="inner", axis=1)


# Or use assign
where_there_is_one = where_there_is_one.assign(client = "display_name")

Output:

where_there_is_one:                df:
| 0   | number | client       |    | 0 | number |
| --- | ---    | ---          |    |---| -------|
| 0   | 1      | display_name |    | 0 | 1      |
| 1   | 1      | display_name |    | 1 | 1      |
| 2   | 1      | display_name |    | 2 | 1      |
| 3   | 1      | display_name |    | 3 | 1      |
| 4   | 1      | display_name |    | 4 | 1      |
                                   | 5 | 0      |
                                   | 6 | 0      |
                                   | 7 | 0      |
                                   | 8 | 0      |
                                   | 9 | 0      |

paperskilltrees · Accepted Answer · 2024-05-03 23:15:01Z

0

While df[name] = value works, it might not create a Series of the data type you desire. When dealing with large numerical arrays, you want to avoid wasting 64 bits on what could fit in just 8 and also bypass unnecessary type conversions .astype(), which can fragment and waste your memory too.

To assign a constant column of a specific data type, you can write something like:

df[name] = pd.Series(0, index=df.index, dtype='Int8')

In this example, we create a pandas Series around the existing DataFrame's index and assign it to a column name. The column is of a nullable integer type (note the capital "I" in Int8) and is filled with all zeros.

answered May 3, 2024 at 23:15

paperskilltrees

3815 silver badges16 bronze badges

Collectives™ on Stack Overflow

Add column to dataframe with constant value

10 Answers 10

1 Comment

1 Comment

1 Comment

2 Comments

Comments

1 Comment

2 Comments

1 Comment

The `append` method has been deprecated since Pandas 1.4.0

Comments

Comments

Your Answer

Linked

Hot Network Questions

Collectives™ on Stack Overflow

10 Answers 10

1 Comment

1 Comment

1 Comment

2 Comments

Comments

1 Comment

2 Comments

1 Comment

The append method has been deprecated since Pandas 1.4.0

Comments

Comments

Your Answer

Sign up or log in

Post as a guest

Linked

Related

The `append` method has been deprecated since Pandas 1.4.0