Python: Appending Value to a dataframe

Question

So apparently I am trying to declare an empty dataframe, then assign some values in it

df = pd.DataFrame()
df["a"] = 1234
df["b"] = b # Already defined earlier
df["c"] = c # Already defined earlier
df["t"] = df["b"]/df["c"]

I am getting the below output:

Empty DataFrame
Columns: [a, b, c, t]
Index: []

Can anyone explain why I am getting this empty dataframe even when I am assigning the values. Sorry if my question is kind of basic

Does this answer your question? Assigning a scalar value to an empty DataFrame doesn't appear to do anything — deadshot
– deadshot, Commented Aug 5, 2020 at 9:21

Daisuke Akagawa · Accepted Answer · 2020-08-05 09:17:11Z

3

I think, you have to initialize DataFrame like this.

df = pd.DataFrame(data=[[1234, b, c, b/c]], columns=list("abct"))

When you make DataFrame with no initial data, the DataFrame has no data and no columns. So you can't append any data I think.

answered Aug 5, 2020 at 9:17

Daisuke Akagawa

4843 silver badges10 bronze badges

Sign up to request clarification or add additional context in comments.

2 Comments

sushanth Over a year ago

This will work if b & c are numeric values, what if those are iterables ?

Daisuke Akagawa Over a year ago

For this method, maybe you cant use iterables , because in this way, I specify the whole row values. And in general, we can't calculate iterable value divided by iterable value.

Just Honza · Accepted Answer · 2020-08-05 09:23:24Z

3

Simply add those values as a list, e.g.:

df["a"] = [123]

answered Aug 5, 2020 at 9:23

Just Honza

2311 silver badge7 bronze badges

Comments

squeezer44 · Accepted Answer · 2020-08-05 09:44:32Z

1

You have started by initialising an empty DataFrame:

# Initialising an empty dataframe
df = pd.DataFrame()

# Print the DataFrame
print(df)

Result
Empty DataFrame
Columns: []
Index: []

As next you've created a column inside the empty DataFrame:

df["a"] = 1234
print(df)

Result
Empty DataFrame
Columns: [a]
Index: []

But you never added values to the existing column "a" - f.e. by using a dictionary (key: "a" and value list [1, 2, 3, 4]:

df = pd.DataFrame({"a":[1, 2, 3, 4]})
print(df)

Result:

In case a list of values is added each value will get an index entry.

answered Aug 5, 2020 at 9:44

squeezer44

6606 silver badges17 bronze badges

Comments

nick · Accepted Answer · 2020-08-05 10:28:45Z

0

The problem is that a cell in a table needs both a row index value and a column index value to insert the cell value. So you need to decide if "a", "b", "c" and "t" are columns or row indexes.

If they are column indexes, then you'd need a row index (0 in the example below) along with what you have written above:

df = pd.DataFrame()
df.loc[0, "a"] = 1234
df.loc[0, "b"] = 2 
df.loc[0, "c"] = 3

Result:

In : df
Out:
        a    b    c
0  1234.0  2.0  3.0

Now that you have data in the dataframe you can perform column operations (i.e., create a new column "t" and for each row assign the value of the corresponding item under "b" divided by the corresponding items under "c"):

df["t"] = df["b"]/df["c"]

Of course, you can also use different indexes for each item as follows:

df = pd.DataFrame()
df.loc[0, "a"] = 1234
df.loc[1, "b"] = 2 
df.loc[2, "c"] = 3

Result:

In : df
Out:
        a    b    c
0  1234.0  NaN  NaN
1     NaN  2.0  NaN
2     NaN  NaN  3.0

But as you can see the cells where you have not specified the (row, column, value) tuple now are NaN. This means if you try df["b"]/df["c"] you will get NaN values out as you are trying a linear operation with a NaN value.

In : df["b"]/df["c"]
Out:
0   NaN
1   NaN
2   NaN
dtype: float64

The converse is if you wanted to insert the items under one column. You'd now need a column header for this (0 in the below):

df = pd.DataFrame()
df.loc["a", 0] = 1234
df.loc["b", 0] = 2 
df.loc["c", 0] = 3

Result:

In : df
Out:
        0
a  1234.0
b     2.0
c     3.0

Now in inserting the value for "t" you'd need to specify exactly which cells you are referring to (note that pandas won't perform vectorised row operations in the same way that it performs vectorised columns operations).

df.loc["t", 0] = df.loc["b", 0]/df.loc["c", 0]

edited Aug 5, 2020 at 10:28

answered Aug 5, 2020 at 10:23

nick

1,3808 silver badges15 bronze badges

2 Comments

user2859263 Over a year ago

Thanks a lot for the much detailed explanation. This helped and now I can print out the entire row as desired :)

nick Over a year ago

Absolutely :). If you like the answer, I'd appreciate it if you accept it.

Collectives™ on Stack Overflow

Python: Appending Value to a dataframe

4 Answers 4

2 Comments

Comments

Comments

2 Comments

Your Answer

Linked

Hot Network Questions

Collectives™ on Stack Overflow

4 Answers 4

2 Comments

Comments

Comments

2 Comments

Your Answer

Sign up or log in

Post as a guest

Linked

Related