How can I get a value from a cell of a dataframe?

Question

I have constructed a condition that extracts exactly one row from my dataframe:

d2 = df[(df['l_ext']==l_ext) & (df['item']==item) & (df['wn']==wn) & (df['wd']==1)]

Now I would like to take a value from a particular column:

val = d2['col_name']

But as a result, I get a dataframe that contains one row and one column (i.e., one cell). It is not what I need. I need one value (one float number). How can I do it in pandas?

If you tried some of these answers but ended up with a SettingWithCopyWarning, you can take a look at this post for an explanation of the warning and possible workarounds/solutions. — cs95
– cs95, Commented Jan 22, 2019 at 23:52

Peter Mortensen · Accepted Answer · 2022-08-21 19:08:12Z

837

If you have a DataFrame with only one row, then access the first (only) row as a Series using iloc, and then the value using the column name:

In [3]: sub_df
Out[3]:
          A         B
2 -0.133653 -0.030854

In [4]: sub_df.iloc[0]
Out[4]:
A   -0.133653
B   -0.030854
Name: 2, dtype: float64

In [5]: sub_df.iloc[0]['A']
Out[5]: -0.13365288513107493

edited Aug 21, 2022 at 19:08

Peter Mortensen

31.4k22 gold badges110 silver badges134 bronze badges

answered May 24, 2013 at 7:31

Andy Hayden

378k110 gold badges640 silver badges546 bronze badges

Sign up to request clarification or add additional context in comments.

4 Comments

bkoodaa Over a year ago

Note that this solution returns a Series, not a value!

Liz Over a year ago

@mLstudent33 It is iloc for the call to the row, and then the column name is given

AlexisLP Over a year ago

Warning. pandas.DataFrame.iloc is deprecated since version 2.2.0

Nermin Aug 8 at 13:03

pandas.DataFrame.iloc has been replaced with pandas.DataFrame.iat

Peter Mortensen · Accepted Answer · 2022-08-21 19:02:17Z

388

These are fast access methods for scalars:

In [15]: df = pandas.DataFrame(numpy.random.randn(5, 3), columns=list('ABC'))

In [16]: df
Out[16]:
          A         B         C
0 -0.074172 -0.090626  0.038272
1 -0.128545  0.762088 -0.714816
2  0.201498 -0.734963  0.558397
3  1.563307 -1.186415  0.848246
4  0.205171  0.962514  0.037709

In [17]: df.iat[0, 0]
Out[17]: -0.074171888537611502

In [18]: df.at[0, 'A']
Out[18]: -0.074171888537611502

edited Aug 21, 2022 at 19:02

Peter Mortensen

31.4k22 gold badges110 silver badges134 bronze badges

answered May 24, 2013 at 12:58

Jeff

130k21 gold badges223 silver badges189 bronze badges

4 Comments

hartmut Over a year ago

I like this answer a lot. But whereas you can do .iloc[-1]['A'] you cannot do at[-1,'A'] to get the last row entry

bormat Over a year ago

this should be the answer because we don't copy in memory an useless line to get only one element inside.

cs95 Over a year ago

@hartmut You can always just do at[df.index[-1],'A']

LunkRat Over a year ago

I like this answer the best. You can also refer to named indexes, which makes your code more readable: df.at['my_row_name', 'my_column_name']

Peter Mortensen · Accepted Answer · 2022-08-21 19:19:09Z

344

You can turn your 1x1 dataframe into a NumPy array, then access the first and only value of that array:

val = d2['col_name'].values[0]

edited Aug 21, 2022 at 19:19

Peter Mortensen

31.4k22 gold badges110 silver badges134 bronze badges

answered Jun 25, 2018 at 20:03

Guillaume

3,9211 gold badge12 silver badges14 bronze badges

2 Comments

Sean McCarthy Over a year ago

I think this is the best answer since it does not return a pandas.series, and it's the simplest.

Joey Over a year ago

As of now, this works in pandas as well, no need to have advantage over methods available in pandas, it is a method available in pandas.

Eduardo Freitas · Accepted Answer · 2020-05-18 06:07:42Z

56

It doesn't need to be complicated:

val = df.loc[df.wd==1, 'col_name'].values[0]

answered May 18, 2020 at 6:07

Eduardo Freitas

1,0779 silver badges6 bronze badges

Comments

Peter Mortensen · Accepted Answer · 2022-08-21 19:17:18Z

54

Most answers are using iloc which is good for selection by position.

If you need selection-by-label, loc would be more convenient.

For getting a value explicitly (equiv to deprecated df.get_value('a','A'))
# This is also equivalent to df1.at['a','A']
In [55]: df1.loc['a', 'A']
Out[55]: 0.13200317033032932

edited Aug 21, 2022 at 19:17

Peter Mortensen

31.4k22 gold badges110 silver badges134 bronze badges

answered Oct 24, 2017 at 2:52

Shihe Zhang

2,7815 gold badges40 silver badges58 bronze badges

Comments

Natacha · Accepted Answer · 2023-03-01 16:04:53Z

36

I needed the value of one cell, selected by column and index names. This solution worked for me:

df.loc[1,:].values[0]

edited Mar 1, 2023 at 16:04

answered Oct 27, 2018 at 18:21

Natacha

1,25819 silver badges27 bronze badges

1 Comment

VMAtm Over a year ago

This create a slice, which can be memory consuming

Peter Mortensen · Accepted Answer · 2022-08-21 19:22:00Z

26

It looks like changes after pandas 10.1 or 13.1.

I upgraded from 10.1 to 13.1. Before, iloc is not available.

Now with 13.1, iloc[0]['label'] gets a single value array rather than a scalar.

Like this:

lastprice = stock.iloc[-1]['Close']

Output:

date
2014-02-26 118.2
name:Close, dtype: float64

edited Aug 21, 2022 at 19:22

Peter Mortensen

31.4k22 gold badges110 silver badges134 bronze badges

answered Feb 28, 2014 at 1:40

timeislove

1,1051 gold badge10 silver badges14 bronze badges

Comments

Peter Mortensen · Accepted Answer · 2022-08-21 19:13:16Z

19

The quickest and easiest options I have found are the following. 501 represents the row index.

df.at[501, 'column_name']
df.get_value(501, 'column_name')

edited Aug 21, 2022 at 19:13

Peter Mortensen

31.4k22 gold badges110 silver badges134 bronze badges

answered Sep 23, 2017 at 16:41

jroakes

3691 gold badge3 silver badges11 bronze badges

1 Comment

Shihe Zhang Over a year ago

get_value is deprecated now(v0.21.0 RC1 (October 13, 2017))reference is here

.get_value and .set_value on Series, DataFrame, Panel, SparseSeries, and SparseDataFrame are deprecated in favor of using .iat[] or .at[] accessors (GH15269)

Peter Mortensen · Accepted Answer · 2022-08-21 19:32:56Z

15

In later versions, you can fix it by simply doing:

val = float(d2['col_name'].iloc[0])

edited Aug 21, 2022 at 19:32

Peter Mortensen

31.4k22 gold badges110 silver badges134 bronze badges

answered Feb 19, 2022 at 13:35

Daniel Gonçalves

3322 silver badges7 bronze badges

Comments

Su Tingxuan · Accepted Answer · 2018-10-10 18:18:29Z

9

df_gdp.columns

Index([u'Country', u'Country Code', u'Indicator Name', u'Indicator Code', u'1960', u'1961', u'1962', u'1963', u'1964', u'1965', u'1966', u'1967', u'1968', u'1969', u'1970', u'1971', u'1972', u'1973', u'1974', u'1975', u'1976', u'1977', u'1978', u'1979', u'1980', u'1981', u'1982', u'1983', u'1984', u'1985', u'1986', u'1987', u'1988', u'1989', u'1990', u'1991', u'1992', u'1993', u'1994', u'1995', u'1996', u'1997', u'1998', u'1999', u'2000', u'2001', u'2002', u'2003', u'2004', u'2005', u'2006', u'2007', u'2008', u'2009', u'2010', u'2011', u'2012', u'2013', u'2014', u'2015', u'2016'], dtype='object')

df_gdp[df_gdp["Country Code"] == "USA"]["1996"].values[0]

8100000000000.0

edited Oct 10, 2018 at 18:18

answered Oct 10, 2018 at 18:12

Su Tingxuan

1391 silver badge2 bronze badges

Comments

Peter Mortensen · Accepted Answer · 2022-08-21 19:12:11Z

8

I am not sure if this is a good practice, but I noticed I can also get just the value by casting the series as float.

E.g.,

rate

3 0.042679

Name: Unemployment_rate, dtype: float64

float(rate)

0.0426789

edited Aug 21, 2022 at 19:12

Peter Mortensen

31.4k22 gold badges110 silver badges134 bronze badges

answered Nov 25, 2016 at 1:48

Michael Wei

811 silver badge2 bronze badges

1 Comment

Praxiteles Over a year ago

Does that work with a multi-element series as well?

cottontail · Accepted Answer · 2023-04-05 03:39:25Z

If a single row was filtered from a dataframe, one way to get a scalar value from a single cell is squeeze() (or item()):

df = pd.DataFrame({'A':range(5), 'B': range(5)})
d2 = df[df['A'].le(5) & df['B'].eq(3)]
val = d2['A'].squeeze()                 # 3

val = d2['A'].item()                    # 3

In fact, item() may be called on the index, so item + at combo could work.

msk = df['A'].le(5) & df['B'].eq(3)
val = df.at[df.index[msk].item(), 'B']  # 3

In fact, the latter method is much faster than any other method listed here to get a single cell value.

df = pd.DataFrame({'A':range(10000), 'B': range(10000)})
msk = df['A'].le(5) & df['B'].eq(3)

%timeit df.at[df.index[msk].item(), 'A']
# 31.4 µs ± 5.83 µs per loop (mean ± std. dev. of 7 runs, 10,000 loops each)
%timeit df.loc[msk, 'A'].squeeze()
# 143 µs ± 8.99 µs per loop (mean ± std. dev. of 7 runs, 10,000 loops each)
%timeit df.loc[msk, 'A'].item()
# 125 µs ± 1.56 µs per loop (mean ± std. dev. of 7 runs, 10,000 loops each)
%timeit df.loc[msk, 'A'].iat[0]
# 125 µs ± 1.96 µs per loop (mean ± std. dev. of 7 runs, 10,000 loops each)
%timeit df[msk]['A'].values[0]
# 189 µs ± 8.67 µs per loop (mean ± std. dev. of 7 runs, 10,000 loops each)

Peter Mortensen · Accepted Answer · 2022-08-21 19:31:54Z

I've run across this when using dataframes with MultiIndexes and found squeeze useful.

From the documentation:

Squeeze 1 dimensional axis objects into scalars.

Series or DataFrames with a single element are squeezed to a scalar. DataFrames with a single column or a single row are squeezed to a Series. Otherwise the object is unchanged.

# Example for a dataframe with MultiIndex
> import pandas as pd

> df = pd.DataFrame(
                    [
                        [1, 2, 3],
                        [4, 5, 6],
                        [7, 8, 9]
                    ],
                    index=pd.MultiIndex.from_tuples( [('i', 1), ('ii', 2), ('iii', 3)] ),
                    columns=pd.MultiIndex.from_tuples( [('A', 'a'), ('B', 'b'), ('C', 'c')] )
)

> df
       A  B  C
       a  b  c
i   1  1  2  3
ii  2  4  5  6
iii 3  7  8  9

> df.loc['ii', 'B']
   b
2  5

> df.loc['ii', 'B'].squeeze()
5

Note that while df.at[] also works (if you aren't needing to use conditionals) you then still AFAIK need to specify all levels of the MultiIndex.

Example:

> df.at[('ii', 2), ('B', 'b')]
5

I have a dataframe with a six-level index and two-level columns, so only having to specify the outer level is quite helpful.

Peter Mortensen · Accepted Answer · 2022-08-21 19:06:12Z

6

For pandas 0.10, where iloc is unavailable, filter a DF and get the first row data for the column VALUE:

df_filt = df[df['C1'] == C1val & df['C2'] == C2val]
result = df_filt.get_value(df_filt.index[0],'VALUE')

If there is more than one row filtered, obtain the first row value. There will be an exception if the filter results in an empty data frame.

edited Aug 21, 2022 at 19:06

Peter Mortensen

31.4k22 gold badges110 silver badges134 bronze badges

answered Oct 15, 2015 at 8:21

Sergey Sergienko

3654 silver badges8 bronze badges

2 Comments

Shihe Zhang Over a year ago

get_value is deprecated now(v0.21.0 RC1 (October 13, 2017)) reference is here

.get_value and .set_value on Series, DataFrame, Panel, SparseSeries, and SparseDataFrame are deprecated in favor of using .iat[] or .at[] accessors (GH15269)

sivabudh Over a year ago

But iat or at cannot get the value based on the column name.

Shaig Hamzaliyev · Accepted Answer · 2023-03-31 07:39:52Z

5

Converting it to integer worked for me but if you need float it is also simple:

int(sub_df.iloc[0])

for float:

float(sub_df.iloc[0])

edited Mar 31, 2023 at 7:39

answered Jan 24, 2021 at 20:08

Shaig Hamzaliyev

3094 silver badges7 bronze badges

2 Comments

Peter Mortensen Over a year ago

But the question says "I need one value (one float number).".

Shaig Hamzaliyev Over a year ago

added float version too. Thx for pointing it out

Emre · Accepted Answer · 2022-05-17 15:47:30Z

2

Using .item() returns a scalar (not a Series), and it only works if there is a single element selected. It's much safer than .values[0] which will return the first element regardless of how many are selected.

>>> df = pd.DataFrame({'a': [1,2,2], 'b': [4,5,6]})
>>> df[df['a'] == 1]['a']  # Returns a Series
0    1
Name: a, dtype: int64
>>> df[df['a'] == 1]['a'].item()
1
>>> df2 = df[df['a'] == 2]
>>> df2['b']
1    5
2    6
Name: b, dtype: int64
>>> df2['b'].values[0]
5
>>> df2['b'].item()
Traceback (most recent call last):
  File "<stdin>", line 1, in <module>
  File "/usr/lib/python3/dist-packages/pandas/core/base.py", line 331, in item
    raise ValueError("can only convert an array of size 1 to a Python scalar")
ValueError: can only convert an array of size 1 to a Python scalar

answered May 17, 2022 at 15:47

Emre

5535 silver badges8 bronze badges

1 Comment

Antony Hatchkins Over a year ago

This is the best answer as it does inform user about multiple matches (if any) - through raising an exception

SAM NIJIN · Accepted Answer · 2023-03-14 10:41:44Z

1

Display the data from a certain cell in pandas dataframe

Using dataframe.iloc,

Dataframe.iloc should be used when given index is the actual index made when the pandas dataframe is created.

Avoid using dataframe.iloc on custom indices.

print(df['REVIEWLIST'].iloc[df.index[1]])

Using dataframe.loc,

Use dataframe.loc if you're using a custom index it can also be used instead of iloc too even the dataframe contains default indices.

print(df['REVIEWLIST'].loc[df.index[1315]])

answered Mar 14, 2023 at 10:41

SAM NIJIN

114 bronze badges

Comments

Juliana Auzier · Accepted Answer · 2023-08-08 18:37:00Z

0

You can get the values like this:

df[(df['column1']==any_value) & (df['column2']==any_value) & (df['column']==any_value)]['column_with_values_to_get']

And you can add (df['columnx']==any_value) as much as you want

answered Aug 8, 2023 at 18:37

Juliana Auzier

315 bronze badges

Comments

Peter Mortensen · Accepted Answer · 2022-08-21 19:21:04Z

-3

To get the full row's value as JSON (instead of a Serie):

row = df.iloc[0]

Use the to_json method like below:

row.to_json()

edited Aug 21, 2022 at 19:21

Peter Mortensen

31.4k22 gold badges110 silver badges134 bronze badges

answered Oct 7, 2020 at 13:23

hzitoun

5,8702 gold badges40 silver badges44 bronze badges

2 Comments

Teepeemm Over a year ago

How is json involved in this question?

Peter Mortensen Over a year ago

Re "Serie": Do you mean "Series"?

Collectives™ on Stack Overflow

How can I get a value from a cell of a dataframe?

19 Answers 19

4 Comments

4 Comments

2 Comments

Comments

Comments

1 Comment

Comments

1 Comment

Comments

Comments

1 Comment

Comments

Comments

2 Comments

2 Comments

1 Comment

Comments

Comments

2 Comments

Your Answer

Linked

Hot Network Questions

Collectives™ on Stack Overflow

19 Answers 19

4 Comments

4 Comments

2 Comments

Comments

Comments

1 Comment

Comments

1 Comment

Comments

Comments

1 Comment

Comments

Comments

2 Comments

2 Comments

1 Comment

Comments

Comments

2 Comments

Your Answer

Sign up or log in

Post as a guest

Linked

Related