Replace only specific values in df column based on specific value in another column

Question

I have the following datframe:

>>> name   ID     geom                                                geometry_error
0  Lily   1234  POLYGON ((5.351418786 7.471461148, 5.352018786...     overlap
1  Pil    3248  POLYGON ((7.351657486 9.341445548, 1.346718786...     overlap
2  Poli   9734  -                                                     -
0  Lily   1234  POLYGON ((5.351265486 2.471876538, 6.33355018786...   overlap

I want to "edit" the geometry_erro column, with a condition that if geom value is '-' , the geometry error value will be "no geometry", e.g:

>>> name   ID     geom                                                geometry_error
0  Lily   1234  POLYGON ((5.351418786 7.471461148, 5.352018786...     overlap
1  Pil    3248  POLYGON ((7.351657486 9.341445548, 1.346718786...     overlap
2  Poli   9734  -                                                     no geometry
0  Lily   1234  POLYGON ((5.351265486 2.471876538, 6.33355018786...   overlap

I have tried to do it with this:

def gg(row):
    if row['geom'] == '-':
        val = 'no geometry generated'   
    return val

df['geometry errors'] = df.apply(gg, axis=1)

>>>UnboundLocalError: local variable 'val' referenced before assignment

I don't understand why I get this error because I have used this varuabke name val in different function in the same script so why now do I get this? and is there maybe better way to do it?

your val is never initialized. your if case is never satisfied for val to get initialized — Yash
– Yash, Commented Sep 24, 2020 at 12:48
your code never goes inside the if case. so val is not initiated at all. add a default val= — Yash
– Yash, Commented Sep 24, 2020 at 12:53

s3dev · Accepted Answer · 2020-09-24 13:09:36Z

1

Use this, nice and simple. np.where is doing the test for you.

Code:

import numpy as np

# ...

df['geometry_error'] = np.where(df['geom'] == '-', 
                                'no geometry generated', 
                                df['geometry_error'])

Output:

   name    ID                                               geom  \
0  Lily  1234   POLYGON ((5.351418786 7.471461148, 5.352018786))   
1   Pil  3248   POLYGON ((7.351657486 9.341445548, 1.346718786))   
2  Poli  9734                                                  -   
3  Lily  1234  POLYGON ((5.351265486 2.471876538, 6.333550187...   

          geometry_error  
0                overlap  
1                overlap  
2  no geometry generated  
3                overlap

answered Sep 24, 2020 at 13:09

s3dev

9,8713 gold badges34 silver badges49 bronze badges

Sign up to request clarification or add additional context in comments.

Comments

adir abargil · Accepted Answer · 2020-09-24 12:51:57Z

0

df[df['geom'] == '-']['geometry_error'] = 'no geometry generated'

answered Sep 24, 2020 at 12:51

adir abargil

5,7453 gold badges23 silver badges29 bronze badges

1 Comment

mooga Over a year ago

can you specify what are you doing in that statement and how it is an answer

mullinscr · Accepted Answer · 2020-09-24 12:58:02Z

0

A couple of approaches:

Replaces all Null cases of geometery_error with 'no geometry'

df['geometry_error'] = df['geometry_error'].fillna('no geometry')

Find all rows where geom == '-' and set their geometry_error to 'no geometry'

df.loc[df['geom'] == '-', 'geometry_error'] = 'no geometry'

I think your function isn't working because you need to change the indent on the return statement:

def gg(row):
    if row['geom'] == '-':
        val = 'no geometry generated'   
        return val

answered Sep 24, 2020 at 12:58

mullinscr

1,7681 gold badge8 silver badges14 bronze badges

1 Comment

Reut Over a year ago

I don't know why it doesn't work, Imaybe is because something I though to be minoric- the 'geometry errors' column is not null, it has '-', I have edit my original post but still don't know why it doesn't work

Collectives™ on Stack Overflow

Replace only specific values in df column based on specific value in another column

3 Answers 3

Comments

1 Comment

1 Comment

Your Answer

Hot Network Questions

Collectives™ on Stack Overflow

3 Answers 3

Comments

1 Comment

1 Comment

Your Answer

Sign up or log in

Post as a guest

Related