Replacing value in some of data.frame columns using dplyr

Question

I would like to replace 0 in my data.frame with 1, but only in factor columns, which have only 3 values (0, 1 or NA). I have to avoid also specifying columns by names as my real data set is pretty large and it would be cumbersome. So I thought I could make use of dplyr::mutate_if and try something like:

df %>% mutate_if(~(is.factor(.) & (unique(.) %in% c(0, 1, NA))), ~replace(., . == 0, 1))

but ended up with following error:

Error in selected[[i]] <- .p(.tbl[[vars[[i]]]], ...) : more elements supplied than there are to replace

What is wrong with this formula? How can I make use of dplyr to replace 0 with 1? My example dataset looks like below:

df <- structure(list(a1 = structure(c(1L, NA, NA, 2L, NA, 1L, NA), .Label = c("0", 
"1"), class = "factor"), a2 = structure(c(NA, NA, NA, 1L, NA, 
NA, NA), .Label = "1", class = "factor"), a3 = structure(c(NA, 
1L, 2L, 3L, NA, 4L, 2L), .Label = c("0", "1", "2", "6"), class = "factor"), 
a4 = structure(c(1L, 1L, NA, NA, NA, NA, 1L), .Label = "0", class = 
"factor"), 
a5 = c(0L, 1L, 1L, NA, 1L, 0L, NA)), .Names = c("a1", "a2", 
"a3", "a4", "a5"), class = c("tbl_df", "tbl", "data.frame"), row.names = 
c(NA, -7L))

All the column in your example are numeric, not factor

talat
– talat

2018-06-21 12:39:25 +00:00
Commented Jun 21, 2018 at 12:39 — talat
– talat, Commented Jun 21, 2018 at 12:39
example edited to match the case

jakes
– jakes

2018-06-21 12:55:07 +00:00
Commented Jun 21, 2018 at 12:55 — jakes
– jakes, Commented Jun 21, 2018 at 12:55

Andre Elrico · Accepted Answer · 2018-06-21 13:05:24Z

1

can be solved like this:

df %>%
mutate_if(~(is.factor(.) & (all(unique(.) %in% c(0, 1, NA)))), ~plyr::revalue(., c("0"="1")))

# # A tibble: 7 x 5
#   a1    a2    a3    a4       a5
#   <fct> <fct> <fct> <fct> <int>
# 1 1     <NA>  <NA>  1         0
# 2 <NA>  <NA>  0     1         1
# 3 <NA>  <NA>  1     <NA>      1
# 4 1     1     2     <NA>     NA
# 5 <NA>  <NA>  <NA>  <NA>      1
# 6 1     <NA>  6     <NA>      0
# 7 <NA>  <NA>  1     1        NA

answered Jun 21, 2018 at 13:05

Andre Elrico

11.6k8 gold badges56 silver badges78 bronze badges

Sign up to request clarification or add additional context in comments.

Comments

Maurits Evers · Accepted Answer · 2018-06-21 13:04:01Z

0

How about this?

df %>%
    mutate_if(is.factor, funs(ifelse(as.character(.) == "0", "1", as.character(.)))) %>%
    mutate_if(is.character, as.factor)
## A tibble: 7 x 5
#  a1    a2    a3    a4       a5
#  <fct> <fct> <fct> <fct> <int>
#1 1     NA    NA    1         0
#2 NA    NA    1     1         1
#3 NA    NA    1     NA        1
#4 1     1     2     NA       NA
#5 NA    NA    NA    NA        1
#6 1     NA    6     NA        0
#7 NA    NA    1     1        NA

answered Jun 21, 2018 at 13:04

Maurits Evers

51k4 gold badges53 silver badges75 bronze badges

3 Comments

jakes Over a year ago

Not entirely applicable as I have also character vars in my original dataset

Andre Elrico Over a year ago

not correct. OP wants the rule to be applied to cols consisting of only (0, 1 or NA).

Maurits Evers Over a year ago

@jakes ok I see. I wasn’t entirely clear on which columns the rule needed to be applied. I understood all factor columns. Should’ve read more carefully...

Collectives™ on Stack Overflow

Replacing value in some of data.frame columns using dplyr

2 Answers 2

Comments

3 Comments

Your Answer

Hot Network Questions

Collectives™ on Stack Overflow

2 Answers 2

Comments

3 Comments

Your Answer

Sign up or log in

Post as a guest

Related