Drop rows containing missing values

drop_na(data, ...)

Arguments

data

A data frame.

...

A selection of columns. If empty, all variables are selected. You can supply bare variable names, select all variables between x and z with x:z, exclude y with -y. For more options, see the dplyr::select() documentation. See also the section on selection rules below.

Rules for selection

Arguments for selecting columns are passed to tidyselect::vars_select() and are treated specially. Unlike other verbs, selecting functions make a strict distinction between data expressions and context expressions.

  • A data expression is either a bare name like x or an expression like x:y or c(x, y). In a data expression, you can only refer to columns from the data frame.

  • Everything else is a context expression in which you can only refer to objects that you have defined with <-.

For instance, col1:col3 is a data expression that refers to data columns, while seq(start, end) is a context expression that refers to objects from the contexts.

If you really need to refer to contextual objects from a data expression, you can unquote them with the tidy eval operator !!. This operator evaluates its argument in the context and inlines the result in the surrounding function call. For instance, c(x, !! x) selects the x column within the data frame and the column referred to by the object x defined in the context (which can contain either a column name as string or a column position).

Examples

library(dplyr) df <- tibble(x = c(1, 2, NA), y = c("a", NA, "b")) df %>% drop_na()
#> # A tibble: 1 x 2 #> x y #> <dbl> <chr> #> 1 1 a
df %>% drop_na(x)
#> # A tibble: 2 x 2 #> x y #> <dbl> <chr> #> 1 1 a #> 2 2 <NA>