Ignore / delete NA values in read.csv

Question

Ignore / delete NA values in read.csv

I have a csv file as shown below, which I read in R using read.csv, where column C has 12/30 null values. I want to work out the maximum of each column, but the function "max" R returns "NA" when used in column C. How do I get R to ignore empty values / NA, I do not see "rm.na" in read.csv?

data<-data.frame(read.csv("test.csv")) data ABC 1 5 6 15 2 3 8 3 3 7 5 4 5 3 8 4 1 4 5 3 4 2 2 10 4 3 8 6 5 2 1 4 4 10 8 4 0 6 0 7 3 8 5 3 3 13 12 13 6 0 0 0 0 2 5 2 NA 7 3 NA 1 8 NA 11 1 NA 1 4 NA 0 7 NA 4 5 NA 3 10 NA 2 0 NA 6 4 NA 0 19 NA 1 5 NA > max(C) [1] NA

+7

r read.csv na

moadeep Apr 04 '13 at 10:12

source share

4 answers

  data<-na.omit(data)

then

  max(data)

If you do not want to change the data frame, then

  max(na.omit(data))

+12

Anurag priyadarshi Nov 12 '13 at 9:06

source share

I suggest removing NA after reading as others suggested. If, however, you insist on reading only lines other than NA, you can use the bash tool linux to delete them and create a new file:

 grep -Ev file_with_NA.csv NA > file_without_NA.csv

If you run linux or mac, you already have this tool. On Windows, you must install MinGW or Cygwin to get the tools.

+1

Paul hiemstra Apr 04 '13 at 10:50

source share

You should be able to use

 max(x,na.rm=TRUE)

-one

Jess2332 Dec 21 '15 at 14:23

source share

Aditya sihag · Accepted Answer · 2013-04-04T10:48:08+0000

you have two options that I can think of

  apply(data,2,max,na.rm=TRUE); # this will remove the NA from columns that contain them

OR

 apply(na.omit(data),2,max); ## this will remove the NA rows from the data frame and then calculate the max values

Ignore / delete NA values ​​in read.csv

More articles:

Ignore / delete NA values in read.csv