I have a large dataset with thread durations on a discussion board. I want the histogram to show the distribution of life expectancy, so I did this:
dall <- read.csv("lifespan.csv") colnames(dall) <- c("thread.id", "seconds.alive", "start.time") hist(dall$seconds.alive)
which generated this hard to read image:
My questions are: a) changing the y axis to a logarithmic scale - a good way to make it more readable? Apparently, some people think this is a bad idea to change the y axis to a log.
b) how to do it?
Instead, I will try to use hist(log10(dall$seconds.alive)) .
hist(log10(dall$seconds.alive))
Also try specifying breaks=100 or less / more:
breaks=100
hist(log10(dall$seconds.alive), breaks=100)