Here is one possible solution for a graph of downsampling along the x axis, if it is converted to a log. He logs the x-axis, rounds this amount and selects the average x value in this bin:
downsampled_qplot <- function(x,y,data,rounding=0, ...) {
Using the definition of ccdf() above, we can compare the original plot of the CCDF distribution with the downsampling version:
myccdf=ccdf(rlnorm(10000,3,2.4)) qplot(x,count,data=myccdf,log='xy',main='original')

downsampled_qplot(x,count,data=myccdf,log='xy',rounding=1,main='rounding = 1')

downsampled_qplot(x,count,data=myccdf,log='xy',rounding=0,main='rounding = 0')

In PDF format, the original graph occupies 640K, and versions with a reduced selection occupy 20K and 8K, respectively.
eytan
source share