Is there any Python (possibly pandas ) equivalent to R
install.packages("caTools") library(caTools) set.seed(88) split = sample.split(df$col, SplitRatio = 0.75)
which will generate the exact same split value?
My current context for this is, for example, getting Pandas data that exactly matches the R ( qualityTrain , qualityTest ) data frames created by:
# https://courses.edx.org/c4x/MITx/15.071x/asset/quality.csv quality = read.csv("quality.csv") set.seed(88) split = sample.split(quality$PoorCare, SplitRatio = 0.75) qualityTrain = subset(quality, split == TRUE) qualityTest = subset(quality, split == FALSE)
orome source share