Suppose I have a DataFrame with 100k rows and a column DataFrame . I would like to break this name into first and last name as efficiently as possible. My current method:
def splitName(name): return pandas.Series(name.split()[0:2]) df[['first', 'last']] = df.apply(lambda x: splitName(x['name']), axis=1)
Unfortunately, DataFrame.apply really, really slow. Is there anything I can do to make this line operation almost as fast as the numpy operation?
Thanks!
python pandas
duckworthd
source share