Pandas DataFrame.add () - ignore missing columns

Question

Pandas DataFrame.add () - ignore missing columns

I have the following two DataFrames:

>>> history
              above below
asn   country
12345 US          5     4
      MX          6     3
54321 MX          4     5
>>> current
              above below
asn   country
12345 MX          1     0
54321 MX          0     1
      US          1     0

I save the counter "above" and "below" in the historyDataFrame like this:

>>> history = history.add(current, fill_value=0)
>>> history
               above  below
asn   country              
12345 MX         7.0    3.0
      US         5.0    4.0
54321 MX         4.0    6.0
      US         1.0    0.0

This works as long as currentthere are no extra columns in the DataFrame. However, when I add an extra column:

>>> current
              above below cruft
asn   country
12345 MX          1     0   999
54321 MX          0     1   999
      US          1     0   999

I get the following:

>>> history = history.add(current, fill_value=0)
>>> history
               above  below cruft
asn   country              
12345 MX         7.0    3.0 999.0
      US         5.0    4.0   NaN
54321 MX         4.0    6.0 999.0
      US         1.0    0.0 999.0

I want this extra column to be ignored, as it is not present in both DataFrames. Desired Result:

>>> history
               above  below
asn   country              
12345 MX         7.0    3.0
      US         5.0    4.0
54321 MX         4.0    6.0
      US         1.0    0.0

+6

python pandas dataframe

stevendesu Feb 28 '18 at 10:06

source share

3 answers

Ummm, a new way

pd.concat([df1,df2],join ='inner',axis=0).sum(level=[0,1])

+7

Wen Feb 28 '18 at 22:20

source share

, , :

cols_to_return = ["above", "below"]
history = history[cols_to_return].add(current[cols_to_return], fill_value=0)

, , .

+4

Yilun Zhang 28 . '18 22:11

Maxu · Accepted Answer · 2018-02-28T22:13:36+0000

In [27]: history.add(current, fill_value=0)[history.columns]
Out[27]:
               above  below
asn   country
12345 MX         7.0    3.0
      US         5.0    4.0
54321 MX         4.0    6.0
      US         1.0    0.0

Pandas DataFrame.add () - ignore missing columns

More articles: