A function that takes in two arguments: a column in the first DataFrame and a column in the second DataFrame that are to be combined, both as type Series. The return value of this function must be a Series that represent the resulting column.

3. fill_value | scalar | optional

The value that will fill the instances of missing values (NaN). The filling happens before the merging process. By default, fill_value=None.

4. overwritelink | boolean | optional

The meaning of the boolean values is as follows:

Value	Description
`True`	If a column in one DataFrame does not exist in the other DataFrame, then the merged column will have its entries filled with `NaN`.
`False`	If a column in the source DataFrame does not exist in the other DataFrame, then the column will appear in the merged DataFrame with its entries kept intact. However, the reverse is not true; if the other DataFrame has columns that do no exist in the source DataFrame, then those columns will also appear in the final DataFrame, but with its entries filled with `NaN`.

Value

Description

True

If a column in one DataFrame does not exist in the other DataFrame, then the merged column will have its entries filled with NaN.

False

If a column in the source DataFrame does not exist in the other DataFrame, then the column will appear in the merged DataFrame with its entries kept intact.

However, the reverse is not true; if the other DataFrame has columns that do no exist in the source DataFrame, then those columns will also appear in the final DataFrame, but with its entries filled with NaN.

By default, overwrite=True. See examples below for clarification.

Return Value

A DataFrame with the columns combined as per the parameters.

Examples

Basic usage

Consider the following DataFrames:


        
        
            
                
                
                    df = pd.DataFrame({"A":[3,4], "B":[5,6]})
df_other = pd.DataFrame({"A":[1,8], "B":[2,9]})
                
            
               A  B   |      A  B
0  3  5   |   0  1  2
1  4  6   |   1  8  9

To combine the columns of the two DataFrames to leave only the higher values:


        
        
            
                
                
                    df.combine(df_other, np.maximum)
                
            
               A  B
0  3  5
1  8  9

Custom function

We can also pass in a custom function for func:


        
        
            
                
                
                    def foo(col, col_other):   # a pair of Series
    return col + col_other

df.combine(df_other, foo)
                
            
               A   B
0  4   7
1  12  15

Note the following:

foo simply computes and returns the sum of a pair of matching columns in the two DataFrames.
foo is called twice here since there are two matching pairs of column labels.

Specifying overwrite

Consider the following DataFrames that have mismatches in the column labels:


        
        
            
                
                
                    df = pd.DataFrame({"A":[3,4], "B":[5,6]})
df_other = pd.DataFrame({"A":[1,8], "C":[2,9]})
                
            
               A  B   |      A  C
0  3  5   |   0  1  2
1  4  6   |   1  8  9

By default, overwrite=True, which means that columns that do not exist in the other DataFrame will be filled with NaN and vice versa:


        
        
            
                
                
                    df.combine(df_other, np.maximum)
                
            
               A  B    C
0  3  NaN  NaN
1  8  NaN  NaN

Here, columns B and C are NaN because df did not have column C, while df_other did not have column B.

We can keep the columns of the source DataFrame intact by setting overwrite=False:


        
        
            
                
                
                    df.combine(df_other, np.maximum, overwrite=False)
                
            
               A  B  C
0  3  5  NaN
1  8  6  NaN

Here, notice how column C, which is a column present only in df_other, still have its entries filled with NaN.

Published by Isshin Inada

Edited by 0 others

Did you find this page useful?

thumb_up

thumb_down

Comment

Citation

Ask a question or leave a feedback...

Official Pandas Documentation

https://pandas.pydata.org/pandas-docs/dev/reference/api/pandas.DataFrame.combine.html

thumb_up

thumb_down

chat_bubble_outline

settings

Enjoy our search

Hit / to insta-search docs and recipes!