Pandas
keyboard_arrow_down 655 guides
chevron_leftHandling Missing Values
Adding missing dates in Datetime IndexChecking if a certain value in a DataFrame is NaNChecking if a DataFrame contains any missing valuesConverting a column with missing values to integer typeCounting non-missing valuesCounting number of rows with missing valuesCounting the number of NaN in each row of a DataFrameCounting number of NaN values in each column of a DataFrameCounting the total number of NaN values of a DataFrameFilling missing values using another columnFilling missing values with the mean of the columnFinding columns with missing valuesGetting integer indexes of rows with NaNGetting rows with missing valuesGetting rows with missing values in certain columnsGetting index of rows with missing values (NaNs)Getting index of rows without missing valuesMapping NaN values to 0 and non-NaN values to 1Mapping NaN values to False and non-NaN values to TrueRemoving columns where some rows contain missing valuesRemoving rows from a DataFrame with missing valuesReplacing all NaN values of a DataFrameReplacing all NaN values with zeros in a DataFrameReplacing missing valuesReplacing missing values with constantsReplacing NaN with blank stringReplacing NaNs for certain columnsReplacing NaNs with preceding valuesReplacing values with NaNsUsing interpolation to fill missing values
check_circle
Mark as learned thumb_up
0
thumb_down
1
chat_bubble_outline
0
Comment auto_stories Bi-column layout
settings
Filling missing values (NaNs) with the mean of the column in Pandas DataFrame
schedule Aug 11, 2023
Last updated local_offer
Tags Python●Pandas
tocTable of Contents
expand_more Master the mathematics behind data science with 100+ top-tier guides
Start your free 7-days trial now!
Start your free 7-days trial now!
Example
Consider the following DataFrame:
df = pd.DataFrame({"A":[np.nan,3,np.nan],"B":[4,np.nan,5],"C":[np.nan,7,8]}, index=["a","b","c"])df
A B Ca NaN 4.0 NaNb 3.0 NaN 7.0c NaN 5.0 8.0
Solution
To fill the missing values with the mean of the column:
df.fillna(df.mean())
A B Ca 3.0 4.0 7.5b 3.0 4.5 7.0c 3.0 5.0 8.0
Here, a new DataFrame is returned, and the original df
is kept intact.
Explanation
Here, df.mean()
returns a Series
that holds the mean of each column:
df.mean()
A 3.0B 4.5C 7.5dtype: float64
Conveniently, this Series
provides the mapping as to which value should be used as the filler for each column. We then directly use fillna(~)
to perform the filling.
Performing the fill in-place
The fillna(~)
method allows for the filling to be performed in-place. Note that in-place means that the original DataFrame is directly modified, and no new DataFrame is returned.
Set inplace=True
like so:
df.fillna(df.mean(), inplace=True)df
A B Ca 3.0 4.0 7.5b 3.0 4.5 7.0c 3.0 5.0 8.0
Published by Isshin Inada
Edited by 0 others
Did you find this page useful?
thumb_up
thumb_down
Comment
Citation
Ask a question or leave a feedback...
thumb_up
0
thumb_down
1
chat_bubble_outline
0
settings
Enjoy our search
Hit / to insta-search docs and recipes!