search
Search
Publish
menu
menu search toc more_vert
Robocat
Guest 0reps
Thanks for the thanks!
close
chevron_left Row and Column Operations Cookbook
Adding a column that contains the difference in consecutive rowsAdding a constant number to DataFrame columnsAdding an empty column to a DataFrameAdding column to DataFrame with constant valuesAdding new columns to a DataFrameAppending rows to a DataFrameApplying a function that takes as input multiple column valuesApplying a function to a single column of a DataFrameChanging column type to categoricalChanging the name of a DataFrame's indexChanging the order of columns in a DataFrameChanging the type of a DataFrame's indexChanging the type of a DataFrame's columnChecking if a column exists in a DataFrameChecking if a DataFrame column contains some valuesChecking if a value exists in a DataFrame in PandasChecking if column is numericChecking the data type of columnsChecking whether column values match or contain a patternCombining two columns as a single column of tuplesCombining two columns of type string in a DataFrameComputing the average of columnsComputing the correlation between columnsConcatenating DataFrames horizontallyConcatenating DataFrames verticallyConverting a row to column labelsConverting categorical type to intConverting column to listConverting Index to listConverting percent strings into numericConverting the index of a DataFrame into a columnCounting duplicate rowsCounting number of rows with no missing valuesCounting the occurrence of values in columnsCounting unique values in a column of a DataFrameCounting unique values in rows of a DataFrameCreating a new column based on other columnsCreating new column using if, elif and elseDescribing certain columnsDropping columns whose label contains a substringGetting column values based on another column values in a DataFrame in PandasGetting columns as a copyGetting columns whose label contains a substringGetting maximum value in columnsGetting maximum value of entire DataFrameGetting mean of columnsGetting median of columnsGetting minimum value in columnsGetting row label when calling applyGetting row labels as listGetting rows where column value contains any substring in a listGetting the name of indexGetting type of indexGrouping DataFrame rows into listsInserting column at a specific locationIterating over each column of a DataFrameIterating over each row of a DataFrameModifying rows of a DataFrameModifying values in IndexRemoving columns from a DataFrameRemoving columns using column labelsRemoving columns using integer indexRemoving columns with all missing valuesRemoving columns with some missing valuesRemoving duplicate columnsRemoving duplicate rowsRemoving first n rows of a DataFrameRemoving multiple columnsRemoving prefix from column labelsRemoving rows at random without shufflingRemoving rows from a DataFrame based on column valuesRemoving rows using integer indexRemoving rows with all zerosRemoving suffix from column labelsRenaming columns of a DataFrameReplacing substring in column valuesReturning multiple columns using the apply functionReversing the order of rowsSetting a new index of a DataFrameSetting an existing column as the new indexSetting column as the indexSetting integers as column labelsShowing all column labelsShuffling the rows of a DataFrameSorting a DataFrame by columnSorting a DataFrame by indexSorting DataFrame alphabeticallySorting DataFrame by column labelsSplitting a column of strings into multiple columnsSplitting column of lists into multiple columnsSplitting dictionary into separate columnsStripping substrings from values in columnsStripping whitespace from columnsStripping whitespaces in column labelsSumming a column of a DataFrameSumming rows of specific columnsSwapping the rows and columns of a DataFrameUpdating a row while iterating over the rows of a DataFrameUpdating rows based on column valuesUsing apply method in parallel
Comments
Log in or sign up
Cancel
Post
account_circle
Profile
exit_to_app
Sign out
help Ask a question
Share on Twitter
search
keyboard_voice
close
Searching Tips
Search for a recipe: "Creating a table in MySQL"
Search for an API documentation: "@append"
Search for code: "!dataframe"
Apply a tag filter: "#python"
Useful Shortcuts
/ to open search panel
Esc to close search panel
to navigate between search results
d to clear all current filters
Enter to expand content preview
icon_star
Doc Search
icon_star
Code Search Beta
SORRY NOTHING FOUND!
mic
Start speaking...
Voice search is only supported in Safari and Chrome.
Navigate to
A
A
share
thumb_up_alt
bookmark
arrow_backShare
Twitter
Facebook
chevron_left Row and Column Operations Cookbook
Adding a column that contains the difference in consecutive rowsAdding a constant number to DataFrame columnsAdding an empty column to a DataFrameAdding column to DataFrame with constant valuesAdding new columns to a DataFrameAppending rows to a DataFrameApplying a function that takes as input multiple column valuesApplying a function to a single column of a DataFrameChanging column type to categoricalChanging the name of a DataFrame's indexChanging the order of columns in a DataFrameChanging the type of a DataFrame's indexChanging the type of a DataFrame's columnChecking if a column exists in a DataFrameChecking if a DataFrame column contains some valuesChecking if a value exists in a DataFrame in PandasChecking if column is numericChecking the data type of columnsChecking whether column values match or contain a patternCombining two columns as a single column of tuplesCombining two columns of type string in a DataFrameComputing the average of columnsComputing the correlation between columnsConcatenating DataFrames horizontallyConcatenating DataFrames verticallyConverting a row to column labelsConverting categorical type to intConverting column to listConverting Index to listConverting percent strings into numericConverting the index of a DataFrame into a columnCounting duplicate rowsCounting number of rows with no missing valuesCounting the occurrence of values in columnsCounting unique values in a column of a DataFrameCounting unique values in rows of a DataFrameCreating a new column based on other columnsCreating new column using if, elif and elseDescribing certain columnsDropping columns whose label contains a substringGetting column values based on another column values in a DataFrame in PandasGetting columns as a copyGetting columns whose label contains a substringGetting maximum value in columnsGetting maximum value of entire DataFrameGetting mean of columnsGetting median of columnsGetting minimum value in columnsGetting row label when calling applyGetting row labels as listGetting rows where column value contains any substring in a listGetting the name of indexGetting type of indexGrouping DataFrame rows into listsInserting column at a specific locationIterating over each column of a DataFrameIterating over each row of a DataFrameModifying rows of a DataFrameModifying values in IndexRemoving columns from a DataFrameRemoving columns using column labelsRemoving columns using integer indexRemoving columns with all missing valuesRemoving columns with some missing valuesRemoving duplicate columnsRemoving duplicate rowsRemoving first n rows of a DataFrameRemoving multiple columnsRemoving prefix from column labelsRemoving rows at random without shufflingRemoving rows from a DataFrame based on column valuesRemoving rows using integer indexRemoving rows with all zerosRemoving suffix from column labelsRenaming columns of a DataFrameReplacing substring in column valuesReturning multiple columns using the apply functionReversing the order of rowsSetting a new index of a DataFrameSetting an existing column as the new indexSetting column as the indexSetting integers as column labelsShowing all column labelsShuffling the rows of a DataFrameSorting a DataFrame by columnSorting a DataFrame by indexSorting DataFrame alphabeticallySorting DataFrame by column labelsSplitting a column of strings into multiple columnsSplitting column of lists into multiple columnsSplitting dictionary into separate columnsStripping substrings from values in columnsStripping whitespace from columnsStripping whitespaces in column labelsSumming a column of a DataFrameSumming rows of specific columnsSwapping the rows and columns of a DataFrameUpdating a row while iterating over the rows of a DataFrameUpdating rows based on column valuesUsing apply method in parallel
thumb_up
0
thumb_down
0
chat_bubble_outline
0
auto_stories new
settings

Changing the type of a DataFrame's column in Pandas

Programming
chevron_right
Python
chevron_right
Pandas
chevron_right
Cookbooks
chevron_right
DataFrame Cookbooks
chevron_right
Row and Column Operations Cookbook
schedule May 9, 2022
Last updated
local_offer PythonPandas
Tags

To change the data type of a DataFrame's column in Pandas, use the Series' astype(~) method.

Changing type to float

Consider the following DataFrame:

df = pd.DataFrame({"A":[3,4], "B":[5,6]})
df
A B
0 3 5
1 4 6

To change the data type of column A to float:

df["A"] = df["A"].astype("float")

To confirm that column A is now of type float:

A float64
B int64
dtype: object

Changing type to string

Consider the following DataFrame:

df = pd.DataFrame({"A":[3,4]})
df
A
0 3
1 4

To convert column A to type string:

df["A"] = df["A"].astype("string")
A string
dtype: object

Dedicated string type

Prior to version 1.0.0, Pandas did not have a data type dedicated to strings. For instance, suppose we created a DataFrame with a column containing just strings:

df = pd.DataFrame({"A":["a","b"]})
A object
dtype: object

You'd expect the column type to be string, but instead you get object. For backward compatibility, the default type of string-based columns is object even after version 1.

Pandas officially recommend the string type over object type for better:

  • readability - text should inherently be string and not an object.

  • performance gains (coming soon)

  • compatibility with methods like select_dtypes(~) where you can specifically pick string-typed columns.

Note that using str as the argument would convert the type to object instead of string:

df["A"] = df["A"].astype(str) # not recommended - use "string" instead
A object
B int64
dtype: object

Changing type to category

Consider the following DataFrame:

df = pd.DataFrame({"group":["A","B"]})
df
group
0 A
1 B

To convert group column to type category:

df["group"] = df["group"].astype("category")
group category
dtype: object
robocat
Published by Isshin Inada
Edited by 0 others
Did you find this page useful?
thumb_up
thumb_down
Ask a question or leave a feedback...