menu

login

Log in

Linear Algebra

Prob and Stats

Other math topics

Machine Learning

Dagster (NEW)

search

Search

Login

Unlock 100+ guides

menu

menu

search toc

close

Outline

Parameters Return value Examples Variance of a 1D array Computing sample variance Computing population variance Variance of a 2D array Entire array Column-wise Row-wise

Comments

Log in or sign up

Cancel

Post

account_circle

exit_to_app

Sign out

What does this mean?

Why is this true?

Give me some examples!

search

keyboard_voice

close

Searching Tips

Search for a recipe:
"Creating a table in MySQL"

Search for an API documentation: "@append"

Search for code: "!dataframe"

Apply a tag filter: "#python"

Useful Shortcuts

/ to open search panel

Esc to close search panel

↑↓ to navigate between search results

⌘d to clear all current filters

⌘Enter to expand content preview

icon_star

Doc Search

icon_star

Code Search Beta

SORRY NOTHING FOUND!

mic

Start speaking...

Voice search is only supported in Safari and Chrome.

fullscreen_exit

Shrink

Navigate to

NumPy

319 guides

keyboard_arrow_down

Linear Algebra

Prob and Stats

Machine Learning

Other math topics

chevron_leftDocumentation

Method argpartition

NumPy Random Generator4 topics

Method choice Method dot Method finfo Method histogram Method iinfo Method max Method mean Method place Method roots Method seed Method uniform Method view Method zeros Method sum Object busdaycalendar Method is_busday Property dtype Method unique Method loadtxt Method vsplit Method fliplr Method setdiff1d Method msort Method argsort Method lexsort Method around Method nanmax Method nanmin Method nanargmax Method nanargmin Method argmax Method argmin Property itemsize Method spacing Method fix Method ceil Method diff Property flat Property real Property base Method flip Method delete Method amax Method amin Method logical_xor Method logical_or Method logical_not Method logical_and Method logaddexp Method logaddexp2 Method logspace Method not_equal Method equal Method greater_equal Method less Method less_equal Method remainder Method mod Method empty Method greater Method isfinite Method busday_count Method repeat Method var Method random_sample Method random Method sign Method std Method absolute Method abs Method sort Method randint Method isreal Method linspace Method gradient Method all Method sample Property T Property imag Method cov Method insert Method log Method log1p Method exp2 Method expm1 Method exp Method arccos Method cos Method arcsin Method sin Method tan Method fromiter Method trim_zeros Method diagflat Method savetxt Method count_nonzero Property size Property shape Method reshape Method resize Method triu Method tril Method eye Method arange Method fill_diagonal Method tile Method save Method transpose Method swapaxes Method meshgrid Property mgrid Method rot90 Method log2 Method radians Method deg2rad Method rad2deg Method degrees Method log10 Method append Method cumprod Property nbytes Method tostring Property data Method modf Method fmod Method tolist Method datetime_as_string Method datetime_data Method array_split Method itemset Method floor Method put_along_axis Method cumsum Method bincount Method put Method putmask Method take Method hypot Method sqrt Method square Method floor_divide Method tri Method signbit Method flatten Method ravel Method roll Method isrealobj Method diag Method diagonal Method quantile Method ones Method iscomplexobj Method iscomplex Method isscalar Method divmod Method isnat Method percentile Method isnan Method divide Method add Method reciprocal Method positive Method subtract Method median Method isneginf Method isposinf Method float_power Method power Method negative Method maximum Method average Method isinf Method multiply Method busday_offset Method identity Method interp Method squeeze Method get_printoptions Method savez_compressed Method savez Method load Method asfarray Method clip Method array Method array_equiv Method array_equal Method frombuffer Method set_string_function Method matmul Method genfromtxt Method fromfunction Method asscalar Method searchsorted Method full_like Method full Method shares_memory Method ptp Method digitize Method argwhere Method geomspace Method zeros_like Method fabs Method flatnonzero Method vstack Method dstack Method fromstring Method tobytes Method expand_dims Method ranf Method arctan Method item Method extract Method compress Method choose Method asarray Method asmatrix Method allclose Method isclose Method any Method corrcoef Method trunc Method prod Method cross Method true_divide Method hsplit Method split Method rint Method ediff1d Method lcm Method gcd Method cbrt Method flipud Property ndim Method array2string Method set_printoptions Method where Method hstack

Char32 topics

check_circle

Mark as learned

thumb_up

0

thumb_down

0

chat_bubble_outline

0

Comment

auto_stories Bi-column layout

settings

NumPy | var method

schedule Aug 12, 2023

Last updated

local_offer

Python●NumPy

Tags

tocTable of Contents

expand_more

Parameters Return value Examples Variance of a 1D array Computing sample variance Computing population variance Variance of a 2D array Entire array Column-wise Row-wise

Master the mathematics behind data science with 100+ top-tier guides
Start your free 7-days trial now!

NumPy's var(~) method computes the variance of values in the input array. The variance is computed using the following formula:

$$\frac{1}{N}\sum_{i=0}^{N}\left(x_i-\bar{x}^2\right)$$

Where:

$N$ is the size of the given array (i.e. the sample size)
$x_i$ is the value of the $i$th index in the Numpy array
$\bar{x}$ is the sample mean

NOTE

var(~) method can also compute the unbiased estimate of the variance. We do this by setting ddof=1 in the parameters, as we shall see later in the examples.

Parameters

1. a | array-like

The array on which to perform the method.

2. axislink | int or tuple | optional

The axis along which we compute the variance. For 2D arrays, the allowed values are as follows:

Axis	Meaning
0	Variance will be computed column-wise
1	Variance will be computed row-wise
None	Variance will be computed on a flattened array

By default, axis=None.

3. dtype | string or type | optional

The type used to compute the variance. If the input array is of type int, then float32 will be used. If the input array is of another numerical type, then its type will be used.

4. ddoflink | int | optional

The delta degree of freedom. This can be used to modify the denominator in the front:

$$\frac{1}{N\color{blue}{-ddof}}\sum_{i=0}^{N}\left(x_i-\bar{x}^2\right)$$

By default, ddof=0.

Return value

If axis=None, then a single float representing the variance of all the values in the array is returned. Otherwise, a Numpy array is returned.

Examples

Variance of a 1D array


        
        
            
                
                
                    np.var([1,2,3,4])
                
            
            1.25

Computing sample variance

To compute the sample variance, set ddof=1:


        
        
            
                
                
                    np.var([1,2,3,4], ddof=1)
                
            
            1.6666666666666667

Computing population variance

To compute the population variance, leave out the ddof parameter or explicitly set ddof=0:


        
        
            
                
                
                    np.var([1,2,3,4])   # By default, ddof=0
                
            
            1.25

Variance of a 2D array

Entire array

Without specifying the axis parameter, Numpy will just regard your Numpy array as a flattened array.


        
        
            
                
                
                    np.var([[1,2],[3,4]])
                
            
            1.25

This code is fundamentally the same as np.var([1,2,3,4]).

Column-wise

To compute the variance column-wise, specify axis=0 in the parameters:


        
        
            
                
                
                    np.var([[1,4],[2,6], [3,8]], axis=0)
                
            
            array([0.66666667, 2.66666667])

Here, we're computing the variance of [1,2,3] (i.e. the first column) as well as [4,6,8] (i.e. the second column).

Row-wise

To compute the variance column-wise, specify axis=1 in the parameters:


        
        
            
                
                
                    np.var([[1,4],[2,6], [3,8]], axis=1)
                
            
            array([2.25, 4.  , 6.25])

Here, we're computing three variances: first row (i.e. [1,4]), second row (i.e. [2,6]) and third row (i.e. [3,8]).

WARNING

Sometimes the numerical type float32 may not be accurate enough for your needs. If your application requires more accurate numbers, then set dtype=np.float64 in the argument. This will take up more memory, but will provide a more accurate result.

robocat

Published by Isshin Inada

Edited by 0 others

Did you find this page useful?

thumb_up

thumb_down

Comment

Citation

Ask a question or leave a feedback...

thumb_up

0

thumb_down

0

chat_bubble_outline

0

settings

Enjoy our search

Hit / to insta-search docs and recipes!

Navigation

Contact us

Resources

Python Pandas MySQL Beautiful Soup Matplotlib NumPy PySpark

Community

Join our Discord

Join our newsletter for updates on new comprehensive DS/ML guides

|