df = spark.range(1,4)
df.show()
                
            
            +---+
| id|
+---+
|  1|
|  2|
|  3|
+---+

Notice how the starting value is included while the ending value is not.

Note that if only one argument is supplied, then the range will start from 0 (inclusive) and the argument will represent the end-value (exclusive):


        
        
            
                
                
                    df = spark.range(3)
df.show()
                
            
            +---+
| id|
+---+
|  0|
|  1|
|  2|
+---+

Setting an incremental value

Instead of the default incremental value of step=1, we can choose a specific incremental value using the third argument:


        
        
            
                
                
                    df = spark.range(1,6,2)
df.show()
                
            
            +---+
| id|
+---+
|  1|
|  3|
|  5|
+---+

Series of values in descending order

We can also get a series of values in descending order:


        
        
            
                
                
                    df = spark.range(4,1,-1)
df.show()
                
            
            +---+
| id|
+---+
|  4|
|  3|
|  2|
+---+

Note the following:

the starting value must be larger than the ending value
the incremental value must be negative.

Specifying the number of partitions

By default, the number of partitions in which the resulting PySpark DataFrame will be split is governed by our PySpark configuration. In my case, the default number of partitions is 8:


        
        
            
                
                
                    df = spark.range(1,4)
df.rdd.getNumPartitions()
                
            
            8

We can override our configuration by specifying the numPartitions parameter:


        
        
            
                
                
                    df = spark.range(1,4, numPartitions=2)
df.rdd.getNumPartitions()
                
            
            2

Published by Isshin Inada

Edited by 0 others

Did you find this page useful?

thumb_up

thumb_down

Comment

Citation

Ask a question or leave a feedback...

Official PySpark Documentation

https://spark.apache.org/docs/3.2.0/api/python/reference/api/pyspark.sql.SparkSession.range.html

thumb_up

thumb_down

chat_bubble_outline

settings

Enjoy our search

Hit / to insta-search docs and recipes!