PySpark
keyboard_arrow_down 147 guides
chevron_leftPySpark RDD
check_circle
Mark as learned thumb_up
1
thumb_down
0
chat_bubble_outline
0
Comment auto_stories Bi-column layout
settings
PySpark RDD | first method
schedule Aug 12, 2023
Last updated local_offer
Tags PySpark
tocTable of Contents
expand_more Master the mathematics behind data science with 100+ top-tier guides
Start your free 7-days trial now!
Start your free 7-days trial now!
PySpark RDD's first(~)
method returns the first element of the RDD.
Parameters
This method does not take in any parameters.
Return Value
The type will be that of the first element of the RDD.
Examples
We create a RDD using the parallelize(~)
method:
rdd
ParallelCollectionRDD[61] at readRDDFromInputStream at PythonRDD.scala:413
Fetching the first element of a RDD
To fetch the first element in the RDD, use the first()
method:
rdd.first()
2
Published by Isshin Inada
Edited by 0 others
Did you find this page useful?
thumb_up
thumb_down
Comment
Citation
Ask a question or leave a feedback...
Official PySpark Documentation
https://spark.apache.org/docs/latest/api/python/reference/api/pyspark.RDD.first.html
thumb_up
1
thumb_down
0
chat_bubble_outline
0
settings
Enjoy our search
Hit / to insta-search docs and recipes!