PySpark DataFrame | show method
Start your free 7-days trial now!
PySpark DataFrame's show(~)
method prints the rows of the DataFrame on the console.
Parameters
1. n
| int
| optional
The number of rows to show. By default, n=20
.
2. truncate
| boolean
or int
| optional
If
True
, then strings that are longer than 20 characters will be truncated.If
False
, then whole strings will be shown.If
int
, then strings that are longer thantruncate
will be truncated.
If truncation occurs, then the left part of the string is preserved. By default, truncate=True
.
3. vertical
| boolean
| optional
If True
, then the rows are printed with one line for each column value. By default, vertical=False
.
Return Value
None
.
Examples
Consider the following PySpark DataFrame:
columns = ["name", "age"]data = [("Alex", 15), ("Bob", 20), ("Cathy", 25)]
Printing the first n rows of PySpark DataFrame
To print the first 20 rows of the PySpark DataFrame:
+-----+---+| name|age|+-----+---+| Alex| 15|| Bob| 20||Cathy| 25|+-----+---+
To print the first 2 rows of the DataFrame:
+----+---+|name|age|+----+---+|Alex| 15|| Bob| 20|+----+---+only showing top 2 rows
Truncating strings in printed rows of PySpark DataFrame
To truncate strings that are longer than 2:
df.show(truncate=2)
+----+---+|name|age|+----+---+| Al| 15|| Bo| 20|| Ca| 25|+----+---+
Disabling truncation of strings in printed rows of PySpark DataFrame
To disable truncation of strings in printed rows:
df.show(truncate=False)
+-----+---+|name |age|+-----+---+|Alex |15 ||Bob |20 ||Cathy|25 |+-----+---+
Printing rows of PySpark DataFrame vertically
To print each column value in a separate line:
df.show(vertical=True)
-RECORD 0----- name | Alex age | 15 -RECORD 1----- name | Bob age | 20 -RECORD 2----- name | Cathy age | 25