Data Manipulation in pyspark - Part 1
Data manipulation methods in pyspark
This article with cover various string transformations in spark dataframe. For the demo, we are going to use Store Sales dataset available from kaggle.
Data manipulation methods in pyspark
This article with cover various string transformations in spark dataframe. For the demo, we are going to use Store Sales dataset available from kaggle.
Renaming Columns in a dataframe
This article with cover various ways of renaming columns in spark dataframe. For the demo, we are going to use Auto-mpg dataset available from kaggle.
Tips and tricks for using databricks platform
In this section, I am going to provide information on how to add external files in databricks community edition and then use these files in notebooks. For the demo, we are going to use Auto-mpg dataset](https://www.kaggle.com/datasets/uciml/autompg-dataset?resource=download).