spark

Pyspark Data Sources

Pyspark supported data sources

Vasav

13 minute read

Spark support various data sources. Spark has some core data sources built into it while the others are available and maintained by other developers from the community. In this post, I am going to explain the core data sources supported by pyspark.

How Apache Spark works

Spark Architecture

Vasav

4 minute read

Apache spark is a distributed compute engine used to process large volume/amount of data. In this article I am going to provide information on how it works behind the scenes.