In this article, we are going to cover other topics. Basics of Scala was covered in the earlier article which was found at this link. In these series, I am mostly going to provide code snippets which can help me revise what I have learnt and act as a cheatsheet. For better and in-depth understanding, feel free to visit the reference section at the very end, it lists all the documents/articles I have referred to for creating this blog post.
In this spark series, I will continue to add more details on the spark dataframe. This post will specifically cover various aspects of dataframe creation. I am going to include how a python datastructure can be converted into a spark dataframe and how to specify the data type/schema associated with the spark dataframe.These series will have more code snippets and less explaination as most of the things are self explanatory.
In this article, I am going to walk you through the process of using duckdb local instance with cube js. For smaller data sets, cube js sementic layer can be employed to drive any supported BI tool. A general workflow would be using excel files to load data in duckdb and using that data with a BI tool can add a lot of value for a small data team.