From the course: Advanced Microsoft Fabric Implementation and Governance

Unlock the full course today

Join today to access over 24,400 courses taught by industry experts.

Getting started with Spark

Getting started with Spark

- [Instructor] Apache Spark is an open-source method for performing all kinds of data transformation as it can handle large data volumes. The version implemented inside of Microsoft Fabric provides a very quick, default starting session and the ability to create custom configurations to better meet the needs of your organization. While a notebook support our Scala, SQL, or Python, we will primarily focus on Python as it is the most used language. You can magically change to any of these supported languages by using the magic commands, which start with a double percent science. Because Microsoft Fabric uses open-source Spark, you can import and export Spark Notebooks and incorporate code created for other applications. Accessing data inside of a Spark Notebook has been simplified inside of Microsoft Fabric, as all you need to do is add a lakehouse to a Spark notebook and you can drag and drop data to create a Spark data frame. Notebooks can be configured to share Spark sessions without…

Contents