From the course: Advanced Microsoft Fabric Implementation and Governance
Unlock the full course today
Join today to access over 24,400 courses taught by industry experts.
Getting started with Spark
From the course: Advanced Microsoft Fabric Implementation and Governance
Getting started with Spark
- [Instructor] Apache Spark is an open-source method for performing all kinds of data transformation as it can handle large data volumes. The version implemented inside of Microsoft Fabric provides a very quick, default starting session and the ability to create custom configurations to better meet the needs of your organization. While a notebook support our Scala, SQL, or Python, we will primarily focus on Python as it is the most used language. You can magically change to any of these supported languages by using the magic commands, which start with a double percent science. Because Microsoft Fabric uses open-source Spark, you can import and export Spark Notebooks and incorporate code created for other applications. Accessing data inside of a Spark Notebook has been simplified inside of Microsoft Fabric, as all you need to do is add a lakehouse to a Spark notebook and you can drag and drop data to create a Spark data frame. Notebooks can be configured to share Spark sessions without…
Practice while you learn with exercise files
Download the files the instructor uses to teach the course. Follow along and learn by watching, listening and practicing.