Fugue with Cloud Providers#

This section will cover using Fugue on top of cloud providers such as Databricks or Coiled. Fugue has a fugue-cloudproviders package that allows users to easily spin up ephemeral compute for their compute workflows.


Databricks is the most common provider for Spark clusters. Using the databricks-connect library, we can easily spin up an ephemeral Spark cluster. We can connect to the SparkSession on the Databricks cluster from a local machine.