Skip to main content

Databricks documentation

Databricks on Google Cloud is a Databricks environment hosted on Google Cloud, running on Google Compute Engine (GCE) and providing built-in integration with Google Cloud Identity, Google Cloud Storage, BigQuery, and other Google Cloud technologies.

tip

Databricks technical documentation is organized by cloud provider. Use the cloud switcher in the upper right corner of the page to choose between Amazon Web Services, Google Cloud Platform, or Microsoft Azure.

Try Databricks

Task

Description

Start a Databricks free trial

Start your journey with Databricks by setting up a free trial account and configuring your first environment.

Workspace UI

Learn the fundamentals of navigating and using the Databricks workspace interface.

Tutorial: Query and visualize data from a notebook

Get hands-on experience with data upload, SQL queries, and creating visualizations in Databricks.

Tutorial: Build an ETL pipeline with Lakeflow Declarative Pipelines

Create your first ETL pipeline to transform and process data using Databricks.

Explore Databricks

Task

Description

Data guides

Discover and connect to data sources, manage data assets, and perform exploratory data analysis.

Data engineering with Databricks

Build and manage ETL pipelines, process large data sets, and orchestrate data workflows.

AI and machine learning on Databricks

Develop, train, and deploy machine learning models and generative AI applications using MLflow and Databricks tools.

Databricks AI/BI

Create dashboards, reports, and visualizations for business insights and BI analytics.

Data warehousing on Databricks

Query and analyze data using SQL, manage schemas, and optimize data warehouse performance.

Develop on Databricks

Build applications, integrate APIs, and extend Databricks functionality with custom code.

Manage Databricks

Task

Description

Administration

Configure account settings and manage workspaces, users, and administrative policies across your Databricks environment.

Security and compliance

Implement security controls, configure access policies, and ensure compliance with industry standards.

Data governance with Databricks

Establish data governance frameworks, manage data lineage, and implement data quality controls.

Link

Description

Reference

Overview of API reference documentation, including reference for the Databricks REST API, SDKs, Python APIs, and Databricks SQL.

Databricks release notes

Stay updated with the latest product updates, new features, and platform improvements.

Status page

Information about the Databricks Status Page to monitor system status, service availability, and maintenance schedules across all regions.

Databricks technical terminology glossary

Find definitions for technical terms, concepts, and terminology used throughout Databricks.

Other resources

Limits and quotas, regions, support, product feedback, free training, migration guides, and more.