At Databricks, we’re all the time working arduous to make your queries run sooner. Nonetheless, there are occasions when it’s useful to look a bit of deeper to see how your queries are become execution plans and distributed for parallel execution.
That’s the place Question Profiles are available. We launched them in the very first launch of Databricks SQL. Since then, we’ve prolonged protection to SQL and Python code working on Serverless Compute for Notebooks and Workflows, in addition to DLT Pipelines.
Because of your suggestions, we’ve made Question Profiles even higher. Now accessible throughout all Clouds, the brand new expertise makes it simpler and extra satisfying to fine-tune efficiency and clear up bottlenecks.
Question Profiles: Your efficiency tuning companion
Question Profiles enable you to perceive how your queries run, whether or not you’re utilizing SQL, Python DataFrames, or DLT pipelines. They enable you to spot gradual components for every question, perceive what occurs throughout execution, and information your performance-tuning choices. Try the video beneath to see Question Profiles in motion—and take a look at them out.
The upgraded interface is extremely interactive and intuitive. You’ll be able to discover execution plans visually, see which operations have been concerned, like scans or joins, and rapidly dive into metrics that present the place time and assets have been spent, whether or not the question continues to be working or already accomplished.
You’ll discover Question Profiles throughout Databricks: on the Question Historical past web page, Notebooks, the SQL Editor, Jobs UI, and DLT Pipelines. They’re additionally built-in with the Databricks Assistant when utilizing the /optimize command.
From fine-tuning queries throughout growth to investigating gradual jobs or pipelines or digging into particulars after you’ve noticed outliers utilizing the Question Historical past system desk, Question Profiles are your go-to device for understanding and bettering efficiency.
A better overview earlier than you dive in
We’ve reimagined the question abstract panel to provide you a clearer image of your question earlier than you even open the whole profile. Whether or not you’re reviewing an announcement from Question Historical past or actively growing in an editor, you get an outline at a look.
You’ll see a visible abstract of learn/write metrics and your filters’ effectiveness, so you possibly can instantly inform how a lot information was pruned. You’ll additionally get a sneak peek into your question profile’s total form and complexity, together with a high-level breakdown of the place time was spent (execution vs. different steps like optimization).
A fast hyperlink takes you straight to the brand new Prime Operators panel, and the question supply is now only a click on away, making it straightforward to leap again to the precise piece of code that generated the question, even from locations like Question Historical past or Jobs pages the place direct enhancing isn’t attainable.
Plus, you’ll discover a abstract of key metrics aggregated throughout all operators, so you possibly can rapidly spot purple flags even earlier than trying on the total execution plan.
Model-new Prime operators panel
The brand new Prime operators panel surfaces the most costly components of your question instantly, so you possibly can rapidly zero in on the largest alternatives for optimization. You get a ranked listing of operators, making it straightforward to focus your tuning efforts on the place they’ll have essentially the most affect.
We’ve added interactive controls: simply click on an operator within the panel to zoom into that a part of the graph and immediately see detailed metrics. It’s a sooner method to discover efficiency hotspots in your question plan.
A sleeker and sooner method to discover the execution graph
We’ve redesigned the execution graph to make navigating extra easy and environment friendly. Now you can zoom on to any node, filter nodes by key phrase, and consider richer particulars, all inside a cleaner, extra polished interface.
Giant graphs are additionally simpler to handle. We’ve launched a minimized node view when zoomed out, which reduces visible noise whereas highlighting the most costly nodes in your plan. This characteristic enables you to rapidly spot efficiency hotspots and resolve the place to zoom in and examine additional.
You’ll be able to select what to give attention to: toggle between time spent, reminiscence used, or rows processed. The time spent metric, specifically, helps pinpoint the place essentially the most intensive work occurred—it aggregates execution time throughout all duties that executed your code in parallel throughout a number of employee nodes in your clusters.
Simpler entry to operator metrics
We’ve simplified exploring and analyzing operator metrics. The up to date format presents key particulars extra clearly, and a brand new filter possibility enables you to rapidly slender down the metrics you care about—no extra limitless scrolling.
Have to take your evaluation elsewhere? Now you can export operator metrics to CSV with a single click on. Plus, we’ve added table-level insights for Scan operators to provide you an outline of key particulars for the tables you learn.
What’s subsequent
We’re not stopping right here. Right here’s a preview of what we’re at the moment exploring:
- Smarter insights and tuning strategies constructed into the question profile.
- Tighter integration with the Databricks Assistant’s /optimize characteristic.
- Question insights as a system desk so you possibly can monitor efficiency at scale.
Tell us what else you’d prefer to see — your suggestions drives what we construct.
Able to dive in? Discover the brand new Question Profile in Databricks SQL, or attempt Databricks SQL without spending a dime. Question Profiles additionally assist Serverless Compute for Notebooks, Workflows and DLT!