power-query

Installation

SKILL.md

Power Query for Semantic Models

Author, validate, and test Power Query M expressions in semantic model import partitions. Covers writing correct M code, preserving query folding, validating expressions, and testing them by executing against real data sources.

Partition Expressions

Each import table in a semantic model has a partition with an M expression defining what data gets loaded during refresh. The expression typically connects to a data source, navigates to a table/view, and applies transformations.

Structure of a Partition Expression

let
    Source = Sql.Database(#"SqlEndpoint", #"Database"),
    Data = Source{[Schema="dbo", Item="Orders"]}[Data],
    #"Removed Columns" = Table.RemoveColumns(Data, {"InternalId"}),
    #"Changed Type" = Table.TransformColumnTypes(#"Removed Columns", {{"Amount", Currency.Type}})
in
    #"Changed Type"

Key elements:

Parameters: #"SqlEndpoint", #"Database" are shared M parameters defined at the model level
Navigation: Source{[Schema="dbo", Item="Orders"]}[Data] navigates to a specific table
Steps: Each step is a named variable in the let...in chain
Quoted identifiers: Step names with spaces use #"Step Name" syntax

Extracting Expressions

# Get partition expression from TMDL via fab
fab get "<Workspace>.Workspace/<Model>.SemanticModel" -f \
  -q "definition.parts[?path=='definition/tables/<Table>.tmdl'].payload"

# Get shared M parameters
fab get "<Workspace>.Workspace/<Model>.SemanticModel" -f \
  -q "definition.parts[?path=='definition/expressions.tmdl'].payload"

Writing M Expressions

Query Folding

Query folding is the most important performance concept. The M engine translates compatible steps into native data source queries (e.g., SQL). When folding breaks, subsequent steps run in the mashup engine, pulling all data into memory first.

Steps that typically fold (for SQL sources):

Table.SelectColumns / Table.RemoveColumns -> SELECT
Table.SelectRows -> WHERE
Table.Sort -> ORDER BY
Table.FirstN -> TOP
Table.Group -> GROUP BY
Table.RenameColumns -> AS aliases

Steps that may or may not fold (source-dependent):

Table.TransformColumnTypes -- frequently breaks folding for text-to-numeric/date conversions on SQL Server sources. Use Table.TransformColumns with explicit conversion functions (e.g., Number.From) as a more reliable foldable alternative.

Steps that break folding:

Table.AddColumn with custom M functions (not translatable to SQL)
Table.Buffer (forces materialization; prefer Table.StopFolding to stop folding without the memory overhead)
Table.LastN (no SQL equivalent without subquery)
Table.Combine across different data sources (cross-database folding within the same SQL Server is possible via EnableCrossDatabaseFolding)
Complex each expressions with M-specific logic
Any step after a fold-breaking step

Best practice: Apply folding-compatible steps (filter, select, type) early; add custom columns and M-only transforms after all foldable work is done.

Column Pruning and Row Filtering

Remove unused columns and filter rows as early as possible:

let
    Source = Sql.Database(SqlEndpoint, Database),
    Data = Source{[Schema="dbo", Item="Orders"]}[Data],
    // Filter and select BEFORE any custom transforms
    #"Filtered" = Table.SelectRows(Data, each [Status] <> "Cancelled"),
    #"Selected" = Table.SelectColumns(#"Filtered", {"OrderId", "Date", "Amount", "CustomerId"})
in
    #"Selected"

These steps fold to SQL: SELECT OrderId, Date, Amount, CustomerId FROM dbo.Orders WHERE Status <> 'Cancelled'

Type Handling

Apply Table.TransformColumnTypes early (folds to CAST in SQL)
Use explicit M types: Int64.Type, type text, type date, Currency.Type, type logical
Avoid implicit type inference on large datasets

Naming Conventions

Step names should describe the transformation: #"Removed Duplicates", #"Filtered Active"
Avoid generic names like #"Custom1" or #"Step1"
Use quoted identifiers #"Name" for steps with spaces (Power Query convention)

Validating M Expressions

Two approaches to validate that an M expression is syntactically correct and produces expected results:

1. Execute via the Power Query API (Recommended)

Test the expression by running it against real data. This validates syntax, data source connectivity, and transformation correctness in one step.

The executing-power-query skill in the etl plugin provides the full workflow. In summary:

Create or reuse a runner dataflow in the workspace
Bind the data source connection to the runner
Wrap the expression in a section document, inline parameters
Execute via POST /v1/workspaces/{wsId}/dataflows/{dfId}/executeQuery
Parse the Arrow response to verify data

MASHUP='section Section1;
shared SqlEndpoint = "myserver.database.windows.net";
shared Database = "MyDB";
shared Result = let
    Source = Sql.Database(SqlEndpoint, Database),
    Data = Table.FirstN(Source{[Schema="dbo",Item="Orders"]}[Data], 10)
in Data;'

curl -s -o result.bin -X POST ".../executeQuery" \
  -H "Authorization: Bearer ${TOKEN}" -H "Content-Type: application/json" \
  -d "$(jq -n --arg m "$MASHUP" '{queryName:"Result",customMashupDocument:$m}')"

See references/validation.md for step-by-step instructions and error handling.

2. Save the Partition via XMLA / TOM

Write the expression back to the model; Analysis Services validates the M syntax on save. This doesn't execute the query but catches structural errors:

Missing or mismatched let/in
Undefined step references
Invalid function calls
Type mismatches in TransformColumnTypes

# Edit the TMDL partition source directly and deploy via fab import,
# or use the XMLA endpoint with Tabular Editor or SSMS to modify
# the partition expression on the deployed model.

AS returns an error if the expression is malformed. This is faster than a full execute but doesn't catch runtime errors (wrong column names, data source issues).

Choosing a Validation Approach

Need	Use
Full data validation (correct columns, types, values)	Execute via API
Quick syntax check	Save to model via XMLA/TOM
Step-by-step debugging	Execute with truncated `in` clause
Performance testing (check folding)	Execute with full data, observe timing

Previewing Partition Steps

See the data at any point in the transformation chain by truncating the let...in:

-- See raw source data (all columns)
in Data;

-- See after column removal
in #"Removed Columns";

-- See final result
in #"Changed Type";

Add Table.FirstN(stepName, 100) before the in to limit rows for large tables. See references/validation.md for the complete procedure.

Common Patterns

Incremental Refresh Partitions

Incremental refresh partitions use RangeStart and RangeEnd parameters:

let
    Source = Sql.Database(#"SqlEndpoint", #"Database"),
    Data = Source{[Schema="dbo", Item="Orders"]}[Data],
    #"Filtered" = Table.SelectRows(Data, each
        [OrderDate] >= #"RangeStart" and [OrderDate] < #"RangeEnd")
in
    #"Filtered"

When testing, inline concrete date values for RangeStart and RangeEnd.

Lakehouse Sources

let
    Source = Lakehouse.Contents(null),
    Data = Source{[Id="lakehouse-guid"]}[Data],
    Table = Data{[Id="table-name", ItemKind="Table"]}[Data]
in
    Table

SQL with Native Query

For complex SQL that can't be expressed in M:

let
    Source = Sql.Database("server", "db"),
    Data = Value.NativeQuery(Source, "SELECT * FROM dbo.MyView WHERE Year = 2024", null, [EnableFolding=true])
in
    Data

Value.NativeQuery with EnableFolding=true allows subsequent M steps to fold on top of the native query.

References

references/validation.md -- Detailed validation workflow with executeQuery API, step preview, error handling
references/best-practices.md -- Query folding guidance, fold-breaker list, anti-patterns, performance tips
examples/execute_m.py -- Python script to execute M expressions via the Fabric API (CLI tool)
examples/preview_partition.py -- Python script to preview partition data at any step (uses fab get + execute_m.py)
Power Query M Reference
Query Folding Guidance

Related skills

More from data-goblin/power-bi-agentic-development

Installs

Repository

data-goblin/pow…elopment

GitHub Stars

563

First Seen

Apr 5, 2026

Security Audits

Gen Agent Trust HubWarn

SocketPass

SnykWarn