Skip to main content

Custom SQL rules for Data Quality

Beyond predefined rules, you can now create custom rules using SQL queries. These rules provide flexibility to test specific conditions and tailor rules to your data quality requirements.

Follow this guide to see how how to set up permissions and create custom SQL rules.

Permissions

Since custom rules allow direct querying of your data, you must have Power Data Steward assigned to create them. This role is required to create and manage custom SQL queries. It includes all Data Steward capabilities, plus the additional Manage External Queries permission.

Power Data Steward role

Create a custom SQL rule

Start the rule creation wizard

  1. Go to Data Governance > Data Quality and select the Rule Instances tab.

  2. Click the Create Rule Instance button.

    tip

    You can create a rule directly from a table or column.

  3. If you have the Power Data Steward role, a dialog will appear, allowing you to choose between:

    • A predefined rule from the rule list
    • A Custom SQL Rule (select this option)
Create Rule Instance dialog

Step 1: Definition

In the Definition step:

  1. Choose the database where your SQL rule will run. The dropdown menu lists only data sources supported for Data Quality feature.

    Create Rule Instance dialog
  2. Write the SQL query that defines the condition to be checked. The query must return rows that fail the data quality rule—these rows will be flagged as errors.


    Sample custom SQL query
    SELECT * FROM orders WHERE status NOT IN ('shipped', 'delivered');
  3. Now Define tested rows count.
    This query specifies the total number of rows tested by the rule. It helps contextualize results but is not required.


    Sample custom SQL query for tested rows
    SELECT COUNT(*) FROM orders;
Create Rule Instance dialog
tip

Both input fields support syntax highlighting for better readability.


Keep in mind that custom queries are not validated, giving you full control over the SQL logic.

Step 2: Select object

Define the Object Type:

  • Column-based rule: Select a table first, then choose a column.
  • Table-based rule: Choose from available tables.

A search input helps you quickly locate the correct object.

tip

Suggestions for available objects appear based on the query entered in the first step.

If you're selecting a column, suggestions for matching columns are shown, along with inputs for both the table and column names.

If you're selecting a table, suggestions for matching tables are shown, with an input to specify the table name.

Object suggestions
Assign custom rules

Step 3: Save failed rows

By default, failed rows are not saved. To activate this feature, activate the Save Failed Rows option. This allows storing and analyzing rows that do not meet the rule’s conditions.

Save failed rows

Step 4: Settings

  1. Enter a name and an optional description for the rule
  2. Assign it to a Library to keep rules organized
  3. Define the Severity level for failed rows
  4. Click the Create rule instance button to save the rule
Settings

Once saved, the custom SQL rule will be applied to incoming data, helping to identify and address quality issues efficiently.