Expect_column_pair_values_to_be_equal does not generate a row level report?

lsantosessex · December 20, 2023, 6:45pm

Hi everyone!

I am very new to GE, so please forgive me if this is a dumb question.

I am using “expect_column_pair_values_to_be_equal” to compare 2 columns, and so far almost everything works fine, GE validates the table and provides me with statistics.

My issue is that I need GE to show me which keys are failing, in other words, I need a report of which records are failing the data diff between A and B, but I am not finding how to do it.

Can anyone please share any insights?

This is my script:


# Setup imports
import great_expectations as gx
from great_expectations.checkpoint import Checkpoint

# expect_column_pair_values_to_be_equal
data_asset = 'stg_QualityCheck'

# Setup context
context = gx.get_context()

# Connect
MSSQL_CONNECTION_STRING = ""

# Setup DataSource
mssql_datasource = context.sources.add_sql(
    name="mssql_datasource", connection_string=MSSQL_CONNECTION_STRING
)

# Setup Data Asset 
mssql_datasource.add_table_asset(
    name=data_asset, table_name=data_asset
)

# Setup Batch Request
batch_request = mssql_datasource.get_asset(data_asset).build_batch_request()

# Setup Validator
expectation_suite_name = "test_expectation"
context.add_or_update_expectation_suite(expectation_suite_name=expectation_suite_name)
validator = context.get_validator(
    batch_request=batch_request,
    expectation_suite_name=expectation_suite_name,
)

print(validator.head())

# Setup Expectations
validator.expect_column_pair_values_to_be_equal('count_r', 'count_l')
validator.save_expectation_suite(discard_failed_expectations=False)

# Setup checkpoint
my_checkpoint_name = "my_sql_checkpoint"

checkpoint = Checkpoint(
    name=my_checkpoint_name,
    run_name_template="%Y%m%d-%H%M%S-test-validation-checkpoint",
    data_context=context,
    batch_request=batch_request,
    expectation_suite_name=expectation_suite_name,
    action_list=[
        {
            "name": "store_validation_result",
            "action": {"class_name": "StoreValidationResultAction"},
        },
        {"name": "update_data_docs", "action": {"class_name": "UpdateDataDocsAction"}},
    ],
)

context.add_or_update_checkpoint(checkpoint=checkpoint)

checkpoint_result = checkpoint.run()

context.open_data_docs()

Thank you!

lsantosessex · December 20, 2023, 6:55pm

Ok, I inserted this configuration in my checkpoint:

runtime_configuration={
                    "result_format": {
                        "result_format": "COMPLETE",
                        "unexpected_index_column_names": ["tb_name_l"],
                        "return_unexpected_index_query": True,
                        "include_unexpected_rows": True
                    },
                },

And now I can see the full report with the column keys:

So I think the issue is solved, I am sending here what I did so maybe it can be helpful to others in the future.

Topic		Replies	Views
Expect_column_pair_values_a_to_be_greater_than_b fail to show unexpected rows GX Core Support	1	47	September 12, 2024
Issue while running Great expectation validations on Big Query External Tables GX Core Support	0	121	May 16, 2024
Custom expectations for ExpectColumnPairValuesToBeInSet GX Core Support help-wanted	1	45	March 31, 2025
Expect_column_pair_values_a_to_be_greater_than_b.py not working Feedback help-wanted	0	569	July 19, 2021
Wanted help in creating query Expectation to use on diffrent column name GX Core Support how-to , help-wanted , databricks	3	344	October 25, 2023

Expect_column_pair_values_to_be_equal does not generate a row level report?

Related topics