ExpectTableRowCountToBeBetween seems to ignore row_condition

mike_f50 · August 5, 2025, 12:57pm

In GX 1.5.7, my ExpectTableRowCountToBeBetween expectations seem to be ignoring their row_conditions. Inspecting the results, the observed_value in each case is the entire table row count, not just the rows satisfying the condition.

Is this expected? expect_table_row_count_to_be_between is not included in the list of limitations for using row conditions.

I have tried setting the condition_parser to both "spark" (which worked perfectly in GX 0.18) and to "great_expectations" (as recommended by the newer documentation), but in both cases the condition is not applied.

Here are some example expectations:

    {
      "type": "expect_table_row_count_to_be_between",
      "kwargs": {
        "min_value": 99,
        "max_value": 101,
        "row_condition": "col(\"Cohort\") == \"MyCohort\"",
        "condition_parser": "great_expectations"
      },
      "meta": {
        "notes": {
          "format": "markdown",
          "content": [
            "Expect row count for **Cohort == MyCohort** to be within **1%** of the row count seen in the most recent successful validation run"
          ]
        }
      },
    },
    {
      "type": "expect_table_row_count_to_be_between",
      "kwargs": {
        "min_value": 198,
        "max_value": 202,
        "row_condition": "col(\"Cohort\") == \"MyCohort2\"",
        "condition_parser": "great_expectations"
      },
      "meta": {
        "notes": {
          "format": "markdown",
          "content": [
            "Expect row count for **Cohort == MyCohort2** to be within **1%** of the row count seen in the most recent successful validation run"
          ]
        }
      },
    },

If my data file has 100 rows for MyCohort and 200 for MyCohort2, both expectations fail with an observed value of 300.

mike_f50 · August 28, 2025, 1:36pm

I have come up with a workaround, which I am sharing here in case anyone else face the same issue.

ExpectColumnSumToBeBetween works with a row_condition. So, if I add a column of 1s to my data frame before passing it to the checkpoint…

from pyspark.sql.functions import lit
spark_data_frame = spark_data_frame.withColumn("RowCount", lit(1))

… then, instead of using ExpectTableRowCountToBeBetween I can use ExpectColumnSumToBeBetween, looking at the RowCount column and with whatever min_value and max_value I expect for the row count.

    {
      "type": "expect_column_sum_to_be_between",
      "kwargs": {
        "column": "RowCount",
        "min_value": 99,
        "max_value": 101,
        "row_condition": "col(\"Cohort\") == \"MyCohort\"",
        "condition_parser": "great_expectations"
      },
      "meta": {
        "notes": {
          "format": "markdown",
          "content": [
            "Expect row count for **Cohort == MyCohort** to be within **1%** of the row count seen in the most recent successful validation run"
          ]
        }
      },
    },
    {
      "type": "expect_column_sum_to_be_between",
      "kwargs": {
        "column": "RowCount",
        "min_value": 198,
        "max_value": 202,
        "row_condition": "col(\"Cohort\") == \"MyCohort2\"",
        "condition_parser": "great_expectations"
      },
      "meta": {
        "notes": {
          "format": "markdown",
          "content": [
            "Expect row count for **Cohort == MyCohort2** to be within **1%** of the row count seen in the most recent successful validation run"
          ]
        }
      },
    },

Nathan · October 22, 2025, 6:01pm

Hi @mike_f50, we fixed the bug with ExpectTableRowCountToBeBetween here. The fix should go out with this week’s release.

mike_f50 · October 31, 2025, 11:59am

Nice! I’ll give it a try.

Topic		Replies	Views
Can't define mulitple "expect_table_row_count_to_be_between" with different "row_condition" GX Core Support	1	509	July 4, 2024
Expect_table_row_count_to_equal_other_table Archive	0	1156	February 28, 2022
Evaluate evaluation parameters for data docs Archive	1	488	October 26, 2020
Expectation which includes two tables Archive how-to	7	3390	September 16, 2020
Can ColumnPairMapExpectations handle row_conditions? GX Core Support	1	89	July 4, 2024

ExpectTableRowCountToBeBetween seems to ignore row_condition

Related topics