Validation with Checkpoint VS Batch

lauraz · October 8, 2024, 11:29pm

From the tutorials, we can either validate with a checkpoint.run() or directly validate on a batch: batch.validate(expectation).

Any suggestions of when to use which, or it really doesn’t matter? it is a bit confusing to beginners which way to go

ToivoMattila · October 9, 2024, 10:57am

The recommendation I’ve heard is to use batches when developing the validation and then use Checkpoints when pushing the data validation to production to easily validate data repeatedly.

Also, Checkpoints can be used to perform additional tasks such as building the Data Docs or sending out failure notifications after the data has been validated.

Topic		Replies	Views
Is a checkpoint with varied batch parameters possible? GX Core Support help-wanted	2	66	September 27, 2024
Issue with checkpoint.run() results when we have multiple validation definitions GX Core Support help-wanted , s3	1	91	April 1, 2025
Multi-batch vs single-batch checkpoints Archive	1	605	March 22, 2021
Bug in great expectation's checkpoint.py GX Core Support	0	39	July 10, 2024
Validate different dataframes with respective expectation suites using checkpoint GX Core Support how-to , databricks	4	273	October 10, 2024

Validation with Checkpoint VS Batch

Related topics