Validation with Checkpoint VS Batch

From the tutorials, we can either validate with a checkpoint.run() or directly validate on a batch: batch.validate(expectation).

Any suggestions of when to use which, or it really doesn’t matter? it is a bit confusing to beginners which way to go

The recommendation I’ve heard is to use batches when developing the validation and then use Checkpoints when pushing the data validation to production to easily validate data repeatedly.

Also, Checkpoints can be used to perform additional tasks such as building the Data Docs or sending out failure notifications after the data has been validated.

1 Like