How to test AI-predictations?

nader · March 23, 2020, 2:15pm

We are using AI (Machine Learning and Deep Learning) to discover cases that we normally:

either did not know about
or would require numerous business rules

Based on our experience, sampling and manual verification of the predictions of AI-models is a long round-trip. Question is how the verification (testing) of such predictions can be automatized given the above two situations?

Can Great Expectations be used?
Any alternatives?

Thanks

jpcampbell42 · March 28, 2020, 12:51pm

Great question! Firstly, absolutely Great Expectations is a great fit for this use case. It’s actually one of the original cases that inspired my work on it.

There’s a lot in your question that I think is worth a deep discussion about the practice of monitoring and measuring reliability of a ML system, but I think a concrete way to think about the action here is to view the ML model itself as a node in a DAG that processes input data (the features to be used for modeling) and produces output data.

In that simple model, Great Expectations is useful for both the input and the output. On the input side, you’re likely to be able to have expectations about:

structure of data, to ensure the preprocessing is working as intended, and
distribution of data, to ensure you’re seeing data that is similar to what you trained on

On the output side, you essentially can think of your expectations as also falling into those two categories, though of course the expectations themselves are likely to be different.

There are alternatives to GE, but I think of them as falling into two extremes, with GE being more in the middle of these:

process-oriented quality checks, such as sampling and routing to human reviewers. I believe, by the way, that such an approach is absolutely essential, but it’s best done in a way where the level of effort is informed by GE
Anomaly detection checks, where you’re asking a system to see if you observe a change in distributions say, but without being able to benefit from your designer’s knowledge about what the system should see.

Happy to discuss this more!

Topic		Replies	Views
How does Great Expectations fit into ML Ops? \| Great Expectations Archive	0	422	September 27, 2020
Can I embed Great Expectations in another Application? Archive	2	621	September 4, 2020
Don’t panic! Prefect and Great Expectations have got your data quality covered \| Great Expectations Archive	0	451	December 8, 2020
Dagster + Great Expectations = Data quality right in your pipeline \| Great Expectations Archive	0	425	September 10, 2020
Airflow Integration Archive airflow	3	2394	August 26, 2020

How to test AI-predictations?

Related topics