ModuleNotFoundError: No module named 'great_expectations.dataset'

Hi everyone,

I’m encountering a ‘ModuleNotFoundError: No module named ‘great_expectations.dataset’’ error when attempting to import the dataset module in Great Expectations. This issue has arisen recently, as the code was functioning correctly for the past 6-7 months.

Has there been a recent update to Great Expectations that might have caused this incompatibility? I’m curious if anyone else has encountered a similar problem

Current version: 1.0.0

Import code:

from great_expectations.dataset.sparkdf_dataset import SparkDFDataset

Dataset has been deprecated for many versions and was fully removed in 1.0 - please pin your version if you rely on this behavior
Note: @Chetan Kini originally posted this reply in Slack. It might not have transferred perfectly.

1 Like

oh, is there any alternative option that I can consider instead? I know I could try implementing this using Pandas, but given the large dataset, that might not be the best option.
Note: @ originally posted this reply in Slack. It might not have transferred perfectly.

the Dataset API hasn’t been supported in quite some time so I think a migration would be necessary if you wanted to stick with a newer, more supported version. I’d take a look at our docs and see what that might mean for your workflow: https://docs.greatexpectations.io/docs/core/introduction/
Note: @Chetan Kini originally posted this reply in Slack. It might not have transferred perfectly.

sure, thanks <@U04BDU6292A>
Note: @ originally posted this reply in Slack. It might not have transferred perfectly.