Hey There,
Anyone knows how to properly configure the datasource for json files composed with lines (one json per line)?
Here’s my datasource configuration at the moment:
datasources:
raw:
class_name: PandasDatasource
data_asset_type:
class_name: PandasDataset
module_name: great_expectations.dataset
batch_kwargs_generators:
subdir_reader:
class_name: SubdirReaderBatchKwargsGenerator
base_directory: ../data/raw
module_name: great_expectations.datasource
Using pandas I can read my file with pd.read_json(‘data/raw/file.json’, lines=True). How can I configure a file like this in the datasources?
I tried configuring batch_kwargs_generators
like this with no luck:
batch_kwargs_generators:
subdir_reader:
class_name: SubdirReaderBatchKwargsGenerator
base_directory: ../data/raw
reader_method: read_json
reader_options:
lines: true