How to configure a Pandas/S3 Datasource

kyle · May 27, 2020, 8:36pm

This article is for comments to: https://docs.greatexpectations.io/en/latest/how_to_guides/configuring_datasources/how_to_configure_a_pandas_s3_datasource.html

Please comment +1 if this How to is important to you.

krisp · May 28, 2020, 1:04pm

+1
S3 datasource, not necessarily pandas

JH2012 · November 20, 2020, 6:08am

+1
Hi, the data asset is not picking up all files (only one is retrieved) that satisfy the regex rule in the target folder when creating the scaffold and checkpoint. is this expected?

leerssej · February 24, 2021, 12:28am

Where may I add the reader_options: sep: "|" entries?
Essentially, I wish to specify that my .csv are pipe delimited, but I am a little confused about the context in which the reader_options are found or should be placed into.

from here: How to configure a Pandas/S3 Datasource — great_expectations documentation

I attempted to drop them into the great_expectations.yml file but that just threw a great_expectations.exceptions.exceptions.InvalidDataContextConfigError: Error while processing DataContextConfig: reader_options error.

Which file do I add these arguments/specifications into or maybe could we add an example of the complete file showing them tucked into their proper places?

Topic		Replies	Views
How to configure a Pandas/filesystem Datasource Archive how-to , help-wanted	5	719	August 26, 2021
How to configure a PySpark datasource for accessing the data from AWS S3? Archive	1	1433	March 28, 2020
Creating Data source for s3 with pandas Archive how-to , help-wanted , s3	1	639	June 14, 2021
Configure datasource for json files Archive	3	1032	December 12, 2020
S3 Data Source Configures Successfully But Suite New Fails with New S3 Datasource Archive s3	2	568	May 3, 2021

How to configure a Pandas/S3 Datasource

Related topics