Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Schema Scrubbers/Validators - Combine and Enable MapType Recursive Cleansing #86

Open
GeekSheikh opened this issue Mar 26, 2021 · 4 comments
Labels
feature New feature or extension of existing feature (was "enhancement")

Comments

@GeekSheikh
Copy link
Contributor

AND

def scrubSchema(df: DataFrame): DataFrame = {

can be combined to accomplish all schema scrubbing needs (I think).

Additionally, MapTypes should be added to supported validations.

@GeekSheikh GeekSheikh added the feature New feature or extension of existing feature (was "enhancement") label Mar 26, 2021
@GeekSheikh
Copy link
Contributor Author

ArrayType was enabled with 0.5.0

@GeekSheikh
Copy link
Contributor Author

0.6.0.4 allows for the following syntax and much more robust schema scrubbing options. The updates are backwards compatible except the SchemaScrubber has moved from SchemaTools to its own object SchemaScrubber

df.scrubSchema
df.scrubSchema(scrubberOverrideRules)

@GeekSheikh GeekSheikh added this to the 0.6.1.2 milestone May 11, 2022
@GeekSheikh
Copy link
Contributor Author

GeekSheikh commented May 11, 2022

Will integrate this with the updated schema recursion strategy that is being defined. #397

@GeekSheikh
Copy link
Contributor Author

Also closely related to #395

@GeekSheikh GeekSheikh modified the milestones: 0.7.0.1, backlog Oct 5, 2022
@gueniai gueniai removed this from the backlog milestone Sep 26, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
feature New feature or extension of existing feature (was "enhancement")
Projects
None yet
Development

No branches or pull requests

2 participants