Skip to content

A curated list of awesome blogs, videos, tools and resources about Data Contracts

License

Notifications You must be signed in to change notification settings

nagask/awesome-data-contracts

 
 

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

11 Commits
 
 
 
 
 
 

Repository files navigation

Awesome Data Contracts

Data contracts is an amazing initiative to bring data producers and data consumers together. This is a curated list of awesome articles, tools, resources that will help you understand concepts and start your data contracts journey.

Articles and Blogs

  1. Improving data quality with data contracts (GoCardless) What are data contracts and GoCardless implementation of data contracts
  2. Interfaces and Breaking Stuff by Tristan Handy (CEO, DBT) Need of data contracts as data interfaces and analogy with software APIs
  3. Implementing data contracts at GoCardless Data contract specification examples from GoCardless systems
  4. 7 key learnings from implementing data contracts (Bar Moses and Andrew Jones) Sample data contract example and learnings from implementations
  5. A PM's thoughts on data contracts A product manager’s view on data contracts and challenges they can solve
  6. Engineer’s guide to data contracts part I - entities covers implementing data contracts for entities
  7. Engineer’s guide to data contracts part II - semantics cover enforcing and monitoring data contracts using transactional outbox pattern
  8. Data Contracts: The Mesh Glue Scope of data contract and how it can be a layer with data mesh architectures
  9. Yet another post on data contracts - part I Thoughts from David J. on what is data contracts and what it should contain
  10. Yet another post on data contracts - part II What kind of data should be covered by data contracts and some highlights of tooling that’s available
  11. Yet another post on data contracts - part III Deeper dive on current tooling that can be used for data contracts implementation
  12. Are we setting data contracts in the right place? Considerations and analogies from software architecture world while defining data contracts
  13. Implementing data contracts Specifications and real world architecture example of data contract implementation
  14. The best data contract is the pull request Datafold’s view on how the best data contract is pull request
  15. An open letter to data ninjas - yes, you need to implement data contracts View on implementations and specifications of data contracts
  16. Data person: attorney at law Interesting view on data contracts which turn data engineers into data lawyers by Stephen Bailey
  17. Demystifying event streams - New event streams model at Merit and event specifications being treated as data contracts - named OMG contract
  18. Fine let’s talk about data contracts - Different view on viability and usability of data contracts
  19. Brain, kidney, cancer research and data contracts (Schemata project) - Example of data contract explanation / discussion in simple terms for people outside of data space

Videos

  1. Inventing data contracts and building data for a FinTech unicorn with Andrew Jones of GoCardless
  2. Data contracts and domain ownership
  3. How data contracts can kill innovation
  4. Getting jiggy with jsonschema: The power of contracts for building data systems
  5. Data Contracts: Accountable Data Quality w/ Chad Sanderson - UDEM October 2022
  6. Data Contract Battle Royale w/ Chad Sanderson vs Ethan Aaron

Tools

  1. Avo.app - Avo is a data governance platform that error-proofs your analytics events and speeds up implementation.
  2. Reflekt - Define tracking plans as code for analytics
  3. Iteratively - Bring your data teams, product managers and engineers together to define, instrument, verify and collaborate on analytics tracking
  4. Buz - Buz collects, validates, and delivers schematized data to where it needs to bee
  5. Benthos - Benthos is a high performance and resilient stream processor, able to connect various sources and sinks in a range of brokering patterns and perform hydration, enrichments, transformations and filters on payloads.
  6. Schemata - Schemata is a schema modeling framework for decentralized domain-driven ownership of data. Schemata combines a set of standard metadata definitions for each schema & data field and a scoring algorithm to provide a feedback loop on how efficient the data modeling of your data warehouse is.

Related

  1. Establish formal data contracts between consumers and producers Older article from 2018 that explored the concept of data contracts (Older generation article)
  2. Data Mesh — A Data Movement and Processing Platform @ Netflix - Schema enforcement in Netflix’s data mesh platform

Contributing

Your contributions are most welcome! Please go ahead and create a pull request. We'll review each and every one. We will keep some pull requests open if we not sure whether the PRs are awesome or relevant, you could vote for them by adding 👍 to them. Pull requests will be merged when their votes reach 5.

About

A curated list of awesome blogs, videos, tools and resources about Data Contracts

Resources

License

Code of conduct

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published