-
data-engineer-handbook Public
Forked from DataExpert-io/data-engineer-handbookThis is a repo with links to everything you'd ever want to learn about data engineering
Jupyter Notebook UpdatedDec 1, 2024 -
-
-
kpi-dashboard-plotly-dash Public
Forked from Mubeen31/kpi-sales-dashboard-in-python-by-plotly-dashPython UpdatedAug 25, 2022 -
awesome Public
Forked from sindresorhus/awesome😎 Awesome lists about all kinds of interesting topics
Creative Commons Zero v1.0 Universal UpdatedMay 13, 2022 -
cards-pytest Public
Forked from okken/cardsProject task tracking / todo list
Python MIT License UpdatedMay 9, 2022 -
100DaysOfCode Public
Forked from pybites/100DaysOfCodePyBites #100DaysOfCode
Jupyter Notebook UpdatedFeb 22, 2022 -
-
zero-administration-inference-with-aws-lambda-for-hugging-face Public
Forked from aws-samples/zero-administration-inference-with-aws-lambda-for-hugging-facespacy-ner-aws-lambda 🤗
Python Other UpdatedFeb 21, 2022 -
-
medium-search-app Public
Forked from chiachong/medium-search-appA simple search engine to search medium stories built with streamlit and elasticsearch.
Python UpdatedDec 3, 2021 -
cs-video-courses Public
Forked from Developer-Y/cs-video-coursesList of Computer Science courses with video lectures.
UpdatedSep 10, 2021 -
text_similarity Public
Forked from cr1m5onk1ng/text_similarityA nlp library for text similarity based on Transformer models
Python Apache License 2.0 UpdatedAug 3, 2021 -
aws-toolbox Public
Forked from dannysteenman/aws-toolboxA collection of DevOps tools including shell & python scripts that automate the boring stuff in AWS.
Shell MIT License UpdatedJun 2, 2021 -
-
DataProfiler Public
Forked from capitalone/DataProfilerWhat's in your data? Extract schema, statistics and entities from datasets
Python Apache License 2.0 UpdatedApr 23, 2021 -
-
entity_resolution Public
Forked from yifeihuang/entity_resolutionExample entity resolution workflow using PySpark
Python MIT License UpdatedFeb 20, 2021 -
amundsenfrontendlibrary Public
Forked from amundsen-io/amundsenfrontendlibraryFront-end service library for Amundsen
TypeScript Apache License 2.0 UpdatedDec 10, 2020 -
incubator-superset Public
Forked from apache/supersetApache Superset is a Data Visualization and Data Exploration Platform
Python Apache License 2.0 UpdatedDec 10, 2020 -
-
python-deequ Public
Forked from awslabs/python-deequPython API for Deequ
Jupyter Notebook Apache License 2.0 UpdatedDec 1, 2020 -
multi-data-lineage-capture-py Public
Forked from IBM/multi-data-lineage-capture-pyIBM Multi-Lineage Data System
Python Apache License 2.0 UpdatedNov 11, 2020 -
-
mobydq Public
Forked from ubisoft/mobydq🐳 Tool to automate data quality checks on data pipelines
Vue Apache License 2.0 UpdatedJun 18, 2020 -
Edator Public
Forked from kianweelee/EdatorA python package that performs exploratory data analysis for users. Additionally, it generates 3 output files that comprise of a cleaned CSV, plots and a text report.
Python MIT License UpdatedMay 17, 2020 -
marquez-python Public
Forked from MarquezProject/marquez-pythonPython client for Marquez
Python Apache License 2.0 UpdatedMay 6, 2020 -
datacatalog-tag-manager Public
Forked from ricardolsmendes/datacatalog-tag-managerPython package to manage Google Cloud Data Catalog tags, loading metadata from external sources -- currently supports the CSV file format
Python MIT License UpdatedMay 1, 2020 -
BERT-for-RRC-ABSA Public
Forked from howardhsu/BERT-for-RRC-ABSAcode for our NAACL 2019 paper: "BERT Post-Training for Review Reading Comprehension and Aspect-based Sentiment Analysis"
Python Apache License 2.0 UpdatedApr 27, 2020 -
marquez-airflow Public
Forked from MarquezProject/marquez-airflowAirflow support for Marquez
Python Apache License 2.0 UpdatedApr 24, 2020