Lists (2)
Sort Name ascending (A-Z)
Stars
Free Data Engineering course!
An open-source data logging library for machine learning models and data pipelines. 📚 Provides visibility into data quality & model performance over time. 🛡️ Supports privacy-preserving data collec…
This is a guide to PySpark code style presenting common situations and the associated best practices based on the most frequent recurring topics across the PySpark repos we've encountered.
Automatically jump-cut silent parts of your videos using Python
The minimal amount of CSS to replicate the GitHub Markdown style
OpenMetadata is a unified metadata platform for data discovery, data observability, and data governance powered by a central metadata repository, in-depth column level lineage, and seamless team co…
Realtime Web Apps and Dashboards for Python and R
📝 create a webpage with just markdown
Platform to build admin panels, internal tools, and dashboards. Integrates with 25+ databases and any API.
Quadratic | Spreadsheet with Python, SQL, and AI
Fast, accurate and scalable probabilistic data linkage with support for multiple SQL backends
"crudtable" is an R package that provides an easy tabular data input user interface in Shiny web applications. With crudtable, all the user CRUD operations on dataset (Create, Read, Update, Delete)…
Setting up a new project directory in the terminal or R
A curated list of static web site generators.
Curated list of awesome open source healthcare software, libraries, tools and resources.
Machine Learning From Scratch. Bare bones NumPy implementations of machine learning models and algorithms with a focus on accessibility. Aims to cover everything from linear regression to deep lear…
An open source email campaign management tool for nonprofits
The MSR FastRDFStore Package is designed for creating an in-memory index of RDF triples, implemented as a WCF service in C#, and consists of server & client side code.
An expressive visual storytelling environment for presenting timelines on the web and in Power BI. Developed at Microsoft Research.
Knack - A Python command line interface framework
A host process for R that provides access and extensibility to it remotely over WebSocket and JSON.
Developer samples for Microsoft HealthVault
Advanced analytics samples and templates using SQL Server R Services
Data science and AI solution accelerator suite that provides templates for prototyping, reporting, and presenting data science analytics of specific domains