Stars
Blazing-fast query execution engine speaks Apache Spark language and has Arrow-DataFusion at its core.
openpilot is an operating system for robotics. Currently, it upgrades the driver assistance system on 275+ supported cars.
A small set of Python functions to draw pretty maps from OpenStreetMap data. Based on osmnx, matplotlib and shapely libraries.
Typer, build great CLIs. Easy to code. Based on Python type hints.
Prettify Python exception output to make it legible.
Scalene: a high-performance, high-precision CPU, GPU, and memory profiler for Python with AI-powered optimization proposals
Hummingbird compiles trained ML models into tensor computation for faster inference.
HiPlot makes understanding high dimensional data easy
Datasets of daily time-series data related to COVID-19 for over 20,000 distinct locations around the world.
We are building an open database of COVID-19 cases with chest X-ray or CT images.
Classification of Lung cancer slide images using deep-learning
Terraform Best Practices - workshop materials
Composable transformations of Python+NumPy programs: differentiate, vectorize, JIT to GPU/TPU, and more
A game theoretic approach to explain the output of any machine learning model.
TensorFlow code and pre-trained models for BERT
Unsupervised text tokenizer for Neural Network-based text generation.
PyTorch Tutorial for Deep Learning Researchers
An Open-Source Package for Neural Relation Extraction (NRE)
Python package for dealing with whole slide images (.svs) for machine learning, particularly for fast prototyping. Includes patch sampling and storing using OpenSlide. Patches may be stored in LMDB…
Definition and DDLs for the OMOP Common Data Model (CDM)
A corpus of Biomedical papers annotated with mentions of UMLS entities.
extract text from any document. no muss. no fuss.
pdfrw is a pure Python library that reads and writes PDFs
MachineLearningSamples-BiomedicalEntityExtraction
Web application for distributing and browsing the Standardized Vocabularies for all instances of an OMOP CDM
Deep Learning Pipelines for Apache Spark
A Python script that allows the creation of a customized version of the Data Dictionary from Standards for Cancer Registries, Volume II: Data Standards and Data Dictionary.