Lists (1)
Sort Name ascending (A-Z)
Stars
Docker container image for Hadoop HDFS mini cluster, and a testcontainers libray API for using it
Container runtimes on macOS (and Linux) with minimal setup
This projects gives Kotlin bindings and several extensions for Apache Spark. We are looking to have this as a part of Apache Spark 3.x
A workshop to learn about SQL Server 2022
Quickly ingest messy CSV and XLS files. Export to clean pandas, SQL, parquet
Simple python app/script to test SQL connection
GitHub Action for Python Poetry setup and also the caching of dependencies and the Poetry binary.
🛠 Python project template generator with batteries included
scopt / scopt
Forked from jstrachan/scoptcommand line options parsing for Scala
Apache Spark Connector for SQL Server and Azure SQL
A sample project that exists for PyPUG's "Tutorial on Packaging and Distributing Projects"
Data validation library for PySpark 3.0.0
A simple mock implementation of the AWS S3 API startable as Docker image, TestContainer, JUnit 4 rule, JUnit Jupiter extension or TestNG listener
Examples of Databricks Asset Bundles
Simple Python wrapper to create Kerberos ticket-granting tickets (TGT).
Python module to create Kerberos keytabs
OpenPDF is a free Java library for creating and editing PDF files, with a LGPL and MPL open source license. OpenPDF is based on a fork of iText. We welcome contributions from other developers. Plea…
Apache Spark - A unified analytics engine for large-scale data processing
Base classes to use when writing tests with Spark
REST job server for Spark. Note that this is *not* the mainline open source version. For that, go to https://github.com/spark-jobserver/spark-jobserver. This fork now serves as a semi-private repo …
Apache DataFusion Comet Spark Accelerator
TP-Link Smarthome WiFi API