Skip to content
View ZenvilleErasmus's full-sized avatar
🚀
🚀

Block or report ZenvilleErasmus

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Databricks SDK for Java

Java 35 24 Updated Jan 7, 2025

Bash command line framework and CLI generator

Ruby 2,150 90 Updated Dec 30, 2024

Docker container image for Hadoop HDFS mini cluster, and a testcontainers libray API for using it

Java 4 1 Updated Jan 17, 2021

Container runtimes on macOS (and Linux) with minimal setup

Go 20,277 407 Updated Jan 7, 2025

This projects gives Kotlin bindings and several extensions for Apache Spark. We are looking to have this as a part of Apache Spark 3.x

Kotlin 465 35 Updated Jun 20, 2024

A workshop to learn about SQL Server 2022

Jupyter Notebook 204 81 Updated Sep 9, 2023

Quickly ingest messy CSV and XLS files. Export to clean pandas, SQL, parquet

Jupyter Notebook 195 46 Updated Jun 9, 2023

Simple python app/script to test SQL connection

Python 2 1 Updated Jan 12, 2022

GitHub Action for Python Poetry setup and also the caching of dependencies and the Poetry binary.

15 6 Updated Feb 29, 2024

🛠 Python project template generator with batteries included

Python 2,137 185 Updated Dec 23, 2024

command line options parsing for Scala

Scala 1,433 163 Updated Apr 12, 2024

Apache Spark Connector for SQL Server and Azure SQL

Scala 279 121 Updated Jul 26, 2024

A sample project that exists for PyPUG's "Tutorial on Packaging and Distributing Projects"

Python 5,156 1,731 Updated Nov 6, 2024

Databricks SDK for Go

Go 54 42 Updated Jan 7, 2025

Data validation library for PySpark 3.0.0

Python 34 5 Updated Nov 11, 2022

Python API for Deequ

Jupyter Notebook 736 139 Updated Oct 15, 2024

A simple mock implementation of the AWS S3 API startable as Docker image, TestContainer, JUnit 4 rule, JUnit Jupiter extension or TestNG listener

Kotlin 853 183 Updated Jan 7, 2025

Access other storage backends via the S3 API

Java 1,823 233 Updated Jan 3, 2025

Examples of Databricks Asset Bundles

Python 115 39 Updated Dec 30, 2024

Simple Python wrapper to create Kerberos ticket-granting tickets (TGT).

Python 6 5 Updated Jul 10, 2024

Python module to create Kerberos keytabs

Python 15 8 Updated Jul 12, 2023

OpenPDF is a free Java library for creating and editing PDF files, with a LGPL and MPL open source license. OpenPDF is based on a fork of iText. We welcome contributions from other developers. Plea…

Java 3,668 603 Updated Nov 27, 2024

Apache Spark - A unified analytics engine for large-scale data processing

Scala 40,298 28,434 Updated Jan 7, 2025

Base classes to use when writing tests with Spark

Scala 1,526 355 Updated Nov 3, 2024

REST job server for Spark. Note that this is *not* the mainline open source version. For that, go to https://github.com/spark-jobserver/spark-jobserver. This fork now serves as a semi-private repo …

Scala 344 134 Updated May 19, 2017

Apache DataFusion Comet Spark Accelerator

Rust 864 169 Updated Jan 7, 2025

Example code for doing DataOps

Python 47 28 Updated Jan 26, 2021

Maven plugin for Scalastyle

Java 23 38 Updated Apr 11, 2023

Scalafmt hook for pre-commit

Shell 13 6 Updated Apr 20, 2021

TP-Link Smarthome WiFi API

TypeScript 1,030 142 Updated Nov 15, 2023
Next