Skip to content
View fuenos01's full-sized avatar

Block or report fuenos01

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

MapReduce performance testing using teragen and terasort

Shell 18 40 Updated Aug 26, 2021

Terraform module to create Amazon Elastic Kubernetes (EKS) resources 🇺🇦

HCL 4,535 4,137 Updated Jan 22, 2025

Terragrunt is a flexible orchestration tool that allows Infrastructure as Code written in OpenTofu/Terraform to scale.

Go 8,377 1,003 Updated Jan 30, 2025

Terraform enables you to safely and predictably create, change, and improve infrastructure. It is a source-available tool that codifies APIs into declarative configuration files that can be shared …

Go 43,992 9,644 Updated Jan 25, 2025

a pyenv plugin to manage virtualenv (a.k.a. python-virtualenv)

Shell 6,448 410 Updated Jan 1, 2025

Simple Python version management

Roff 40,477 3,094 Updated Jan 19, 2025

Docker Apache Airflow

Shell 3,792 544 Updated Mar 1, 2023

Includes notes on using Apache Spark in general, notes on using Spark for Physics, how to run TPCDS on PySpark, how to create histograms with Spark, tools for performance testing CPUs, Jupyter note…

Jupyter Notebook 434 151 Updated Jan 27, 2025

🍺 The missing package manager for macOS (or Linux)

Ruby 42,267 9,940 Updated Jan 30, 2025

Adaptable, fast automation for all

Groovy 17,220 4,824 Updated Jan 30, 2025
Python 39 4 Updated Mar 4, 2019

A command-line tool for launching Apache Spark clusters.

Python 638 117 Updated Dec 13, 2024

Scripts used to setup a Spark cluster on EC2

Python 394 299 Updated Nov 22, 2017

Upserts, Deletes And Incremental Processing on Big Data.

Java 5,624 2,452 Updated Jan 29, 2025

Azkaban workflow manager.

Java 4,486 1,582 Updated Jul 3, 2024

All the things about TPC-DS in Apache Spark

Scala 105 40 Updated Jun 15, 2023

Use the TPC-DS benchmark to test Spark SQL performance

TSQL 176 95 Updated Apr 27, 2020

Essential Spark extensions and helper methods ✨😲

Scala 755 152 Updated Oct 24, 2024

Spark style guide

Jupyter Notebook 257 47 Updated Sep 30, 2024

A curated list of awesome Apache Spark packages and resources.

Shell 1,748 333 Updated Oct 24, 2024

A series of DAGs/Workflows to help maintain the operation of Airflow

Python 1,697 399 Updated Jun 18, 2024

Apache Airflow - A platform to programmatically author, schedule, and monitor workflows

Python 38,487 14,565 Updated Jan 30, 2025

PowerMock is a Java framework that allows you to unit test code normally regarded as untestable.

Java 4,178 586 Updated Jan 3, 2024

JVM Profiler Sending Metrics to Kafka, Console Output or Custom Reporter

Java 1,785 341 Updated Jan 29, 2024

Super S3 command line tool

Python 1,380 211 Updated Jul 21, 2024

Official s3cmd repo -- Command line tool for managing S3 compatible storage services (including Amazon S3 and CloudFront).

Python 4,636 906 Updated Jan 27, 2025

An open-source storage framework that enables building a Lakehouse architecture with compute engines including Spark, PrestoDB, Flink, Trino, and Hive and APIs

Scala 7,781 1,754 Updated Jan 29, 2025

Spark: The Definitive Guide's Code Repository

Scala 2,915 2,796 Updated Aug 26, 2020

command line options parsing for Scala

Scala 1,434 163 Updated Apr 12, 2024
Next