Skip to content
View mxRazumov's full-sized avatar

Block or report mxRazumov

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Typesafe wrapper for Apache Spark DataFrame API

Scala 140 9 Updated Oct 23, 2024

The dbt-native data observability solution for data & analytics engineers. Monitor your data pipelines in minutes. Available as self-hosted or cloud service with premium features.

HTML 1,988 175 Updated Feb 6, 2025

A topic-centric list of HQ open datasets.

61,905 10,014 Updated Nov 13, 2024

Records actions made in the AWS Management Console and outputs the equivalent CLI/SDK commands and CloudFormation/Terraform templates.

CSS 1,427 89 Updated Jan 24, 2021

CLI tool which enables you to login and retrieve AWS temporary credentials using a SAML IDP

Go 2,102 567 Updated Jan 27, 2025

🔥 Simple AWS authentication CLI with support for MFA, secure credential storage and easy IAM role switching.

JavaScript 174 5 Updated Apr 9, 2024

State of the Art Natural Language Processing

Scala 3,911 719 Updated Feb 6, 2025

A giter8 template for Spark SBT projects

Scala 72 23 Updated Mar 20, 2021

Collection of open-source Spark tools & frameworks that have made the data engineering and data science teams at Swoop highly productive

Scala 186 34 Updated Feb 12, 2023

Тимлид – это ❄️, потому что в каждой компании он уникален и неповторим.

Vue 5,069 496 Updated Dec 8, 2024

Spark style guide

Jupyter Notebook 257 47 Updated Sep 30, 2024

Deequ is a library built on top of Apache Spark for defining "unit tests for data", which measure data quality in large datasets.

Scala 3,358 545 Updated Feb 5, 2025

A command-line tool for launching Apache Spark clusters.

Python 639 117 Updated Dec 13, 2024

Tekumara build of Apache PySpark with Hadoop 3.x and cloud jars for S3 access

Scala 3 Updated Feb 22, 2021

A Database Change Management tool for Snowflake

Python 538 234 Updated Jan 27, 2025

A collection of 3 lambda functions that are invoked by Amazon S3 or Amazon API Gateway to analyze uploaded images with Amazon Rekognition and save picture labels to ElasticSearch (written in Kotlin)

Kotlin 387 105 Updated Apr 17, 2020

pure golang library for reading/writing parquet file

Go 1,303 298 Updated Aug 17, 2024

Always know what to expect from your data.

Python 10,169 1,553 Updated Feb 6, 2025

Sampling CPU and HEAP profiler for Java featuring AsyncGetCallTrace + perf_events

C++ 7,917 878 Updated Feb 6, 2025

The easy-to-use open source Business Intelligence and Embedded Analytics tool that lets everyone work with data 📊

Clojure 40,392 5,328 Updated Feb 6, 2025

This is the development repository for sparkMeasure, a tool and library designed for efficient analysis and troubleshooting of Apache Spark jobs. It focuses on easing the collection and examination…

Scala 725 146 Updated Jan 22, 2025

Karabiner-Elements is a powerful tool for customizing keyboards on macOS

C++ 19,373 851 Updated Feb 2, 2025

Parquet file generator

Scala 22 7 Updated Apr 17, 2018

Qubole Sparklens tool for performance tuning Apache Spark

Scala 569 138 Updated Jun 26, 2024

Make Your Company Data Driven. Connect to any data source, easily visualize, dashboard and share your data.

Python 26,851 4,418 Updated Feb 5, 2025

Project SnappyData - memory optimized analytics database, based on Apache Spark™ and Apache Geode™. Stream, Transact, Analyze, Predict in one cluster

Scala 1,038 203 Updated Nov 21, 2022

REST job server for Apache Spark

Scala 2,836 994 Updated Jan 4, 2025

Apache Spark - A unified analytics engine for large-scale data processing

Scala 40,474 28,478 Updated Feb 6, 2025

The Scala 3 compiler, also known as Dotty.

Scala 5,944 1,078 Updated Feb 6, 2025
Next