-
Studio Management LLC (Hedge Fund)
- New Delhi, India
- in/siddhant-chadha-838136142
- https://medium.com/@siddhantchadha2121
Highlights
Stars
A lightweight timer/cron agents framework for Java applications
Feign makes writing java http clients easier
Always know what to expect from your data.
Open-Source Web UI for Apache Kafka Management
Java implementation of an Envoy gRPC control plane
Platform for building AI that can learn and answer questions over federated data.
Learn how to design large-scale systems. Prep for the system design interview. Includes Anki flashcards.
Apache Kyuubi is a distributed and multi-tenant gateway to provide serverless SQL on data warehouses and lakehouses.
Apache Superset is a Data Visualization and Data Exploration Platform
Rapid ETL/ELT-connectors/pipeline development leveraged on top of Apache Spark
A pattern-based approach for learning technical interview questions
Apache Spark - A unified analytics engine for large-scale data processing
DataSphereStudio is a one stop data application development& management portal, covering scenarios including data exchange, desensitization/cleansing, analysis/mining, quality measurement, visualiz…
Powerful event-bus optimized for high throughput in multi-threaded applications. Features: Sync and Async event publication, weak/strong references, event filtering, annotation driven
A distributed data integration framework that simplifies common aspects of big data integration such as data ingestion, replication, organization and lifecycle management for both streaming and bat…
Generic Data Ingestion & Dispersal Library for Hadoop
Dr. Elephant is a job and flow-level performance monitoring and tuning tool for Apache Hadoop and Apache Spark
REST job server for Apache Spark
A tool for secrets management, encryption as a service, and privileged access management
Official repository of Trino, the distributed SQL query engine for big data, formerly known as PrestoSQL (https://trino.io)
JVM Profiler Sending Metrics to Kafka, Console Output or Custom Reporter
Deequ is a library built on top of Apache Spark for defining "unit tests for data", which measure data quality in large datasets.
former home of the groovy programming language, moved to https://github.com/apache/groovy
An extensible distributed system for reliable nearline data streaming at scale
A Directed Acyclic Graph task dependency scheduler designed to simplify complex distributed pipelines
Open source platform for X.509 certificate based service authentication and fine grained access control in dynamic infrastructures. Athenz supports provisioning and configuration (centralized autho…