- Beijing
-
starrocks Public
Forked from StarRocks/starrocksStarRocks is a next-gen sub-second MPP database for full analytics scenarios, including multi-dimensional analytics, real-time analytics and ad-hoc query.
-
grpc-java Public
Forked from grpc/grpc-javaThe Java gRPC implementation. HTTP/2 based RPC
Java Apache License 2.0 UpdatedOct 18, 2024 -
trino Public
Forked from trinodb/trinoOfficial repository of Trino, the distributed SQL query engine for big data, formerly known as PrestoSQL (https://trino.io)
Java Apache License 2.0 UpdatedMar 12, 2024 -
starrocks-connector-for-apache-flink Public
Forked from StarRocks/starrocks-connector-for-apache-flinkJava Apache License 2.0 UpdatedMar 6, 2024 -
doris Public
Forked from apache/dorisApache Doris is an easy-to-use, high performance and unified analytics database.
Java Apache License 2.0 UpdatedJul 9, 2023 -
ByConity Public
Forked from ByConity/ByConityByConity is an open source cloud-native data warehouse
C++ Apache License 2.0 UpdatedJun 5, 2023 -
openai-cookbook Public
Forked from openai/openai-cookbookExamples and guides for using the OpenAI API
Jupyter Notebook MIT License UpdatedMay 5, 2023 -
-
-
fe-plugins-auditloader Public
Forked from WuMenglong/fe-plugins-auditloaderJava UpdatedDec 21, 2022 -
dolphinscheduler Public
Forked from apache/dolphinschedulerApache DolphinScheduler is a distributed and extensible workflow scheduler platform with powerful DAG visual interfaces, dedicated to solving complex job dependencies in the data pipeline and provi…
Java Apache License 2.0 UpdatedSep 29, 2022 -
-
delta-sharing Public
Forked from delta-io/delta-sharingAn open protocol for secure data sharing
Scala Apache License 2.0 UpdatedApr 25, 2022 -
delta Public
Forked from delta-io/deltaAn open-source storage layer that brings scalable, ACID transactions to Apache Spark™ and big data workloads.
Scala Apache License 2.0 UpdatedJan 4, 2022 -
spark Public
Forked from apache/sparkApache Spark - A unified analytics engine for large-scale data processing
Scala Apache License 2.0 UpdatedDec 7, 2021 -
cloudera-playbook Public
Forked from cloudera/cloudera-playbookCloudera deployment automation with Ansible
HTML Apache License 2.0 UpdatedApr 8, 2021 -
SparkInternals Public
Forked from JerryLead/SparkInternalsNotes talking about the design and implementation of Apache Spark
UpdatedDec 4, 2020 -
LearningSparkV2 Public
Forked from databricks/LearningSparkV2This is the github repo for Learning Spark: Lightning-Fast Data Analytics [2nd Edition]
Scala Apache License 2.0 UpdatedNov 9, 2020 -
-
facebook-hive-udfs Public
Forked from brndnmtthws/facebook-hive-udfsFacebook's Hive UDFs
Java Apache License 2.0 UpdatedAug 15, 2020 -
kafka Public
Forked from apache/kafkaMirror of Apache Kafka
Java Apache License 2.0 UpdatedJul 9, 2020 -
-
nifi Public
Forked from apache/nifiMirror of Apache NiFi
Java Apache License 2.0 UpdatedOct 29, 2019 -
flink-parquet-demo Public
A simple demo to use parquet format to write hdfs file.
-
fast-data-dev Public
Forked from lensesio/fast-data-devKafka Docker for development. Kafka, Zookeeper, Schema Registry, Kafka-Connect, Landoop Tools, 20+ connectors
-
-
infoworld-post Public
Forked from dataArtisans/infoworld-postCode examples for a blog post on infoworld.com
Java Apache License 2.0 UpdatedDec 28, 2018 -
AthenaX Public
Forked from uber-archive/AthenaXSQL-based streaming analytics platform at scale
Java Apache License 2.0 UpdatedSep 18, 2018 -
medium-blog-kafka-udemy Public
Forked from simplesteph/medium-blog-kafka-udemySupporting repository for the blog post at https://medium.com/@stephane.maarek/how-to-use-apache-kafka-to-transform-a-batch-pipeline-into-a-real-time-one-831b48a6ad85
Java UpdatedMay 30, 2018 -