-
Google
- Sunnyvale
-
unitycatalog Public
Forked from unitycatalog/unitycatalogOpen, Multi-modal Catalog for Data & AI
Java Apache License 2.0 UpdatedJun 13, 2024 -
OpenLineage Public
Forked from OpenLineage/OpenLineageAn Open Standard for lineage metadata collection
Java Apache License 2.0 UpdatedAug 27, 2023 -
-
velox Public
Forked from facebookincubator/veloxA new C++ vectorized database acceleration library aimed to optimizing query engines and data processing systems.
C++ Apache License 2.0 UpdatedSep 27, 2021 -
-
iceberg Public
Forked from apache/icebergApache Iceberg
Java Apache License 2.0 UpdatedSep 21, 2020 -
spark-1 Public
Forked from apache/sparkApache Spark - A unified analytics engine for large-scale data processing
Scala Apache License 2.0 UpdatedJul 29, 2020 -
parquet-mr Public
Forked from apache/parquet-javaApache Parquet
Java Apache License 2.0 UpdatedMay 29, 2020 -
datahub Public
Forked from datahub-project/datahubA Generalized Metadata Search & Discovery Tool
TypeScript Apache License 2.0 UpdatedFeb 26, 2020 -
-
inverting-proxy Public
Forked from google/inverting-proxyReverse proxy that inverts the direction of traffic
Go Apache License 2.0 UpdatedFeb 6, 2020 -
spark-on-k8s-operator Public
Forked from kubeflow/spark-operatorKubernetes operator for managing the lifecycle of Apache Spark applications on Kubernetes.
Go Apache License 2.0 UpdatedDec 12, 2019 -
datafu Public
Forked from apache/datafuMirror of Apache DataFu
Java Apache License 2.0 UpdatedOct 25, 2019 -
hive-bigquery-storage-handler Public
Forked from GoogleCloudDataproc/hive-bigquery-storage-handlerHive Storage Handler for interoperability between BigQuery and Apache Hive
Java Apache License 2.0 UpdatedOct 1, 2019 -
-
dataprocspawner Public
Forked from GoogleCloudDataproc/jupyterhub-dataprocspawnerPython Apache License 2.0 UpdatedAug 8, 2019 -
bigdata-interop Public
Forked from GoogleCloudDataproc/hadoop-connectorsLibraries and tools for interoperability between Hadoop-related open-source software and Google Cloud Platform.
Java Apache License 2.0 UpdatedAug 8, 2019 -
dataproc-initialization-actions Public
Forked from GoogleCloudDataproc/initialization-actionsRun in all nodes of your cluster before the cluster starts - lets you customize your cluster
Shell Apache License 2.0 UpdatedAug 7, 2019 -
professional-services Public
Forked from GoogleCloudPlatform/professional-servicesCommon solutions and tools developed by Google Cloud's Professional Services team
HTML Apache License 2.0 UpdatedAug 6, 2019 -
spark-bigquery-connector Public
Forked from GoogleCloudDataproc/spark-bigquery-connectorThe connector uses the Spark SQL Data Source API to read data from Google BigQuery.
Scala Apache License 2.0 UpdatedJul 31, 2019 -
cloud-sql-jdbc-socket-factory Public
Forked from GoogleCloudPlatform/cloud-sql-jdbc-socket-factoryJava Apache License 2.0 UpdatedJul 25, 2019 -
-
-
spydra Public
Forked from spotify/spydraEphemeral Hadoop clusters using Google Compute Platform
Java Apache License 2.0 UpdatedOct 9, 2018 -
presto Public
Forked from prestodb/prestoDistributed SQL query engine for big data
Java Apache License 2.0 UpdatedAug 31, 2018 -
hadoop Public
Forked from apache/hadoopMirror of Apache Hadoop
Java Apache License 2.0 UpdatedAug 10, 2018 -
Big-Data-Benchmark-for-Big-Bench Public
Forked from takeon8/Big-Data-Benchmark-for-Big-BenchBig Bench Workload Development
Shell Other UpdatedAug 10, 2018 -
hive Public
Forked from apache/hiveMirror of Apache Hive
Java Apache License 2.0 UpdatedAug 9, 2018 -
metabase Public
Forked from metabase/metabaseThe simplest, fastest way to get business intelligence and analytics to everyone in your company 😋
JavaScript GNU Affero General Public License v3.0 UpdatedJul 17, 2018 -
incubator-druid Public
Forked from apache/druidApache Druid (Incubating) - Column oriented distributed data store ideal for powering interactive applications
Java Apache License 2.0 UpdatedJul 16, 2018