Skip to content
View cugwind's full-sized avatar
  • 武汉

Block or report cugwind

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
30 stars written in Scala
Clear filter

Apache Spark - A unified analytics engine for large-scale data processing

Scala 40,203 28,384 Updated Dec 17, 2024

A platform to build and run apps that are elastic, agile, and resilient. SDK, libraries, and hosted environments.

Scala 13,070 3,592 Updated Dec 6, 2024

CMAK is a tool for managing Apache Kafka clusters

Scala 11,853 2,505 Updated Aug 2, 2023

An open-source storage framework that enables building a Lakehouse architecture with compute engines including Spark, PrestoDB, Flink, Trino, and Hive and APIs

Scala 7,681 1,732 Updated Dec 17, 2024

Lightweight, modular, and extensible library for functional programming.

Scala 5,277 1,210 Updated Dec 13, 2024

Breeze is/was a numerical processing library for Scala.

Scala 3,448 691 Updated Aug 29, 2024

REST job server for Apache Spark

Scala 2,839 995 Updated Dec 14, 2024

Apache Kyuubi is a distributed and multi-tenant gateway to provide serverless SQL on data warehouses and lakehouses.

Scala 2,115 919 Updated Dec 12, 2024

GeoMesa is a suite of tools for working with big geo-spatial data in a distributed fashion.

Scala 1,437 434 Updated Dec 16, 2024

GeoTrellis is a geographic data processing engine for high performance applications.

Scala 1,345 362 Updated Nov 14, 2024

Livy is an open source REST interface for interacting with Apache Spark from anywhere

Scala 1,008 314 Updated Oct 5, 2022

Apache Livy is an open source REST interface for interacting with Apache Spark from anywhere.

Scala 894 603 Updated Nov 12, 2024

Expressive types for Spark.

Scala 880 138 Updated Dec 12, 2024

Essential Spark extensions and helper methods ✨😲

Scala 754 153 Updated Oct 24, 2024

Mirror of Apache Toree (Incubating)

Scala 740 225 Updated Nov 8, 2024

Geo Spatial Data Analytics on Spark

Scala 533 149 Updated Aug 26, 2021

A Spark plugin for reading and writing Excel files

Scala 471 149 Updated Dec 5, 2024

基于Spark的新闻推荐系统,包含爬虫项目、web网站以及spark推荐系统

Scala 357 91 Updated Jun 21, 2022

Serverless proxy for Spark cluster

Scala 326 68 Updated Oct 29, 2020

Akka Http directives implementing the CORS specifications defined by W3C

Scala 255 38 Updated Aug 12, 2024

Collection of open-source Spark tools & frameworks that have made the data engineering and data science teams at Swoop highly productive

Scala 186 34 Updated Feb 12, 2023

Semi-automatic incremental construction and debugging of regular expressions for grok to parse logfiles for logstash http://logstash.net/ . Deployed at http://grokconstructor.appspot.com/ .

Scala 159 52 Updated Jan 16, 2024

Big Spatial Data Processing using Spark

Scala 145 56 Updated Mar 7, 2017

The Raster Foundry web application.

Scala 141 45 Updated Nov 7, 2022

Template projects for GeoSpark, GeoSpark-SQL, GeoSpark-Viz

Scala 65 33 Updated Dec 30, 2020

Scala implementation of the GeoScript API

Scala 47 14 Updated Apr 13, 2016

Support for operating on images via Apache Spark

Scala 26 12 Updated Jun 12, 2023

GeoTrellis PointCloud library to work with any pointcloud data on Spark

Scala 26 10 Updated Oct 5, 2020

Spark DataFrames for earth observation data

Scala 19 5 Updated May 1, 2018

MURS is a memory scheduler for in-memory computing, which tries to mitigate the memory pressure for multiple data processing tasks sharing the executor.

Scala 2 Updated May 29, 2017