cugwind

Zhang Ron cugwind

A computer science student is a learner who studies the principles and practices of computing and programming.

9 followers · 11 following

武汉

Stars

30 stars written in Scala

Clear filter

apache / spark

Apache Spark - A unified analytics engine for large-scale data processing

Scala 40,203 28,384 Updated Dec 17, 2024

akka / akka

A platform to build and run apps that are elastic, agile, and resilient. SDK, libraries, and hosted environments.

Scala 13,070 3,592 Updated Dec 6, 2024

yahoo / CMAK

CMAK is a tool for managing Apache Kafka clusters

Scala 11,853 2,505 Updated Aug 2, 2023

delta-io / delta

An open-source storage framework that enables building a Lakehouse architecture with compute engines including Spark, PrestoDB, Flink, Trino, and Hive and APIs

Scala 7,681 1,732 Updated Dec 17, 2024

typelevel / cats

Lightweight, modular, and extensible library for functional programming.

Scala 5,277 1,210 Updated Dec 13, 2024

scalanlp / breeze

Breeze is/was a numerical processing library for Scala.

Scala 3,448 691 Updated Aug 29, 2024

spark-jobserver / spark-jobserver

REST job server for Apache Spark

Scala 2,839 995 Updated Dec 14, 2024

apache / kyuubi

Apache Kyuubi is a distributed and multi-tenant gateway to provide serverless SQL on data warehouses and lakehouses.

Scala 2,115 919 Updated Dec 12, 2024

locationtech / geomesa

GeoMesa is a suite of tools for working with big geo-spatial data in a distributed fashion.

Scala 1,437 434 Updated Dec 16, 2024

locationtech / geotrellis

GeoTrellis is a geographic data processing engine for high performance applications.

Scala 1,345 362 Updated Nov 14, 2024

cloudera / livy

Livy is an open source REST interface for interacting with Apache Spark from anywhere

Scala 1,008 314 Updated Oct 5, 2022

apache / incubator-livy

Apache Livy is an open source REST interface for interacting with Apache Spark from anywhere.

Scala 894 603 Updated Nov 12, 2024

typelevel / frameless

Expressive types for Spark.

Scala 880 138 Updated Dec 12, 2024

mrpowers-io / spark-daria

Essential Spark extensions and helper methods ✨😲

Scala 754 153 Updated Oct 24, 2024

apache / incubator-toree

Mirror of Apache Toree (Incubating)

Scala 740 225 Updated Nov 8, 2024

harsha2010 / magellan

Geo Spatial Data Analytics on Spark

Scala 533 149 Updated Aug 26, 2021

nightscape / spark-excel

A Spark plugin for reading and writing Excel files

Scala 471 149 Updated Dec 5, 2024

luochana / News_recommend

基于Spark的新闻推荐系统，包含爬虫项目、web网站以及spark推荐系统

Scala 357 91 Updated Jun 21, 2022

Hydrospheredata / mist

Serverless proxy for Spark cluster

Scala 326 68 Updated Oct 29, 2020

lomigmegard / akka-http-cors

Akka Http directives implementing the CORS specifications defined by W3C

Scala 255 38 Updated Aug 12, 2024

swoop-inc / spark-alchemy

Collection of open-source Spark tools & frameworks that have made the data engineering and data science teams at Swoop highly productive

Scala 186 34 Updated Feb 12, 2023

stoerr / GrokConstructor

Semi-automatic incremental construction and debugging of regular expressions for grok to parse logfiles for logstash http://logstash.net/ . Deployed at http://grokconstructor.appspot.com/ .

Scala 159 52 Updated Jan 16, 2024

syoummer / SpatialSpark

Big Spatial Data Processing using Spark

Scala 145 56 Updated Mar 7, 2017

raster-foundry / raster-foundry

The Raster Foundry web application.

Scala 141 45 Updated Nov 7, 2022

jiayuasu / GeoSparkTemplateProject

Template projects for GeoSpark, GeoSpark-SQL, GeoSpark-Viz

Scala 65 33 Updated Dec 30, 2020

dwins / geoscript.scala

Scala implementation of the GeoScript API

Scala 47 14 Updated Apr 13, 2016

microsoft / spark-images

Support for operating on images via Apache Spark

Scala 26 12 Updated Jun 12, 2023

geotrellis / geotrellis-pointcloud

GeoTrellis PointCloud library to work with any pointcloud data on Spark

Scala 26 10 Updated Oct 5, 2020

s22s / pre-lt-raster-frames

Spark DataFrames for earth observation data

Scala 19 5 Updated May 1, 2018

CGCL-codes / MURS

Forked from zx247549135/spark

MURS is a memory scheduler for in-memory computing, which tries to mitigate the memory pressure for multiple data processing tasks sharing the executor.

Scala 2 Updated May 29, 2017

Provide feedback

Saved searches

Use saved searches to filter your results more quickly