Stars
BitSail is a distributed high-performance data integration engine which supports batch, streaming and incremental scenarios. BitSail is widely used to synchronize hundreds of trillions of data ever…
Apache Doris is an easy-to-use, high performance and unified analytics database.
Apache Paimon is a lake format that enables building a Realtime Lakehouse Architecture with Flink and Spark for both streaming and batch operations.
闻达:一个LLM调用平台。目标为针对特定环境的高效内容生成,同时考虑个人和中小企业的计算资源局限性,以及知识安全和私密性问题
dataService platform is a low-code platform, which only needs to write SQL to realize the development of API services, solve the unification of data services, facilitate the governance of data serv…
big data comparison and data profiling platform: low code,data comparison and data profiling
Cloud Shuffle Service(CSS) is a general purpose remote shuffle solution for compute engines, including Spark/Flink/MapReduce.
Making large AI models cheaper, faster and more accessible
zck573693104 / ColossalAI
Forked from hpcaitech/ColossalAIColossal-AI: A Unified Deep Learning System for Large-Scale Parallel Training
总结梳理自然语言处理工程师(NLP)需要积累的各方面知识,包括面试题,各种基础知识,工程能力等等,提升核心竞争力
Open source annotation tool for machine learning practitioners.
zck573693104 / doccano
Forked from doccano/doccanoOpen source annotation tool for machine learning practitioners.
Make stream processing easier! Easy-to-use streaming application development framework and operation platform.
👑 Easy-to-use and powerful NLP and LLM library with 🤗 Awesome model zoo, supporting wide-range of NLP tasks from research to industrial applications, including 🗂Text Classification, 🔍 Neural Search…
Alink is the Machine Learning algorithm platform based on Flink, developed by the PAI team of Alibaba computing platform.
flink learning blog. http://www.54tianzhisheng.cn/ 含 Flink 入门、概念、原理、实战、性能调优、源码解析等内容。涉及 Flink Connector、Metrics、Library、DataStream API、Table API & SQL 等内容的学习案例,还有 Flink 落地应用的大型项目案例(PVUV、日志存储、百亿数据实时去…
zck573693104 / docker-hadoop-spark-workbench
Forked from big-data-europe/docker-hadoop-spark-workbench[EXPERIMENTAL] This repo includes deployment instructions for running HDFS/Spark inside docker containers. Also includes spark-notebook and HDFS FileBrowser.
【大厂面试专栏】一份Java程序员需要的技术指南,这里有面试题、系统架构、职场锦囊、主流中间件等,让你成为更牛的自己!