Skip to content
Change the repository type filter

All

    Repositories list

    • Java Spring (Boot/Cloud..) backend playground | #SE
      JavaScript
      4000Updated Jun 20, 2024Jun 20, 2024
    • Apache Airflow Website
      312000Updated Mar 20, 2024Mar 20, 2024
    • prefect

      Public
      Prefect is a workflow orchestration tool empowering developers to build, observe, and react to data pipelines
      Python
      Apache License 2.0
      1.7k000Updated Mar 14, 2024Mar 14, 2024
    • Apache Flink Training Excercises
      Java
      Apache License 2.0
      671000Updated Jan 18, 2024Jan 18, 2024
    • This demo shows how to capture data changes from relational databases and stream them to Confluent Cloud.
      HCL
      4000Updated Dec 9, 2023Dec 9, 2023
    • drone-fly

      Public
      A service which allows Hive Metastore Listeners to be deployed outside of the Hive Metastore Service
      Java
      Apache License 2.0
      3000Updated Jul 3, 2023Jul 3, 2023
    • My CS learning : algorithm, data structure, and system design | #SE
      Python
      44000Updated Jun 11, 2023Jun 11, 2023
    • metabase

      Public
      The simplest, fastest way to get business intelligence and analytics to everyone in your company 😋
      Clojure
      Other
      5.2k000Updated Mar 5, 2023Mar 5, 2023
    • Amazon Redshift Utils contains utilities, scripts and view which are useful in a Redshift environment
      Python
      Apache License 2.0
      1.3k000Updated Feb 10, 2023Feb 10, 2023
    • Spark Structured Streaming / Kafka / Cassandra / Elastic
      Scala
      Apache License 2.0
      79000Updated Feb 7, 2023Feb 7, 2023
    • datahub

      Public
      The Metadata Platform for the Modern Data Stack
      Java
      Apache License 2.0
      3k000Updated Feb 6, 2023Feb 6, 2023
    • Free Data Engineering course!
      Jupyter Notebook
      5.6k000Updated Feb 5, 2023Feb 5, 2023
    • Open-source data observability for analytics engineers.
      HTML
      Apache License 2.0
      169000Updated Jan 12, 2023Jan 12, 2023
    • AWS libraries/modules for working with Kinesis aggregated record data
      Java
      Apache License 2.0
      153000Updated Jan 3, 2023Jan 3, 2023
    • A library for scraping listings data from daft.ie
      Python
      MIT License
      6000Updated Dec 24, 2022Dec 24, 2022
    • Fish-like autosuggestions for zsh
      Shell
      MIT License
      1.9k000Updated Dec 23, 2022Dec 23, 2022
    • The software used to extract structured data from Wikipedia
      Scala
      271000Updated Nov 23, 2022Nov 23, 2022
    • python爬虫教程系列、从0到1学习python爬虫,包括浏览器抓包,手机APP抓包,如 fiddler、mitmproxy,各种爬虫涉及的模块的使用,如:requests、beautifulSoup、selenium、appium、scrapy等,以及IP代理,验证码识别,Mysql,MongoDB数据库的python使用,多线程多进程爬虫的使用,css 爬虫加密逆向破解,JS爬虫逆向,分布式爬虫,爬虫项目实战实例等
      Python
      MIT License
      3.8k000Updated Nov 23, 2022Nov 23, 2022
    • Extract metadata from a video to an sqlite database
      Python
      Other
      3000Updated Oct 5, 2022Oct 5, 2022
    • Process Common Crawl data with Python and Spark
      Python
      MIT License
      87000Updated Sep 21, 2022Sep 21, 2022
    • Scala
      MIT License
      213000Updated Sep 4, 2022Sep 4, 2022
    • Python
      MIT License
      68000Updated Aug 14, 2022Aug 14, 2022
    • The official repository for the Rock the JVM ZIO course
      Scala
      49000Updated Jul 9, 2022Jul 9, 2022
    • maxwell

      Public
      Maxwell's daemon, a mysql-to-json kafka producer
      Java
      Other
      1k000Updated Jul 7, 2022Jul 7, 2022
    • Amazon Kinesis Data Analytics Flink Starter Kit helps you with the development of Flink Application with Kinesis Stream as a source and Amazon S3 as a sink. This demonstrates the use of Session Window with AggregateFunction.
      Java
      MIT No Attribution
      15000Updated Jun 30, 2022Jun 30, 2022
    • REST job server for Apache Spark
      Scala
      Other
      995000Updated Jun 24, 2022Jun 24, 2022
    • Beginner data engineering project - batch edition
      Shell
      MIT License
      144000Updated Jun 13, 2022Jun 13, 2022
    • Welcome to the AWS Code Examples Repository. This repo contains code examples used in the AWS documentation, AWS SDK Developer Guides, and more. For more information, see the Readme.rst file below.
      Java
      Apache License 2.0
      5.7k000Updated Jun 7, 2022Jun 7, 2022
    • spotify

      Public
      A Scala wrapper for the Spotify Web API
      Scala
      Apache License 2.0
      4000Updated Apr 24, 2022Apr 24, 2022
    • Python
      27000Updated Apr 19, 2022Apr 19, 2022