Skip to content

ppfenning/datahub

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Data Hub

Build Status Gitter

Data Hub

Introduction

Data Hub is Linkedin's generalized metadata search & discovery tool. To learn more about Data Hub, check out our Linkedin blog post and Strata presentation. This repository contains the complete source code to be able to build Data Hub's frontend & backend services.

Quickstart

  1. Install docker and docker-compose.
  2. Clone this repo and make sure you are at the datahub branch.
  3. Run below command to download and run all Docker containers in your local:
cd docker/quickstart && docker-compose pull && docker-compose up --build
  1. After you have all Docker containers running in your machine, run below command to ingest provided sample data to Data Hub:
docker build -t ingestion -f docker/ingestion/Dockerfile . && cd docker/ingestion && docker-compose up

Note: Make sure that you're using Java 8, we have a strict dependency to Java 8 for build.

  1. Finally, you can start Data Hub by typing http://localhost:9001 in your browser. You can sign in with datahub as username and password.

Quicklinks

Roadmap

  1. Add user profile page
  2. Deploy Data Hub to Azure Cloud

About

The Metadata Platform for the Modern Data Stack

Resources

License

Code of conduct

Security policy

Stars

Watchers

Forks

Packages

No packages published

Languages

  • Java 45.3%
  • Python 34.7%
  • TypeScript 17.6%
  • JavaScript 1.2%
  • Shell 0.4%
  • SCSS 0.3%
  • Other 0.5%