The scripts in this repository pull (some) data from the Ultimate Debian Database and convert them to a graph database, in particular Neo4j.
A detailed description of the process is available either on my blog (Part 1, Part 2, Part 3), or debian-package-neo4j.md.
The scripts here are:
pull-udd.pl
queries the UDD and downloads the two tables for packages and sources. Needs DBI::PG Perl module. This scripts needs quite some time, as the server is not fast. Please be patient.generate-graph
is a Perl script that reads the two csv files generated frompull-udd.pl
and generates csv files ready to be imported into Neo4jsort-uniq.sh
ensures that duplicate lines are removed from the csvsbuild-db
assembles the proper command line forneo4j-import
build-all
glues everything together
After the scripts has run, the files generated are in debian-neo4j
and can be used as Neo4j database files.
Comments and improvements are always welcome.
Copyright 2017-2018 Norbert Preining
License: GPL3+