Skip to content

astrotuna201/penguin

 
 

Repository files navigation

Penguin

Explore the ideas of data frames, accelerated compute, tables, data structures, and distributed data in Swift.

Documentation:

Roadmap

Below is the aspirational roadmap (at an extremely high level) for the high-level goals:

  • Parse from CSV.
  • Finish up base API & document it. (Incomplete list of needs: appending rows, support in-place mutations everywhere, more powerful sorting, remove unnecessary APIs)
  • Optimize PIndexSet representations.
  • Refactor internals around a parallelism model from PenguinParallel.
  • Optimize backing store for PTypedColumn, including adding support for larger-than-RAM & file formats like Arrow.
  • Add querying / group-by support (including support for a PTableGroup --- alternate names wanted).
  • Parallelize the implementation of the operators.
  • Investigate hardware acceleration & JIT code-gen.
  • Distributed orchestration of computation.

Extensions (help wanted):

  • Connect to databases.
  • Parse additional file formats.

This is not an officially supported Google product.

About

Data tables in Swift

Resources

License

Code of conduct

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • Swift 99.6%
  • Other 0.4%