Stars
2
stars
written in Scala
Clear filter
Deequ is a library built on top of Apache Spark for defining "unit tests for data", which measure data quality in large datasets.
The Berkeley Document Summarizer is a learning-based, single-document summarization system that extracts source document content, exploits syntactic information to compress it, and uses coreference…