Facilitates Data I/O between Spark and IBM Object Storage services.
HBase secondary index and big data analytics application
PlayFramework 2.x module to fetch, cache, and display tweets from Twitter
Scala Map that uses binary search in memory mapped sorted file. It makes possible usage of data sets bigger than available memory as a Map.
Neuron DI is a new approach for dependency injection in Java and Scala.
Nonrecursive Datalog Rewriter for Linear TGDs and Conjunctive Queries
Store for immutable objects in S3
Distributed exome CNV analyzer. Apache 2 licensed.
Scala-Spark port of https://github.com/bmabey/pyLDAvis for Apache Spark LDA Topic Modelling Visualisation
A concurrent reactive programming framework.
Fork of dmlc/xgboost for RAPIDS + XGBoost integration
Scala web crawling and scraping using fs2 streams
Glicko2 (improved ELO) sports players rating system for the JVM
geographical name normalization (a.k.a. toponym resolution)
Clone of
CSV reader/writer with conversion to Scala case class
Scala port of ua-parser
Tiny library to create copies of case classes