-
apache/predictionio
PredictionIO, a machine learning server for developers and ML engineers.
Scala versions: 2.10 -
microsoft/synapseml
Simple and Distributed Machine Learning
Scala versions: 2.11 -
yotpoltd/metorikku
A simplified, lightweight ETL Framework based on Apache Spark
Scala versions: 2.11 2.12 -
delta-io/delta-sharing
An open protocol for secure data sharing
Scala versions: 2.12 -
hydrospheredata/mist
Serverless proxy for Spark cluster
Scala versions: 2.10 2.11 2.12 -
touk/nussknacker
A visual tool to define and run real-time decision algorithms. Brings agility to business teams, liberates developers to focus on technology.
Scala versions: 2.11 2.12 -
setl-framework/setl
A simple Spark-powered ETL framework that just works 🍺
Scala versions: 2.11 2.12 -
sparkling-graph/sparkling-graph
SparklingGraph provides easy to use set of features that will give you ability to proces large scala graphs using Spark and GraphX.
Scala versions: 2.10 2.11 -
clustering4ever/clustering4ever
C4E, a JVM friendly library written in Scala for both local and distributed (Spark) Clustering.
Scala versions: 2.11 -
swoop-inc/spark-records
Bulletproof Apache Spark jobs with fast root cause analysis of failures.
Scala versions: 2.12 -
locationtech-labs/geopyspark
GeoTrellis for PySpark
Scala versions: 2.11