28 results
-
microsoft/synapseml
Simple and Distributed Machine Learning
Scala versions: 2.11 -
h2oai/sparkling-water
Sparkling Water provides H2O functionality inside Spark cluster
Scala versions: 2.12 2.11 2.10 -
delta-io/delta-sharing
An open protocol for secure data sharing
Scala versions: 2.13 2.12 -
yotpoltd/metorikku
A simplified, lightweight ETL Framework based on Apache Spark
Scala versions: 2.12 2.11 -
touk/nussknacker
Low-code tool for automating actions on real time data | Stream processing for the users.
Scala versions: 2.13 2.12 2.11 -
hydrospheredata/mist
Serverless proxy for Spark cluster
Scala versions: 2.12 2.11 2.10 -
setl-framework/setl
A simple Spark-powered ETL framework that just works 🍺
Scala versions: 2.12 2.11 -
sparkling-graph/sparkling-graph
SparklingGraph provides easy to use set of features that will give you ability to proces large scala graphs using Spark and GraphX.
Scala versions: 2.11 2.10 -
clustering4ever/clustering4ever
C4E, a JVM friendly library written in Scala for both local and distributed (Spark) Clustering.
Scala versions: 2.11 -
swoop-inc/spark-records
Bulletproof Apache Spark jobs with fast root cause analysis of failures.
Scala versions: 2.12