26 results
-
microsoft/synapseml
Simple and Distributed Machine Learning
Scala Versions: 2.11 -
yotpoltd/metorikku
A simplified, lightweight ETL Framework based on Apache Spark
Scala Versions: 2.11 2.12 -
hydrospheredata/mist
Serverless proxy for Spark cluster
Scala Versions: 2.10 2.11 2.12 -
delta-io/delta-sharing
An open protocol for secure data sharing
Scala Versions: 2.12 -
touk/nussknacker
A visual tool to define and run real-time decision algorithms. Brings agility to business teams, liberates developers to focus on technology.
Scala Versions: 2.11 2.12 -
setl-framework/setl
A simple Spark-powered ETL framework that just works 🍺
Scala Versions: 2.11 2.12 -
sparkling-graph/sparkling-graph
SparklingGraph provides easy to use set of features that will give you ability to proces large scala graphs using Spark and GraphX.
Scala Versions: 2.10 2.11 -
clustering4ever/clustering4ever
C4E, a JVM friendly library written in Scala for both local and distributed (Spark) Clustering.
Scala Versions: 2.11 -
locationtech-labs/geopyspark
GeoTrellis for PySpark
Scala Versions: 2.11