-
microsoft/synapseml
Simple and Distributed Machine Learning
Scala versions: 2.11 -
haifengl/smile
Statistical Machine Intelligence & Learning Engine
Scala versions: 2.10 2.11 2.12 2.13 -
swoop-inc/spark-alchemy
Collection of open-source Spark tools & frameworks that have made the data engineering and data science teams at Swoop highly productive
Scala versions: 2.12 -
setl-framework/setl
A simple Spark-powered ETL framework that just works 🍺
Scala versions: 2.11 2.12 -
picnicml/doddle-model
:cake: doddle-model: machine learning in Scala.
Scala versions: 2.11 2.12 2.13 -
zenecture/neuroflow
Artificial Neural Networks for Scala
Scala versions: 2.11 2.12 -
streamnative/pulsar-spark
When Apache Pulsar meets Apache Spark
Scala versions: 2.11 2.12 -
pityka/nspl
scala plotting library
Scala versions: 2.11 2.12 2.13 3.xScala.js versions: 0.6 1.x -
galliaproject/gallia-core
A schema-aware Scala library for data transformation
Scala versions: 2.12 2.13 -
facultyai/scala-plotly-client
Visualise your data from Scala using Plotly
Scala versions: 2.10 2.11 -
pityka/saddle
SADDLE: Scala Data Library
Scala versions: 2.11 2.12 2.13 3.xScala.js versions: 0.6 1.x -
catboost/catboost
A fast, scalable, high performance Gradient Boosting on Decision Trees library, used for ranking, classification, regression and other machine learning tasks for Python, R, Java, C++. Supports computation on CPU and GPU.
Scala versions: 2.11 2.12 2.13 -
whylabs/whylogs
The open standard for data logging
Scala versions: 2.12 -
h2oai/h2o-3
H2O is an Open Source, Distributed, Fast & Scalable Machine Learning Platform: Deep Learning, Gradient Boosting (GBM) & XGBoost, Random Forest, Generalized Linear Modeling (GLM with Elastic Net), K-Means, PCA, Generalized Additive Models (GAM), RuleFit, Support Vector Machine (SVM), Stacked Ensembles, Automatic Machine Learning (AutoML), etc.
Scala versions: 2.10 2.11