-
pityka/lamp
deep learning and scientific computing framework with native CPU and GPU backend for the Scala programming language
Scala versions: 3.x 2.13 2.12 -
rzykov/fastml4j
Fast Scala and nd4j based machine learning framework
Scala versions: 2.11 -
s22s/pre-lt-raster-frames
Spark DataFrames for earth observation data
Scala versions: 2.11 -
florentf9/sparkml-som
:sparkles: Spark ML implementation of SOM algorithm (Kohonen self-organizing map)
Scala versions: 2.11 -
flipkart-incubator/spark-transformers
Spark-Transformers: Library for exporting Apache Spark MLLIB models to use them in any Java application with no other dependencies.
Scala versions: 2.11 2.10 -
mrdimosthenis/scala-synapses
A plug-and play library for neural networks written in Scala 3
Scala versions: 3.x -
iaja/scalaldavis
Scala-Spark port of https://github.com/bmabey/pyLDAvis for Apache Spark LDA Topic Modelling Visualisation
Scala versions: 2.11 -
eto-ai/rikai
Parquet-based ML data format optimized for working with unstructured data
Scala versions: 2.13 2.12 -
liquidsvm/liquidsvm
Support vector machines (SVMs) and related kernel-based learning algorithms are a well-known class of machine learning algorithms, for non-parametric classification and regression. liquidSVM is an implementation of SVMs whose key features are: fully integrated hyper-parameter selection, extreme speed on both small and large data sets, full flexibility for experts, and inclusion of a variety of different learning scenarios: multi-class classification, ROC, and Neyman-Pearson learning, and least-squares, quantile, and expectile regression.
Scala versions: 2.11 -
whylabs/whylogs
An open-source data logging library for machine learning models and data pipelines. 📚 Provides visibility into data quality & model performance over time. 🛡️ Supports privacy-preserving data collection, ensuring safety & robustness. 📈
Scala versions: 2.12 -
mrdimosthenis/synapses
A group of neural-network libraries for functional and mainstream languages
Scala versions: 2.13 -
catboost/catboost
A fast, scalable, high performance Gradient Boosting on Decision Trees library, used for ranking, classification, regression and other machine learning tasks for Python, R, Java, C++. Supports computation on CPU and GPU.
Scala versions: 2.13 2.12 2.11 -
tupol/spark-xkmeans
Extension to the standard K-Means implementation of Spark ML library
Scala versions: 2.11 -
alibaba/alink
Alink is the Machine Learning algorithm platform based on Flink, developed by the PAI team of Alibaba computing platform.
Scala versions: 2.11 -
h2oai/h2o-3
H2O is an Open Source, Distributed, Fast & Scalable Machine Learning Platform: Deep Learning, Gradient Boosting (GBM) & XGBoost, Random Forest, Generalized Linear Modeling (GLM with Elastic Net), K-Means, PCA, Generalized Additive Models (GAM), RuleFit, Support Vector Machine (SVM), Stacked Ensembles, Automatic Machine Learning (AutoML), etc.
Scala versions: 2.11 2.10