-
sadikovi/spark-netflow 2.1.0
NetFlow data source for Spark SQL and DataFrames
Scala versions: 2.12 -
absaoss/spark-data-standardization 0.2.2
A library for Spark that helps to stadardize any input data (DataFrame) to adhere to the provided schema.
Scala versions: 2.13 2.12 2.11 -
izhangzhihao/sbt-spark-submit 0.0.5
sbt plugin for spark-submit
-
exasol/spark-connector 1.1.0
A connector for Apache Spark to access Exasol
Scala versions: 2.12 -
logimethods/nats-connector-spark-scala 1.0.0
A Scala based Spark Publish/Subscribe NATS Connector
Scala versions: 2.11 -
flipkart-incubator/spark-transformers 0.4.0
Spark-Transformers: Library for exporting Apache Spark MLLIB models to use them in any Java application with no other dependencies.
Scala versions: 2.11 2.10 -
ozancicek/artan 0.5.1
Online latent state estimation with Spark
Scala versions: 2.12 -
anicolaspp/maprdbconnector 1.0.9
An independent MapR-DB Connector for Apache Spark that fully utilizes MapR-DB secondary indexes
Scala versions: 2.11 -
timvw/adobe-analytics-datafeed-datasource 0.1.0
Apache Spark data source for Adobe Analytics Data Feed
Scala versions: 2.12 -
ponkin/bloom 0.11
Probabilistic data structures java implementation.
Scala versions: 2.11 -
piotr-kalanski/spark-local 0.6.0
API enabling switching between Spark execution engine and local fast implementation based on Scala collections.
Scala versions: 2.11 -
iaja/scalaldavis 0.1.2
Scala-Spark port of https://github.com/bmabey/pyLDAvis for Apache Spark LDA Topic Modelling Visualisation
Scala versions: 2.11 -
eto-ai/rikai 0.1.14
Parquet-based ML data format optimized for working with unstructured data
Scala versions: 2.13 2.12