-
azure/azure-event-hubs-spark
Enabling Continuous Data Processing with Apache Spark and Azure Event Hubs
Scala versions: 2.12 2.11 2.10 -
microsoft/mobius
C# and F# language binding and extensions to Apache Spark
Scala versions: 2.11 2.10 -
chermenin/spark-states
Custom state store providers for Apache Spark
Scala versions: 2.12 2.11 -
tupol/spark-utils
Basic framework utilities to quickly start writing production ready Apache Spark applications
Scala versions: 2.12 2.11 -
agile-lab-dev/wasp
WASP is a framework to build complex real time big data applications. It relies on a kind of Kappa/Lambda architecture mainly leveraging Kafka and Spark. If you need to ingest huge amount of heterogeneous data and analyze them through complex pipelines, this is the framework for you.
Scala versions: 2.12 2.11 -
qubole/streaminglens
Qubole Streaminglens tool for tuning Spark Structured Streaming Pipelines
Scala versions: 2.11 -
qubole/s3-sqs-connector
A library for reading data from Amzon S3 with optimised listing using Amazon SQS using Spark SQL Streaming ( or Structured streaming).
Scala versions: 2.11 -
catalystcode/streaming-facebook
A library for reading social data from Facebook using Spark Streaming.
Scala versions: 2.11 -
ponkin/bloom
Probabilistic data structures java implementation.
Scala versions: 2.11 -
catalystcode/streaming-reddit
A library for reading public search results from Reddit using Spark Streaming.
Scala versions: 2.11 -
cdapio/cdap
An open source framework for building data analytic applications.
Scala versions: 2.12 2.11 2.10