ADAM is a genomics analysis platform with specialized file formats built using Apache Avro, Apache Spark, and Apache Parquet. Apache 2 licensed.
Tools for working with genomic and high throughput sequencing data.
machine learning for genomic variants
Very large scale k-mer counting and analysis on Apache Spark.
Collection of Data Structures for working with genomic intervals