Open Source Tools for AI-Projects

Displaying 1 - 9 of 9

spark by apache


Apache Spark - A unified analytics engine for large-scale data processing

setl by SETL-Framework


A simple Spark-powered ETL framework that just works 🍺

analytics-zoo by intel-analytics


Distributed Tensorflow, Keras and PyTorch on Apache Spark/Flink & Ray

maggy by logicalclocks


Distribution transparent Machine Learning experiments on Apache Spark

deequ by awslabs


Deequ is a library built on top of Apache Spark for defining "unit tests for data", which measure data quality in large datasets.

basin by basin-etl


Basin is a visual programming editor for building Spark and PySpark pipelines. Easily build, debug, and deploy complex ETL pipelines from your browser

pyspark2pmml by jpmml


Python library for converting Apache Spark ML pipelines to PMML

sparklyr2pmml by jpmml


R library for converting Apache Spark ML pipelines to PMML

mist by Hydrospheredata


Serverless proxy for Spark cluster