Open Source Tools for AI-Projects

Displaying 1 - 9 of 9

spark by apache


Apache Spark - A unified analytics engine for large-scale data processing

setl by SETL-Framework


A simple Spark-powered ETL framework that just works 🍺

analytics-zoo by intel-analytics


Distributed Tensorflow, Keras and PyTorch on Apache Spark/Flink & Ray

maggy by logicalclocks


Distribution transparent Machine Learning experiments on Apache Spark

deequ by awslabs


Deequ is a library built on top of Apache Spark for defining "unit tests for data", which measure data quality in large datasets.

basin by basin-etl


Basin is a visual programming editor for building Spark and PySpark pipelines. Easily build, debug, and deploy complex ETL pipelines from your browser

sparklyr2pmml by jpmml


R library for converting Apache Spark ML pipelines to PMML

pyspark2pmml by jpmml


Python library for converting Apache Spark ML pipelines to PMML

mist by Hydrospheredata


Serverless proxy for Spark cluster