Open Source Tools for AI-Projects

Displaying 1 - 10 of 26

arrow by apache


Apache Arrow is a multi-language toolbox for accelerated data interchange and in-memory processing

edgedb by edgedb


A next-generation graph-relational database.

cudf by rapidsai


cuDF - GPU DataFrame Library

dagster by dagster-io


An orchestration platform for the development, production, and observation of data assets.

h2o-3 by h2oai


H2O is an Open Source, Distributed, Fast & Scalable Machine Learning Platform: Deep Learning, Gradient Boosting (GBM) & XGBoost, Random Forest, Generalized Linear Modeling (GLM with Elastic Net), K-Means, PCA, Generalized Additive Models (GAM), RuleFit, Support Vector Machine (SVM), Stacked Ensembles, Automatic Machine Learning (AutoML), etc.

Neuraxle by Neuraxio


A Sklearn-like Framework for Hyperparameter Tuning and AutoML in Deep Learning projects. Finally have the right abstractions and design patterns to properly do AutoML. Let your pipeline steps have hyperparameter spaces. Enable checkpoints to cut duplicate calculations. Go from research to production environment easily.

bonobo by python-bonobo


Extract Transform Load for Python 3.5+

flyte by flyteorg


Kubernetes-native workflow automation platform for complex, mission-critical data and ML processes at scale. It has been battle-tested at Lyft, Spotify, Freenome, and others and is truly open-source.

quilt by quiltdata


Quilt is a self-organizing data hub for S3

modin by modin-project


Modin: Speed up your Pandas workflows by changing a single line of code