Apache Spark

Apache Spark is an open source distributed general-purpose cluster-computing framework. It provides an interface for programming entire clusters with implicit data parallelism and fault tolerance.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Apache Spark

Here are 248 public repositories matching this topic...

IBM / data-prep-kit

josephmachado / beginner_de_project

derrickburns / generalized-kmeans-clustering

jaceklaskowski / spark-workshop

lynnlangit / learning-hadoop-and-spark

databrickslabs / automl-toolkit

apache / spark-website

innat / ML-Resource

feng-li / Distributed-Statistical-Computing

rogaha / data-processing-pipeline

tikal-fuseday / delta-architecture

tdebatty / spark-knn-graphs

lresende / ansible-kubernetes-cluster

jukiewiczm / kaggle-predict-future-sales

arjones / bigdata-workshop-es

vmitchell85 / spark-kiosk-notify

shaojunying / Software-engineering-discipline-online-learning-platform-based-on-knowledge-map

marcelmittelstaedt / BigData

netease-bigdata / ne-spark-courseware

korolmi / dataeng

Related Topics