A June 2016 roundup of distributed Deep Learning projects on Apache Spark

1 minute read

Published: June 28, 2016

Here’s a quick roundup of distributed deep learning efforts running on Apache Spark. This will only list active(-ish) projects rather than academic experiments (of which there are too many to list) There’s roughly two approaches:

Linking Spark with an existing framework

SparkNet from UC berkeley connects Apache Spark with Caffe. You can read the paper
CaffeOnSpark from Yahoo takes the same approach, see the blog post.
Arimo distributed TensorFlow on Spark for hyper-parameter tuninig, but that was before the release of the distributed version. Here’s a video from Spark Summit East.
Elephas connects Keras with Apache Spark

Implementing a full-fledged frameworrk

DeepDist (repo) is a framework for DBNs implementing downpour gradient descent. The approach is reminiscent of Splash
DeepLearning4J is reimplementing a wide range of NNs, from a fast Java array lib. They run distributed on Spark, with GPU acceleration.

This is just a quick preview, and the criteria for notability are somewhat arbitrary : e.g. I chose not to include OpenDL, because it’s a seemingly unmaintained experiment based on Jeff Dean’s “Large Scale Distributed Deep Networks” paper. Feel free to mention anything I would have forgotten in comments !

Share on

Twitter Facebook LinkedIn

Drilling down on Rust Performance Bottlenecks with tokio-tracing and texray

11 minute read

Published: April 28, 2025

When a Rust program feels sluggish, adding instrumentation can shine a light on where the time is going. In this post, we’ll walk through a guided journey of using Tokio’s tracing framework and the tracing-texray tool to drill into performance issues. We assume you’re familiar with the basics of tokio-tracing (if not, see the Tokio tracing introduction for spans and events fundamentals). Our journey will start with a simple sequential task, then ramp up to parallel execution and illustrate how to maintain insight at each step.

Byzantine-Consistent Broadcast- A Promising Yet Challenging Frontier in Digital Asset Transfers

7 minute read

Published: March 31, 2025

Digital asset transfer systems are at a crossroads. An idea that first captured attention between 2020 and 2022—Byzantine Consistent Broadcast (BCB)—is now experiencing a revival with projects like Pod and Delta. In this post, we to show that while BCB can unlock incredible performance through parallel state execution, it also introduces important challenges, especially around expressivity and the scalability of reads. The goal is to explain why this approach is both exciting and demanding.

François Garillot

A June 2016 roundup of distributed Deep Learning projects on Apache Spark

Linking Spark with an existing framework

Implementing a full-fledged frameworrk

Share on

You May Also Enjoy

Drilling down on Rust Performance Bottlenecks with tokio-tracing and texray

Byzantine-Consistent Broadcast- A Promising Yet Challenging Frontier in Digital Asset Transfers

PSA: maven central and sonatype are slow

May 2016 time series storage roundup