Skip to main content

AI, Engineering

The Reversible Residual Network: Backpropagation Without Storing Activations

December 1, 2017 / Global

Share

Abstract

Residual Networks (ResNets) have demonstrated significant improvement over traditional Convolutional Neural Networks (CNNs) on image classification, increasing in performance as networks grow both deeper and wider. However, memory consumption becomes a bottleneck as one needs to store all the intermediate activations for calculating gradients using backpropagation. In this work, we present the Reversible Residual Network (RevNet), a variant of ResNets where each layer’s activations can be reconstructed exactly from the next layer’s. Therefore, the activations for most layers need not be stored in memory during backprop. We demonstrate the effectiveness of RevNets on CIFAR and ImageNet, establishing nearly identical performance to equally-sized ResNets, with activation storage requirements independent of depth.

Authors

Aidan N Gomez, Mengye Ren, Raquel Urtasun, Roger Grosse

Conference

NeurIPS 2017

Full Paper

‘The Reversible Residual Network: Backpropagation Without Storing Activations’ (PDF)

Uber ATG

Category:

Related articles

Engineering, AI, Data / ML

Scaling AI/ML Infrastructure at Uber

March 28 / Global

Engineering, AI, Backend, Data / ML

DataCentral: Uber’s Big Data Observability and Chargeback Platform

February 1 / Global

Engineering, AI

Palette Meta Store Journey

January 18 / Global

Interested in joining Uber Eng?

Engineering, AI, Backend

Cinnamon Auto-Tuner: Adaptive Concurrency in the Wild

December 7, 2023 / Global

Engineering, AI, Data / ML

Risk Entity Watch – Using Anomaly Detection to Fight Fraud

September 28, 2023 / Global

Engineering, AI, Backend, Culture

The Transformative Power of Generative AI in Software Development: Lessons from Uber’s Tech-Wide Hackathon

August 3, 2023 / Global

View more stories

Sign up to drive

Sign up to ride