Loading...
Please wait, while we are loading the content...
Similar Documents
Caffe con Troll: Shallow Ideas to Speed Up Deep Learning
| Content Provider | ACM Digital Library |
|---|---|
| Author | Abuzaid, Firas Hadjis, Stefan Zhang, Ce Ré, Christopher |
| Abstract | We present Caffe con Troll (CcT), a fully compatible end-to-end version of the popular framework Caffe with rebuilt internals. We built CcT to examine the performance characteristics of training and deploying general-purpose convolutional neural networks across different hardware architectures. We find that, by employing standard batching optimizations for CPU training, we achieve a 6:3× throughput improvement over Caffe on popular networks like CaffeNet. Moreover, with these improvements, the end-to-end training time for CNNs is directly proportional to the FLOPS delivered by the CPU, which enables us to efficiently train hybrid CPU-GPU systems for CNNs. |
| Starting Page | 1 |
| Ending Page | 4 |
| Page Count | 4 |
| File Format | |
| ISBN | 9781450337243 |
| DOI | 10.1145/2799562.2799641 |
| Language | English |
| Publisher | Association for Computing Machinery (ACM) |
| Publisher Date | 2015-05-31 |
| Publisher Place | New York |
| Access Restriction | Subscribed |
| Content Type | Text |
| Resource Type | Article |