Loading...
Please wait, while we are loading the content...
Similar Documents
ILP-M Conv: Optimize Convolution Algorithm for Single-Image Convolution Neural Network Inference on Mobile GPUs
| Content Provider | arXiv |
|---|---|
| Author | Ji, Zhuoran |
| Date of Submission | 2019-10-03 |
| Abstract | Convolution neural networks are widely used for mobile applications. However, GPU convolution algorithms are designed for mini-batch neural network training, the single-image convolution neural network inference algorithm on mobile GPUs is not well-studied. After discussing the usage difference and examining the existing convolution algorithms, we proposed the HNTMP convolution algorithm. The HNTMP convolution algorithm achieves $14.6 \times$ speedup than the most popular \textit{im2col} convolution algorithm, and $2.30 \times$ speedup than the fastest existing convolution algorithm (direct convolution) as far as we know. |
| Related Links | https://arxiv.org/pdf/1909.02765.pdf |
| arXiv | 1909.02765 |
| Language | English |
| Access Restriction | Open |
| Subject Keyword | Computer Science - Distributed, Parallel, and Cluster Computing Computer Science - Computer Vision and Pattern Recognition Computer Science - Performance Computer Science |
| Content Type | Text |
| Resource Type | Article |
| Subject | Computer Science |