Efficient-Deep-Learning

深度神经网络压缩和加速方法综述

神经网络压缩模型加速网络剪枝权重量化知识蒸馏 Github 开源项目

此项目汇总了深度神经网络压缩和加速的多种方法,涵盖神经架构设计、剪枝、量化、矩阵分解和知识蒸馏等技术。重点介绍了剪枝(含彩票假设)、知识蒸馏和量化等领域的研究进展,并提供了大量相关论文摘要。项目还收录了初始化剪枝和高效视觉Transformer等相关资源,为该领域的研究和开发提供了全面参考。

Github

介绍相关项目

EfficientDNNs

A collection of recent methods on DNN compression and acceleration. There are mainly 5 kinds of methods for efficient DNNs:

neural architecture re-design or search (NAS)
- maintain accuracy, less cost (e.g., #Params, #FLOPs, etc.): MobileNet, ShuffleNet etc.
- maintain cost, more accuracy: Inception, ResNeXt, Xception etc.
pruning (including structured and unstructured)
quantization
matrix/low-rank decomposition
knowledge distillation (KD)

Note, this repo is more about pruning (with lottery ticket hypothesis or LTH as a sub-topic), KD, and quantization. For other topics like NAS, see more comprehensive collections (## Related Repos and Websites) at the end of this file. Welcome to send a pull request if you'd like to add any pertinent papers.

Other repos:

LTH (lottery ticket hypothesis) and its broader version, pruning at initialization (PaI), now is at the frontier of network pruning. We single out the PaI papers to this repo. Welcome to check it out!
Awesome-Efficient-ViT for a curated list of efficient vision transformers.

About abbreviation: In the list below, o for oral, s for spotlight, b for best paper, w for workshop.

Surveys

1993-TNN-Pruning Algorithms -- A survey
2017-Proceedings of the IEEE-Efficient Processing of Deep Neural Networks: A Tutorial and Survey [2020 Book: Efficient Processing of Deep Neural Networks]
2017.12-A survey of FPGA-based neural network accelerator
2018-FITEE-Recent Advances in Efficient Computation of Deep Convolutional Neural Networks
2018-IEEE Signal Processing Magazine-Model compression and acceleration for deep neural networks: The principles, progress, and challenges. Arxiv extension
2018.8-A Survey on Methods and Theories of Quantized Neural Networks
2019-JMLR-Neural Architecture Search: A Survey
2020-MLSys-What is the state of neural network pruning
2019.02-The State of Sparsity in Deep Neural Networks
2021-TPAMI-Knowledge Distillation and Student-Teacher Learning for Visual Intelligence: A Review and New Outlooks
2021-IJCV-Knowledge Distillation: A Survey
2020-Proceedings of the IEEE-Model Compression and Hardware Acceleration for Neural Networks: A Comprehensive Survey
2020-Pattern Recognition-Binary neural networks: A survey
2021-TPDS-The Deep Learning Compiler: A Comprehensive Survey
2021-JMLR-Sparsity in Deep Learning: Pruning and growth for efficient inference and training in neural networks
2022-IJCAI-Recent Advances on Neural Network Pruning at Initialization
2021.6-Efficient Deep Learning: A Survey on Making Deep Learning Models Smaller, Faster, and Better

Papers [Pruning and Quantization]

1980s,1990s

1988-NIPS-A back-propagation algorithm with optimal use of hidden units
1988-NIPS-Skeletonization: A Technique for Trimming the Fat from a Network via Relevance Assessment
1988-NIPS-What Size Net Gives Valid Generalization?
1989-NIPS-Dynamic Behavior of Constained Back-Propagation Networks
1988-NIPS-Comparing Biases for Minimal Network Construction with Back-Propagation
1989-NIPS-Optimal Brain Damage
1990-NN-A simple procedure for pruning back-propagation trained neural networks
1992-NIPS-Second order derivatives for network pruning: Optimal Brain Surgeon
1993-ICNN-Optimal Brain Surgeon and general network pruning

2000s

2001-JMLR-Sparse Bayesian learning and the relevance vector machine
2007-Book-The minimum description length principle

2011

2011-JMLR-Learning with Structured Sparsity
2011-NIPSw-Improving the speed of neural networks on CPUs

2013

2014

2014-BMVC-Speeding up convolutional neural networks with low rank expansions
2014-INTERSPEECH-1-Bit Stochastic Gradient Descent and its Application to Data-Parallel Distributed Training of Speech DNNs
2014-NIPS-Exploiting Linear Structure Within Convolutional Networks for Efficient Evaluation
2014-NIPS-Do deep neural nets really need to be deep
2014.12-Memory bounded deep convolutional networks

2015

2015-ICLR-Speeding-up convolutional neural networks using fine-tuned cp-decomposition
2015-ICML-Compressing neural networks with the hashing trick
2015-INTERSPEECH-A Diversity-Penalizing Ensemble Training Method for Deep Learning
2015-BMVC-Data-free parameter pruning for deep neural networks
2015-BMVC-Learning the structure of deep architectures using l1 regularization
2015-NIPS-Learning both Weights and Connections for Efficient Neural Network
2015-NIPS-Binaryconnect: Training deep neural networks with binary weights during propagations
2015-NIPS-Structured Transforms for Small-Footprint Deep Learning
2015-NIPS-Tensorizing Neural Networks
2015-NIPSw-Distilling Intractable Generative Models
2015-NIPSw-Federated Optimization:Distributed Optimization Beyond the Datacenter
2015-CVPR-Efficient and Accurate Approximations of Nonlinear Convolutional Networks [2016 TPAMI version: Accelerating Very Deep Convolutional Networks for Classification and Detection]
2015-CVPR-Sparse Convolutional Neural Networks
2015-ICCV-An Exploration of Parameter Redundancy in Deep Networks with Circulant Projections
2015.12-Exploiting Local Structures with the Kronecker Layer in Convolutional Networks

2016

2016-ICLR-Deep Compression: Compressing Deep Neural Networks with Pruning, Trained Quantization and Huffman Coding [Best paper!]
2016-ICLR-All you need is a good init [Code]
2016-ICLR-Data-dependent Initializations of Convolutional Neural Networks [Code]
2016-ICLR-Convolutional neural networks with low-rank regularization [Code]
2016-ICLR-Diversity networks
2016-ICLR-Neural networks with few multiplications
2016-ICLR-Compression of deep convolutional neural networks for fast and low power mobile applications
2016-ICLRw-Randomout: Using a convolutional gradient norm to win the filter lottery
2016-CVPR-Fast algorithms for convolutional neural networks
2016-CVPR-Fast ConvNets Using Group-wise Brain Damage
2016-BMVC-Learning neural network architectures using backpropagation
2016-ECCV-Less is more: Towards compact cnns
2016-EMNLP-Sequence-Level Knowledge Distillation
2016-NIPS-Learning Structured Sparsity in Deep Neural Networks [Caffe Code]
2016-NIPS-Dynamic Network Surgery for Efficient DNNs [Caffe Code]
2016-NIPS-Learning the Number of Neurons in Deep Neural Networks
2016-NIPS-Memory-Efficient Backpropagation Through Time
2016-NIPS-PerforatedCNNs: Acceleration through Elimination of Redundant Convolutions
2016-NIPS-LightRNN: Memory and Computation-Efficient Recurrent Neural Networks
2016-NIPS-CNNpack: packing convolutional neural networks in the frequency domain
2016-ISCA-Eyeriss: A Spatial Architecture for Energy-Efficient Dataflow for Convolutional Neural Networks
2016-ICASSP-Learning compact recurrent neural networks
2016-CoNLL-Compression of Neural Machine Translation Models via Pruning
2016.03-Adaptive Computation Time for Recurrent Neural Networks
2016.06-[Structured Convolution

相关项目

项目侧边栏1

项目侧边栏2

推荐项目

Project Cover

豆包MarsCode

豆包 MarsCode 是一款革命性的编程助手，通过AI技术提供代码补全、单测生成、代码解释和智能问答等功能，支持100+编程语言，与主流编辑器无缝集成，显著提升开发效率和代码质量。

Project Cover

AI写歌

Suno AI是一个革命性的AI音乐创作平台，能在短短30秒内帮助用户创作出一首完整的歌曲。无论是寻找创作灵感还是需要快速制作音乐，Suno AI都是音乐爱好者和专业人士的理想选择。

Project Cover

有言AI

有言平台提供一站式AIGC视频创作解决方案，通过智能技术简化视频制作流程。无论是企业宣传还是个人分享，有言都能帮助用户快速、轻松地制作出专业级别的视频内容。

Project Cover

Kimi

Kimi AI助手提供多语言对话支持，能够阅读和理解用户上传的文件内容，解析网页信息，并结合搜索结果为用户提供详尽的答案。无论是日常咨询还是专业问题，Kimi都能以友好、专业的方式提供帮助。

Project Cover

阿里绘蛙

绘蛙是阿里巴巴集团推出的革命性AI电商营销平台。利用尖端人工智能技术，为商家提供一键生成商品图和营销文案的服务，显著提升内容创作效率和营销效果。适用于淘宝、天猫等电商平台，让商品第一时间被种草。

Project Cover

吐司

探索Tensor.Art平台的独特AI模型，免费访问各种图像生成与AI训练工具，从Stable Diffusion等基础模型开始，轻松实现创新图像生成。体验前沿的AI技术，推动个人和企业的创新发展。

Project Cover

SubCat字幕猫

SubCat字幕猫APP是一款创新的视频播放器，它将改变您观看视频的方式！SubCat结合了先进的人工智能技术，为您提供即时视频字幕翻译，无论是本地视频还是网络流媒体，让您轻松享受各种语言的内容。

Project Cover

美间AI

美间AI创意设计平台，利用前沿AI技术，为设计师和营销人员提供一站式设计解决方案。从智能海报到3D效果图，再到文案生成，美间让创意设计更简单、更高效。

Project Cover

AIWritePaper论文写作

AIWritePaper论文写作是一站式AI论文写作辅助工具，简化了选题、文献检索至论文撰写的整个过程。通过简单设定，平台可快速生成高质量论文大纲和全文，配合图表、参考文献等一应俱全，同时提供开题报告和答辩PPT等增值服务，保障数据安全，有效提升写作效率和论文质量。

使用协议隐私政策广告服务

投诉举报邮箱: service@vectorlightyear.com

@2024 懂AI·鲁ICP备2024100362号-6·鲁公网安备37021002001498号