Abstract
Model compression is crucial for the deployment of neural networks on devices with limited computational and memory resources. Many different methods show comparable accuracy of the compressed model and similar compression rates. However, the majority of the compression methods are based on heuristics and offer no worst case guarantees on the tradeoff between the compression rate and the approximation error for an arbitrarily new sample. We propose the first efficient structured pruning algorithm with a provable tradeoff between its compression rate and the approximation error for any future test sample. Our method is based on the coreset framework, and it approximates the output of a layer of neurons/filters by a coreset of neurons/filters in the previous layer and discards the rest. We apply this framework in a layer-by-layer fashion from the bottom to the top. Unlike previous works, our coreset is data-independent, meaning that it provably guarantees the accuracy of the function for any input [Formula: see text], including an adversarial one.
| Original language | English |
|---|---|
| Pages (from-to) | 7829-7841 |
| Number of pages | 13 |
| Journal | IEEE Transactions on Neural Networks and Learning Systems |
| Volume | 33 |
| Issue number | 12 |
| DOIs | |
| State | Published - Dec 2022 |
Bibliographical note
Publisher Copyright:IEEE
Keywords
- Algorithms
- Data Compression
- Neural Networks, Computer
- Neurons
Fingerprint
Dive into the research topics of 'Data-Independent Structured Pruning of Neural Networks via Coresets'. Together they form a unique fingerprint.Related research output
- 1 Conference contribution
-
Data-Independent Structured Pruning of Neural Networks via Coresets
Mussay, B., Feldman, D., Zhou, S., Braverman, V. & Osadchy, M., 2020, 8th International Conference on Learning Representations, ICLR 2020. 24 p.Research output: Chapter in Book/Report/Conference proceeding › Conference contribution › peer-review
Cite this
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver