SqueezeNet: AlexNet-level accuracy with 50x fewer parameters and \textless0.5MB model size

SqueezeNet: AlexNet-level accuracy with 50x fewer parameters and \textless0.5MB model size. Iandola, F. N., Han, S., Moskewicz, M. W., Ashraf, K., Dally, W. J., & Keutzer, K. November, 2016. arXiv:1602.07360 [cs] rate: 5
$SqueezeNet: AlexNet-level accuracy with 50x fewer parameters and \textless0.5MB model size [link]$ Paper abstract bibtex

Recent research on deep convolutional neural networks (CNNs) has focused primarily on improving accuracy. For a given accuracy level, it is typically possible to identify multiple CNN architectures that achieve that accuracy level. With equivalent accuracy, smaller CNN architectures offer at least three advantages: (1) Smaller CNNs require less communication across servers during distributed training. (2) Smaller CNNs require less bandwidth to export a new model from the cloud to an autonomous car. (3) Smaller CNNs are more feasible to deploy on FPGAs and other hardware with limited memory. To provide all of these advantages, we propose a small CNN architecture called SqueezeNet. SqueezeNet achieves AlexNet-level accuracy on ImageNet with 50x fewer parameters. Additionally, with model compression techniques, we are able to compress SqueezeNet to less than 0.5MB (510× smaller than AlexNet).

@misc{iandola_squeezenet_2016,
	title = {{SqueezeNet}: {AlexNet}-level accuracy with 50x fewer parameters and {\textless}0.{5MB} model size},
	shorttitle = {{SqueezeNet}},
	url = {http://arxiv.org/abs/1602.07360},
	abstract = {Recent research on deep convolutional neural networks (CNNs) has focused primarily on improving accuracy. For a given accuracy level, it is typically possible to identify multiple CNN architectures that achieve that accuracy level. With equivalent accuracy, smaller CNN architectures offer at least three advantages: (1) Smaller CNNs require less communication across servers during distributed training. (2) Smaller CNNs require less bandwidth to export a new model from the cloud to an autonomous car. (3) Smaller CNNs are more feasible to deploy on FPGAs and other hardware with limited memory. To provide all of these advantages, we propose a small CNN architecture called SqueezeNet. SqueezeNet achieves AlexNet-level accuracy on ImageNet with 50x fewer parameters. Additionally, with model compression techniques, we are able to compress SqueezeNet to less than 0.5MB (510× smaller than AlexNet).},
	language = {en},
	urldate = {2023-06-16},
	publisher = {arXiv},
	author = {Iandola, Forrest N. and Han, Song and Moskewicz, Matthew W. and Ashraf, Khalid and Dally, William J. and Keutzer, Kurt},
	month = nov,
	year = {2016},
	note = {arXiv:1602.07360 [cs]
rate: 5},
	keywords = {\#CNN, \#ICLR{\textgreater}16, /unread, Computer Science - Artificial Intelligence, Computer Science - Computer Vision and Pattern Recognition, ⭐⭐⭐⭐⭐},
}

Downloads: 0

{"_id":"uBaGtiJMnhqn7jiEv","bibbaseid":"iandola-han-moskewicz-ashraf-dally-keutzer-squeezenetalexnetlevelaccuracywith50xfewerparametersandtextless05mbmodelsize-2016","author_short":["Iandola, F. N.","Han, S.","Moskewicz, M. W.","Ashraf, K.","Dally, W. J.","Keutzer, K."],"bibdata":{"bibtype":"misc","type":"misc","title":"SqueezeNet: AlexNet-level accuracy with 50x fewer parameters and \\textless0.5MB model size","shorttitle":"SqueezeNet","url":"http://arxiv.org/abs/1602.07360","abstract":"Recent research on deep convolutional neural networks (CNNs) has focused primarily on improving accuracy. For a given accuracy level, it is typically possible to identify multiple CNN architectures that achieve that accuracy level. With equivalent accuracy, smaller CNN architectures offer at least three advantages: (1) Smaller CNNs require less communication across servers during distributed training. (2) Smaller CNNs require less bandwidth to export a new model from the cloud to an autonomous car. (3) Smaller CNNs are more feasible to deploy on FPGAs and other hardware with limited memory. To provide all of these advantages, we propose a small CNN architecture called SqueezeNet. SqueezeNet achieves AlexNet-level accuracy on ImageNet with 50x fewer parameters. Additionally, with model compression techniques, we are able to compress SqueezeNet to less than 0.5MB (510× smaller than AlexNet).","language":"en","urldate":"2023-06-16","publisher":"arXiv","author":[{"propositions":[],"lastnames":["Iandola"],"firstnames":["Forrest","N."],"suffixes":[]},{"propositions":[],"lastnames":["Han"],"firstnames":["Song"],"suffixes":[]},{"propositions":[],"lastnames":["Moskewicz"],"firstnames":["Matthew","W."],"suffixes":[]},{"propositions":[],"lastnames":["Ashraf"],"firstnames":["Khalid"],"suffixes":[]},{"propositions":[],"lastnames":["Dally"],"firstnames":["William","J."],"suffixes":[]},{"propositions":[],"lastnames":["Keutzer"],"firstnames":["Kurt"],"suffixes":[]}],"month":"November","year":"2016","note":"arXiv:1602.07360 [cs] rate: 5","keywords":"#CNN, #ICLR\\textgreater16, /unread, Computer Science - Artificial Intelligence, Computer Science - Computer Vision and Pattern Recognition, ⭐⭐⭐⭐⭐","bibtex":"@misc{iandola_squeezenet_2016,\n\ttitle = {{SqueezeNet}: {AlexNet}-level accuracy with 50x fewer parameters and {\\textless}0.{5MB} model size},\n\tshorttitle = {{SqueezeNet}},\n\turl = {http://arxiv.org/abs/1602.07360},\n\tabstract = {Recent research on deep convolutional neural networks (CNNs) has focused primarily on improving accuracy. For a given accuracy level, it is typically possible to identify multiple CNN architectures that achieve that accuracy level. With equivalent accuracy, smaller CNN architectures offer at least three advantages: (1) Smaller CNNs require less communication across servers during distributed training. (2) Smaller CNNs require less bandwidth to export a new model from the cloud to an autonomous car. (3) Smaller CNNs are more feasible to deploy on FPGAs and other hardware with limited memory. To provide all of these advantages, we propose a small CNN architecture called SqueezeNet. SqueezeNet achieves AlexNet-level accuracy on ImageNet with 50x fewer parameters. Additionally, with model compression techniques, we are able to compress SqueezeNet to less than 0.5MB (510× smaller than AlexNet).},\n\tlanguage = {en},\n\turldate = {2023-06-16},\n\tpublisher = {arXiv},\n\tauthor = {Iandola, Forrest N. and Han, Song and Moskewicz, Matthew W. and Ashraf, Khalid and Dally, William J. and Keutzer, Kurt},\n\tmonth = nov,\n\tyear = {2016},\n\tnote = {arXiv:1602.07360 [cs]\nrate: 5},\n\tkeywords = {\\#CNN, \\#ICLR{\\textgreater}16, /unread, Computer Science - Artificial Intelligence, Computer Science - Computer Vision and Pattern Recognition, ⭐⭐⭐⭐⭐},\n}\n\n\n\n","author_short":["Iandola, F. N.","Han, S.","Moskewicz, M. W.","Ashraf, K.","Dally, W. J.","Keutzer, K."],"key":"iandola_squeezenet_2016","id":"iandola_squeezenet_2016","bibbaseid":"iandola-han-moskewicz-ashraf-dally-keutzer-squeezenetalexnetlevelaccuracywith50xfewerparametersandtextless05mbmodelsize-2016","role":"author","urls":{"Paper":"http://arxiv.org/abs/1602.07360"},"keyword":["#CNN","#ICLR\\textgreater16","/unread","Computer Science - Artificial Intelligence","Computer Science - Computer Vision and Pattern Recognition","⭐⭐⭐⭐⭐"],"metadata":{"authorlinks":{}},"downloads":0,"html":""},"bibtype":"misc","biburl":"https://bibbase.org/zotero/zzhenry2012","dataSources":["nZHrFJKyxKKDaWYM8"],"keywords":["#cnn","#iclr\\textgreater16","/unread","computer science - artificial intelligence","computer science - computer vision and pattern recognition","⭐⭐⭐⭐⭐"],"search_terms":["squeezenet","alexnet","level","accuracy","50x","fewer","parameters","textless0","5mb","model","size","iandola","han","moskewicz","ashraf","dally","keutzer"],"title":"SqueezeNet: AlexNet-level accuracy with 50x fewer parameters and \\textless0.5MB model size","year":2016}