Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1612.01064
Cited By
Trained Ternary Quantization
4 December 2016
Chenzhuo Zhu
Song Han
Huizi Mao
W. Dally
MQ
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Trained Ternary Quantization"
50 / 509 papers shown
Title
Silenzio: Secure Non-Interactive Outsourced MLP Training
Jonas Sander
T. Eisenbarth
28
0
0
24 Apr 2025
BackSlash: Rate Constrained Optimized Training of Large Language Models
Jun Wu
Jiangtao Wen
Yuxing Han
34
0
0
23 Apr 2025
Tin-Tin: Towards Tiny Learning on Tiny Devices with Integer-based Neural Network Training
Yi Hu
Jinhang Zuo
Eddie Zhang
Bob Iannucci
Carlee Joe-Wong
24
0
0
13 Apr 2025
PARQ: Piecewise-Affine Regularized Quantization
Lisa Jin
Jianhao Ma
Zechun Liu
Andrey Gromov
Aaron Defazio
Lin Xiao
MQ
38
0
0
19 Mar 2025
Continual Quantization-Aware Pre-Training: When to transition from 16-bit to 1.58-bit pre-training for BitNet language models?
Jacob Nielsen
Peter Schneider-Kamp
Lukas Galke
MQ
61
1
0
17 Feb 2025
Histogram-Equalized Quantization for logic-gated Residual Neural Networks
Van Thien Nguyen
William Guicquero
Gilles Sicard
MQ
38
1
0
10 Jan 2025
DEX: Data Channel Extension for Efficient CNN Inference on Tiny AI Accelerators
Taesik Gong
F. Kawsar
Chulhong Min
59
3
0
09 Dec 2024
Optimizing Large Language Models through Quantization: A Comparative Analysis of PTQ and QAT Techniques
Jahid Hasan
MQ
25
1
0
09 Nov 2024
t-READi: Transformer-Powered Robust and Efficient Multimodal Inference for Autonomous Driving
Pengfei Hu
Yuhang Qian
Tianyue Zheng
Ang Li
Zhe Chen
Yue Gao
Xiuzhen Cheng
Jun-Jie Luo
26
0
0
13 Oct 2024
Gradient-Free Neural Network Training on the Edge
Dotan Di Castro
O. Joglekar
Shir Kozlovsky
Vladimir Tchuiev
Michal Moshkovitz
MQ
14
0
0
13 Oct 2024
Self-Masking Networks for Unsupervised Adaptation
Alfonso Taboada Warmerdam
Mathilde Caron
Yuki M. Asano
39
1
0
11 Sep 2024
Toward Efficient Convolutional Neural Networks With Structured Ternary Patterns
Christos Kyrkou
34
0
0
20 Jul 2024
Quality Scalable Quantization Methodology for Deep Learning on Edge
S. Khaliq
Rehan Hafiz
MQ
35
1
0
15 Jul 2024
BitsFusion: 1.99 bits Weight Quantization of Diffusion Model
Yang Sui
Yanyu Li
Anil Kag
Yerlan Idelbayev
Junli Cao
Ju Hu
Dhritiman Sagar
Bo Yuan
Sergey Tulyakov
Jian Ren
MQ
39
18
0
06 Jun 2024
TerDiT: Ternary Diffusion Models with Transformers
Xudong Lu
Aojun Zhou
Ziyi Lin
Qi Liu
Yuhui Xu
Renrui Zhang
Yafei Wen
Shuai Ren
Peng Gao
Junchi Yan
MQ
45
2
0
23 May 2024
From Algorithm to Hardware: A Survey on Efficient and Safe Deployment of Deep Neural Networks
Xue Geng
Zhe Wang
Chunyun Chen
Qing Xu
Kaixin Xu
...
Zhenghua Chen
M. Aly
Jie Lin
Min-man Wu
Xiaoli Li
33
1
0
09 May 2024
Lightweight Change Detection in Heterogeneous Remote Sensing Images with Online All-Integer Pruning Training
Chengyang Zhang
Weiming Li
Gang Li
Huina Song
Zhaohui Song
Xueqian Wang
Antonio Plaza
31
0
0
03 May 2024
Lightweight Deep Learning for Resource-Constrained Environments: A Survey
Hou-I Liu
Marco Galindo
Hongxia Xie
Lai-Kuan Wong
Hong-Han Shuai
Yung-Hui Li
Wen-Huang Cheng
50
48
0
08 Apr 2024
Tiny Machine Learning: Progress and Futures
Ji Lin
Ligeng Zhu
Wei-Ming Chen
Wei-Chen Wang
Song Han
36
51
0
28 Mar 2024
DaCapo: Accelerating Continuous Learning in Autonomous Systems for Video Analytics
Yoonsung Kim
Changhun Oh
Jinwoo Hwang
Wonung Kim
Seongryong Oh
Yubin Lee
Hardik Sharma
Amir Yazdanbakhsh
Jongse Park
33
7
0
21 Mar 2024
On the Convergence of Federated Learning Algorithms without Data Similarity
Ali Beikmohammadi
Sarit Khirirat
Sindri Magnússon
FedML
33
1
0
29 Feb 2024
Less is KEN: a Universal and Simple Non-Parametric Pruning Algorithm for Large Language Models
Michele Mastromattei
Fabio Massimo Zanzotto
VLM
23
1
0
05 Feb 2024
Effect of Weight Quantization on Learning Models by Typical Case Analysis
Shuhei Kashiwamura
Ayaka Sakata
Masaaki Imaizumi
MQ
17
1
0
30 Jan 2024
Learning Long Sequences in Spiking Neural Networks
Matei Ioan Stan
Oliver Rhodes
35
10
0
14 Dec 2023
PLUM: Improving Inference Efficiency By Leveraging Repetition-Sparsity Trade-Off
Sachit Kuhar
Yash Jain
Alexey Tumanov
MQ
52
0
0
04 Dec 2023
Neural Language Model Pruning for Automatic Speech Recognition
Leonardo Emili
Thiago Fraga-Silva
Ernest Pusateri
M. Nußbaum-Thom
Youssef Oualil
22
1
0
05 Oct 2023
SupeRBNN: Randomized Binary Neural Network Using Adiabatic Superconductor Josephson Devices
Z. Li
Geng Yuan
Tomoharu Yamauchi
Zabihi Masoud
Yanyue Xie
...
Xulong Tang
Nobuyuki Yoshikawa
Devesh Tiwari
Yanzhi Wang
O. Chen
MQ
4
4
0
21 Sep 2023
Real-Time Semantic Segmentation: A Brief Survey & Comparative Study in Remote Sensing
Clifford Broni-Bediako
Junshi Xia
Naoto Yokoya
36
9
0
12 Sep 2023
On-Chip Hardware-Aware Quantization for Mixed Precision Neural Networks
Wei Huang
Haotong Qin
Yangdong Liu
Jingzhuo Liang
Yifu Ding
Ying Li
Xianglong Liu
MQ
21
0
0
05 Sep 2023
Token-Scaled Logit Distillation for Ternary Weight Generative Language Models
Minsoo Kim
Sihwa Lee
Jangwhan Lee
S. Hong
Duhyeuk Chang
Wonyong Sung
Jungwook Choi
MQ
16
14
0
13 Aug 2023
Search-time Efficient Device Constraints-Aware Neural Architecture Search
Oshin Dutta
Tanu Kanvar
Sumeet Agarwal
28
3
0
10 Jul 2023
Learning Discrete Weights and Activations Using the Local Reparameterization Trick
G. Berger
Aviv Navon
Ethan Fetaya
MQ
20
0
0
04 Jul 2023
Data-Free Quantization via Mixed-Precision Compensation without Fine-Tuning
Jun Chen
Shipeng Bai
Tianxin Huang
Mengmeng Wang
Guanzhong Tian
Y. Liu
MQ
34
18
0
02 Jul 2023
Q-YOLO: Efficient Inference for Real-time Object Detection
Mingze Wang
H. Sun
Jun Shi
Xuhui Liu
Baochang Zhang
Xianbin Cao
ObjD
28
8
0
01 Jul 2023
Designing strong baselines for ternary neural network quantization through support and mass equalization
Edouard Yvinec
Arnaud Dapogny
Kévin Bailly
MQ
11
0
0
30 Jun 2023
CAMEL: Co-Designing AI Models and Embedded DRAMs for Efficient On-Device Learning
Sai Qian Zhang
Thierry Tambe
Nestor Cuevas
Gu-Yeon Wei
David Brooks
10
4
0
04 May 2023
Cuttlefish: Low-Rank Model Training without All the Tuning
Hongyi Wang
Saurabh Agarwal
Pongsakorn U-chupala
Yoshiki Tanaka
Eric P. Xing
Dimitris Papailiopoulos
OffRL
56
21
0
04 May 2023
TorchBench: Benchmarking PyTorch with High API Surface Coverage
Yueming Hao
Xu Zhao
Bin Bao
David Berard
William Constable
Adnan Aziz
Xu Liu
25
5
0
27 Apr 2023
Stable and low-precision training for large-scale vision-language models
Mitchell Wortsman
Tim Dettmers
Luke Zettlemoyer
Ari S. Morcos
Ali Farhadi
Ludwig Schmidt
MQ
MLLM
VLM
22
38
0
25 Apr 2023
AutoQNN: An End-to-End Framework for Automatically Quantizing Neural Networks
Cheng Gong
Ye Lu
Surong Dai
Deng Qian
Chenkun Du
Tao Li
MQ
27
0
0
07 Apr 2023
MadEye: Boosting Live Video Analytics Accuracy with Adaptive Camera Configurations
M. Wong
M. Ramanujam
Guha Balakrishnan
Ravi Netravali
27
4
0
04 Apr 2023
Q-DETR: An Efficient Low-Bit Quantized Detection Transformer
Sheng Xu
Yanjing Li
Mingbao Lin
Penglei Gao
Guodong Guo
Jinhu Lu
Baochang Zhang
MQ
18
23
0
01 Apr 2023
A Generative Framework for Low-Cost Result Validation of Machine Learning-as-a-Service Inference
Abhinav Kumar
Miguel A. Guirao Aguilera
R. Tourani
S. Misra
AAML
19
0
0
31 Mar 2023
Information-Theoretic GAN Compression with Variational Energy-based Model
Minsoo Kang
Hyewon Yoo
Eunhee Kang
Sehwan Ki
Hyong-Euk Lee
Bohyung Han
GAN
18
3
0
28 Mar 2023
Energy-efficient Task Adaptation for NLP Edge Inference Leveraging Heterogeneous Memory Architectures
Zirui Fu
Aleksandre Avaliani
M. Donato
39
1
0
25 Mar 2023
Mathematical Challenges in Deep Learning
V. Nia
Guojun Zhang
I. Kobyzev
Michael R. Metel
Xinlin Li
...
S. Hemati
M. Asgharian
Linglong Kong
Wulong Liu
Boxing Chen
AI4CE
VLM
35
1
0
24 Mar 2023
Scalable Object Detection on Embedded Devices Using Weight Pruning and Singular Value Decomposition
D. Ham
Jaeyeop Jeong
June-Kyoo Park
Raehyeon Jeong
S. Jeon
Hyeongjun Jeon
Ye-Eun Lim
8
0
0
05 Mar 2023
Ternary Quantization: A Survey
Danyang Liu
Xue Liu
MQ
16
3
0
02 Mar 2023
Structured Pruning for Deep Convolutional Neural Networks: A survey
Yang He
Lingao Xiao
3DPC
28
116
0
01 Mar 2023
Teacher Intervention: Improving Convergence of Quantization Aware Training for Ultra-Low Precision Transformers
Minsoo Kim
Kyuhong Shim
Seongmin Park
Wonyong Sung
Jungwook Choi
MQ
11
1
0
23 Feb 2023
1
2
3
4
...
9
10
11
Next