ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2305.14405
  4. Cited By
NeuralMatrix: Compute the Entire Neural Networks with Linear Matrix
  Operations for Efficient Inference

NeuralMatrix: Compute the Entire Neural Networks with Linear Matrix Operations for Efficient Inference

23 May 2023
Ruiqi Sun
Siwei Ye
Jie Zhao
Xin He
Yiran Li
An Zou
ArXivPDFHTML

Papers citing "NeuralMatrix: Compute the Entire Neural Networks with Linear Matrix Operations for Efficient Inference"

2 / 2 papers shown
Title
I-BERT: Integer-only BERT Quantization
I-BERT: Integer-only BERT Quantization
Sehoon Kim
A. Gholami
Z. Yao
Michael W. Mahoney
Kurt Keutzer
MQ
86
332
0
05 Jan 2021
GLUE: A Multi-Task Benchmark and Analysis Platform for Natural Language
  Understanding
GLUE: A Multi-Task Benchmark and Analysis Platform for Natural Language Understanding
Alex Jinpeng Wang
Amanpreet Singh
Julian Michael
Felix Hill
Omer Levy
Samuel R. Bowman
ELM
294
6,927
0
20 Apr 2018
1