Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1606.08415
Cited By
Gaussian Error Linear Units (GELUs)
27 June 2016
Dan Hendrycks
Kevin Gimpel
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Gaussian Error Linear Units (GELUs)"
50 / 753 papers shown
Title
Equivariant Machine Learning Decoder for 3D Toric Codes
Oliver Weissl
Evgenii Egorov
24
0
0
06 Sep 2024
Multi-Modal Adapter for Vision-Language Models
Dominykas Seputis
Serghei Mihailov
Soham Chatterjee
Zehao Xiao
VLM
28
1
0
03 Sep 2024
SMAFormer: Synergistic Multi-Attention Transformer for Medical Image Segmentation
Fuchen Zheng
Xuhang Chen
Weihuang Liu
Haolun Li
Yingtie Lei
Jiahui He
Chi-Man Pun
Shounjun Zhou
MedIm
29
11
0
31 Aug 2024
WavTokenizer: an Efficient Acoustic Discrete Codec Tokenizer for Audio Language Modeling
Shengpeng Ji
Ziyue Jiang
Xize Cheng
Yifu Chen
Minghui Fang
...
Rongjie Huang
Yidi Jiang
Qian Chen
Zhou Zhao
Zhou Zhao
VLM
54
33
0
29 Aug 2024
Function-Space MCMC for Bayesian Wide Neural Networks
Lucia Pezzetti
Stefano Favaro
Stefano Peluchetti
BDL
124
0
0
26 Aug 2024
P3P: Pseudo-3D Pre-training for Scaling 3D Masked Autoencoders
Xuechao Chen
Ying Chen
Jialin Li
Qiang Nie
Hanqiu Deng
Qixing Huang
Yang Li
Yang Li
3DPC
73
0
0
19 Aug 2024
Cross-View Geolocalization and Disaster Mapping with Street-View and VHR Satellite Imagery: A Case Study of Hurricane IAN
Hao Li
Fabian Deuser
Wenping Yina
Xuanshu Luo
Paul Walther
Gengchen Mai
Wei Huang
Martin Werner
30
4
0
13 Aug 2024
CROME: Cross-Modal Adapters for Efficient Multimodal LLM
Sayna Ebrahimi
Sercan Ö. Arik
Tejas Nama
Tomas Pfister
44
1
0
13 Aug 2024
LipidBERT: A Lipid Language Model Pre-trained on METiS de novo Lipid Library
Tianhao Yu
Cai Yao
Zhuorui Sun
Feng Shi
Lin Zhang
...
Xicheng Zhang
Jiali Zou
Wenshou Wang
C. Lai
Kai Wang
26
3
0
12 Aug 2024
Content-decoupled Contrastive Learning-based Implicit Degradation Modeling for Blind Image Super-Resolution
Jiang Yuan
Ji Ma
Bo Wang
Weiming Hu
30
0
0
10 Aug 2024
On the choice of the non-trainable internal weights in random feature maps
Pinak Mandal
Georg Gottwald
Nicholas Cranch
TPM
40
1
0
07 Aug 2024
Pose Magic: Efficient and Temporally Consistent Human Pose Estimation with a Hybrid Mamba-GCN Network
Xinyi Zhang
Qiqi Bao
Qinpeng Cui
Wenming Yang
Qingmin Liao
3DH
Mamba
28
1
0
06 Aug 2024
Table Transformers for Imputing Textual Attributes
Ting-Ruen Wei
Yuan Wang
Yoshitaka Inoue
Hsin-Tai Wu
Yi Fang
LMTD
32
0
0
04 Aug 2024
Why Rectified Power Unit Networks Fail and How to Improve It: An Effective Theory Perspective
Taeyoung Kim
Myungjoo Kang
25
0
0
04 Aug 2024
Unsupervised Representation Learning by Balanced Self Attention Matching
Daniel Shalam
Simon Korman
SSL
33
0
0
04 Aug 2024
DeMansia: Mamba Never Forgets Any Tokens
Ricky Fang
Mamba
19
0
0
04 Aug 2024
Active Learning for Neural PDE Solvers
Daniel Musekamp
Marimuthu Kalimuthu
David Holzmüller
Makoto Takamoto
Carlos Fernandez
AI4CE
45
4
0
02 Aug 2024
Semi-Supervised Teacher-Reference-Student Architecture for Action Quality Assessment
Wu Yun
Mengshi Qi
Fei Peng
Huadong Ma
46
1
0
29 Jul 2024
Depth-Wise Convolutions in Vision Transformers for Efficient Training on Small Datasets
Tianxiao Zhang
Wenju Xu
Bo Luo
Guanghui Wang
ViT
MDE
40
7
0
28 Jul 2024
DC is all you need: describing ReLU from a signal processing standpoint
Christodoulos Kechris
Jonathan Dan
Jose Miranda
David Atienza
33
1
0
23 Jul 2024
Stretching Each Dollar: Diffusion Training from Scratch on a Micro-Budget
Vikash Sehwag
Xianghao Kong
Jingtao Li
Michael Spranger
Lingjuan Lyu
DiffM
44
9
0
22 Jul 2024
PolyR-CNN: R-CNN for end-to-end polygonal building outline extraction
Weiqin Jiao
Claudio Persello
G. Vosselman
3DV
23
2
0
20 Jul 2024
MedMAE: A Self-Supervised Backbone for Medical Imaging Tasks
Anubhav Gupta
Islam I. Osman
Mohamed S. Shehata
John W. Braun
29
1
0
20 Jul 2024
EnergyDiff: Universal Time-Series Energy Data Generation using Diffusion Models
Nan Lin
Peter Palensky
Pedro P. Vergara
DiffM
29
0
0
18 Jul 2024
VD3D: Taming Large Video Diffusion Transformers for 3D Camera Control
Sherwin Bahmani
Ivan Skorokhodov
Aliaksandr Siarohin
Willi Menapace
Guocheng Qian
...
Chaoyang Wang
Jiaxu Zou
Andrea Tagliasacchi
David B. Lindell
Sergey Tulyakov
VGen
DiffM
85
42
0
17 Jul 2024
PartImageNet++ Dataset: Scaling up Part-based Models for Robust Recognition
Xiao-Li Li
Yining Liu
Na Dong
Sitian Qin
Xiaolin Hu
36
3
0
15 Jul 2024
Any-Property-Conditional Molecule Generation with Self-Criticism using Spanning Trees
Alexia Jolicoeur-Martineau
A. Baratin
Kisoo Kwon
Boris Knyazev
Yan Zhang
36
1
0
12 Jul 2024
Don't Fear Peculiar Activation Functions: EUAF and Beyond
Qianchao Wang
Shijun Zhang
Dong Zeng
Zhaoheng Xie
Hengtao Guo
Feng-Lei Fan
Tieyong Zeng
36
3
0
12 Jul 2024
BiGym: A Demo-Driven Mobile Bi-Manual Manipulation Benchmark
Nikita Chernyadev
Nicholas Backshall
Xiao Ma
Yunfan Lu
Younggyo Seo
Stephen James
22
11
0
10 Jul 2024
MambaVision: A Hybrid Mamba-Transformer Vision Backbone
Ali Hatamizadeh
Jan Kautz
Mamba
40
56
0
10 Jul 2024
HDKD: Hybrid Data-Efficient Knowledge Distillation Network for Medical Image Classification
Omar S. El-Assiouti
Ghada Hamed
Dina Khattab
H. M. Ebied
35
1
0
10 Jul 2024
Deconstructing What Makes a Good Optimizer for Language Models
Rosie Zhao
Depen Morwani
David Brandfonbrener
Nikhil Vyas
Sham Kakade
42
17
0
10 Jul 2024
Uni-ELF: A Multi-Level Representation Learning Framework for Electrolyte Formulation Design
Boshen Zeng
Sian Chen
Xinxin Liu
Changhong Chen
Bin Deng
Xiaoxu Wang
Zhifeng Gao
Yuzhi Zhang
Weinan E
Linfeng Zhang
13
1
0
08 Jul 2024
BiRoDiff: Diffusion policies for bipedal robot locomotion on unseen terrains
Gvs Mothish
Manan Tayal
Shishir Kolathaya
36
4
0
07 Jul 2024
MMAD: Multi-label Micro-Action Detection in Videos
Kun Li
Pengyu Liu
Pengyu Liu
Guoliang Chen
Zhiliang Wu
Hehe Fan
Meng Wang
40
7
0
07 Jul 2024
Improving ensemble extreme precipitation forecasts using generative artificial intelligence
Yingkai Sha
R. Sobash
David John Gagne II
25
0
0
05 Jul 2024
AWT: Transferring Vision-Language Models via Augmentation, Weighting, and Transportation
Yuhan Zhu
Yuyang Ji
Zhiyu Zhao
Gangshan Wu
Limin Wang
VLM
39
7
0
05 Jul 2024
Learning to (Learn at Test Time): RNNs with Expressive Hidden States
Yu Sun
Xinhao Li
Karan Dalal
Jiarui Xu
Arjun Vikram
...
Xinlei Chen
Xiaolong Wang
Sanmi Koyejo
Tatsunori Hashimoto
Carlos Guestrin
58
93
0
05 Jul 2024
MUSE-Net: Missingness-aware mUlti-branching Self-attention Encoder for Irregular Longitudinal Electronic Health Records
Zekai Wang
Tieming Liu
B. Yao
31
0
0
30 Jun 2024
LoPT: Low-Rank Prompt Tuning for Parameter Efficient Language Models
Shouchang Guo
Sonam Damani
Keng-hao Chang
VLM
28
1
0
27 Jun 2024
A Sanity Check for AI-generated Image Detection
Shilin Yan
Ouxiang Li
Jiayin Cai
Y. Hao
Xiaolong Jiang
Yao Hu
Weidi Xie
VLM
64
20
0
27 Jun 2024
MATE: Meet At The Embedding -- Connecting Images with Long Texts
Young Kyun Jang
Junmo Kang
Yong Jae Lee
Donghyun Kim
VLM
38
5
0
26 Jun 2024
Enhancing Monotonic Modeling with Spatio-Temporal Adaptive Awareness in Diverse Marketing
Bin Li
Jiayan Pei
Feiyang Xiao
Yifan Zhao
Zhixing Zhang
Diwei Liu
Hengxu He
Jia Jia
29
0
0
20 Jun 2024
UniZero: Generalized and Efficient Planning with Scalable Latent World Models
Yuan Pu
Yazhe Niu
Jiyuan Ren
Zhenjie Yang
Hongsheng Li
Yu Liu
OffRL
41
1
0
15 Jun 2024
An Empirical Study of Mamba-based Language Models
R. Waleffe
Wonmin Byeon
Duncan Riach
Brandon Norick
V. Korthikanti
...
Vartika Singh
Jared Casper
Jan Kautz
M. Shoeybi
Bryan Catanzaro
61
64
0
12 Jun 2024
Loss Gradient Gaussian Width based Generalization and Optimization Guarantees
A. Banerjee
Qiaobo Li
Yingxue Zhou
44
0
0
11 Jun 2024
Geometric sparsification in recurrent neural networks
Wyatt Mackey
Ioannis Schizas
Jared Deighton
David L. Boothe, Jr.
Vasileios Maroulas
28
0
0
10 Jun 2024
Aligning Agents like Large Language Models
Adam Jelley
Yuhan Cao
Dave Bignell
Sam Devlin
Tabish Rashid
LM&Ro
36
1
0
06 Jun 2024
Feature contamination: Neural networks learn uncorrelated features and fail to generalize
Tianren Zhang
Chujie Zhao
Guanyu Chen
Yizhou Jiang
Feng Chen
OOD
MLT
OODD
77
3
0
05 Jun 2024
Textless Acoustic Model with Self-Supervised Distillation for Noise-Robust Expressive Speech-to-Speech Translation
Min-Jae Hwang
Ilia Kulikov
Benjamin Peloquin
Hongyu Gong
Peng-Jen Chen
Ann Lee
27
1
0
04 Jun 2024
Previous
1
2
3
4
5
6
...
14
15
16
Next