ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1305.0445
  4. Cited By
Deep Learning of Representations: Looking Forward

Deep Learning of Representations: Looking Forward

2 May 2013
Yoshua Bengio
ArXivPDFHTML

Papers citing "Deep Learning of Representations: Looking Forward"

50 / 196 papers shown
Title
GPU-centric Communication Schemes for HPC and ML Applications
GPU-centric Communication Schemes for HPC and ML Applications
Naveen Namashivayam
GNN
35
0
0
31 Mar 2025
Learning disentangled representations for instrument-based music similarity
Learning disentangled representations for instrument-based music similarity
Yuka Hashizume
Li Li
Atsushi Miyashita
T. Toda
49
0
0
21 Mar 2025
On Neural Inertial Classification Networks for Pedestrian Activity Recognition
On Neural Inertial Classification Networks for Pedestrian Activity Recognition
Zeev Yampolsky
Ofir Kruzel
Victoria Khalfin Fekson
Itzik Klein
39
0
0
23 Feb 2025
Enhancement of Neural Inertial Regression Networks: A Data-Driven Perspective
Victoria Khalfin Fekson
Nitsan Pri-Hadash
Netta Palez
Aviad Etzion
Itzik Klein
40
1
0
03 Jan 2025
Decoding Dark Matter: Specialized Sparse Autoencoders for Interpreting
  Rare Concepts in Foundation Models
Decoding Dark Matter: Specialized Sparse Autoencoders for Interpreting Rare Concepts in Foundation Models
Aashiq Muhamed
Mona Diab
Virginia Smith
45
2
0
01 Nov 2024
MoIN: Mixture of Introvert Experts to Upcycle an LLM
MoIN: Mixture of Introvert Experts to Upcycle an LLM
Ajinkya Tejankar
K. Navaneet
Ujjawal Panchal
Kossar Pourahmadi
Hamed Pirsiavash
MoE
29
0
0
13 Oct 2024
More Experts Than Galaxies: Conditionally-overlapping Experts With Biologically-Inspired Fixed Routing
More Experts Than Galaxies: Conditionally-overlapping Experts With Biologically-Inspired Fixed Routing
Sagi Shaier
Francisco Pereira
K. Wense
Lawrence E Hunter
Matt Jones
MoE
46
0
0
10 Oct 2024
Reflections on Disentanglement and the Latent Space
Reflections on Disentanglement and the Latent Space
Ludovica Schaerf
21
0
0
08 Oct 2024
FactorLLM: Factorizing Knowledge via Mixture of Experts for Large
  Language Models
FactorLLM: Factorizing Knowledge via Mixture of Experts for Large Language Models
Zhongyu Zhao
Menghang Dong
Rongyu Zhang
Wenzhao Zheng
Yunpeng Zhang
Huanrui Yang
Dalong Du
Kurt Keutzer
Shanghang Zhang
48
0
0
15 Aug 2024
Mixture of Nested Experts: Adaptive Processing of Visual Tokens
Mixture of Nested Experts: Adaptive Processing of Visual Tokens
Gagan Jain
Nidhi Hegde
Aditya Kusupati
Arsha Nagrani
Shyamal Buch
Prateek Jain
Anurag Arnab
Sujoy Paul
MoE
48
7
0
29 Jul 2024
SA-DVAE: Improving Zero-Shot Skeleton-Based Action Recognition by
  Disentangled Variational Autoencoders
SA-DVAE: Improving Zero-Shot Skeleton-Based Action Recognition by Disentangled Variational Autoencoders
Sheng-Wei Li
Zi-Xiang Wei
Wei-Jie Chen
Yi-Hsin Yu
Chih-Yuan Yang
Jane Yung-jen Hsu
DRL
41
3
0
18 Jul 2024
Scaling Diffusion Transformers to 16 Billion Parameters
Scaling Diffusion Transformers to 16 Billion Parameters
Zhengcong Fei
Mingyuan Fan
Changqian Yu
Debang Li
Junshi Huang
DiffM
MoE
59
16
0
16 Jul 2024
ColorwAI: Generative Colorways of Textiles through GAN and Diffusion
  Disentanglement
ColorwAI: Generative Colorways of Textiles through GAN and Diffusion Disentanglement
Ludovica Schaerf
Andrea Alfarano
Eric Postma
DiffM
31
2
0
16 Jul 2024
CiteME: Can Language Models Accurately Cite Scientific Claims?
CiteME: Can Language Models Accurately Cite Scientific Claims?
Ori Press
Andreas Hochlehnert
Ameya Prabhu
Vishaal Udandarao
Ofir Press
Matthias Bethge
47
13
0
10 Jul 2024
Mixture of A Million Experts
Mixture of A Million Experts
Xu Owen He
MoE
41
25
0
04 Jul 2024
Turbo Sparse: Achieving LLM SOTA Performance with Minimal Activated
  Parameters
Turbo Sparse: Achieving LLM SOTA Performance with Minimal Activated Parameters
Yixin Song
Haotong Xie
Zhengyan Zhang
Bo Wen
Li Ma
Zeyu Mi
Haibo Chen
MoE
34
21
0
10 Jun 2024
Ego-Foresight: Agent Visuomotor Prediction as Regularization for RL
Ego-Foresight: Agent Visuomotor Prediction as Regularization for RL
Manuel S. Nunes
Atabak Dehban
Y. Demiris
J. Santos-Victor
48
0
0
27 May 2024
Memory Mosaics
Memory Mosaics
Jianyu Zhang
Niklas Nolte
Ranajoy Sadhukhan
Beidi Chen
Léon Bottou
VLM
73
3
0
10 May 2024
Improving Dictionary Learning with Gated Sparse Autoencoders
Improving Dictionary Learning with Gated Sparse Autoencoders
Senthooran Rajamanoharan
Arthur Conmy
Lewis Smith
Tom Lieberum
Vikrant Varma
János Kramár
Rohin Shah
Neel Nanda
RALM
35
79
0
24 Apr 2024
Tripod: Three Complementary Inductive Biases for Disentangled
  Representation Learning
Tripod: Three Complementary Inductive Biases for Disentangled Representation Learning
Kyle Hsu
Jubayer Ibn Hamid
Kaylee Burns
Chelsea Finn
Jiajun Wu
CML
26
4
0
16 Apr 2024
Learning Multidimensional Disentangled Representations of Instrumental
  Sounds for Musical Similarity Assessment
Learning Multidimensional Disentangled Representations of Instrumental Sounds for Musical Similarity Assessment
Yuka Hashizume
Li Li
Atsushi Miyashita
T. Toda
30
3
0
10 Apr 2024
Mixture-of-Depths: Dynamically allocating compute in transformer-based
  language models
Mixture-of-Depths: Dynamically allocating compute in transformer-based language models
David Raposo
Sam Ritter
Blake A. Richards
Timothy Lillicrap
Peter C. Humphreys
Adam Santoro
MoE
40
69
0
02 Apr 2024
MerRec: A Large-scale Multipurpose Mercari Dataset for
  Consumer-to-Consumer Recommendation Systems
MerRec: A Large-scale Multipurpose Mercari Dataset for Consumer-to-Consumer Recommendation Systems
Lichi Li
Zainul Din
Zhen Tan
Sam London
Tianlong Chen
Ajay Daptardar
47
0
0
22 Feb 2024
Interpreting CLIP with Sparse Linear Concept Embeddings (SpLiCE)
Interpreting CLIP with Sparse Linear Concept Embeddings (SpLiCE)
Usha Bhalla
Alexander X. Oesterling
Suraj Srinivas
Flavio du Pin Calmon
Himabindu Lakkaraju
41
35
0
16 Feb 2024
Conditional Information Gain Trellis
Conditional Information Gain Trellis
Ufuk Can Biçici
Tuna Han Salih Meral
L. Akarun
29
2
0
13 Feb 2024
ReLU$^2$ Wins: Discovering Efficient Activation Functions for Sparse
  LLMs
ReLU2^22 Wins: Discovering Efficient Activation Functions for Sparse LLMs
Zhengyan Zhang
Yixin Song
Guanghui Yu
Xu Han
Yankai Lin
Chaojun Xiao
Chenyang Song
Zhiyuan Liu
Zeyu Mi
Maosong Sun
22
31
0
06 Feb 2024
CompeteSMoE -- Effective Training of Sparse Mixture of Experts via
  Competition
CompeteSMoE -- Effective Training of Sparse Mixture of Experts via Competition
Quang-Cuong Pham
Giang Do
Huy Nguyen
TrungTin Nguyen
Chenghao Liu
...
Binh T. Nguyen
Savitha Ramasamy
Xiaoli Li
Steven C. H. Hoi
Nhat Ho
25
17
0
04 Feb 2024
Efficient Deweather Mixture-of-Experts with Uncertainty-aware
  Feature-wise Linear Modulation
Efficient Deweather Mixture-of-Experts with Uncertainty-aware Feature-wise Linear Modulation
Rongyu Zhang
Yulin Luo
Jiaming Liu
Huanrui Yang
Zhen Dong
...
Tomoyuki Okuno
Yohei Nakata
Kurt Keutzer
Yuan Du
Shanghang Zhang
MoMe
MoE
35
3
0
27 Dec 2023
Adaptive Computation Modules: Granular Conditional Computation For
  Efficient Inference
Adaptive Computation Modules: Granular Conditional Computation For Efficient Inference
Bartosz Wójcik
Alessio Devoto
Karol Pustelnik
Pasquale Minervini
Simone Scardapane
23
5
0
15 Dec 2023
Statistical Perspective of Top-K Sparse Softmax Gating Mixture of
  Experts
Statistical Perspective of Top-K Sparse Softmax Gating Mixture of Experts
Huy Nguyen
Pedram Akbarian
Fanqi Yan
Nhat Ho
MoE
41
16
0
25 Sep 2023
Lightweight Modeling of User Context Combining Physical and Virtual
  Sensor Data
Lightweight Modeling of User Context Combining Physical and Virtual Sensor Data
M. Campana
Dimitris Chatzopoulos
Franca Delmastro
Pan Hui
14
5
0
28 Jun 2023
Neuro-Causal Factor Analysis
Neuro-Causal Factor Analysis
Alex Markham
Ming-Yu Liu
Bryon Aragam
Liam Solus
CML
28
3
0
31 May 2023
Disentanglement via Latent Quantization
Disentanglement via Latent Quantization
Kyle Hsu
W. Dorrell
James C. R. Whittington
Jiajun Wu
Chelsea Finn
DRL
26
25
0
28 May 2023
ProtoVAE: Prototypical Networks for Unsupervised Disentanglement
ProtoVAE: Prototypical Networks for Unsupervised Disentanglement
Vaishnavi Patil
Matthew Evanusa
J. JáJá
BDL
DRL
24
0
0
16 May 2023
Towards Convergence Rates for Parameter Estimation in Gaussian-gated
  Mixture of Experts
Towards Convergence Rates for Parameter Estimation in Gaussian-gated Mixture of Experts
Huy Nguyen
TrungTin Nguyen
Khai Nguyen
Nhat Ho
MoE
46
12
0
12 May 2023
Learning Disentangled Semantic Spaces of Explanations via Invertible
  Neural Networks
Learning Disentangled Semantic Spaces of Explanations via Invertible Neural Networks
Yingji Zhang
Danilo S. Carvalho
André Freitas
DRL
26
7
0
02 May 2023
Correcting Flaws in Common Disentanglement Metrics
Correcting Flaws in Common Disentanglement Metrics
Louis Mahon
Lei Shah
Thomas Lukasiewicz
CoGe
DRL
34
3
0
05 Apr 2023
Scaling Vision-Language Models with Sparse Mixture of Experts
Scaling Vision-Language Models with Sparse Mixture of Experts
Sheng Shen
Z. Yao
Chunyuan Li
Trevor Darrell
Kurt Keutzer
Yuxiong He
VLM
MoE
18
62
0
13 Mar 2023
Inversion dynamics of class manifolds in deep learning reveals tradeoffs
  underlying generalisation
Inversion dynamics of class manifolds in deep learning reveals tradeoffs underlying generalisation
Simone Ciceri
Lorenzo Cassani
Matteo Osella
P. Rotondo
P. Pizzochero
M. Gherardi
31
7
0
09 Mar 2023
Spatial Mixture-of-Experts
Spatial Mixture-of-Experts
Nikoli Dryden
Torsten Hoefler
MoE
34
9
0
24 Nov 2022
M$^3$ViT: Mixture-of-Experts Vision Transformer for Efficient Multi-task
  Learning with Model-Accelerator Co-design
M3^33ViT: Mixture-of-Experts Vision Transformer for Efficient Multi-task Learning with Model-Accelerator Co-design
Hanxue Liang
Zhiwen Fan
Rishov Sarkar
Ziyu Jiang
Tianlong Chen
Kai Zou
Yu Cheng
Cong Hao
Zhangyang Wang
MoE
36
81
0
26 Oct 2022
DOT-VAE: Disentangling One Factor at a Time
DOT-VAE: Disentangling One Factor at a Time
Vaishnavi Patil
Matthew Evanusa
J. JáJá
CoGe
DRL
CML
23
1
0
19 Oct 2022
Commutativity and Disentanglement from the Manifold Perspective
Commutativity and Disentanglement from the Manifold Perspective
Frank Qiu
CoGe
25
0
0
14 Oct 2022
Formal Semantic Geometry over Transformer-based Variational AutoEncoder
Formal Semantic Geometry over Transformer-based Variational AutoEncoder
Yingji Zhang
Danilo S. Carvalho
Ian Pratt-Hartmann
André Freitas
26
4
0
12 Oct 2022
Deep Double Descent via Smooth Interpolation
Deep Double Descent via Smooth Interpolation
Matteo Gamba
Erik Englesson
Marten Bjorkman
Hossein Azizpour
63
10
0
21 Sep 2022
A Survey of Neural Trees
A Survey of Neural Trees
Haoling Li
Jie Song
Mengqi Xue
Haofei Zhang
Jingwen Ye
Lechao Cheng
Mingli Song
AI4CE
20
6
0
07 Sep 2022
Solving large-scale MEG/EEG source localization and functional
  connectivity problems simultaneously using state-space models
Solving large-scale MEG/EEG source localization and functional connectivity problems simultaneously using state-space models
Jose M. Sanchez-Bornot
R. Sotero
J. Kelso
Damien Coyle
19
3
0
26 Aug 2022
Semi-Supervised Disentanglement of Tactile Contact~Geometry from
  Sliding-Induced Shear
Semi-Supervised Disentanglement of Tactile Contact~Geometry from Sliding-Induced Shear
A. Gupta
Alex Church
Nathan Lepora
22
2
0
26 Aug 2022
Doge Tickets: Uncovering Domain-general Language Models by Playing
  Lottery Tickets
Doge Tickets: Uncovering Domain-general Language Models by Playing Lottery Tickets
Yi Yang
Chen Zhang
Benyou Wang
Dawei Song
LRM
24
6
0
20 Jul 2022
Analysis of Branch Specialization and its Application in Image
  Decomposition
Analysis of Branch Specialization and its Application in Image Decomposition
Jonathan Brokman
Guy Gilboa
10
2
0
12 Jun 2022
1234
Next