Deep Learning of Representations: Looking Forward

2 May 2013

Papers citing "Deep Learning of Representations: Looking Forward"

50 / 196 papers shown

Title
GPU-centric Communication Schemes for HPC and ML Applications Naveen Namashivayam GNN 35 0 0 31 Mar 2025
Learning disentangled representations for instrument-based music similarity Yuka Hashizume Li Li Atsushi Miyashita T. Toda 49 0 0 21 Mar 2025
On Neural Inertial Classification Networks for Pedestrian Activity Recognition Zeev Yampolsky Ofir Kruzel Victoria Khalfin Fekson Itzik Klein 39 0 0 23 Feb 2025
Enhancement of Neural Inertial Regression Networks: A Data-Driven Perspective Victoria Khalfin Fekson Nitsan Pri-Hadash Netta Palez Aviad Etzion Itzik Klein 40 1 0 03 Jan 2025
Decoding Dark Matter: Specialized Sparse Autoencoders for Interpreting Rare Concepts in Foundation Models Aashiq Muhamed Mona Diab Virginia Smith 45 2 0 01 Nov 2024
MoIN: Mixture of Introvert Experts to Upcycle an LLM Ajinkya Tejankar K. Navaneet Ujjawal Panchal Kossar Pourahmadi Hamed Pirsiavash MoE 29 0 0 13 Oct 2024
More Experts Than Galaxies: Conditionally-overlapping Experts With Biologically-Inspired Fixed Routing Sagi Shaier Francisco Pereira K. Wense Lawrence E Hunter Matt Jones MoE 46 0 0 10 Oct 2024
Reflections on Disentanglement and the Latent Space Ludovica Schaerf 21 0 0 08 Oct 2024
FactorLLM: Factorizing Knowledge via Mixture of Experts for Large Language Models Zhongyu Zhao Menghang Dong Rongyu Zhang Wenzhao Zheng Yunpeng Zhang Huanrui Yang Dalong Du Kurt Keutzer Shanghang Zhang 48 0 0 15 Aug 2024
Mixture of Nested Experts: Adaptive Processing of Visual Tokens Gagan Jain Nidhi Hegde Aditya Kusupati Arsha Nagrani Shyamal Buch Prateek Jain Anurag Arnab Sujoy Paul MoE 48 7 0 29 Jul 2024
SA-DVAE: Improving Zero-Shot Skeleton-Based Action Recognition by Disentangled Variational Autoencoders Sheng-Wei Li Zi-Xiang Wei Wei-Jie Chen Yi-Hsin Yu Chih-Yuan Yang Jane Yung-jen Hsu DRL 41 3 0 18 Jul 2024
Scaling Diffusion Transformers to 16 Billion Parameters Zhengcong Fei Mingyuan Fan Changqian Yu Debang Li Junshi Huang DiffM MoE 59 16 0 16 Jul 2024
ColorwAI: Generative Colorways of Textiles through GAN and Diffusion Disentanglement Ludovica Schaerf Andrea Alfarano Eric Postma DiffM 31 2 0 16 Jul 2024
CiteME: Can Language Models Accurately Cite Scientific Claims? Ori Press Andreas Hochlehnert Ameya Prabhu Vishaal Udandarao Ofir Press Matthias Bethge 47 13 0 10 Jul 2024
Mixture of A Million Experts Xu Owen He MoE 41 25 0 04 Jul 2024
Turbo Sparse: Achieving LLM SOTA Performance with Minimal Activated Parameters Yixin Song Haotong Xie Zhengyan Zhang Bo Wen Li Ma Zeyu Mi Haibo Chen MoE 34 21 0 10 Jun 2024
Ego-Foresight: Agent Visuomotor Prediction as Regularization for RL Manuel S. Nunes Atabak Dehban Y. Demiris J. Santos-Victor 48 0 0 27 May 2024
Memory Mosaics Jianyu Zhang Niklas Nolte Ranajoy Sadhukhan Beidi Chen Léon Bottou VLM 73 3 0 10 May 2024
Improving Dictionary Learning with Gated Sparse Autoencoders Senthooran Rajamanoharan Arthur Conmy Lewis Smith Tom Lieberum Vikrant Varma János Kramár Rohin Shah Neel Nanda RALM 35 79 0 24 Apr 2024
Tripod: Three Complementary Inductive Biases for Disentangled Representation Learning Kyle Hsu Jubayer Ibn Hamid Kaylee Burns Chelsea Finn Jiajun Wu CML 26 4 0 16 Apr 2024
Learning Multidimensional Disentangled Representations of Instrumental Sounds for Musical Similarity Assessment Yuka Hashizume Li Li Atsushi Miyashita T. Toda 30 3 0 10 Apr 2024
Mixture-of-Depths: Dynamically allocating compute in transformer-based language models David Raposo Sam Ritter Blake A. Richards Timothy Lillicrap Peter C. Humphreys Adam Santoro MoE 40 69 0 02 Apr 2024
MerRec: A Large-scale Multipurpose Mercari Dataset for Consumer-to-Consumer Recommendation Systems Lichi Li Zainul Din Zhen Tan Sam London Tianlong Chen Ajay Daptardar 47 0 0 22 Feb 2024
Interpreting CLIP with Sparse Linear Concept Embeddings (SpLiCE) Usha Bhalla Alexander X. Oesterling Suraj Srinivas Flavio du Pin Calmon Himabindu Lakkaraju 41 35 0 16 Feb 2024
Conditional Information Gain Trellis Ufuk Can Biçici Tuna Han Salih Meral L. Akarun 29 2 0 13 Feb 2024
ReLU $^2$ Wins: Discovering Efficient Activation Functions for Sparse LLMs Zhengyan Zhang Yixin Song Guanghui Yu Xu Han Yankai Lin Chaojun Xiao Chenyang Song Zhiyuan Liu Zeyu Mi Maosong Sun 22 31 0 06 Feb 2024
CompeteSMoE -- Effective Training of Sparse Mixture of Experts via Competition Quang-Cuong Pham Giang Do Huy Nguyen TrungTin Nguyen Chenghao Liu ... Binh T. Nguyen Savitha Ramasamy Xiaoli Li Steven C. H. Hoi Nhat Ho 25 17 0 04 Feb 2024
Efficient Deweather Mixture-of-Experts with Uncertainty-aware Feature-wise Linear Modulation Rongyu Zhang Yulin Luo Jiaming Liu Huanrui Yang Zhen Dong ... Tomoyuki Okuno Yohei Nakata Kurt Keutzer Yuan Du Shanghang Zhang MoMe MoE 35 3 0 27 Dec 2023
Adaptive Computation Modules: Granular Conditional Computation For Efficient Inference Bartosz Wójcik Alessio Devoto Karol Pustelnik Pasquale Minervini Simone Scardapane 23 5 0 15 Dec 2023
Statistical Perspective of Top-K Sparse Softmax Gating Mixture of Experts Huy Nguyen Pedram Akbarian Fanqi Yan Nhat Ho MoE 41 16 0 25 Sep 2023
Lightweight Modeling of User Context Combining Physical and Virtual Sensor Data M. Campana Dimitris Chatzopoulos Franca Delmastro Pan Hui 14 5 0 28 Jun 2023
Neuro-Causal Factor Analysis Alex Markham Ming-Yu Liu Bryon Aragam Liam Solus CML 28 3 0 31 May 2023
Disentanglement via Latent Quantization Kyle Hsu W. Dorrell James C. R. Whittington Jiajun Wu Chelsea Finn DRL 26 25 0 28 May 2023
ProtoVAE: Prototypical Networks for Unsupervised Disentanglement Vaishnavi Patil Matthew Evanusa J. JáJá BDL DRL 24 0 0 16 May 2023
Towards Convergence Rates for Parameter Estimation in Gaussian-gated Mixture of Experts Huy Nguyen TrungTin Nguyen Khai Nguyen Nhat Ho MoE 46 12 0 12 May 2023
Learning Disentangled Semantic Spaces of Explanations via Invertible Neural Networks Yingji Zhang Danilo S. Carvalho André Freitas DRL 26 7 0 02 May 2023
Correcting Flaws in Common Disentanglement Metrics Louis Mahon Lei Shah Thomas Lukasiewicz CoGe DRL 34 3 0 05 Apr 2023
Scaling Vision-Language Models with Sparse Mixture of Experts Sheng Shen Z. Yao Chunyuan Li Trevor Darrell Kurt Keutzer Yuxiong He VLM MoE 18 62 0 13 Mar 2023
Inversion dynamics of class manifolds in deep learning reveals tradeoffs underlying generalisation Simone Ciceri Lorenzo Cassani Matteo Osella P. Rotondo P. Pizzochero M. Gherardi 31 7 0 09 Mar 2023
Spatial Mixture-of-Experts Nikoli Dryden Torsten Hoefler MoE 34 9 0 24 Nov 2022
M $^3$ ViT: Mixture-of-Experts Vision Transformer for Efficient Multi-task Learning with Model-Accelerator Co-design Hanxue Liang Zhiwen Fan Rishov Sarkar Ziyu Jiang Tianlong Chen Kai Zou Yu Cheng Cong Hao Zhangyang Wang MoE 36 81 0 26 Oct 2022
DOT-VAE: Disentangling One Factor at a Time Vaishnavi Patil Matthew Evanusa J. JáJá CoGe DRL CML 23 1 0 19 Oct 2022
Commutativity and Disentanglement from the Manifold Perspective Frank Qiu CoGe 25 0 0 14 Oct 2022
Formal Semantic Geometry over Transformer-based Variational AutoEncoder Yingji Zhang Danilo S. Carvalho Ian Pratt-Hartmann André Freitas 26 4 0 12 Oct 2022
Deep Double Descent via Smooth Interpolation Matteo Gamba Erik Englesson Marten Bjorkman Hossein Azizpour 63 10 0 21 Sep 2022
A Survey of Neural Trees Haoling Li Jie Song Mengqi Xue Haofei Zhang Jingwen Ye Lechao Cheng Mingli Song AI4CE 20 6 0 07 Sep 2022
Solving large-scale MEG/EEG source localization and functional connectivity problems simultaneously using state-space models Jose M. Sanchez-Bornot R. Sotero J. Kelso Damien Coyle 19 3 0 26 Aug 2022
Semi-Supervised Disentanglement of Tactile Contact~Geometry from Sliding-Induced Shear A. Gupta Alex Church Nathan Lepora 22 2 0 26 Aug 2022
Doge Tickets: Uncovering Domain-general Language Models by Playing Lottery Tickets Yi Yang Chen Zhang Benyou Wang Dawei Song LRM 24 6 0 20 Jul 2022
Analysis of Branch Specialization and its Application in Image Decomposition Jonathan Brokman Guy Gilboa 10 2 0 12 Jun 2022