ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1707.00683
  4. Cited By
Modulating early visual processing by language
v1v2v3 (latest)

Modulating early visual processing by language

2 July 2017
H. D. Vries
Florian Strub
Jérémie Mary
Hugo Larochelle
Olivier Pietquin
Aaron Courville
ArXiv (abs)PDFHTML

Papers citing "Modulating early visual processing by language"

50 / 279 papers shown
Title
Edge-aware baselines for ogbn-proteins in PyTorch Geometric: species-wise normalization, post-hoc calibration, and cost-accuracy trade-offs
Edge-aware baselines for ogbn-proteins in PyTorch Geometric: species-wise normalization, post-hoc calibration, and cost-accuracy trade-offs
Aleksandar Stanković
Dejan Lisica
109
0
0
17 Nov 2025
IBNorm: Information-Bottleneck Inspired Normalization for Representation Learning
IBNorm: Information-Bottleneck Inspired Normalization for Representation Learning
Xiandong Zou
Pan Zhou
53
0
0
29 Oct 2025
FuncGNN: Learning Functional Semantics of Logic Circuits with Graph Neural Networks
FuncGNN: Learning Functional Semantics of Logic Circuits with Graph Neural Networks
Qiyun Zhao
GNN
118
1
0
07 Jun 2025
Walking the Weight Manifold: a Topological Approach to Conditioning Inspired by Neuromodulation
Walking the Weight Manifold: a Topological Approach to Conditioning Inspired by Neuromodulation
Ari S. Benjamin
Kyle Daruwalla
Christian Pehle
Abdul-Malik Zekri
Anthony M. Zador
174
0
0
29 May 2025
Challenges and Limitations of Generative AI in Synthesizing Wearable Sensor Data
Challenges and Limitations of Generative AI in Synthesizing Wearable Sensor Data
Flavio Di Martino
Franca Delmastro
250
0
0
20 May 2025
Legilimens: Performant Video Analytics on the System-on-Chip Edge
Legilimens: Performant Video Analytics on the System-on-Chip Edge
M. Ramanujam
Yinwei Dai
Kyle Jamieson
Ravi Netravali
213
0
0
29 Apr 2025
Plain Transformers Can be Powerful Graph Learners
Plain Transformers Can be Powerful Graph Learners
Liheng Ma
Soumyasundar Pal
Yingxue Zhang
Juil Sock
Mark Coates
280
0
0
17 Apr 2025
Hadamard product in deep learning: Introduction, Advances and Challenges
Hadamard product in deep learning: Introduction, Advances and ChallengesIEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2025
Grigorios G. Chrysos
Yongtao Wu
Razvan Pascanu
Philip Torr
Volkan Cevher
AAML
305
12
0
17 Apr 2025
HingeRLC-GAN: Combating Mode Collapse with Hinge Loss and RLC Regularization
HingeRLC-GAN: Combating Mode Collapse with Hinge Loss and RLC RegularizationInternational Conference on Pattern Recognition (ICPR), 2025
Osman Goni
Himadri Saha Arka
Mithun Halder
Mir Moynuddin Ahmed Shibly
Swakkhar Shatabda
GAN
185
1
0
24 Mar 2025
HyperCLIP: Adapting Vision-Language models with Hypernetworks
HyperCLIP: Adapting Vision-Language models with Hypernetworks
Victor Akinwande
Mohammad Sadegh Norouzzadeh
Devin Willmott
Anna Bair
Madan Ravi Ganesh
J. Zico Kolter
CLIPVLM
265
2
0
21 Dec 2024
Conditional Latent Space Molecular Scaffold Optimization for Accelerated Molecular Design
Conditional Latent Space Molecular Scaffold Optimization for Accelerated Molecular Design
O. Boyar
Hiroyuki Hanada
I. Takeuchi
BDL
240
0
0
03 Nov 2024
PVContext: Hybrid Context Model for Point Cloud Compression
PVContext: Hybrid Context Model for Point Cloud Compression
Guoqing Zhang
Wenbo Zhao
Jian Liu
Yuanchao Bai
Junjun Jiang
Xianming Liu
3DPC
93
0
0
19 Sep 2024
HiFi-CS: Towards Open Vocabulary Visual Grounding For Robotic Grasping Using Vision-Language Models
HiFi-CS: Towards Open Vocabulary Visual Grounding For Robotic Grasping Using Vision-Language Models
V. Bhat
Prashanth Krishnamurthy
Ramesh Karri
Farshad Khorrami
409
9
0
16 Sep 2024
GANs Conditioning Methods: A Survey
GANs Conditioning Methods: A Survey
Anis Bourou
Valérie Mezger
Auguste Genovesio
EGVMAI4CE
375
3
0
28 Aug 2024
Towards the Spectral bias Alleviation by Normalizations in Coordinate
  Networks
Towards the Spectral bias Alleviation by Normalizations in Coordinate Networks
Zhicheng Cai
Hao Zhu
Qiu Shen
Xinran Wang
Xun Cao
277
6
0
25 Jul 2024
Improving Reward-Conditioned Policies for Multi-Armed Bandits using
  Normalized Weight Functions
Improving Reward-Conditioned Policies for Multi-Armed Bandits using Normalized Weight Functions
Kai Xu
Farid Tajaddodianfar
Ben Allison
162
0
0
16 Jun 2024
Meta-Learning Neural Procedural Biases
Meta-Learning Neural Procedural Biases
Christian Raymond
Qi Chen
Bing Xue
Mengjie Zhan
245
1
0
12 Jun 2024
FlexLoc: Conditional Neural Networks for Zero-Shot Sensor Perspective
  Invariance in Object Localization with Distributed Multimodal Sensors
FlexLoc: Conditional Neural Networks for Zero-Shot Sensor Perspective Invariance in Object Localization with Distributed Multimodal Sensors
Jason Wu
Ziqi Wang
Xiaomin Ouyang
Ho Lyun Jeong
Colin Samplawski
Lance M. Kaplan
Benjamin M. Marlin
Mani Srivastava
HAI
99
1
0
10 Jun 2024
GeminiFusion: Efficient Pixel-wise Multimodal Fusion for Vision
  Transformer
GeminiFusion: Efficient Pixel-wise Multimodal Fusion for Vision Transformer
Ding Jia
Jianyuan Guo
Kai Han
Han Wu
Chao Zhang
Chang Xu
Xinghao Chen
ViT
464
48
0
03 Jun 2024
On the Limits of Multi-modal Meta-Learning with Auxiliary Task
  Modulation Using Conditional Batch Normalization
On the Limits of Multi-modal Meta-Learning with Auxiliary Task Modulation Using Conditional Batch Normalization
Jordi Armengol-Estapé
Vincent Michalski
Ramnath Kumar
P. St-Charles
Doina Precup
Samira Ebrahimi Kahou
255
0
0
29 May 2024
GarmentDreamer: 3DGS Guided Garment Synthesis with Diverse Geometry and
  Texture Details
GarmentDreamer: 3DGS Guided Garment Synthesis with Diverse Geometry and Texture Details
Boqian Li
Xuan Li
Ying Jiang
Tianyi Xie
Feng Gao
Huamin Wang
Yin Yang
Jian Ren
225
22
0
20 May 2024
Just Shift It: Test-Time Prototype Shifting for Zero-Shot Generalization
  with Vision-Language Models
Just Shift It: Test-Time Prototype Shifting for Zero-Shot Generalization with Vision-Language Models
Elaine Sui
Xiaohan Wang
Serena Yeung-Levy
VLM
240
20
0
19 Mar 2024
Reimagining Anomalies: What If Anomalies Were Normal?
Reimagining Anomalies: What If Anomalies Were Normal?
Philipp Liznerski
Saurabh Varshneya
Ece Calikus
Sophie Fellenz
Matthias Kirchler
178
4
0
22 Feb 2024
Diffusion Model Conditioning on Gaussian Mixture Model and Negative
  Gaussian Mixture Gradient
Diffusion Model Conditioning on Gaussian Mixture Model and Negative Gaussian Mixture Gradient
Weiguo Lu
Xuan Wu
Deng Ding
Jinqiao Duan
Jirong Zhuang
Gangnan Yuan
DiffMVLM
292
2
0
20 Jan 2024
Content-Conditioned Generation of Stylized Free hand Sketches
Content-Conditioned Generation of Stylized Free hand Sketches
Jiajun Liu
Siyuan Wang
Guangming Zhu
Liang Zhang
Ning Li
Eryang Gao
GAN
121
0
0
09 Jan 2024
Modality Plug-and-Play: Elastic Modality Adaptation in Multimodal LLMs
  for Embodied AI
Modality Plug-and-Play: Elastic Modality Adaptation in Multimodal LLMs for Embodied AI
Kai Huang
Boyuan Yang
Wei Gao
176
2
0
13 Dec 2023
Apparate: Rethinking Early Exits to Tame Latency-Throughput Tensions in
  ML Serving
Apparate: Rethinking Early Exits to Tame Latency-Throughput Tensions in ML ServingSymposium on Operating Systems Principles (SOSP), 2023
Yinwei Dai
Rui Pan
Anand Iyer
Kai Li
Ravi Netravali
136
14
0
08 Dec 2023
Data-driven Crop Growth Simulation on Time-varying Generated Images
  using Multi-conditional Generative Adversarial Networks
Data-driven Crop Growth Simulation on Time-varying Generated Images using Multi-conditional Generative Adversarial Networks
L. Drees
Dereje T. Demie
Madhuri R. Paul
Johannes Leonhardt
Sabine J. Seidel
Thomas F. Döring
R. Roscher
155
11
0
06 Dec 2023
Surf-D: Generating High-Quality Surfaces of Arbitrary Topologies Using
  Diffusion Models
Surf-D: Generating High-Quality Surfaces of Arbitrary Topologies Using Diffusion ModelsEuropean Conference on Computer Vision (ECCV), 2023
Zhengming Yu
Bushi Liu
Xiaoxiao Long
Cheng Lin
Zekun Li
...
Taku Komura
Marc Habermann
Christian Theobalt
Xin Li
Wenping Wang
189
9
0
28 Nov 2023
3D Teeth Reconstruction from Panoramic Radiographs using Neural Implicit
  Functions
3D Teeth Reconstruction from Panoramic Radiographs using Neural Implicit FunctionsInternational Conference on Medical Image Computing and Computer-Assisted Intervention (MICCAI), 2023
Sihwa Park
Seongjun Kim
In-Seok Song
Seung Jun Baek
115
5
0
28 Nov 2023
Unified Batch Normalization: Identifying and Alleviating the Feature
  Condensation in Batch Normalization and a Unified Framework
Unified Batch Normalization: Identifying and Alleviating the Feature Condensation in Batch Normalization and a Unified Framework
Shaobo Wang
Xiangdong Zhang
Dongrui Liu
Junchi Yan
263
1
0
27 Nov 2023
Human Machine Co-Creation. A Complementary Cognitive Approach to
  Creative Character Design Process Using GANs
Human Machine Co-Creation. A Complementary Cognitive Approach to Creative Character Design Process Using GANs
Mohammad Lataifeh
Xavier A Carrascoa
Ashraf M Elnagara
Naveed Ahmeda
Imran N. Junejo
GAN
188
5
0
23 Nov 2023
Active Prompt Learning in Vision Language Models
Active Prompt Learning in Vision Language Models
Jihwan Bang
Sumyeong Ahn
Jae-Gil Lee
VLM
235
18
0
18 Nov 2023
Constraint-Conditioned Policy Optimization for Versatile Safe
  Reinforcement Learning
Constraint-Conditioned Policy Optimization for Versatile Safe Reinforcement LearningNeural Information Processing Systems (NeurIPS), 2023
Yi-Fan Yao
Zuxin Liu
Zhepeng Cen
Jiacheng Zhu
Wenhao Yu
Tingnan Zhang
Ding Zhao
OffRL
220
17
0
05 Oct 2023
Increasing diversity of omni-directional images generated from single
  image using cGAN based on MLPMixer
Increasing diversity of omni-directional images generated from single image using cGAN based on MLPMixerAsian Conference on Pattern Recognition (ACPR), 2023
Atsuya Nakata
Ryuto Miyazaki
Takao Yamanaka
196
1
0
15 Sep 2023
Exchanging-based Multimodal Fusion with Transformer
Exchanging-based Multimodal Fusion with Transformer
Renyu Zhu
Chengcheng Han
Yong Qian
Qiushi Sun
Xiang Li
Ming Gao
Xuezhi Cao
Yunsen Xian
142
5
0
05 Sep 2023
Deep Video Codec Control for Vision Models
Deep Video Codec Control for Vision Models
Christoph Reich
Biplob K. Debnath
Deep Patel
Tim Prangemeier
Daniel Cremers
S. Chakradhar
325
2
0
30 Aug 2023
Metadata Improves Segmentation Through Multitasking Elicitation
Metadata Improves Segmentation Through Multitasking Elicitation
Iaroslav Plutenko
Mikhail Papkov
K. Palo
L. Parts
D. Fishman
233
1
0
18 Aug 2023
Writer adaptation for offline text recognition: An exploration of neural
  network-based methods
Writer adaptation for offline text recognition: An exploration of neural network-based methods
Tobias van der Werff
Maruf A. Dhali
Lambert Schomaker
166
1
0
11 Jul 2023
MetaModulation: Learning Variational Feature Hierarchies for Few-Shot
  Learning with Fewer Tasks
MetaModulation: Learning Variational Feature Hierarchies for Few-Shot Learning with Fewer TasksInternational Conference on Machine Learning (ICML), 2023
Wenfang Sun
Yingjun Du
Xiantong Zhen
Fan Wang
Lingling Wang
Cees G. M. Snoek
158
7
0
17 May 2023
Lossless Adaptation of Pretrained Vision Models For Robotic Manipulation
Lossless Adaptation of Pretrained Vision Models For Robotic ManipulationInternational Conference on Learning Representations (ICLR), 2023
Mohit Sharma
Claudio Fantacci
Yuxiang Zhou
Skanda Koppula
N. Heess
Jonathan Scholz
Y. Aytar
VLM
226
37
0
13 Apr 2023
A Comprehensive and Versatile Multimodal Deep Learning Approach for
  Predicting Diverse Properties of Advanced Materials
A Comprehensive and Versatile Multimodal Deep Learning Approach for Predicting Diverse Properties of Advanced MaterialsAdvancement of science (Adv. Sci.), 2023
Shun Muroga
Yasuaki Miki
Kenji Hata
AI4CE
121
28
0
29 Mar 2023
MAIR: Multi-view Attention Inverse Rendering with 3D Spatially-Varying
  Lighting Estimation
MAIR: Multi-view Attention Inverse Rendering with 3D Spatially-Varying Lighting EstimationComputer Vision and Pattern Recognition (CVPR), 2023
Jun-Hyuk Choi
Seok-Kun Lee
Haesol Park
Seung‐Won Jung
Ig-Jae Kim
Junghyun Cho
3DV
205
13
0
22 Mar 2023
PartNeRF: Generating Part-Aware Editable 3D Shapes without 3D
  Supervision
PartNeRF: Generating Part-Aware Editable 3D Shapes without 3D SupervisionComputer Vision and Pattern Recognition (CVPR), 2023
Konstantinos Tertikas
Pascalidou Despoina
Boxiao Pan
Jeong Joon Park
Mikaela Angelina Uy
Ioannis Emiris
Yannis Avrithis
Leonidas Guibas
179
40
0
16 Mar 2023
Modular Deep Learning
Modular Deep Learning
Jonas Pfeiffer
Sebastian Ruder
Ivan Vulić
Edoardo Ponti
MoMeOOD
373
101
0
22 Feb 2023
Redes Generativas Adversarias (GAN) Fundamentos Teóricos y
  Aplicaciones
Redes Generativas Adversarias (GAN) Fundamentos Teóricos y Aplicaciones
J. D. L. Torre
GAN
144
1
0
18 Feb 2023
Fine-grained Cross-modal Fusion based Refinement for Text-to-Image
  Synthesis
Fine-grained Cross-modal Fusion based Refinement for Text-to-Image SynthesisChinese journal of electronics (CJE), 2023
Haoran Sun
Yang Wang
Haipeng Liu
Biao Qian
239
13
0
17 Feb 2023
Diffusion-based Conditional ECG Generation with Structured State Space
  Models
Diffusion-based Conditional ECG Generation with Structured State Space Models
Juan Miguel Lopez Alcaraz
Nils Strodthoff
DiffM
169
74
0
19 Jan 2023
ACQ: Improving Generative Data-free Quantization Via Attention
  Correction
ACQ: Improving Generative Data-free Quantization Via Attention CorrectionPattern Recognition (Pattern Recogn.), 2023
Jixing Li
Xiaozhou Guo
Benzhe Dai
Guoliang Gong
Min Jin
Gang Chen
Wenyu Mao
Huaxiang Lu
MQ
243
5
0
18 Jan 2023
PaDPaF: Partial Disentanglement with Partially-Federated GANs
PaDPaF: Partial Disentanglement with Partially-Federated GANs
Abdulla Jasem Almansoori
Samuel Horváth
Martin Takáč
FedML
150
0
0
07 Dec 2022
123456
Next