Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1810.09050
Cited By
v1
v2
v3 (latest)
A Comparison of Five Multiple Instance Learning Pooling Functions for Sound Event Detection with Weak Labeling
22 October 2018
Yun Wang
Juncheng Billy Li
Florian Metze
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"A Comparison of Five Multiple Instance Learning Pooling Functions for Sound Event Detection with Weak Labeling"
50 / 86 papers shown
Title
Self-Supervision Enhances Instance-based Multiple Instance Learning Methods in Digital Pathology: A Benchmark Study
Ali Mammadov
Loic Le Folgoc
Julien Adam
Anne Buronfosse
Gilles Hayem
Guillaume Hocquet
Pietro Gori
SSL
75
0
0
02 May 2025
Exploring Performance-Complexity Trade-Offs in Sound Event Detection Models
T. Morocutti
Florian Schmid
Jonathan Greif
Francesco Foscarin
Gerhard Widmer
74
0
0
14 Mar 2025
Variational autoencoders stabilise TCN performance when classifying weakly labelled bioacoustics data
Laia Garrobé Fonollosa
Douglas Gillespie
L. Stanković
V. Stanković
Luke Rendell
49
0
0
22 Oct 2024
MAT-SED: A Masked Audio Transformer with Masked-Reconstruction Based Pre-training for Sound Event Detection
Pengfei Cai
Yan Song
Kang Li
Haoyu Song
Ian Mcloughlin
76
6
0
16 Aug 2024
Automatic Labels are as Effective as Manual Labels in Biomedical Images Classification with Deep Learning
Niccolo Marini
S. Marchesin
Lluis Borras Ferris
Simon Püttmann
Marek Wodzinski
...
Filippo Fraggetta
Iris Nagtegaal
Gianmaria Silvello
Manfredo Atzori
Henning Muller
32
1
0
20 Jun 2024
Sound Event Bounding Boxes
Janek Ebbers
François Germain
Gordon Wichern
Jonathan Le Roux
83
13
0
06 Jun 2024
FedMM: Federated Multi-Modal Learning with Modality Heterogeneity in Computational Pathology
Yuanzhe Peng
Jieming Bian
Jie Xu
140
5
0
24 Feb 2024
Towards Weakly Supervised Text-to-Audio Grounding
Xuenan Xu
Ziyang Ma
Mengyue Wu
Kai Yu
AI4TS
79
9
0
05 Jan 2024
TagAlign: Improving Vision-Language Alignment with Multi-Tag Classification
Qinying Liu
Wei Wu
Kecheng Zheng
Zhan Tong
Jiawei Liu
Yu Liu
Wei Chen
Zilei Wang
Yujun Shen
VLM
89
6
0
21 Dec 2023
Grounded Image Text Matching with Mismatched Relation Reasoning
Yu Wu
Yan-Tao Wei
Haozhe Jasper Wang
Yongfei Liu
Sibei Yang
Xuming He
74
6
0
02 Aug 2023
Improving Audio-Text Retrieval via Hierarchical Cross-Modal Interaction and Auxiliary Captions
Yifei Xin
Yuexian Zou
121
9
0
28 Jul 2023
Smooth Attention for Deep Multiple Instance Learning: Application to CT Intracranial Hemorrhage Detection
Yunan Wu
Francisco M. Castro-Macías
Pablo Morales-Álvarez
Rafael Molina
Aggelos K. Katsaggelos
55
2
0
18 Jul 2023
Multimodal Imbalance-Aware Gradient Modulation for Weakly-supervised Audio-Visual Video Parsing
Jie Fu
Junyu Gao
Changsheng Xu
114
9
0
05 Jul 2023
Post-Processing Independent Evaluation of Sound Event Detection Systems
Janek Ebbers
Reinhold Haeb-Umbach
Romain Serizel
82
7
0
27 Jun 2023
Listen, Think, and Understand
Yuan Gong
Hongyin Luo
Alexander H. Liu
Leonid Karlinsky
James R. Glass
ELM
MLLM
LRM
130
161
0
18 May 2023
Universal Source Separation with Weakly Labelled Data
Qiuqiang Kong
Kai Chen
Haohe Liu
Xingjian Du
Taylor Berg-Kirkpatrick
Shlomo Dubnov
Mark D. Plumbley
82
22
0
11 May 2023
Multiscale Audio Spectrogram Transformer for Efficient Audio Classification
Wenjie Zhu
M. Omar
85
22
0
19 Mar 2023
AST-SED: An Effective Sound Event Detection Method Based on Audio Spectrogram Transformer
Kang Li
Yan Song
Lirong Dai
Ian Mcloughlin
Xin Fang
Lin Liu
78
22
0
07 Mar 2023
Play It Back: Iterative Attention for Audio Recognition
Alexandros Stergiou
Dima Damen
90
4
0
20 Oct 2022
Optimizing Temporal Resolution Of Convolutional Recurrent Neural Networks For Sound Event Detection
Wim Boes
Hugo Van hamme
25
1
0
18 Oct 2022
Impact of temporal resolution on convolutional recurrent networks for audio tagging and sound event detection
Wim Boes
Hugo Van hamme
40
0
0
26 Sep 2022
Joint Analysis of Acoustic Scenes and Sound Events with Weakly labeled Data
Shunsuke Tsubaki
Keisuke Imoto
Nobutaka Ono
27
2
0
10 Jul 2022
Deep Multiple Instance Learning For Forecasting Stock Trends Using Financial News
Yiqi Deng
Siu-Ming Yiu
AIFin
46
0
0
29 Jun 2022
Impact of Acoustic Event Tagging on Scene Classification in a Multi-Task Learning Framework
Rahil Parikh
Harshavardhan Sundar
Ming Sun
Chao Wang
Spyros Matsoukas
26
1
0
27 Jun 2022
Go Beyond Multiple Instance Neural Networks: Deep-learning Models based on Local Pattern Aggregation
Linpeng Jin
48
1
0
28 May 2022
Masked Spectrogram Prediction For Self-Supervised Audio Pre-Training
Dading Chong
Helin Wang
Peilin Zhou
Qingcheng Zeng
79
68
0
27 Apr 2022
Learning 3D Semantics from Pose-Noisy 2D Images with Hierarchical Full Attention Network
Yuhang He
Lin Chen
Jun-Zhou Xie
Long Chen
3DPC
67
3
0
17 Apr 2022
A Comparative Analysis of Decision-Level Fusion for Multimodal Driver Behaviour Understanding
Alina Roitberg
Kunyu Peng
Zdravko Marinov
C. Seibold
David Schneider
Rainer Stiefelhagen
99
19
0
10 Apr 2022
RaDur: A Reference-aware and Duration-robust Network for Target Sound Detection
Dongchao Yang
Helin Wang
Zhongjie Ye
Yuexian Zou
Wenwu Wang
57
0
0
05 Apr 2022
A Mixed supervised Learning Framework for Target Sound Detection
Dongchao Yang
Helin Wang
Yuexian Zou
Wenwu Wang
55
0
0
05 Apr 2022
AudioTagging Done Right: 2nd comparison of deep learning methods for environmental sound classification
Juncheng Billy Li
Shuhui Qu
Po-Yao (Bernie) Huang
Florian Metze
VLM
100
9
0
25 Mar 2022
Federated Self-Supervised Learning for Acoustic Event Classification
Meng Feng
Chieh-Chi Kao
Qingming Tang
Ming Sun
Viktor Rozgic
Spyros Matsoukas
Chao Wang
74
13
0
22 Mar 2022
CMKD: CNN/Transformer-Based Cross-Model Knowledge Distillation for Audio Classification
Yuan Gong
Sameer Khurana
Andrew Rouditchenko
James R. Glass
VLM
73
29
0
13 Mar 2022
Multi-Instance Causal Representation Learning for Instance Label Prediction and Out-of-Distribution Generalization
Weijia Zhang
Xuanhui Zhang
Hanwen Deng
Min-Ling Zhang
105
23
0
25 Feb 2022
NeuroView-RNN: It's About Time
C. Barberan
Sina Alemohammad
Naiming Liu
Randall Balestriero
Richard G. Baraniuk
AI4TS
HAI
80
2
0
23 Feb 2022
data2vec: A General Framework for Self-supervised Learning in Speech, Vision and Language
Alexei Baevski
Wei-Ning Hsu
Qiantong Xu
Arun Babu
Jiatao Gu
Michael Auli
SSL
VLM
ViT
123
863
0
07 Feb 2022
HTS-AT: A Hierarchical Token-Semantic Audio Transformer for Sound Classification and Detection
Ke Chen
Xingjian Du
Bilei Zhu
Zejun Ma
Taylor Berg-Kirkpatrick
Shlomo Dubnov
ViT
171
277
0
02 Feb 2022
Detect what you want: Target Sound Detection
Dongchao Yang
Helin Wang
Yuexian Zou
Fan Cui
Chao Weng
97
7
0
19 Dec 2021
MM-Pyramid: Multimodal Pyramid Attentional Network for Audio-Visual Event Localization and Video Parsing
Jiashuo Yu
Ying Cheng
Ruiwei Zhao
Rui Feng
Yuejie Zhang
99
61
0
24 Nov 2021
Conformer-Based Self-Supervised Learning for Non-Speech Audio Tasks
Sangeeta Srivastava
Yun Wang
Andros Tjandra
Anurag Kumar
Chunxi Liu
Kritika Singh
Yatharth Saraf
SSL
99
25
0
14 Oct 2021
Transferring Voice Knowledge for Acoustic Event Detection: An Empirical Study
Dawei Liang
Yangyang Shi
Yun Wang
Nayan Singhal
Alex Xiao
Jonathan Shaw
Edison Thomaz
Ozlem Kalinli
M. Seltzer
48
4
0
07 Oct 2021
Sound Event Detection Transformer: An Event-based End-to-End Model for Sound Event Detection
Zhi-qin Ye
Xiangdong Wang
Hong Liu
Yueliang Qian
Ruijie Tao
Long Yan
Kazushige Ouchi
ViT
60
16
0
05 Oct 2021
Do sound event representations generalize to other audio tasks? A case study in audio transfer learning
Anurag Kumar
Yun Wang
V. Ithapu
Christian Fuegen
76
3
0
21 Jun 2021
Audiovisual transfer learning for audio tagging and sound event detection
Wim Boes
Hugo Van hamme
CLIP
VLM
36
11
0
09 Jun 2021
ERANNs: Efficient Residual Audio Neural Networks for Audio Pattern Recognition
S. Verbitskiy
Vladimir Berikov
Viacheslav Vyshegorodtsev
109
75
0
03 Jun 2021
Voice activity detection in the wild: A data-driven approach using teacher-student training
Heinrich Dinkel
Shuai Wang
Xuenan Xu
Mengyue Wu
K. Yu
VLM
40
33
0
10 May 2021
Unsupervised Learning of Multi-level Structures for Anomaly Detection
Songmin Dai
Jide Li
Lu Wang
Congcong Zhu
Yifan Wu
Xiaoqiang Li
31
0
0
25 Apr 2021
CNN-based Discriminative Training for Domain Compensation in Acoustic Event Detection with Frame-wise Classifier
Tiantian Tang
Xinyuan Zhou
Yanhua Long
Yijie Li
Jiaen Liang
57
3
0
26 Mar 2021
Forward-Backward Convolutional Recurrent Neural Networks and Tag-Conditioned Convolutional Neural Networks for Weakly Labeled Semi-supervised Sound Event Detection
Janek Ebbers
Reinhold Haeb-Umbach
36
13
0
11 Mar 2021
Multi-Format Contrastive Learning of Audio Representations
Luyu Wang
Aaron van den Oord
95
59
0
11 Mar 2021
1
2
Next