Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2004.09297
Cited By
MPNet: Masked and Permuted Pre-training for Language Understanding
20 April 2020
Kaitao Song
Xu Tan
Tao Qin
Jianfeng Lu
Tie-Yan Liu
Re-assign community
ArXiv
PDF
HTML
Papers citing
"MPNet: Masked and Permuted Pre-training for Language Understanding"
50 / 116 papers shown
Title
Contextualizing the Limits of Model & Evaluation Dataset Curation on Semantic Similarity Classification Tasks
Daniel Theron
10
0
0
03 Nov 2023
Cultural Adaptation of Recipes
Yong Cao
Yova Kementchedjhieva
Ruixiang Cui
Antonia Karamolegkou
Li Zhou
Megan Dare
Lucia Donatelli
Daniel Hershcovich
18
5
0
26 Oct 2023
Kiki or Bouba? Sound Symbolism in Vision-and-Language Models
Morris Alper
Hadar Averbuch-Elor
28
10
0
25 Oct 2023
Multilingual estimation of political-party positioning: From label aggregation to long-input Transformers
Dmitry Nikolaev
Tanise Ceron
Sebastian Padó
13
1
0
19 Oct 2023
Semantic Scene Difference Detection in Daily Life Patroling by Mobile Robots using Pre-Trained Large-Scale Vision-Language Model
Yoshiki Obinata
Kento Kawaharazuka
Naoaki Kanazawa
N. Yamaguchi
Naoto Tsukamoto
Iori Yanokura
Shingo Kitagawa
Koki Shinjo
K. Okada
Masayuki Inaba
LM&Ro
15
5
0
28 Sep 2023
Graecia capta ferum victorem cepit. Detecting Latin Allusions to Ancient Greek Literature
Frederick Riemenschneider
Anette Frank
14
1
0
23 Aug 2023
ExpeL: LLM Agents Are Experiential Learners
Andrew Zhao
Daniel Huang
Quentin Xu
Matthieu Lin
Y. Liu
Gao Huang
LLMAG
20
192
0
20 Aug 2023
Three Ways of Using Large Language Models to Evaluate Chat
Ondvrej Plátek
Vojtvech Hudevcek
Patrícia Schmidtová
Mateusz Lango
Ondrej Dusek
ALM
19
5
0
12 Aug 2023
Detecting Spells in Fantasy Literature with a Transformer Based Artificial Intelligence
Marcel Moravek
Alexander Zender
Andreas Müller
10
0
0
07 Aug 2023
KoRC: Knowledge oriented Reading Comprehension Benchmark for Deep Text Understanding
Zijun Yao
Yantao Liu
Xin Lv
S. Cao
Jifan Yu
Lei Hou
Juanzi Li
30
10
0
06 Jul 2023
Utilizing ChatGPT Generated Data to Retrieve Depression Symptoms from Social Media
Ana-Maria Bucur
AI4MH
22
10
0
05 Jul 2023
GIO: Gradient Information Optimization for Training Dataset Selection
Dante Everaert
Christopher Potts
19
3
0
20 Jun 2023
WizMap: Scalable Interactive Visualization for Exploring Large Machine Learning Embeddings
Zijie J. Wang
Fred Hohman
Duen Horng Chau
26
20
0
15 Jun 2023
Learning to Mask and Permute Visual Tokens for Vision Transformer Pre-Training
Lorenzo Baraldi
Roberto Amoroso
Marcella Cornia
Lorenzo Baraldi
Andrea Pilzer
Rita Cucchiara
36
2
0
12 Jun 2023
RadLing: Towards Efficient Radiology Report Understanding
Rikhiya Ghosh
Sanjeev Kumar Karn
Manuela Danu
Larisa Micu
Ramya Vunikili
Oladimeji Farri
MedIm
16
6
0
04 Jun 2023
Description-Based Text Similarity
Shauli Ravfogel
Valentina Pyatkin
Amir D. N. Cohen
Avshalom Manevich
Yoav Goldberg
20
5
0
21 May 2023
Machine-Made Media: Monitoring the Mobilization of Machine-Generated Articles on Misinformation and Mainstream News Websites
Hans W. A. Hanley
Zakir Durumeric
DeLMO
16
29
0
16 May 2023
Going beyond research datasets: Novel intent discovery in the industry setting
Aleksandra Chrabrowa
Tsimur Hadeliya
D. Kajtoch
Robert Mroczkowski
Piotr Rybak
8
2
0
09 May 2023
Curating corpora with classifiers: A case study of clean energy sentiment online
M. V. Arnold
P. Dodds
C. Danforth
16
0
0
04 May 2023
Procedure-Aware Pretraining for Instructional Video Understanding
Honglu Zhou
Roberto Martín-Martín
Mubbasir Kapadia
Silvio Savarese
Juan Carlos Niebles
23
38
0
31 Mar 2023
Personalizing Task-oriented Dialog Systems via Zero-shot Generalizable Reward Function
A. B. Siddique
M. H. Maqbool
Kshitija Taywade
H. Foroosh
24
12
0
24 Mar 2023
Improving Content Retrievability in Search with Controllable Query Generation
Gustavo Penha
Enrico Palumbo
Maryam Aziz
Alice Wang
Hugues Bouchard
25
11
0
21 Mar 2023
Proactive Prioritization of App Issues via Contrastive Learning
Moghis Fereidouni
A. Mosharrof
Umar Farooq
A. B. Siddique
23
4
0
12 Mar 2023
IGB: Addressing The Gaps In Labeling, Features, Heterogeneity, and Size of Public Graph Datasets for Deep Learning Research
Arpandeep Khatua
Vikram Sharma Mailthody
Bhagyashree Taleka
Tengfei Ma
Xiang Song
Wen-mei W. Hwu
AI4CE
21
37
0
27 Feb 2023
In-context Example Selection with Influences
Nguyen Tai
Eric Wong
9
48
0
21 Feb 2023
Few-shot Multimodal Multitask Multilingual Learning
Aman Chadha
Vinija Jain
34
0
0
19 Feb 2023
ClusterLog: Clustering Logs for Effective Log-based Anomaly Detection
Chris Egersdoerfer
Dong Dai
Di Zhang
11
6
0
19 Jan 2023
A large-scale and PCR-referenced vocal audio dataset for COVID-19
Jobie Budd
Kieran Baker
E. Karoune
H. Coppock
Selina Patel
...
D. Pigoli
Stephen J. Roberts
Josef Packham
T. Thornley
Chris Holmes
16
4
0
15 Dec 2022
I2MVFormer: Large Language Model Generated Multi-View Document Supervision for Zero-Shot Image Classification
Muhammad Ferjad Naeem
Muhammad Gul Zain Ali Khan
Yongqin Xian
Muhammad Zeshan Afzal
D. Stricker
Luc Van Gool
F. Tombari
VLM
22
51
0
05 Dec 2022
Utilizing Background Knowledge for Robust Reasoning over Traffic Situations
Jiarui Zhang
Filip Ilievski
Aravinda Kollaa
Jonathan M Francis
Kaixin Ma
A. Oltramari
16
2
0
04 Dec 2022
Sonus Texere! Automated Dense Soundtrack Construction for Books using Movie Adaptations
Jaidev Shriram
Makarand Tapaswi
Vinoo Alluri
11
2
0
02 Dec 2022
Language Model Pre-training on True Negatives
Zhuosheng Zhang
Hai Zhao
Masao Utiyama
Eiichiro Sumita
22
2
0
01 Dec 2022
SciRepEval: A Multi-Format Benchmark for Scientific Document Representations
Amanpreet Singh
Mike DÁrcy
Arman Cohan
Doug Downey
Sergey Feldman
9
79
0
23 Nov 2022
Evaluating the Knowledge Dependency of Questions
Hyeongdon Moon
Yoonseok Yang
Jamin Shin
Hangyeol Yu
Seunghyun Lee
Myeongho Jeong
Juneyoung Park
Minsam Kim
Seungtaek Choi
AI4Ed
23
10
0
21 Nov 2022
Discord Questions: A Computational Approach To Diversity Analysis in News Coverage
Philippe Laban
Chien-Sheng Wu
Lidiya Murakhovs'ka
Xiang Ánthony' Chen
Caiming Xiong
8
12
0
09 Nov 2022
Unsupervised Audio-Visual Lecture Segmentation
Darshan Singh
Anchit Gupta
C. V. Jawahar
Makarand Tapaswi
VOS
16
4
0
29 Oct 2022
Robustifying Sentiment Classification by Maximally Exploiting Few Counterfactuals
Maarten De Raedt
Fréderic Godin
Chris Develder
Thomas Demeester
6
1
0
21 Oct 2022
MTEB: Massive Text Embedding Benchmark
Niklas Muennighoff
Nouamane Tazi
L. Magne
Nils Reimers
21
369
0
13 Oct 2022
Noise-Robust De-Duplication at Scale
Emily Silcock
Luca DÁmico-Wong
Jinglin Yang
Melissa Dell
SyDa
18
20
0
09 Oct 2022
One-Shot Doc Snippet Detection: Powering Search in Document Beyond Text
Abhinav Java
Shripad Deshmukh
Milan Aggarwal
Surgan Jandial
Mausoom Sarkar
Balaji Krishnamurthy
30
3
0
12 Sep 2022
Towards explainable evaluation of language models on the semantic similarity of visual concepts
Maria Lymperaiou
George Manoliadis
Orfeas Menis-Mastromichalakis
Edmund Dervakos
Giorgos Stamou
AAML
16
5
0
08 Sep 2022
Evaluating Dense Passage Retrieval using Transformers
Nima Sadri
11
0
0
15 Aug 2022
Cause-and-Effect Analysis of ADAS: A Comparison Study between Literature Review and Complaint Data
Jackie Ayoub
Zifei Wang
Meitang Li
Huizhong Guo
Rini Sherony
Shan Bao
Feng Zhou
11
17
0
30 Jul 2022
Two-Pass Low Latency End-to-End Spoken Language Understanding
Siddhant Arora
Siddharth Dalmia
Xuankai Chang
Brian Yan
A. Black
Shinji Watanabe
VLM
13
19
0
14 Jul 2022
GateNLP-UShef at SemEval-2022 Task 8: Entity-Enriched Siamese Transformer for Multilingual News Article Similarity
Iknoor Singh
Yue Li
Melissa Thong
Carolina Scarton
20
3
0
31 May 2022
Happenstance: Utilizing Semantic Search to Track Russian State Media Narratives about the Russo-Ukrainian War On Reddit
Hans W. A. Hanley
Deepak Kumar
Zakir Durumeric
9
40
0
28 May 2022
What's in a Caption? Dataset-Specific Linguistic Diversity and Its Effect on Visual Description Models and Metrics
David M. Chan
Austin Myers
Sudheendra Vijayanarasimhan
David A. Ross
Bryan Seybold
John F. Canny
26
6
0
12 May 2022
Probing Cross-Lingual Lexical Knowledge from Multilingual Sentence Encoders
Ivan Vulić
Goran Glavavs
Fangyu Liu
Nigel Collier
E. Ponti
Anna Korhonen
12
8
0
30 Apr 2022
UMass PCL at SemEval-2022 Task 4: Pre-trained Language Model Ensembles for Detecting Patronizing and Condescending Language
David Koleczek
Alexander Scarlatos
Siddha Makarand Karkare
Preshma Linet Pereira
14
0
0
18 Apr 2022
What Matters in Language Conditioned Robotic Imitation Learning over Unstructured Data
Oier Mees
Lukás Hermann
Wolfram Burgard
LM&Ro
28
149
0
13 Apr 2022
Previous
1
2
3
Next