Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2108.08877
Cited By
Sentence-T5: Scalable Sentence Encoders from Pre-trained Text-to-Text Models
19 August 2021
Jianmo Ni
Gustavo Hernández Ábrego
Noah Constant
Ji Ma
Keith B. Hall
Daniel Matthew Cer
Yinfei Yang
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Sentence-T5: Scalable Sentence Encoders from Pre-trained Text-to-Text Models"
50 / 284 papers shown
Title
VISLA Benchmark: Evaluating Embedding Sensitivity to Semantic and Lexical Alterations
Sri Harsha Dumpala
Aman Jaiswal
Chandramouli Shama Sastry
E. Milios
Sageev Oore
Hassan Sajjad
VLM
CoGe
35
0
0
25 Apr 2024
Learning representations of learning representations
Rita González-Márquez
Dmitry Kobak
17
0
0
12 Apr 2024
Event-enhanced Retrieval in Real-time Search
Yanan Zhang
Xiaoling Bai
Tianhua Zhou
21
1
0
09 Apr 2024
Contextual Chart Generation for Cyber Deception
David D. Nguyen
David Liebowitz
Surya Nepal
S. Kanhere
Sharif Abuadbba
33
0
0
07 Apr 2024
Dwell in the Beginning: How Language Models Embed Long Documents for Dense Retrieval
Joao Coelho
Bruno Martins
João Magalhães
Jamie Callan
Chenyan Xiong
RALM
26
4
0
05 Apr 2024
Simple Techniques for Enhancing Sentence Embeddings in Generative Language Models
Bowen Zhang
Kehua Chang
Chunping Li
32
10
0
05 Apr 2024
Transforming LLMs into Cross-modal and Cross-lingual Retrieval Systems
Frank Palma Gomez
Ramon Sanabria
Yun-hsuan Sung
Daniel Matthew Cer
Siddharth Dalmia
Gustavo Hernández Ábrego
VLM
18
3
0
02 Apr 2024
Generative Retrieval as Multi-Vector Dense Retrieval
Shiguang Wu
Wenda Wei
Mengqi Zhang
Zhumin Chen
Jun Ma
Zhaochun Ren
Maarten de Rijke
Pengjie Ren
3DV
18
7
0
31 Mar 2024
Gecko: Versatile Text Embeddings Distilled from Large Language Models
Jinhyuk Lee
Zhuyun Dai
Xiaoqi Ren
Blair Chen
Daniel Matthew Cer
...
Aditya Kusupati
Prateek Jain
Siddhartha Reddy Jonnalagadda
Ming-Wei Chang
Iftekhar Naim
RALM
VLM
SyDa
25
40
0
29 Mar 2024
RankMamba: Benchmarking Mamba's Document Ranking Performance in the Era of Transformers
Zhichao Xu
19
12
0
27 Mar 2024
Multilingual Sentence-T5: Scalable Sentence Encoders for Multilingual Applications
Chihiro Yano
Akihiko Fukuchi
Shoko Fukasawa
Hideyuki Tachibana
Yotaro Watanabe
26
2
0
26 Mar 2024
Cross-lingual Contextualized Phrase Retrieval
Huayang Li
Deng Cai
Zhi Qu
Qu Cui
Hidetaka Kamigaito
Lemao Liu
Taro Watanabe
29
0
0
25 Mar 2024
A Semantic Search Engine for Mathlib4
Guoxiong Gao
Haocheng Ju
Jiedong Jiang
Zihan Qin
Bin Dong
25
3
0
20 Mar 2024
Just Say the Name: Online Continual Learning with Category Names Only via Data Generation
Minhyuk Seo
Diganta Misra
Seongwon Cho
Minjae Lee
Jonghyun Choi
CLL
30
6
0
16 Mar 2024
ACLSum: A New Dataset for Aspect-based Summarization of Scientific Publications
Sotaro Takeshita
Tommaso Green
Ines Reinig
Kai Eckert
Simone Paolo Ponzetto
21
11
0
08 Mar 2024
Persona Extraction Through Semantic Similarity for Emotional Support Conversation Generation
Seunghee Han
Se Jin Park
Chae Won Kim
Y. Ro
17
1
0
07 Mar 2024
MeanCache: User-Centric Semantic Caching for LLM Web Services
Waris Gill
Mohamed Elidrisi
Pallavi Kalapatapu
Ammar Ahmed
Ali Anwar
Muhammad Ali Gulzar Virginia Tech
19
1
0
05 Mar 2024
A Decade of Privacy-Relevant Android App Reviews: Large Scale Trends
Omer Akgul
Sai Teja Peddinti
Nina Taft
Michelle L. Mazurek
Hamza Harkous
Animesh Srivastava
Benoit Seguin
23
5
0
04 Mar 2024
A Generative Approach for Wikipedia-Scale Visual Entity Recognition
Mathilde Caron
Ahmet Iscen
Alireza Fathi
Cordelia Schmid
16
5
0
04 Mar 2024
Predictions from language models for multiple-choice tasks are not robust under variation of scoring methods
Polina Tsvilodub
Hening Wang
Sharon Grosch
Michael Franke
14
8
0
01 Mar 2024
Is Crowdsourcing Breaking Your Bank? Cost-Effective Fine-Tuning of Pre-trained Language Models with Proximal Policy Optimization
Shuo Yang
Gjergji Kasneci
ALM
34
2
0
28 Feb 2024
Data-Efficient Learning via Clustering-Based Sensitivity Sampling: Foundation Models and Beyond
Kyriakos Axiotis
Vincent Cohen-Addad
Monika Henzinger
Sammy Jerome
Vahab Mirrokni
David Saulpic
David P. Woodruff
Michael Wunder
25
5
0
27 Feb 2024
OAG-Bench: A Human-Curated Benchmark for Academic Graph Mining
Fanjin Zhang
Shijie Shi
Yifan Zhu
Bo Chen
Yukuo Cen
...
Huihui Yuan
Jian Song
Xiaoyan Li
Yuxiao Dong
Jie Tang
18
15
0
24 Feb 2024
Repetition Improves Language Model Embeddings
Jacob Mitchell Springer
Suhas Kotha
Daniel Fried
Graham Neubig
Aditi Raghunathan
40
9
0
23 Feb 2024
Privacy-Preserving Instructions for Aligning Large Language Models
Da Yu
Peter Kairouz
Sewoong Oh
Zheng Xu
32
10
0
21 Feb 2024
UMBCLU at SemEval-2024 Task 1A and 1C: Semantic Textual Relatedness with and without machine translation
Ali Naseh
Sai Vallurupalli
LRM
19
2
0
20 Feb 2024
BGE Landmark Embedding: A Chunking-Free Embedding Method For Retrieval Augmented Long-Context Large Language Models
Kun Luo
Zheng Liu
Shitao Xiao
Kang Liu
21
10
0
18 Feb 2024
LoraRetriever: Input-Aware LoRA Retrieval and Composition for Mixed Tasks in the Wild
Ziyu Zhao
Leilei Gan
Guoyin Wang
Wangchunshu Zhou
Hongxia Yang
Kun Kuang
Fei Wu
MoMe
15
28
0
15 Feb 2024
How to Train Data-Efficient LLMs
Noveen Sachdeva
Benjamin Coleman
Wang-Cheng Kang
Jianmo Ni
Lichan Hong
Ed H. Chi
James Caverlee
Julian McAuley
D. Cheng
19
50
0
15 Feb 2024
Answer is All You Need: Instruction-following Text Embedding via Answering the Question
Letian Peng
Yuwei Zhang
Zilong Wang
Jayanth Srinivasa
Gaowen Liu
Zihan Wang
Jingbo Shang
37
6
0
15 Feb 2024
Interpretable Measures of Conceptual Similarity by Complexity-Constrained Descriptive Auto-Encoding
Alessandro Achille
Greg Ver Steeg
Tian Yu Liu
Matthew Trager
Carson Klingenberg
Stefano Soatto
17
1
0
14 Feb 2024
Multilingual E5 Text Embeddings: A Technical Report
Liang Wang
Nan Yang
Xiaolong Huang
Linjun Yang
Rangan Majumder
Furu Wei
11
24
0
08 Feb 2024
On the Emergence of Cross-Task Linearity in the Pretraining-Finetuning Paradigm
Zhanpeng Zhou
Zijun Chen
Yilan Chen
Bo-Wen Zhang
Junchi Yan
MoMe
11
9
0
06 Feb 2024
BGE M3-Embedding: Multi-Lingual, Multi-Functionality, Multi-Granularity Text Embeddings Through Self-Knowledge Distillation
Jianlv Chen
Shitao Xiao
Peitian Zhang
Kun Luo
Defu Lian
Zheng Liu
115
306
0
05 Feb 2024
PoCo: Policy Composition from and for Heterogeneous Robot Learning
Lirui Wang
Jialiang Zhao
Yilun Du
Edward H. Adelson
Russ Tedrake
56
26
0
04 Feb 2024
The RL/LLM Taxonomy Tree: Reviewing Synergies Between Reinforcement Learning and Large Language Models
M. Pternea
Prerna Singh
Abir Chakraborty
Y. Oruganti
M. Milletarí
Sayli Bapat
Kebei Jiang
OffRL
6
6
0
02 Feb 2024
Nomic Embed: Training a Reproducible Long Context Text Embedder
Zach Nussbaum
John X. Morris
Brandon Duderstadt
Andriy Mulyar
8
90
0
02 Feb 2024
An Information-Theoretic Approach to Analyze NLP Classification Tasks
Luran Wang
Mark J. F. Gales
Vatsal Raina
14
1
0
01 Feb 2024
Improving Semantic Control in Discrete Latent Spaces with Transformer Quantized Variational Autoencoders
Yingji Zhang
Danilo S. Carvalho
Marco Valentino
Ian Pratt-Hartmann
André Freitas
DRL
30
5
0
01 Feb 2024
Cross-lingual neural fuzzy matching for exploiting target-language monolingual corpora in computer-aided translation
M. Esplà-Gomis
Víctor M. Sánchez-Cartagena
J. A. Pérez-Ortiz
F. Sánchez-Martínez
6
3
0
16 Jan 2024
User Embedding Model for Personalized Language Prompting
Sumanth Doddapaneni
Krishna Sayana
Ambarish Jash
Sukhdeep S. Sodhi
Dima Kuzmin
RALM
27
8
0
10 Jan 2024
DepressionEmo: A novel dataset for multilabel classification of depression emotions
Abu Bakar Siddiqur Rahman
Hoang-Thang Ta
Lotfollah Najjar
A. Azadmanesh
A. Gonul
AI4MH
17
10
0
09 Jan 2024
Data-CUBE: Data Curriculum for Instruction-based Sentence Representation Learning
Yingqian Min
Kun Zhou
Dawei Gao
Wayne Xin Zhao
He Hu
Yaliang Li
14
1
0
07 Jan 2024
German Text Embedding Clustering Benchmark
Silvan Wehrli
Bert Arnrich
Christopher Irrgang
14
5
0
05 Jan 2024
Improving Text Embeddings with Large Language Models
Liang Wang
Nan Yang
Xiaolong Huang
Linjun Yang
Rangan Majumder
Furu Wei
SyDa
14
154
0
31 Dec 2023
Parameter Efficient Tuning Allows Scalable Personalization of LLMs for Text Entry: A Case Study on Abbreviation Expansion
Katrin Tomanek
Shanqing Cai
Subhashini Venugopalan
19
1
0
21 Dec 2023
LlaMaVAE: Guiding Large Language Model Generation via Continuous Latent Sentence Spaces
Yingji Zhang
Danilo S. Carvalho
Ian Pratt-Hartmann
André Freitas
VLM
11
1
0
20 Dec 2023
Vectorizing string entries for data processing on tables: when are larger language models better?
Léo Grinsztajn
Edouard Oyallon
Myung Jun Kim
Gaël Varoquaux
25
2
0
15 Dec 2023
CLIP as RNN: Segment Countless Visual Concepts without Training Endeavor
Shuyang Sun
Runjia Li
Philip H. S. Torr
Xiuye Gu
Siyang Li
VLM
CLIP
18
32
0
12 Dec 2023
Stellar: Systematic Evaluation of Human-Centric Personalized Text-to-Image Methods
Panos Achlioptas
Alexandros Benetatos
Iordanis Fostiropoulos
Dimitris Skourtis
10
8
0
11 Dec 2023
Previous
1
2
3
4
5
6
Next