Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1909.11942
Cited By
ALBERT: A Lite BERT for Self-supervised Learning of Language Representations
26 September 2019
Zhenzhong Lan
Mingda Chen
Sebastian Goodman
Kevin Gimpel
Piyush Sharma
Radu Soricut
SSL
AIMat
Re-assign community
ArXiv
PDF
HTML
Papers citing
"ALBERT: A Lite BERT for Self-supervised Learning of Language Representations"
50 / 2,911 papers shown
Title
SimVG: A Simple Framework for Visual Grounding with Decoupled Multi-modal Fusion
Ming Dai
Lingfeng Yang
Yihao Xu
Zhenhua Feng
Wankou Yang
ObjD
27
9
0
26 Sep 2024
Pre-trained Language Models Return Distinguishable Probability Distributions to Unfaithfully Hallucinated Texts
Taehun Cha
Donghun Lee
HILM
24
1
0
25 Sep 2024
dnaGrinder: a lightweight and high-capacity genomic foundation model
Qihang Zhao
Chi Zhang
Weixiong Zhang
26
0
0
24 Sep 2024
ToxiCraft: A Novel Framework for Synthetic Generation of Harmful Information
Zheng Hui
Zhaoxiao Guo
Hang Zhao
Juanyong Duan
Congrui Huang
25
6
0
23 Sep 2024
Data-centric NLP Backdoor Defense from the Lens of Memorization
Zhenting Wang
Zhizhi Wang
Mingyu Jin
Mengnan Du
Juan Zhai
Shiqing Ma
29
3
0
21 Sep 2024
Normalized Narrow Jump To Conclusions: Normalized Narrow Shortcuts for Parameter Efficient Early Exit Transformer Prediction
Amrit Diggavi Seshadri
14
1
0
21 Sep 2024
FAMOUS: Flexible Accelerator for the Attention Mechanism of Transformer on UltraScale+ FPGAs
Ehsan Kabir
Md. Arafat Kabir
Austin R. J. Downey
Jason D. Bakos
David Andrews
Miaoqing Huang
GNN
26
0
0
21 Sep 2024
Profiling Patient Transcript Using Large Language Model Reasoning Augmentation for Alzheimer's Disease Detection
Chin-Po Chen
Jeng-Lin Li
LM&MA
21
0
0
19 Sep 2024
Evaluation of pretrained language models on music understanding
Yannis Vasilakis
Rachel M. Bittner
Johan Pauwels
20
1
0
17 Sep 2024
OneEncoder: A Lightweight Framework for Progressive Alignment of Modalities
Bilal Faye
Hanane Azzag
M. Lebbah
ObjD
28
0
0
17 Sep 2024
Towards Data-Centric RLHF: Simple Metrics for Preference Dataset Comparison
Judy Hanwen Shen
Archit Sharma
Jun Qin
42
4
0
15 Sep 2024
Deep Fast Machine Learning Utils: A Python Library for Streamlined Machine Learning Prototyping
Fabi Prezja
AI4CE
35
0
0
14 Sep 2024
Multi-intent Aware Contrastive Learning for Sequential Recommendation
Junshu Huang
Zi Long
Xianghua Fu
Yin Chen
HAI
23
0
0
13 Sep 2024
A BERT-Based Summarization approach for depression detection
Hossein Salahshoor Gavalan
Mohmmad Naim Rastgoo
Bahareh Nakisa
25
1
0
13 Sep 2024
TheraGen: Therapy for Every Generation
Kartikey Doshi
Jimit Shah
Narendra Shekokar
AI4MH
20
0
0
12 Sep 2024
Enhancing adversarial robustness in Natural Language Inference using explanations
Alexandros Koulakos
Maria Lymperaiou
Giorgos Filandrianos
Giorgos Stamou
SILM
AAML
35
0
0
11 Sep 2024
DA-MoE: Towards Dynamic Expert Allocation for Mixture-of-Experts Models
Maryam Akhavan Aghdam
Hongpeng Jin
Yanzhao Wu
MoE
18
3
0
10 Sep 2024
DetoxBench: Benchmarking Large Language Models for Multitask Fraud & Abuse Detection
Joymallya Chakraborty
Wei Xia
Anirban Majumder
Dan Ma
Walid Chaabene
Naveed Janvekar
19
2
0
09 Sep 2024
Application Specific Compression of Deep Learning Models
Rohit Raj Rai
Angana Borah
Amit Awekar
24
0
0
09 Sep 2024
Driving with Prior Maps: Unified Vector Prior Encoding for Autonomous Vehicle Mapping
Shuang Zeng
Xinyuan Chang
Xinran Liu
Zheng Pan
Xing Wei
37
1
0
09 Sep 2024
Expanding Expressivity in Transformer Models with MöbiusAttention
Anna-Maria Halacheva
M. Nayyeri
Steffen Staab
25
1
0
08 Sep 2024
Achieving Peak Performance for Large Language Models: A Systematic Review
Z. R. K. Rostam
Sándor Szénási
Gábor Kertész
32
3
0
07 Sep 2024
An Effective Deployment of Diffusion LM for Data Augmentation in Low-Resource Sentiment Classification
Zhuowei Chen
Lianxi Wang
Yuben Wu
Xinfeng Liao
Yujia Tian
Junyang Zhong
DiffM
27
0
0
05 Sep 2024
Pre-Trained Language Models for Keyphrase Prediction: A Review
Muhammad Umair
Tangina Sultana
Young-Koo Lee
32
4
0
02 Sep 2024
From Prediction to Application: Language Model-based Code Knowledge Tracing with Domain Adaptive Pre-Training and Automatic Feedback System with Pedagogical Prompting for Comprehensive Programming Education
Unggi Lee
Jiyeong Bae
Yeonji Jung
Minji Kang
Gyuri Byun
...
Sookbun Lee
Jaekwon Park
Taekyung Ahn
Gunho Lee
Hyeoncheol Kim
AI4Ed
KELM
26
1
0
31 Aug 2024
Speaker Tagging Correction With Non-Autoregressive Language Models
Grigor Kirakosyan
Davit Karamyan
3DV
26
0
0
30 Aug 2024
Is Personality Prediction Possible Based on Reddit Comments?
Robert Deimann
Till Preidt
Shaptarshi Roy
Jan Stanicki
18
0
0
28 Aug 2024
A Survey of Large Language Models for European Languages
Wazir Ali
S. Pyysalo
39
2
0
27 Aug 2024
Shifted Window Fourier Transform And Retention For Image Captioning
J. Hu
Roberto Cavicchioli
Alessandro Capotondi
VLM
29
0
0
25 Aug 2024
Genetic Approach to Mitigate Hallucination in Generative IR
Hrishikesh Kulkarni
Nazli Goharian
O. Frieder
Sean MacAvaney
HILM
30
2
0
25 Aug 2024
Domain-specific long text classification from sparse relevant information
Célia DĆruz
J. Bereder
Frédéric Precioso
Michel Riveill
29
0
0
23 Aug 2024
VFM-Det: Towards High-Performance Vehicle Detection via Large Foundation Models
Wentao Wu
Fanghua Hong
Xiao Wang
Chenglong Li
Jin Tang
VLM
54
1
0
23 Aug 2024
MedDec: A Dataset for Extracting Medical Decisions from Discharge Summaries
Mohamed Elgaar
Jiali Cheng
Nidhi Vakil
Hadi Amiri
L. A. Celi
28
2
0
23 Aug 2024
Internal and External Knowledge Interactive Refinement Framework for Knowledge-Intensive Question Answering
Haowei Du
Dongyan Zhao
KELM
30
0
0
23 Aug 2024
Large Language Models are Good Attackers: Efficient and Stealthy Textual Backdoor Attacks
Ziqiang Li
Yueqi Zeng
Pengfei Xia
Lei Liu
Zhangjie Fu
Bin Li
SILM
AAML
42
2
0
21 Aug 2024
BURExtract-Llama: An LLM for Clinical Concept Extraction in Breast Ultrasound Reports
Yuxuan Chen
Haoyan Yang
Hengkai Pan
Fardeen Siddiqui
Antonio Verdone
Qingyang Zhang
S. Chopra
Chen Zhao
Yiqiu Shen
17
2
0
21 Aug 2024
Inside the Black Box: Detecting Data Leakage in Pre-trained Language Encoders
Yuan Xin
Z. Li
Ning Yu
Dingfan Chen
Mario Fritz
Michael Backes
Yang Zhang
PILM
MIACV
29
2
0
20 Aug 2024
Uniting contrastive and generative learning for event sequences models
Aleksandr Yugay
Alexey Zaytsev
AI4TS
32
1
0
19 Aug 2024
MegaFake: A Theory-Driven Dataset of Fake News Generated by Large Language Models
Lionel Z. Wang
Yiming Ma
Renfei Gao
Beichen Guo
Han Zhu
Wenqi Fan
Zexin Lu
Ka Chung Ng
SyDa
23
2
0
19 Aug 2024
A Psychology-based Unified Dynamic Framework for Curriculum Learning
Guangyu Meng
Qingkai Zeng
John P. Lalor
Hong-ye Yu
27
0
0
09 Aug 2024
Investigating a Benchmark for Training-set free Evaluation of Linguistic Capabilities in Machine Reading Comprehension
Viktor Schlegel
Goran Nenadic
R. Batista-Navarro
ELM
27
0
0
09 Aug 2024
Survey: Transformer-based Models in Data Modality Conversion
Elyas Rashno
Amir Eskandari
Aman Anand
F. Zulkernine
MedIm
33
0
0
08 Aug 2024
MDT-A2G: Exploring Masked Diffusion Transformers for Co-Speech Gesture Generation
Xiaofeng Mao
Zhengkai Jiang
Qilin Wang
Chencan Fu
Jiangning Zhang
Jiafu Wu
Yabiao Wang
Chengjie Wang
Wei Li
Mingmin Chi
72
4
0
06 Aug 2024
Dopamin: Transformer-based Comment Classifiers through Domain Post-Training and Multi-level Layer Aggregation
Nam Le Hai
Nghi D. Q. Bui
31
1
0
06 Aug 2024
Modelling Visual Semantics via Image Captioning to extract Enhanced Multi-Level Cross-Modal Semantic Incongruity Representation with Attention for Multimodal Sarcasm Detection
Sajal Aggarwal
Ananya Pandey
Dinesh Kumar Vishwakarma
41
1
0
05 Aug 2024
Large Language Model Aided QoS Prediction for Service Recommendation
Huiying Liu
Zekun Zhang
Honghao Li
Qilin Wu
Yiwen Zhang
18
1
0
05 Aug 2024
Recent Advances in Multi-Choice Machine Reading Comprehension: A Survey on Methods and Datasets
Shima Foolad
Kourosh Kiani
R. Rastgoo
FaML
37
0
0
04 Aug 2024
Effective Demonstration Annotation for In-Context Learning via Language Model-Based Determinantal Point Process
Peng Wang
Xiaobin Wang
Chao Lou
Shengyu Mao
Pengjun Xie
Yong-jia Jiang
52
0
0
04 Aug 2024
Cross-layer Attention Sharing for Large Language Models
Yongyu Mu
Yuzhang Wu
Yuchun Fan
Chenglong Wang
Hengyu Li
Qiaozhi He
Murun Yang
Tong Xiao
Jingbo Zhu
36
5
0
04 Aug 2024
Deep Learning based Visually Rich Document Content Understanding: A Survey
Muhammad Ali
Jean Lee
Salman Khan
34
6
0
02 Aug 2024
Previous
1
2
3
4
5
...
57
58
59
Next