ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1905.06566
  4. Cited By
HIBERT: Document Level Pre-training of Hierarchical Bidirectional
  Transformers for Document Summarization

HIBERT: Document Level Pre-training of Hierarchical Bidirectional Transformers for Document Summarization

Annual Meeting of the Association for Computational Linguistics (ACL), 2019
16 May 2019
Xingxing Zhang
Furu Wei
M. Zhou
ArXiv (abs)PDFHTML

Papers citing "HIBERT: Document Level Pre-training of Hierarchical Bidirectional Transformers for Document Summarization"

50 / 167 papers shown
Parallel Hierarchical Transformer with Attention Alignment for
  Abstractive Multi-Document Summarization
Parallel Hierarchical Transformer with Attention Alignment for Abstractive Multi-Document Summarization
Ye Ma
Lu Zong
165
0
0
16 Aug 2022
Generating Coherent Narratives by Learning Dynamic and Discrete Entity
  States with a Contrastive Framework
Generating Coherent Narratives by Learning Dynamic and Discrete Entity States with a Contrastive FrameworkAAAI Conference on Artificial Intelligence (AAAI), 2022
Jian Guan
Zhenyu Yang
Rongsheng Zhang
Zhipeng Hu
Shiyu Huang
240
11
0
08 Aug 2022
An Automatic and Efficient BERT Pruning for Edge AI Systems
An Automatic and Efficient BERT Pruning for Edge AI SystemsIEEE International Symposium on Quality Electronic Design (ISQED), 2022
Shaoyi Huang
Ning Liu
Yueying Liang
Hongwu Peng
Hongjia Li
Dongkuan Xu
Mimi Xie
Caiwen Ding
205
24
0
21 Jun 2022
RoSGAS: Adaptive Social Bot Detection with Reinforced Self-Supervised
  GNN Architecture Search
RoSGAS: Adaptive Social Bot Detection with Reinforced Self-Supervised GNN Architecture SearchACM Transactions on the Web (TWEB), 2022
Yingguang Yang
Renyu Yang
Yangyang Li
Kai Cui
Zhiqin Yang
Yue Wang
Jie Xu
Haiyong Xie
179
64
0
14 Jun 2022
Scaling Vision Transformers to Gigapixel Images via Hierarchical
  Self-Supervised Learning
Scaling Vision Transformers to Gigapixel Images via Hierarchical Self-Supervised LearningComputer Vision and Pattern Recognition (CVPR), 2022
Richard J. Chen
Chengkuan Chen
Yicong Li
Tiffany Y. Chen
A. Trister
Rahul G. Krishnan
Faisal Mahmood
ViTMedIm
312
580
0
06 Jun 2022
Pre-training Transformer Models with Sentence-Level Objectives for
  Answer Sentence Selection
Pre-training Transformer Models with Sentence-Level Objectives for Answer Sentence SelectionConference on Empirical Methods in Natural Language Processing (EMNLP), 2022
Luca Di Liello
Siddhant Garg
Luca Soldaini
Alessandro Moschitti
159
18
0
20 May 2022
Knowledge-aware Document Summarization: A Survey of Knowledge, Embedding
  Methods and Architectures
Knowledge-aware Document Summarization: A Survey of Knowledge, Embedding Methods and ArchitecturesKnowledge-Based Systems (KBS), 2022
Yutong Qu
Wei Emma Zhang
Jian Yang
Lingfei Wu
Hongzhi Zhang
AI4TS
163
8
0
24 Apr 2022
OTExtSum: Extractive Text Summarisation with Optimal Transport
OTExtSum: Extractive Text Summarisation with Optimal Transport
Peggy Tang
Kun Hu
Rui Yan
Lei Zhang
Junbin Gao
Zhiyong Wang
OT
180
12
0
21 Apr 2022
Revisiting Transformer-based Models for Long Document Classification
Revisiting Transformer-based Models for Long Document ClassificationConference on Empirical Methods in Natural Language Processing (EMNLP), 2022
Xiang Dai
Ilias Chalkidis
Kenny Erleben
Desmond Elliott
VLM
230
90
0
14 Apr 2022
MHMS: Multimodal Hierarchical Multimedia Summarization
MHMS: Multimodal Hierarchical Multimedia Summarization
Jielin Qiu
Jiacheng Zhu
Mengdi Xu
Franck Dernoncourt
Trung Bui
Zhaowen Wang
Yue Liu
Ding Zhao
Hailin Jin
173
14
0
07 Apr 2022
Discovering material information using hierarchical Reformer model on
  financial regulatory filings
Discovering material information using hierarchical Reformer model on financial regulatory filings
Francois Mercier
Makesh Narsimhan
AIFinAI4TS
66
0
0
28 Mar 2022
ERNIE-SPARSE: Learning Hierarchical Efficient Transformer Through
  Regularized Self-Attention
ERNIE-SPARSE: Learning Hierarchical Efficient Transformer Through Regularized Self-Attention
Yang Liu
Jiaxiang Liu
L. Chen
Yuxiang Lu
Shi Feng
Zhida Feng
Yu Sun
Hao Tian
Huancheng Wu
Hai-feng Wang
172
13
0
23 Mar 2022
HIBRIDS: Attention with Hierarchical Biases for Structure-aware Long
  Document Summarization
HIBRIDS: Attention with Hierarchical Biases for Structure-aware Long Document SummarizationAnnual Meeting of the Association for Computational Linguistics (ACL), 2022
Shuyang Cao
Lu Wang
251
46
0
21 Mar 2022
Read Top News First: A Document Reordering Approach for Multi-Document
  News Summarization
Read Top News First: A Document Reordering Approach for Multi-Document News SummarizationFindings (Findings), 2022
Chao Zhao
Tenghao Huang
Somnath Basu Roy Chowdhury
Muthu Kumar Chandrasekaran
Kathleen McKeown
Snigdha Chaturvedi
MoMe
142
20
0
19 Mar 2022
HiStruct+: Improving Extractive Text Summarization with Hierarchical
  Structure Information
HiStruct+: Improving Extractive Text Summarization with Hierarchical Structure InformationFindings (Findings), 2022
Qianqian Ruan
Malte Ostendorff
Georg Rehm
AILaw
176
61
0
17 Mar 2022
Long Document Summarization with Top-down and Bottom-up Inference
Long Document Summarization with Top-down and Bottom-up InferenceFindings (Findings), 2022
Bo Pang
Erik Nijkamp
Wojciech Kry'sciñski
Silvio Savarese
Yingbo Zhou
Caiming Xiong
RALMBDL
185
66
0
15 Mar 2022
Hierarchical BERT for Medical Document Understanding
Hierarchical BERT for Medical Document Understanding
Ning Zhang
Maciej Jankowski
148
12
0
11 Mar 2022
Who Should Review Your Proposal? Interdisciplinary Topic Path Detection
  for Research Proposals
Who Should Review Your Proposal? Interdisciplinary Topic Path Detection for Research Proposals
Meng Xiao
Ziyue Qiao
Yanjie Fu
Hao Dong
Yi Du
Pengyang Wang
Dong Li
Yuan-Hong Zhou
170
4
0
07 Mar 2022
SciBERTSUM: Extractive Summarization for Scientific Documents
SciBERTSUM: Extractive Summarization for Scientific DocumentsInternational Workshop on Document Analysis Systems (DAS), 2022
Athar Sefid
C. Lee Giles
153
13
0
21 Jan 2022
Pretrained Language Models for Text Generation: A Survey
Pretrained Language Models for Text Generation: A SurveyACM Computing Surveys (ACM CSUR), 2022
Junyi Li
Tianyi Tang
Wayne Xin Zhao
J. Nie
Ji-Rong Wen
AI4CE
519
263
0
14 Jan 2022
Adaptive Beam Search to Enhance On-device Abstractive Summarization
Adaptive Beam Search to Enhance On-device Abstractive SummarizationIEEE India Conference (INDICON), 2021
S. HarichandanaB.S.
Sumit Kumar
79
1
0
22 Dec 2021
Human Guided Exploitation of Interpretable Attention Patterns in
  Summarization and Topic Segmentation
Human Guided Exploitation of Interpretable Attention Patterns in Summarization and Topic Segmentation
Raymond Li
Wen Xiao
Linzi Xing
Lanjun Wang
Gabriel Murray
Giuseppe Carenini
ViT
264
9
0
10 Dec 2021
Recommending Multiple Positive Citations for Manuscript via
  Content-Dependent Modeling and Multi-Positive Triplet
Recommending Multiple Positive Citations for Manuscript via Content-Dependent Modeling and Multi-Positive Triplet
Yang Zhang
Qiang Ma
101
1
0
25 Nov 2021
DeepHelp: Deep Learning for Shout Crisis Text Conversations
DeepHelp: Deep Learning for Shout Crisis Text Conversations
D. Cahn
AI4MH
116
1
0
25 Oct 2021
Topic-Guided Abstractive Multi-Document Summarization
Topic-Guided Abstractive Multi-Document SummarizationConference on Empirical Methods in Natural Language Processing (EMNLP), 2021
Peng Cui
Le Hu
203
46
0
21 Oct 2021
HRKD: Hierarchical Relational Knowledge Distillation for Cross-domain
  Language Model Compression
HRKD: Hierarchical Relational Knowledge Distillation for Cross-domain Language Model Compression
Chenhe Dong
Yaliang Li
Ying Shen
Minghui Qiu
VLM
315
8
0
16 Oct 2021
Modeling Endorsement for Multi-Document Abstractive Summarization
Modeling Endorsement for Multi-Document Abstractive Summarization
Logan Lebanoff
Bingqing Wang
Z. Feng
Fei Liu
619
4
0
15 Oct 2021
HETFORMER: Heterogeneous Transformer with Sparse Attention for Long-Text
  Extractive Summarization
HETFORMER: Heterogeneous Transformer with Sparse Attention for Long-Text Extractive Summarization
Ye Liu
Jianguo Zhang
Yao Wan
Congying Xia
Lifang He
Philip S. Yu
400
32
0
12 Oct 2021
Iterative Decoding for Compositional Generalization in Transformers
Iterative Decoding for Compositional Generalization in Transformers
Luana Ruiz
Joshua Ainslie
Santiago Ontañón
134
7
0
08 Oct 2021
Attention Augmented Convolutional Transformer for Tabular Time-series
Attention Augmented Convolutional Transformer for Tabular Time-series
Sharath M. Shankaranarayana
D. Runje
LMTDAI4TS
162
8
0
05 Oct 2021
Leveraging Information Bottleneck for Scientific Document Summarization
Leveraging Information Bottleneck for Scientific Document Summarization
Jiaxin Ju
Ming Liu
Huan Yee Koh
Yuan Jin
Lan Du
Shirui Pan
250
16
0
04 Oct 2021
LawSum: A weakly supervised approach for Indian Legal Document
  Summarization
LawSum: A weakly supervised approach for Indian Legal Document Summarization
Vedant Vijay Parikh
Vidit Mathur
Parth Mehta
Namita Mittal
Prasenjit Majumder
AILaw
235
30
0
04 Oct 2021
Recursively Summarizing Books with Human Feedback
Recursively Summarizing Books with Human Feedback
Jeff Wu
Long Ouyang
Daniel M. Ziegler
Nissan Stiennon
Ryan J. Lowe
Jan Leike
Paul Christiano
ALM
503
342
0
22 Sep 2021
Enriching and Controlling Global Semantics for Text Summarization
Enriching and Controlling Global Semantics for Text SummarizationConference on Empirical Methods in Natural Language Processing (EMNLP), 2021
Thong Nguyen
Anh Tuan Luu
Truc Lu
Tho Quan
121
38
0
22 Sep 2021
Investigating Crowdsourcing Protocols for Evaluating the Factual
  Consistency of Summaries
Investigating Crowdsourcing Protocols for Evaluating the Factual Consistency of Summaries
Xiangru Tang
Alexander R. Fabbri
Haoran Li
Ziming Mao
Griffin Adams
Borui Wang
Asli Celikyilmaz
Yashar Mehdad
Dragomir R. Radev
HILM
279
25
0
19 Sep 2021
MeLT: Message-Level Transformer with Masked Document Representations as
  Pre-Training for Stance Detection
MeLT: Message-Level Transformer with Masked Document Representations as Pre-Training for Stance Detection
Matthew Matero
Nikita Soni
Niranjan Balasubramanian
H. Andrew Schwartz
217
21
0
16 Sep 2021
Topic-Aware Contrastive Learning for Abstractive Dialogue Summarization
Topic-Aware Contrastive Learning for Abstractive Dialogue SummarizationConference on Empirical Methods in Natural Language Processing (EMNLP), 2021
Junpeng Liu
Yanyan Zou
Hainan Zhang
Hongshen Chen
Zhuoye Ding
Caixia Yuan
Caixia Yuan
118
70
0
10 Sep 2021
Low-Resource Dialogue Summarization with Domain-Agnostic Multi-Source
  Pretraining
Low-Resource Dialogue Summarization with Domain-Agnostic Multi-Source PretrainingConference on Empirical Methods in Natural Language Processing (EMNLP), 2021
Yicheng Zou
Bolin Zhu
Xingwu Hu
Tao Gui
Tao Gui
231
33
0
09 Sep 2021
Code-switched inspired losses for generic spoken dialog representations
Code-switched inspired losses for generic spoken dialog representationsConference on Empirical Methods in Natural Language Processing (EMNLP), 2021
E. Chapuis
Pierre Colombo
Matthieu Labeau
Chloe Clave
363
12
0
27 Aug 2021
Making Transformers Solve Compositional Tasks
Making Transformers Solve Compositional TasksAnnual Meeting of the Association for Computational Linguistics (ACL), 2021
Santiago Ontañón
Joshua Ainslie
Vaclav Cvicek
Zachary Kenneth Fisher
227
84
0
09 Aug 2021
MemSum: Extractive Summarization of Long Documents Using Multi-Step
  Episodic Markov Decision Processes
MemSum: Extractive Summarization of Long Documents Using Multi-Step Episodic Markov Decision ProcessesAnnual Meeting of the Association for Computational Linguistics (ACL), 2021
Nianlong Gu
Elliott Ash
Richard H. R. Hahnloser
RALM
173
39
0
19 Jul 2021
A Sentence-level Hierarchical BERT Model for Document Classification
  with Limited Labelled Data
A Sentence-level Hierarchical BERT Model for Document Classification with Limited Labelled DataIFIP Working Conference on Database Semantics (IWDS), 2021
Jinghui Lu
M. Henchion
Ivan Bacher
Brian Mac Namee
VLM
130
24
0
12 Jun 2021
A Survey of Transformers
A Survey of TransformersAI Open (AO), 2021
Tianyang Lin
Yuxin Wang
Xiangyang Liu
Xipeng Qiu
ViT
445
1,386
0
08 Jun 2021
Hi-Transformer: Hierarchical Interactive Transformer for Efficient and
  Effective Long Document Modeling
Hi-Transformer: Hierarchical Interactive Transformer for Efficient and Effective Long Document ModelingAnnual Meeting of the Association for Computational Linguistics (ACL), 2021
Chuhan Wu
Fangzhao Wu
Tao Qi
Yongfeng Huang
364
79
0
02 Jun 2021
VILA: Improving Structured Content Extraction from Scientific PDFs Using
  Visual Layout Groups
VILA: Improving Structured Content Extraction from Scientific PDFs Using Visual Layout GroupsTransactions of the Association for Computational Linguistics (TACL), 2021
Zejiang Shen
Kyle Lo
Lucy Lu Wang
Bailey Kuehl
Daniel S. Weld
Doug Downey
VLM
284
42
0
01 Jun 2021
Iterative Hierarchical Attention for Answering Complex Questions over
  Long Documents
Iterative Hierarchical Attention for Answering Complex Questions over Long Documents
Haitian Sun
William W. Cohen
Ruslan Salakhutdinov
333
14
0
01 Jun 2021
Controllable Abstractive Dialogue Summarization with Sketch Supervision
Controllable Abstractive Dialogue Summarization with Sketch SupervisionFindings (Findings), 2021
Chien-Sheng Wu
Linqing Liu
Wenhao Liu
Pontus Stenetorp
Caiming Xiong
216
57
0
28 May 2021
Towards mental time travel: a hierarchical memory for reinforcement
  learning agents
Towards mental time travel: a hierarchical memory for reinforcement learning agentsNeural Information Processing Systems (NeurIPS), 2021
Andrew Kyle Lampinen
Stephanie C. Y. Chan
Andrea Banino
Felix Hill
309
58
0
28 May 2021
Nested Hierarchical Transformer: Towards Accurate, Data-Efficient and
  Interpretable Visual Understanding
Nested Hierarchical Transformer: Towards Accurate, Data-Efficient and Interpretable Visual UnderstandingAAAI Conference on Artificial Intelligence (AAAI), 2021
Zizhao Zhang
Han Zhang
Long Zhao
Ting Chen
Sercan O. Arik
Tomas Pfister
ViT
357
206
0
26 May 2021
Pretrained Language Models for Text Generation: A Survey
Pretrained Language Models for Text Generation: A Survey
Junyi Li
Tianyi Tang
Wayne Xin Zhao
Ji-Rong Wen
LM&MAVLMSyDa
267
206
0
21 May 2021
Previous
1234
Next