ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2311.09198
  4. Cited By
Never Lost in the Middle: Improving Large Language Models via Attention
  Strengthening Question Answering

Never Lost in the Middle: Improving Large Language Models via Attention Strengthening Question Answering

15 November 2023
Junqing He
Kunhao Pan
Xiaoqun Dong
Zhuoyang Song
LiuYiBo LiuYiBo
Yuxin Liang
Hao Wang
Qianguosun Qianguosun
Enming Zhang
Zejian Xie
Jiaxing Zhang
    KELM
    RALM
ArXivPDFHTML

Papers citing "Never Lost in the Middle: Improving Large Language Models via Attention Strengthening Question Answering"

6 / 6 papers shown
Title
SEGMENT+: Long Text Processing with Short-Context Language Models
SEGMENT+: Long Text Processing with Short-Context Language Models
Wei Shi
Shuang Li
Kerun Yu
Jinglei Chen
Zujie Liang
...
Feng Wei
Bo Zheng
Jiaqing Liang
Jiangjie Chen
Yanghua Xiao
RALM
VLM
46
2
0
09 Oct 2024
Eliminating Position Bias of Language Models: A Mechanistic Approach
Eliminating Position Bias of Language Models: A Mechanistic Approach
Ziqi Wang
Hanlin Zhang
Xiner Li
Kuan-Hao Huang
Chi Han
Shuiwang Ji
Sham Kakade
Hao Peng
Heng Ji
49
12
0
01 Jul 2024
Mitigate Position Bias in Large Language Models via Scaling a Single
  Dimension
Mitigate Position Bias in Large Language Models via Scaling a Single Dimension
Yijiong Yu
Huiqiang Jiang
Xufang Luo
Qianhui Wu
Chin-Yew Lin
Dongsheng Li
Yuqing Yang
Yongfeng Huang
L. Qiu
35
9
0
04 Jun 2024
Learning to Trust Your Feelings: Leveraging Self-awareness in LLMs for
  Hallucination Mitigation
Learning to Trust Your Feelings: Leveraging Self-awareness in LLMs for Hallucination Mitigation
Yuxin Liang
Zhuoyang Song
Hao Wang
Jiaxing Zhang
HILM
31
28
0
27 Jan 2024
Your Transformer May Not be as Powerful as You Expect
Your Transformer May Not be as Powerful as You Expect
Shengjie Luo
Shanda Li
Shuxin Zheng
Tie-Yan Liu
Liwei Wang
Di He
52
50
0
26 May 2022
Train Short, Test Long: Attention with Linear Biases Enables Input
  Length Extrapolation
Train Short, Test Long: Attention with Linear Biases Enables Input Length Extrapolation
Ofir Press
Noah A. Smith
M. Lewis
242
690
0
27 Aug 2021
1