ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2406.09215
  4. Cited By
On Softmax Direct Preference Optimization for Recommendation

On Softmax Direct Preference Optimization for Recommendation

13 June 2024
Yuxin Chen
Junfei Tan
An Zhang
Zhengyi Yang
Leheng Sheng
Enzhi Zhang
Xiang Wang
Tat-Seng Chua
ArXivPDFHTML

Papers citing "On Softmax Direct Preference Optimization for Recommendation"

16 / 16 papers shown
Title
Bridge the Domains: Large Language Models Enhanced Cross-domain Sequential Recommendation
Bridge the Domains: Large Language Models Enhanced Cross-domain Sequential Recommendation
Qidong Liu
Xiangyu Zhao
Yejing Wang
Zijian Zhang
Howard Zhong
Chong Chen
X. Li
Wei Huang
Feng Tian
AI4TS
17
0
0
25 Apr 2025
AdaViP: Aligning Multi-modal LLMs via Adaptive Vision-enhanced Preference Optimization
AdaViP: Aligning Multi-modal LLMs via Adaptive Vision-enhanced Preference Optimization
Jinda Lu
Jinghan Li
Yuan Gao
Junkang Wu
Jiancan Wu
X. Wang
Xiangnan He
31
0
0
22 Apr 2025
Process-Supervised LLM Recommenders via Flow-guided Tuning
Process-Supervised LLM Recommenders via Flow-guided Tuning
Chongming Gao
Mengyao Gao
Chenxiao Fan
Shuai Yuan
Wentao Shi
Xiangnan He
68
2
0
10 Mar 2025
Language Representation Favored Zero-Shot Cross-Domain Cognitive Diagnosis
Language Representation Favored Zero-Shot Cross-Domain Cognitive Diagnosis
Shuo Liu
Zihan Zhou
Yuanhao Liu
Jing Zhang
Hong Qian
AI4Ed
51
1
0
18 Jan 2025
Unified Parameter-Efficient Unlearning for LLMs
Chenlu Ding
Jiancan Wu
Yancheng Yuan
Jinda Lu
Kai Zhang
Alex Su
Xiang Wang
Xiangnan He
MU
KELM
88
6
0
30 Nov 2024
Preference Diffusion for Recommendation
Preference Diffusion for Recommendation
Shuo Liu
An Zhang
Guoqing Hu
Hong Qian
Tat-Seng Chua
40
1
0
17 Oct 2024
TPO: Aligning Large Language Models with Multi-branch & Multi-step Preference Trees
TPO: Aligning Large Language Models with Multi-branch & Multi-step Preference Trees
Weibin Liao
Xu Chu
Yasha Wang
LRM
36
6
0
10 Oct 2024
Customizing Language Models with Instance-wise LoRA for Sequential Recommendation
Customizing Language Models with Instance-wise LoRA for Sequential Recommendation
Xiaoyu Kong
Jiancan Wu
An Zhang
Leheng Sheng
Hui Lin
Xiang Wang
Xiangnan He
AI4TS
48
4
0
19 Aug 2024
Direct Preference Optimization with an Offset
Direct Preference Optimization with an Offset
Afra Amini
Tim Vieira
Ryan Cotterell
68
54
0
16 Feb 2024
Noise Contrastive Alignment of Language Models with Explicit Rewards
Noise Contrastive Alignment of Language Models with Explicit Rewards
Huayu Chen
Guande He
Lifan Yuan
Ganqu Cui
Hang Su
Jun Zhu
46
37
0
08 Feb 2024
KTO: Model Alignment as Prospect Theoretic Optimization
KTO: Model Alignment as Prospect Theoretic Optimization
Kawin Ethayarajh
Winnie Xu
Niklas Muennighoff
Dan Jurafsky
Douwe Kiela
153
437
0
02 Feb 2024
Self-Rewarding Language Models
Self-Rewarding Language Models
Weizhe Yuan
Richard Yuanzhe Pang
Kyunghyun Cho
Xian Li
Sainbayar Sukhbaatar
Jing Xu
Jason Weston
ReLM
SyDa
ALM
LRM
215
291
0
18 Jan 2024
Silkie: Preference Distillation for Large Visual Language Models
Silkie: Preference Distillation for Large Visual Language Models
Lei Li
Zhihui Xie
Mukai Li
Shunian Chen
Peiyi Wang
Liang Chen
Yazheng Yang
Benyou Wang
Lingpeng Kong
MLLM
96
67
0
17 Dec 2023
Chat-REC: Towards Interactive and Explainable LLMs-Augmented Recommender
  System
Chat-REC: Towards Interactive and Explainable LLMs-Augmented Recommender System
Yunfan Gao
Tao Sheng
Youlin Xiang
Yun Xiong
Haofen Wang
Jiawei Zhang
RALM
KELM
107
242
0
25 Mar 2023
Learning Vector-Quantized Item Representation for Transferable
  Sequential Recommenders
Learning Vector-Quantized Item Representation for Transferable Sequential Recommenders
Yupeng Hou
Zhankui He
Julian McAuley
Wayne Xin Zhao
54
122
0
22 Oct 2022
Training language models to follow instructions with human feedback
Training language models to follow instructions with human feedback
Long Ouyang
Jeff Wu
Xu Jiang
Diogo Almeida
Carroll L. Wainwright
...
Amanda Askell
Peter Welinder
Paul Christiano
Jan Leike
Ryan J. Lowe
OSLM
ALM
301
11,730
0
04 Mar 2022
1