ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2410.04350
  4. Cited By
TIS-DPO: Token-level Importance Sampling for Direct Preference Optimization With Estimated Weights

TIS-DPO: Token-level Importance Sampling for Direct Preference Optimization With Estimated Weights

6 October 2024
Aiwei Liu
Haoping Bai
Zhiyun Lu
Yanchao Sun
Xiang Kong
Simon Wang
Jiulong Shan
Albin Madappally Jose
Xiaojiang Liu
Lijie Wen
Philip S. Yu
Meng Cao
ArXivPDFHTML

Papers citing "TIS-DPO: Token-level Importance Sampling for Direct Preference Optimization With Estimated Weights"

3 / 3 papers shown
Title
Large Language Model (LLM) for Software Security: Code Analysis, Malware Analysis, Reverse Engineering
Large Language Model (LLM) for Software Security: Code Analysis, Malware Analysis, Reverse Engineering
Hamed Jelodar
Samita Bai
Parisa Hamedi
Hesamodin Mohammadian
R. Razavi-Far
Ali Ghorbani
34
0
0
07 Apr 2025
On Benchmarking Code LLMs for Android Malware Analysis
On Benchmarking Code LLMs for Android Malware Analysis
Yiling He
Hongyu She
Xingzhi Qian
Xinran Zheng
Zhuo Chen
Z. Qin
Lorenzo Cavallaro
ELM
43
1
0
01 Apr 2025
A Contemporary Survey of Large Language Model Assisted Program Analysis
A Contemporary Survey of Large Language Model Assisted Program Analysis
Jiayimei Wang
Tao Ni
Wei-Bin Lee
Qingchuan Zhao
41
5
0
05 Feb 2025
1