ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2303.03751
  4. Cited By
Zeroth-Order Optimization Meets Human Feedback: Provable Learning via
  Ranking Oracles

Zeroth-Order Optimization Meets Human Feedback: Provable Learning via Ranking Oracles

7 March 2023
Zhiwei Tang
Dmitry Rybin
Tsung-Hui Chang
    ALM
    DiffM
ArXivPDFHTML

Papers citing "Zeroth-Order Optimization Meets Human Feedback: Provable Learning via Ranking Oracles"

12 / 12 papers shown
Title
Comparisons Are All You Need for Optimizing Smooth Functions
Comparisons Are All You Need for Optimizing Smooth Functions
Chenyi Zhang
Tongyang Li
AAML
18
1
0
19 May 2024
Deep Representation Learning for Multi-functional Degradation Modeling
  of Community-dwelling Aging Population
Deep Representation Learning for Multi-functional Degradation Modeling of Community-dwelling Aging Population
Suiyao Chen
Xinyi Liu
Yulei Li
Jing Wu
Handong Yao
22
5
0
08 Apr 2024
Leveraging Deep Learning and Xception Architecture for High-Accuracy MRI
  Classification in Alzheimer Diagnosis
Leveraging Deep Learning and Xception Architecture for High-Accuracy MRI Classification in Alzheimer Diagnosis
Shaojie Li
Haichen Qu
Xinqi Dong
Bo Dang
Hengyi Zang
Yulu Gong
31
10
0
24 Mar 2024
Advanced Feature Manipulation for Enhanced Change Detection Leveraging
  Natural Language Models
Advanced Feature Manipulation for Enhanced Change Detection Leveraging Natural Language Models
Zhenglin Li
Yangchen Huang
Mengran Zhu
Jingyu Zhang
Jinghao Chang
Houze Liu
21
4
0
23 Mar 2024
Development and Application of a Monte Carlo Tree Search Algorithm for
  Simulating Da Vinci Code Game Strategies
Development and Application of a Monte Carlo Tree Search Algorithm for Simulating Da Vinci Code Game Strategies
Ye Zhang
Mengran Zhu
Kailin Gui
Jiayue Yu
Yong Hao
Haozhan Sun
38
12
0
15 Mar 2024
FedLion: Faster Adaptive Federated Optimization with Fewer Communication
FedLion: Faster Adaptive Federated Optimization with Fewer Communication
Zhiwei Tang
Tsung-Hui Chang
21
5
0
15 Feb 2024
DeepGI: An Automated Approach for Gastrointestinal Tract Segmentation in
  MRI Scans
DeepGI: An Automated Approach for Gastrointestinal Tract Segmentation in MRI Scans
Ye Zhang
Yulu Gong
Dongji Cui
Xinrui Li
Xinyu Shen
35
31
0
27 Jan 2024
Sparsity-Guided Holistic Explanation for LLMs with Interpretable
  Inference-Time Intervention
Sparsity-Guided Holistic Explanation for LLMs with Interpretable Inference-Time Intervention
Zhen Tan
Tianlong Chen
Zhenyu (Allen) Zhang
Huan Liu
34
15
0
22 Dec 2023
ReConTab: Regularized Contrastive Representation Learning for Tabular
  Data
ReConTab: Regularized Contrastive Representation Learning for Tabular Data
Suiyao Chen
Jing Wu
N. Hovakimyan
Handong Yao
29
31
0
28 Oct 2023
Prompt-Tuning Decision Transformer with Preference Ranking
Prompt-Tuning Decision Transformer with Preference Ranking
Shengchao Hu
Li Shen
Ya-Qin Zhang
Dacheng Tao
OffRL
21
14
0
16 May 2023
Training language models to follow instructions with human feedback
Training language models to follow instructions with human feedback
Long Ouyang
Jeff Wu
Xu Jiang
Diogo Almeida
Carroll L. Wainwright
...
Amanda Askell
Peter Welinder
Paul Christiano
Jan Leike
Ryan J. Lowe
OSLM
ALM
301
11,730
0
04 Mar 2022
Low-rank Matrix Recovery With Unknown Correspondence
Low-rank Matrix Recovery With Unknown Correspondence
Zhiwei Tang
Tsung-Hui Chang
X. Ye
H. Zha
23
4
0
15 Oct 2021
1