ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2310.03013
  4. Cited By
SemiReward: A General Reward Model for Semi-supervised Learning

SemiReward: A General Reward Model for Semi-supervised Learning

4 October 2023
Siyuan Li
Weiyang Jin
Zedong Wang
Fang Wu
Zicheng Liu
Cheng Tan
Stan Z. Li
ArXivPDFHTML

Papers citing "SemiReward: A General Reward Model for Semi-supervised Learning"

8 / 8 papers shown
Title
FreeMatch: Self-adaptive Thresholding for Semi-supervised Learning
FreeMatch: Self-adaptive Thresholding for Semi-supervised Learning
Yidong Wang
Hao Chen
Qiang Heng
Wenxin Hou
Yue Fan
...
Marios Savvides
T. Shinozaki
Bhiksha Raj
Bernt Schiele
Xing Xie
172
251
0
15 May 2022
Harnessing Hard Mixed Samples with Decoupled Regularizer
Harnessing Hard Mixed Samples with Decoupled Regularizer
Zicheng Liu
Siyuan Li
Ge Wang
Cheng Tan
Lirong Wu
Stan Z. Li
39
17
0
21 Mar 2022
Training language models to follow instructions with human feedback
Training language models to follow instructions with human feedback
Long Ouyang
Jeff Wu
Xu Jiang
Diogo Almeida
Carroll L. Wainwright
...
Amanda Askell
Peter Welinder
Paul Christiano
Jan Leike
Ryan J. Lowe
OSLM
ALM
301
11,730
0
04 Mar 2022
FlexMatch: Boosting Semi-Supervised Learning with Curriculum Pseudo
  Labeling
FlexMatch: Boosting Semi-Supervised Learning with Curriculum Pseudo Labeling
Bowen Zhang
Yidong Wang
Wenxin Hou
Hao Wu
Jindong Wang
Manabu Okumura
T. Shinozaki
AAML
207
848
0
15 Oct 2021
Co-learning: Learning from Noisy Labels with Self-supervision
Co-learning: Learning from Noisy Labels with Self-supervision
Cheng Tan
Jun-Xiong Xia
Lirong Wu
Stan Z. Li
NoLa
61
114
0
05 Aug 2021
Localization Distillation for Dense Object Detection
Localization Distillation for Dense Object Detection
Zhaohui Zheng
Rongguang Ye
Ping Wang
Dongwei Ren
W. Zuo
Qibin Hou
Ming-Ming Cheng
ObjD
93
111
0
24 Feb 2021
Meta Pseudo Labels
Meta Pseudo Labels
Hieu H. Pham
Zihang Dai
Qizhe Xie
Minh-Thang Luong
Quoc V. Le
VLM
245
648
0
23 Mar 2020
Mean teachers are better role models: Weight-averaged consistency
  targets improve semi-supervised deep learning results
Mean teachers are better role models: Weight-averaged consistency targets improve semi-supervised deep learning results
Antti Tarvainen
Harri Valpola
OOD
MoMe
244
1,279
0
06 Mar 2017
1