ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2408.08072
  4. Cited By
I-SHEEP: Self-Alignment of LLM from Scratch through an Iterative
  Self-Enhancement Paradigm
v1v2 (latest)

I-SHEEP: Self-Alignment of LLM from Scratch through an Iterative Self-Enhancement Paradigm

15 August 2024
Yiming Liang
Ge Zhang
Xingwei Qu
Tianyu Zheng
Jiawei Guo
Xinrun Du
Zhenzhu Yang
Jiaheng Liu
Chenghua Lin
Lei Ma
Wenhao Huang
Jiajun Zhang
    ALM
ArXiv (abs)PDFHTMLHuggingFace (36 upvotes)

Papers citing "I-SHEEP: Self-Alignment of LLM from Scratch through an Iterative Self-Enhancement Paradigm"

14 / 14 papers shown
AutoMalDesc: Large-Scale Script Analysis for Cyber Threat Research
AutoMalDesc: Large-Scale Script Analysis for Cyber Threat Research
Alexandru Apostu
Andrei Preda
Alexandra Daniela Damir
Diana Bolocan
Radu Tudor Ionescu
Ioana Croitoru
Mihaela Găman
107
0
0
17 Nov 2025
A Survey on Efficient Large Language Model Training: From Data-centric Perspectives
A Survey on Efficient Large Language Model Training: From Data-centric PerspectivesAnnual Meeting of the Association for Computational Linguistics (ACL), 2025
Junyu Luo
Bohan Wu
Xiao Luo
Zhiping Xiao
Yiqiao Jin
...
Nan Yin
Yifan Wang
Jingyang Yuan
Wei Ju
Ming Zhang
183
8
0
29 Oct 2025
From Defender to Devil? Unintended Risk Interactions Induced by LLM Defenses
From Defender to Devil? Unintended Risk Interactions Induced by LLM Defenses
Xiangtao Meng
Tianshuo Cong
Li Wang
Wenyu Chen
Zheng Li
Shanqing Guo
Xiaoyun Wang
AAML
187
2
0
09 Oct 2025
SeaPO: Strategic Error Amplification for Robust Preference Optimization of Large Language Models
SeaPO: Strategic Error Amplification for Robust Preference Optimization of Large Language Models
Jun Rao
Yunjie Liao
Xuebo Liu
Zepeng Lin
Lian Lian
Dong Jin
Shengjun Cheng
Jun-chen Yu
Min Zhang
158
0
0
29 Sep 2025
Middo: Model-Informed Dynamic Data Optimization for Enhanced LLM Fine-Tuning via Closed-Loop Learning
Middo: Model-Informed Dynamic Data Optimization for Enhanced LLM Fine-Tuning via Closed-Loop Learning
Zinan Tang
Xin Gao
Qizhi Pei
Zhuoshi Pan
Mengzhang Cai
Jiang Wu
Conghui He
Lijun Wu
SyDa
369
2
0
29 Aug 2025
C2-Evo: Co-Evolving Multimodal Data and Model for Self-Improving Reasoning
C2-Evo: Co-Evolving Multimodal Data and Model for Self-Improving Reasoning
Xiuwei Chen
Wentao Hu
Hanhui Li
Jun Zhou
Zisheng Chen
...
Kui Zhang
Yu-Jie Yuan
J. N. Han
Hang Xu
Xiaodan Liang
SyDaLRM
286
6
0
22 Jul 2025
RECAST: Expanding the Boundaries of LLMs' Complex Instruction Following with Multi-Constraint Data
RECAST: Expanding the Boundaries of LLMs' Complex Instruction Following with Multi-Constraint Data
Wenhao Liu
Wenhao Liu
Mingchen Xie
Jingwen Xu
Zisu Huang
...
Changze Lv
He-Da Wang
Qi Zhang
Xiaoqing Zheng
Xuanjing Huang
511
1
0
25 May 2025
The Quest for Efficient Reasoning: A Data-Centric Benchmark to CoT Distillation
The Quest for Efficient Reasoning: A Data-Centric Benchmark to CoT Distillation
Ruichen Zhang
Rana Muhammad Shahroz Khan
Zhen Tan
Dawei Li
Song Wang
Tianlong Chen
LRM
301
2
0
24 May 2025
ReflectEvo: Improving Meta Introspection of Small LLMs by Learning Self-Reflection
ReflectEvo: Improving Meta Introspection of Small LLMs by Learning Self-ReflectionAnnual Meeting of the Association for Computational Linguistics (ACL), 2025
Jiaqi Li
Xinyi Dong
Yang Liu
Zhizhuo Yang
Quansen Wang
Xiaobo Wang
Songchun Zhu
Zixia Jia
Zilong Zheng
ObjDSyDaLRM
314
5
0
22 May 2025
Mitigating Tail Narrowing in LLM Self-Improvement via Socratic-Guided Sampling
Mitigating Tail Narrowing in LLM Self-Improvement via Socratic-Guided SamplingNorth American Chapter of the Association for Computational Linguistics (NAACL), 2024
Yiwen Ding
Zhiheng Xi
Wei He
Zhuoyuan Li
Yitao Zhai
Xiaowei Shi
Xunliang Cai
Tao Gui
Tao Gui
Qi Zhang
LRM
431
15
0
24 Feb 2025
Self-Improving Transformers Overcome Easy-to-Hard and Length Generalization Challenges
Self-Improving Transformers Overcome Easy-to-Hard and Length Generalization Challenges
Nayoung Lee
Ziyang Cai
Avi Schwarzschild
Kangwook Lee
Dimitris Papailiopoulos
ReLMVLMLRMAI4CE
493
23
0
03 Feb 2025
Aligning Instruction Tuning with Pre-training
Aligning Instruction Tuning with Pre-training
Yiming Liang
Tianyu Zheng
Xinrun Du
Ge Zhang
Qingbin Liu
...
Guoyin Wang
Rundong Wang
Wenhao Huang
Jiajun Zhang
Xiang Yue
730
9
0
16 Jan 2025
From Generation to Judgment: Opportunities and Challenges of LLM-as-a-judge
From Generation to Judgment: Opportunities and Challenges of LLM-as-a-judge
Dawei Li
Bohan Jiang
Liangjie Huang
Alimohammad Beigi
Chengshuai Zhao
...
Canyu Chen
Tianhao Wu
Kai Shu
Lu Cheng
Huan Liu
ELMAILaw
1.3K
363
0
25 Nov 2024
MentalArena: Self-play Training of Language Models for Diagnosis and Treatment of Mental Health Disorders
MentalArena: Self-play Training of Language Models for Diagnosis and Treatment of Mental Health Disorders
Cheng-rong Li
May Fung
Qingyun Wang
Chi Han
Pengfei Yu
Jindong Wang
Heng Ji
AI4MH
925
2
0
09 Oct 2024
1
Page 1 of 1