v1v2 (latest)

I-SHEEP: Self-Alignment of LLM from Scratch through an Iterative Self-Enhancement Paradigm

15 August 2024

Ge Zhang

Tianyu Zheng

Zhenzhu Yang

Jiaheng Liu

Chenghua Lin

Lei Ma

Wenhao Huang

Jiajun Zhang

ALM

ArXiv (abs)PDF HTML HuggingFace (36 upvotes)

Papers citing "I-SHEEP: Self-Alignment of LLM from Scratch through an Iterative Self-Enhancement Paradigm"

14 / 14 papers shown

AutoMalDesc: Large-Scale Script Analysis for Cyber Threat Research

Alexandru Apostu

Andrei Preda

Alexandra Daniela Damir

107

17 Nov 2025

A Survey on Efficient Large Language Model Training: From Data-centric PerspectivesAnnual Meeting of the Association for Computational Linguistics (ACL), 2025

...

183

29 Oct 2025

From Defender to Devil? Unintended Risk Interactions Induced by LLM Defenses

187

09 Oct 2025

SeaPO: Strategic Error Amplification for Robust Preference Optimization of Large Language Models

158

29 Sep 2025

Middo: Model-Informed Dynamic Data Optimization for Enhanced LLM Fine-Tuning via Closed-Loop Learning

369

29 Aug 2025

C2-Evo: Co-Evolving Multimodal Data and Model for Self-Improving Reasoning

...

286

22 Jul 2025

RECAST: Expanding the Boundaries of LLMs' Complex Instruction Following with Multi-Constraint Data

...

511

25 May 2025

The Quest for Efficient Reasoning: A Data-Centric Benchmark to CoT Distillation

Ruichen Zhang

Rana Muhammad Shahroz Khan

301

24 May 2025

ReflectEvo: Improving Meta Introspection of Small LLMs by Learning Self-ReflectionAnnual Meeting of the Association for Computational Linguistics (ACL), 2025

314

22 May 2025

Mitigating Tail Narrowing in LLM Self-Improvement via Socratic-Guided SamplingNorth American Chapter of the Association for Computational Linguistics (NAACL), 2024

431

24 Feb 2025

Self-Improving Transformers Overcome Easy-to-Hard and Length Generalization Challenges

Dimitris Papailiopoulos

ReLM VLM LRM AI4CE

493

03 Feb 2025

Aligning Instruction Tuning with Pre-training

...

730

16 Jan 2025

From Generation to Judgment: Opportunities and Challenges of LLM-as-a-judge

...

1.3K

363

25 Nov 2024

MentalArena: Self-play Training of Language Models for Diagnosis and Treatment of Mental Health Disorders

Cheng-rong Li

May Fung

Qingyun Wang

Chi Han

Pengfei Yu

Jindong Wang

Heng Ji

AI4MH

925

09 Oct 2024