v1v2 (latest)

GLM-130B: An Open Bilingual Pre-trained Model

International Conference on Learning Representations (ICLR), 2022

5 October 2022

Xiao Liu

Ming Ding

Yuxiao Dong

ArXiv (abs)PDF HTML HuggingFace (3 upvotes)Github (7683★)

Papers citing "GLM-130B: An Open Bilingual Pre-trained Model"

50 / 779 papers shown

One Attention, One Scale: Phase-Aligned Rotary Positional Embeddings for Mixed-Resolution Diffusion Transformer

181

24 Nov 2025

ADF-LoRA: Alternating Low-Rank Aggregation for Decentralized Federated Fine-Tuning

115

23 Nov 2025

HFL-FlowLLM: Large Language Models for Network Traffic Flow Classification in Heterogeneous Federated Learning

Jiazhuo Tian

Yachao Yuan

150

18 Nov 2025

BudgetLeak: Membership Inference Attacks on RAG Systems via the Generation Budget Side Channel

311

15 Nov 2025

CLIP is All You Need for Human-like Semantic Representations in Stable Diffusion

138

11 Nov 2025

Large language model-based task planning for service robots: A review

278

27 Oct 2025

KnowMol: Advancing Molecular Large Language Models with Multi-Level Chemical Knowledge

185

22 Oct 2025

DETree: DEtecting Human-AI Collaborative Texts via Tree-Structured Hierarchical Representation Learning

309

20 Oct 2025

Vision-Centric Activation and Coordination for Multimodal Large Language Models

427

16 Oct 2025

Enabling Doctor-Centric Medical AI with LLMs through Workflow-Aligned Tasks and Benchmarks

...

250

13 Oct 2025

Diversity Boosts AI-Generated Text Detection

Advik Raj Basani

Pin-Yu Chen

DeLMO

416

23 Sep 2025

EvoEmpirBench: Dynamic Spatial Reasoning with Agent-ExpVer

251

16 Sep 2025

ComputerRL: Scaling End-to-End Online Reinforcement Learning for Computer Use Agents

220

19 Aug 2025

MAC: A Live Benchmark for Multimodal Large Language Models in Scientific Understanding

184

14 Aug 2025

FPEdit: Robust LLM Fingerprinting through Localized Parameter Editing

312

04 Aug 2025

AMix-1: A Pathway to Test-Time Scalable Protein Foundation Model

...

350

11 Jul 2025

Small Batch Size Training for Language Models: When Vanilla SGD Works, and Why Gradient Accumulation Is Wasteful

498

09 Jul 2025

Improving Factuality for Dialogue Response Generation via Graph-Based Knowledge Augmentation

328

14 Jun 2025

AdaDecode: Accelerating LLM Decoding with Adaptive Layer Parallelism

414

04 Jun 2025

Look Within or Look Beyond? A Theoretical Comparison Between Parameter-Efficient and Full Fine-Tuning

235

28 May 2025

MiniLongBench: The Low-cost Long Context Understanding Benchmark for Large Language ModelsAnnual Meeting of the Association for Computational Linguistics (ACL), 2025

393

26 May 2025

Benchmarking and Rethinking Knowledge Editing for Large Language Models

294

24 May 2025

EvdCLIP: Improving Vision-Language Retrieval with Entity Visual Descriptions from Large Language ModelsAAAI Conference on Artificial Intelligence (AAAI), 2025

604

24 May 2025

L-MTP: Leap Multi-Token Prediction Beyond Adjacent Context for Large Language Models

494

23 May 2025

RBench-V: A Primary Assessment for Visual Reasoning Models with Multi-modal Outputs

...

448

22 May 2025

FedSEA-LLaMA: A Secure, Efficient and Adaptive Federated Splitting Framework for Large Language Models

494

21 May 2025

The Unreasonable Effectiveness of Entropy Minimization in LLM Reasoning

566

125

21 May 2025

Gated Attention for Large Language Models: Non-linearity, Sparsity, and Attention-Sink-Free

...

1.1K

143

10 May 2025

R-Bench: Graduate-level Multi-disciplinary Benchmarks for LLM & MLLM Complex Reasoning Evaluation

...

431

04 May 2025

Enhancing LLM-Based Agents via Global Planning and Hierarchical Execution

593

23 Apr 2025

Efficient Evaluation of Large Language Models via Collaborative Filtering

Xu-Xiang Zhong

Chao Yi

Han-Jia Ye

384

05 Apr 2025

Building Instruction-Tuning Datasets from Human-Written Instructions with Open-Weight Large Language Models

...

505

31 Mar 2025

Model Hemorrhage and the Robustness Limits of Large Language Models

371

31 Mar 2025

Not a nuisance but a useful heuristic: Outlier dimensions favor frequent tokens in language models

631

27 Mar 2025

J&H: Evaluating the Robustness of Large Language Models Under Knowledge-Injection Attacks in Legal DomainAAAI Conference on Artificial Intelligence (AAAI), 2025

390

24 Mar 2025

ComfyGPT: A Self-Optimizing Multi-Agent System for Comprehensive ComfyUI Workflow Generation

352

22 Mar 2025

Crab: A Unified Audio-Visual Scene Understanding Model with Explicit CooperationComputer Vision and Pattern Recognition (CVPR), 2025

302

17 Mar 2025

LLMSeR: Enhancing Sequential Recommendation via LLM-based Data Augmentation

352

16 Mar 2025

FedALT: Federated Fine-Tuning through Adaptive Local Training with Rest-of-World LoRA

549

14 Mar 2025

CalliReader: Contextualizing Chinese Calligraphy via an Embedding-Aligned Vision-Language Model

366

13 Mar 2025

LightPlanner: Unleashing the Reasoning Capabilities of Lightweight Large Language Models in Task Planning

329

11 Mar 2025

Compositional Translation: A Novel LLM-based Approach for Low-resource Machine Translation

A. Zebaze

Benoît Sagot

Rachel Bawden

336

06 Mar 2025

Adaptive Keyframe Sampling for Long Video UnderstandingComputer Vision and Pattern Recognition (CVPR), 2025

318

119

28 Feb 2025

Learning to Generate Structured Output with Schema Reinforcement LearningAnnual Meeting of the Association for Computational Linguistics (ACL), 2025

692

26 Feb 2025

GradientStabilizer:Fix the Norm, Not the Gradient

...

Qingsong Wen

Shiwei Liu

442

24 Feb 2025

PiCO: Peer Review in LLMs based on the Consistency Optimization

614

24 Feb 2025

EPERM: An Evidence Path Enhanced Reasoning Model for Knowledge Graph Question and AnsweringAAAI Conference on Artificial Intelligence (AAAI), 2025

307

22 Feb 2025

EvoP: Robust LLM Inference via Evolutionary Pruning

704

19 Feb 2025

A Survey of Personalized Large Language Models: Progress and Future Directions

397

17 Feb 2025

Refining Positive and Toxic Samples for Dual Safety Self-Alignment of LLMs with Minimal Human Interventions

393

08 Feb 2025