v1v2 (latest)

Beyond Gemini-3-Pro: Revisiting LLM Routing and Aggregation at Scale

4 January 2026

Shengji Tang

Weihao Lin

Peng Ye

Jingqi Ye

Hao Li

Yiqun Zhang

Xiaosong Wang

Bo Zhang

Shuyue Hu

Tao Chen

Lei Bai

Wanli Ouyang

ArXiv (abs)PDF HTML

Main:8 Pages

12 Figures

Bibliography:3 Pages

8 Tables

Appendix:10 Pages

Abstract

Large Language Models (LLMs) have rapidly advanced, with Gemini-3-Pro setting a new performance milestone. In this work, we explore collective intelligence as an alternative to monolithic scaling, and demonstrate that open-source LLMs' collaboration can surpass Gemini-3-Pro. We first revisit LLM routing and aggregation at scale and identify three key bottlenecks: (1) current train-free routers are limited by a query-based paradigm focusing solely on textual similarity; (2) recent aggregation methods remain largely static, failing to select appropriate aggregators for different tasks;(3) the complementarity of routing and aggregation remains underutilized. To address these problems, we introduce JiSi, a novel framework designed to release the full potential of LLMs' collaboration through three innovations: (1) Query-Response Mixed Routing capturing both semantic information and problem difficulty; (2) Support-Set-based Aggregator Selection jointly evaluating the aggregation and domain capacity of aggregators; (3) Adaptive Routing-Aggregation Switch dynamically leveraging the advantages of routing and aggregation. Comprehensive experiments on nine benchmarks demonstrate that JiSi can surpass Gemini-3-Pro with only 47% costs by orchestrating ten open-source LLMs, while outperforming mainstream baselines. It suggests that collective intelligence represents a novel path towards Artificial General Intelligence (AGI).

View on arXiv

Comments on this paper