ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2404.00943
  4. Cited By
Evalverse: Unified and Accessible Library for Large Language Model
  Evaluation

Evalverse: Unified and Accessible Library for Large Language Model Evaluation

1 April 2024
Jihoo Kim
Wonho Song
Dahyun Kim
Yunsu Kim
Yungi Kim
Chanjun Park
    ELM
ArXivPDFHTML

Papers citing "Evalverse: Unified and Accessible Library for Large Language Model Evaluation"

5 / 5 papers shown
Title
Do LLMs estimate uncertainty well in instruction-following?
Do LLMs estimate uncertainty well in instruction-following?
Juyeon Heo
Miao Xiong
Christina Heinze-Deml
Jaya Narain
ELM
36
2
0
18 Oct 2024
Do LLMs "know" internally when they follow instructions?
Do LLMs "know" internally when they follow instructions?
Juyeon Heo
Christina Heinze-Deml
Oussama Elachqar
Shirley Ren
Udhay Nallasamy
Andy Miller
Kwan Ho Ryan Chan
Jaya Narain
31
3
0
18 Oct 2024
A Comprehensive Survey on Process-Oriented Automatic Text Summarization with Exploration of LLM-Based Methods
A Comprehensive Survey on Process-Oriented Automatic Text Summarization with Exploration of LLM-Based Methods
Hanlei Jin
Yang Zhang
Dan Meng
Jun Wang
Jinghua Tan
51
76
0
05 Mar 2024
LegalBench: Prototyping a Collaborative Benchmark for Legal Reasoning
LegalBench: Prototyping a Collaborative Benchmark for Legal Reasoning
Neel Guha
Daniel E. Ho
Julian Nyarko
Christopher Ré
AILaw
ELM
84
16
0
13 Sep 2022
Scaling Laws for Neural Language Models
Scaling Laws for Neural Language Models
Jared Kaplan
Sam McCandlish
T. Henighan
Tom B. Brown
B. Chess
R. Child
Scott Gray
Alec Radford
Jeff Wu
Dario Amodei
220
3,054
0
23 Jan 2020
1