ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2406.09948
  4. Cited By
BLEnD: A Benchmark for LLMs on Everyday Knowledge in Diverse Cultures and Languages

BLEnD: A Benchmark for LLMs on Everyday Knowledge in Diverse Cultures and Languages

17 January 2025
Junho Myung
Nayeon Lee
Yi Zhou
Jiho Jin
Rifki Afina Putri
Dimosthenis Antypas
Hsuvas Borkakoty
Eunsu Kim
Carla Pérez-Almendros
A. Ayele
Víctor Gutiérrez-Basulto
Yazmín Ibánez-García
Hwaran Lee
Shamsuddeen Hassan Muhammad
K. Park
A. Rzayev
Nina White
Seid Muhie Yimam
Mohammad Taher Pilehvar
N. Ousidhoum
Jose Camacho-Collados
Alice H. Oh
ArXivPDFHTML

Papers citing "BLEnD: A Benchmark for LLMs on Everyday Knowledge in Diverse Cultures and Languages"

26 / 26 papers shown
Title
An Evaluation of Cultural Value Alignment in LLM
An Evaluation of Cultural Value Alignment in LLM
Nicholas Sukiennik
Chen Gao
Fengli Xu
Y. Li
24
0
0
11 Apr 2025
Surveying Professional Writers on AI: Limitations, Expectations, and Fears
Surveying Professional Writers on AI: Limitations, Expectations, and Fears
Anastasiia Ivanova
Natalia Fedorova
Sergey Tilga
Ekaterina Artemova
20
0
0
07 Apr 2025
Can LLMs Grasp Implicit Cultural Values? Benchmarking LLMs' Metacognitive Cultural Intelligence with CQ-Bench
Can LLMs Grasp Implicit Cultural Values? Benchmarking LLMs' Metacognitive Cultural Intelligence with CQ-Bench
Ziyi Liu
Priyanka Dey
Zhenyu Zhao
Jen-Tse Huang
Rahul Gupta
Y. Liu
Jieyu Zhao
28
0
0
01 Apr 2025
The Case for "Thick Evaluations" of Cultural Representation in AI
The Case for "Thick Evaluations" of Cultural Representation in AI
Rida Qadri
Mark Díaz
Ding Wang
Michael Madaio
37
2
0
24 Mar 2025
SaudiCulture: A Benchmark for Evaluating Large Language Models Cultural Competence within Saudi Arabia
SaudiCulture: A Benchmark for Evaluating Large Language Models Cultural Competence within Saudi Arabia
Lama Ayash
Hassan Alhuzali
Ashwag Alasmari
Sultan Aloufi
36
0
0
21 Mar 2025
LLM Alignment for the Arabs: A Homogenous Culture or Diverse Ones?
LLM Alignment for the Arabs: A Homogenous Culture or Diverse Ones?
Amr Keleg
46
0
0
19 Mar 2025
Towards properties of adversarial image perturbations
Towards properties of adversarial image perturbations
Egor Kuznetsov
Kirill Aistov
Maxim Koroteev
54
0
0
18 Mar 2025
Empowering Smaller Models: Tuning LLaMA and Gemma with Chain-of-Thought for Ukrainian Exam Tasks
Empowering Smaller Models: Tuning LLaMA and Gemma with Chain-of-Thought for Ukrainian Exam Tasks
Mykyta Syromiatnikov
Victoria Ruvinskaya
Nataliia Komleva
ALM
LRM
65
0
0
18 Mar 2025
Uncovering inequalities in new knowledge learning by large language models across different languages
Chenglong Wang
Haoyu Tang
Xiyuan Yang
Yueqi Xie
Jina Suh
...
Junming Huang
Yu Xie
Zhaoya Gong
Xing Xie
Fangzhao Wu
51
0
0
06 Mar 2025
Evaluating Polish linguistic and cultural competency in large language models
Sławomir Dadas
Małgorzata Grębowiec
Michał Perełkiewicz
Rafał Poświata
ELM
39
1
0
02 Mar 2025
Palm: A Culturally Inclusive and Linguistically Diverse Dataset for Arabic LLMs
Fakhraddin Alwajih
Abdellah El Mekki
Samar Magdy
AbdelRahim Elmadany
Omer Nacar
...
Anis Koubaa
Ismail Berrada
Mustafa Jarrar
Shady Shehata
Muhammad Abdul-Mageed
24
1
0
28 Feb 2025
Sailor2: Sailing in South-East Asia with Inclusive Multilingual LLMs
Sailor2: Sailing in South-East Asia with Inclusive Multilingual LLMs
Longxu Dou
Qian Liu
Fan Zhou
Changyu Chen
Zili Wang
...
Tianyu Pang
Chao Du
Xinyi Wan
Wei Lu
Min Lin
82
1
0
18 Feb 2025
Diffusion Models Through a Global Lens: Are They Culturally Inclusive?
Diffusion Models Through a Global Lens: Are They Culturally Inclusive?
Zahra Bayramli
Ayhan Suleymanzade
Na Min An
Huzama Ahmad
Eunsu Kim
Junyeong Park
James Thorne
Alice H. Oh
89
0
0
13 Feb 2025
INCLUDE: Evaluating Multilingual Language Understanding with Regional
  Knowledge
INCLUDE: Evaluating Multilingual Language Understanding with Regional Knowledge
Angelika Romanou
Negar Foroutan
Anna Sotnikova
Zeming Chen
Sree Harsha Nelaturu
...
Mike Zhang
Imanol Schlag
Marzieh Fadaee
Sara Hooker
Antoine Bosselut
ELM
92
5
0
29 Nov 2024
MM-Eval: A Multilingual Meta-Evaluation Benchmark for LLM-as-a-Judge and Reward Models
MM-Eval: A Multilingual Meta-Evaluation Benchmark for LLM-as-a-Judge and Reward Models
Guijin Son
Dongkeun Yoon
Juyoung Suk
Javier Aula-Blasco
Mano Aslan
Vu Trong Kim
Shayekh Bin Islam
Jaume Prats-Cristià
Lucía Tormo-Bañuelos
Seungone Kim
ELM
LRM
25
0
0
23 Oct 2024
Can Language Models Reason about Individualistic Human Values and
  Preferences?
Can Language Models Reason about Individualistic Human Values and Preferences?
Liwei Jiang
Taylor Sorensen
Sydney Levine
Yejin Choi
26
7
0
04 Oct 2024
CulturalBench: a Robust, Diverse and Challenging Benchmark on Measuring
  the (Lack of) Cultural Knowledge of LLMs
CulturalBench: a Robust, Diverse and Challenging Benchmark on Measuring the (Lack of) Cultural Knowledge of LLMs
Yu Ying Chiu
Liwei Jiang
Bill Yuchen Lin
Chan Young Park
Shuyue Stella Li
...
Mehar Bhatia
Maria Antoniak
Yulia Tsvetkov
Vered Shwartz
Yejin Choi
ELM
ALM
39
18
0
03 Oct 2024
AraDiCE: Benchmarks for Dialectal and Cultural Capabilities in LLMs
AraDiCE: Benchmarks for Dialectal and Cultural Capabilities in LLMs
Basel Mousi
Nadir Durrani
Fatema Ahmad
Md. Arid Hasan
Maram Hasanain
Tameem Kabbani
Fahim Dalvi
Shammur A. Chowdhury
Firoj Alam
31
0
0
17 Sep 2024
Evaluating Cultural Awareness of LLMs for Yoruba, Malayalam, and English
Evaluating Cultural Awareness of LLMs for Yoruba, Malayalam, and English
Fiifi Dawson
Zainab Mosunmola
Sahil Pocker
Raj Abhijit Dandekar
Rajat Dandekar
Sreedath Panat
28
3
0
14 Sep 2024
NativQA: Multilingual Culturally-Aligned Natural Query for LLMs
NativQA: Multilingual Culturally-Aligned Natural Query for LLMs
Md. Arid Hasan
Maram Hasanain
Fatema Ahmad
Sahinur Rahman Laskar
Sunaya Upadhyay
Vrunda N. Sukhadia
Mucahid Kutlu
Shammur A. Chowdhury
Firoj Alam
28
4
0
13 Jul 2024
CulturePark: Boosting Cross-cultural Understanding in Large Language
  Models
CulturePark: Boosting Cross-cultural Understanding in Large Language Models
Cheng-rong Li
Damien Teney
Linyi Yang
Qingsong Wen
Xing Xie
Jindong Wang
37
4
0
24 May 2024
IndoCulture: Exploring Geographically-Influenced Cultural Commonsense
  Reasoning Across Eleven Indonesian Provinces
IndoCulture: Exploring Geographically-Influenced Cultural Commonsense Reasoning Across Eleven Indonesian Provinces
Fajri Koto
Rahmad Mahendra
Nurul Aisyah
Timothy Baldwin
LRM
59
16
0
02 Apr 2024
CLIcK: A Benchmark Dataset of Cultural and Linguistic Intelligence in
  Korean
CLIcK: A Benchmark Dataset of Cultural and Linguistic Intelligence in Korean
Eunsu Kim
Juyoung Suk
Philhoon Oh
Haneul Yoo
James Thorne
Alice H. Oh
ELM
61
15
0
11 Mar 2024
CultureLLM: Incorporating Cultural Differences into Large Language
  Models
CultureLLM: Incorporating Cultural Differences into Large Language Models
Cheng-rong Li
Mengzhou Chen
Jindong Wang
Sunayana Sitaram
Xing Xie
VLM
43
17
0
09 Feb 2024
A Predictive Factor Analysis of Social Biases and Task-Performance in
  Pretrained Masked Language Models
A Predictive Factor Analysis of Social Biases and Task-Performance in Pretrained Masked Language Models
Yi Zhou
Jose Camacho-Collados
Danushka Bollegala
71
6
0
19 Oct 2023
Language Models as Knowledge Bases?
Language Models as Knowledge Bases?
Fabio Petroni
Tim Rocktaschel
Patrick Lewis
A. Bakhtin
Yuxiang Wu
Alexander H. Miller
Sebastian Riedel
KELM
AI4MH
393
2,216
0
03 Sep 2019
1