ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2403.04577
39
0

Wiki-TabNER: Integrating Named Entity Recognition into Wikipedia Tables

7 March 2024
A. Koleva
Martin Ringsquandl
Ahmed Hatem
Thomas Runkler
Volker Tresp
    LMTD
ArXivPDFHTML
Abstract

Interest in solving table interpretation tasks has grown over the years, yet it still relies on existing datasets that may be overly simplified. This is potentially reducing the effectiveness of the dataset for thorough evaluation and failing to accurately represent tables as they appear in the real-world. To enrich the existing benchmark datasets, we extract and annotate a new, more challenging dataset. The proposed Wiki-TabNER dataset features complex tables containing several entities per cell, with named entities labeled using DBpedia classes. This dataset is specifically designed to address named entity recognition (NER) task within tables, but it can also be used as a more challenging dataset for evaluating the entity linking task. In this paper we describe the distinguishing features of the Wiki-TabNER dataset and the labeling process. In addition, we propose a prompting framework for evaluating the new large language models on the within tables NER task. Finally, we perform qualitative analysis to gain insights into the challenges encountered by the models and to understand the limitations of the proposed~dataset.

View on arXiv
@article{koleva2025_2403.04577,
  title={ Wiki-TabNER: Integrating Named Entity Recognition into Wikipedia Tables },
  author={ Aneta Koleva and Martin Ringsquandl and Ahmed Hatem and Thomas Runkler and Volker Tresp },
  journal={arXiv preprint arXiv:2403.04577},
  year={ 2025 }
}
Comments on this paper