Hallucinating (or poorly fed) LLMs?
The problem of data accuracy
DOI:
https://doi.org/10.6092/issn.1825-1927/18877Keywords:
Data scraping, Large Language Models, LLMs, data accuracy, AI, Artificial IntelligenceAbstract
Data scraping is crucial for large language models (LLMs) to gather substantial data for training. However, it raises concerns regarding accuracy. Web scraping systems lack filtering, leading to inaccurate and outdated information. Validating accuracy in large volumes is, however, technically demanding. Nevertheless, data accuracy is vital for output quality and user trust in LLMs. This presentation explores reconciling data scraping with accuracy, considering conflicting rights and interests at stake.
Downloads
Published
2024-01-12
How to Cite
Stringhi, E. (2023) “Hallucinating (or poorly fed) LLMs? The problem of data accuracy”, i-lex. Bologna, Italy, 16(2), pp. 54–63. doi: 10.6092/issn.1825-1927/18877.
Issue
Section
Articles
License
Copyright (c) 2023 Elisabetta Stringhi

This work is licensed under a Creative Commons Attribution 4.0 International License.