As artificial intelligence companies race to secure reliable and well-organized data for training large language models, ...