#AI is thwarting the study of human #language.
404media.co/project-analyzing-…
(#paywalled)
"The creator of an #OpenSource project that scraped the internet to determine the ever-changing popularity of different words in human language usage says that they are sunsetting the project because generative AI spam has poisoned the internet…“Now the web at large is full of #slop generated by #LLMs, written by no one to communicate nothing. Including this slop in the data skews the word frequencies.”
Project Analyzing Human Language Usage Shuts Down Because ‘Generative AI Has Polluted the Data’
Wordfreq shuts down because "I don’t think anyone has reliable information about post-2021 language usage by humans.”Jason Koebler (404 Media)
This entry was edited (4 weeks ago)