Working Paper: “The Attribution Crisis in LLM Search Results: Estimating Ecosystem Exploitation”
The working paper linked below was recently published by the Social Science Research Council (SSRC).
Title
The Attribution Crisis in LLM Search Results: Estimating Ecosystem Exploitation
Authors
Ilan Strauss
Social Science Research Council
Jangho Yang
University of Waterloo
Tim O’Reilly
Social Science Research Council
Sruly Rosenblat
Social Science Research Council
Isobel Moure
Social Science Research Council
Source
SSRC AI Disclosures Project Working Paper Series (SSRC AI WP 2025-06)
June 2025
Abstract
Web-enabled LLMs frequently answer queries without crediting the web pages they consume, creating an “attribution gap” – the difference between relevant URLs read and those actually cited. Drawing on approximately 14,000 real-world LMArena conversation logs with search-enabled LLM systems, we document three exploitation patterns: 1) No Search: 34% of Google Gemini and 24% of OpenAI GPT-4o responses are generated without explicitly fetching any online content; 2) No citation: Gemini provides no clickable citation source in 92% of answers; 3) High-volume, low-credit: Perplexity’s Sonar visits approximately 10 relevant pages per query but cites only three to four. A negative binomial hurdle model shows that the average query answered by Gemini or Sonar leaves about 3 relevant websites uncited, whereas GPT-4o’s tiny uncited gap is best explained by its selective log disclosures rather than by better attribution. Citation efficiency – extra citations provided per additional relevant web page visited – varies widely across models, from 0.19 to 0.45 on identical queries, underscoring that retrieval design, not technical limits, shapes ecosystem impact. We recommend a transparent LLM search architecture based on standardized telemetry and full disclosure of search traces and citation logs.
Direct to Full Text Working Paper
34 pages; PDF.
Filed under: Journal Articles, News
About Gary Price
Gary Price (gprice@gmail.com) is a librarian, writer, consultant, and frequent conference speaker based in the Washington D.C. metro area. He earned his MLIS degree from Wayne State University in Detroit. Price has won several awards including the SLA Innovations in Technology Award and Alumnus of the Year from the Wayne St. University Library and Information Science Program. From 2006-2009 he was Director of Online Information Services at Ask.com.



