May 26, 2022

Conference Paper: Scholarly Plagiarism: “Citation Sentence Reuse Behavior of Scientists: A Case Study on Massive Bibliographic Text Dataset of Computer Science”

The following paper will be presented and published in theŠ proceedings of ACM/IEEE-CS 2017 Joint Conference on Digital Libraries in Toronto.


Citation Sentence Reuse Behavior of Scientists: A Case Study on Massive Bibliographic Text Dataset of Computer Science


Mayank Singh
Indian Institutes of Technology (IIT)

Abhishek Niranjan
Indian Institutes of Technology (IIT)s

Divyansh Gupta
Indian Institutes of Technology (IIT)

Nikhil Angad Bakshi
Indian Institutes of Technology (IIT)

Animesh Mukherjee
Indian Institutes of Technology (IIT)

Pawan Goyal
Indian Institutes of Technology (IIT)


via arXiv


Our current knowledge of scholarly plagiarism is largely based on the similarity between full text research articles.

In this paper, we propose an innovative and novel conceptualization of scholarly plagiarism in the form of reuse of explicit citation sentences in scientific research articles. Note that while full-text plagiarism is an indicator of a gross-level behavior, copying of citation sentences is a more nuanced micro-scale phenomenon observed even for well-known researchers. ŒThe current work poses several interesting questions and attempts to answer them by empirically investigating a large bibliographic text dataset from computer science containing millions of lines of citation sentences.

In particular, we report evidences of massive copying behavior. We also present several striking real examples throughout the paper to showcase widespread adoption of this undesirable practice. In contrast to the popular perception, we find that copying tendency increases as an author matures. Œthe copying behavior is reported to exist in all €fields of computer science; however, the theoretical fi€elds indicate more copying than the applied fi€elds.

Direct to Full Text Article (5 pages; PDF)

About Gary Price

Gary Price ( is a librarian, writer, consultant, and frequent conference speaker based in the Washington D.C. metro area. Before launching INFOdocket, Price and Shirl Kennedy were the founders and senior editors at ResourceShelf and DocuTicker for 10 years. From 2006-2009 he was Director of Online Information Services at, and is currently a contributing editor at Search Engine Land.