Cyber-pirates are offering music CDs, video clips and books on the web in digital format to a large audience at virtually no cost. Content publishers such as Disney and Sony Records therefore expect to lose several billions of dollars in copyright revenues over the next few years. To address this problem, we propose building a copy detection system (CDS), where content publishers will register their valuable digital content. The CDS then crawls the web, compares the web content to the registered content, and notifies the content owners of illegal copies. In this dissertation, we discuss how to build such a system so it is accurate, scalable (e.g., to hundreds of gigabytes of data, or millions of web pages) and resilient to “attacks” (e.g., copying a resampled audio clip) from cyber-pirates. We also discuss two prototype CDS systems we have built as “proofs of concept:” (1) SCAM (Stanford Copy Analysis Mechanism) for text documents, and (2) DECEIVE (Detecting Copies of Internet Video) for video sequences.
Cited By
- Mou L, Huang T, Tian Y, Jiang M and Gao W (2013). Content-based copy detection through multimodal feature representation and temporal pyramid matching, ACM Transactions on Multimedia Computing, Communications, and Applications, 10:1, (1-20), Online publication date: 1-Dec-2013.
- Lin Y, Wu J, Gao P, Xia Y and Mao T LSH-based large scale chinese calligraphic character recognition Proceedings of the 13th ACM/IEEE-CS joint conference on Digital libraries, (323-330)
- Lee J, Lee S, Seo Y and Yoo W Robust video fingerprinting based on hierarchical symmetric difference feature Proceedings of the 20th ACM international conference on Information and knowledge management, (2089-2092)
- Malkin M and Venkatesan R Comparison of texts streams in the presence of mild adversaries Proceedings of the 2005 Australasian workshop on Grid computing and e-research - Volume 44, (179-186)
- Jamkhedkar P and Heileman G DRM as a layered system Proceedings of the 4th ACM workshop on Digital rights management, (11-21)
- Datar M, Immorlica N, Indyk P and Mirrokni V Locality-sensitive hashing scheme based on p-stable distributions Proceedings of the twentieth annual symposium on Computational geometry, (253-262)
- Kankanhalli M and Hau K (2019). Watermarking of Electronic Text Documents, Electronic Commerce Research, 2:1-2, (169-187), Online publication date: 1-Jan-2002.