NDLI: Autonomous link spam detection in purely collaborative environments

Please wait, while we are loading the content...

WP:clubhouse?: an exploration of Wikipedia's gender imbalance

Quality is a verb: the operationalization of data quality in a citizen science community

A meta-reflective wiki for collaborative design

Vandalism detection in Wikipedia: a high-performing, feature-rich model and its reduction through Lasso

Hot off the wiki: dynamics, practices, and structures in Wikipedia's coverage of the Tōhoku catastrophes

The success of corporate wiki systems: an end user perspective

Don't bite the newbies: how reverts affect the quantity and quality of Wikipedia work

Wikipedia category visualization using radial layout

Wikiotics: the interactive language instruction Wiki

Apples to oranges?: comparing across studies of open collaboration/peer production

WikiLit: collecting the wiki and Wikipedia literature

Doctoral symposium: participants and overview

Gender differences in Wikipedia editing

Online and offline interactions in online communities

Wiki grows up: arbitrary data models, access control, and beyond

Autonomous link spam detection in purely collaborative environments

Collective memory building in Wikipedia: the case of North African uprisings

ICKEwiki: requirements and concepts for an enterprise wiki for SMEs

Mentoring in Wikipedia: a clash of cultures

Wiki refactoring: an assisted approach based on ballots

PukiWiki-Java Connector, a simple API for saving data of Java programs on a wiki

Lessons from the classroom: successful techniques for teaching wikis using Wikipedia

Finding patterns in behavioral observations by automatically labeling forms of wikiwork in Barnstars

Don't leave me alone: effectiveness of a framed wiki-based learning activity

Design and implementation of the Sweble Wikitext parser: unlocking the structured data of Wikipedia

NICE: social translucence through UI intervention

Wikipedia world map: method and application of map-like wiki visualization

Wiki scaffolding: helping organizations to set up wikis

"How should I go from ___ to ___ without getting killed?": motivation and benefits in open collaboration

Visualizing author contribution statistics in Wikis using an edit significance metric

Collaborative video editing for Wikipedia

$5^{th}$ Workshop on Wikis for Software Engineering

What Wikipedia deletes: characterizing dangerous collaborative content

COLT: a proposed center for open teaching and learning

Wiki4EAM: using hybrid wikis for enterprise architecture management

Participation in Wikipedia's article deletion processes

Wiki architectures as social translucence enablers

Exploring underproduction in Wikipedia

TWiki a collaboration tool for the LHC

A scourge to the pillar of neutrality: a WikiProject fighting systemic bias

Places on the map and in the cloud: representations of locality and geography in Wikipedia

Exploring linguistic points of view of Wikipedia

Feedback mechanisms and their impact on motivation to contribute to wikis in higher education

CoSyne: a framework for multilingual content synchronization of wikis

Incentivizing the ASL-STEM forum

Wiki as business application platform: the MES showcase

Autonomous link spam detection in purely collaborative environments

Content Provider	ACM Digital Library
Author	Agrawal, Avantika Baker, Phillip Lee, Insup West, Andrew G. Exline, Brittney
Abstract	Collaborative models (e.g., wikis) are an increasingly prevalent Web technology. However, the open-access that defines such systems can also be utilized for nefarious purposes. In particular, this paper examines the use of collaborative functionality to add inappropriate hyperlinks to destinations outside the host environment (i.e., link spam). The collaborative encyclopedia, Wikipedia, is the basis for our analysis. Recent research has exposed vulnerabilities in Wikipedia's link spam mitigation, finding that human editors are latent and dwindling in quantity. To this end, we propose and develop an autonomous classifier for link additions. Such a system presents unique challenges. For example, low barriers-to-entry invite a diversity of spam types, not just those with economic motivations. Moreover, issues can arise with how a link is presented (regardless of the destination). In this work, a spam corpus is extracted from over 235,000 link additions to English Wikipedia. From this, 40+ features are codified and analyzed. These indicators are computed using wiki metadata, landing site analysis, and external data sources. The resulting classifier attains 64% recall at 0.5% false-positives (ROC-AUC= 0.97). Such performance could enable egregious link additions to be blocked automatically with low false-positive rates, while prioritizing the remainder for human inspection. Finally, a live Wikipedia implementation of the technique has been developed.
Starting Page	91
Ending Page	100
Page Count	10
File Format	PDF
ISBN	9781450309097
DOI	10.1145/2038558.2038574
Language	English
Publisher	Association for Computing Machinery (ACM)
Publisher Date	2011-10-03
Publisher Place	New York
Access Restriction	Subscribed
Subject Keyword	Collaboration Information security Intelligent routing Link spam Wikipedia Reputation Spatio-temporal features Machine-learning Collaborative security Spam mitigation
Content Type	Text
Resource Type	Article

Central Library (ISO-9001:2015 Certified)
Indian Institute of Technology Kharagpur
Kharagpur, West Bengal, India | PIN - 721302

See location in the Map
03222 282435
Mail: support@ndl.gov.in

Sl.	Authority	Responsibilities	Communication Details
1	Ministry of Education (GoI), Department of Higher Education	Sanctioning Authority	https://www.education.gov.in/ict-initiatives
2	Indian Institute of Technology Kharagpur	Host Institute of the Project: The host institute of the project is responsible for providing infrastructure support and hosting the project	https://www.iitkgp.ac.in
3	National Digital Library of India Office, Indian Institute of Technology Kharagpur	The administrative and infrastructural headquarters of the project	Dr. B. Sutradhar bsutra@ndl.gov.in
4	Project PI / Joint PI	Principal Investigator and Joint Principal Investigators of the project	Dr. B. Sutradhar bsutra@ndl.gov.in Prof. Saswat Chakrabarti will be added soon
5	Website/Portal (Helpdesk)	Queries regarding NDLI and its services	support@ndl.gov.in
6	Contents and Copyright Issues	Queries related to content curation and copyright issues	content@ndl.gov.in
7	National Digital Library of India Club (NDLI Club)	Queries related to NDLI Club formation, support, user awareness program, seminar/symposium, collaboration, social media, promotion, and outreach	clubsupport@ndl.gov.in
8	Digital Preservation Centre (DPC)	Assistance with digitizing and archiving copyright-free printed books	dpc@ndl.gov.in
9	IDR Setup or Support	Queries related to establishment and support of Institutional Digital Repository (IDR) and IDR workshops	idr@ndl.gov.in