NDLI: An end-to-end learning solution for assessing the quality of Wikipedia articles

Please wait, while we are loading the content...

What do Wikidata and Wikipedia Have in Common?: An Analysis of their Use of External References

Current and Alternate Approaches to Personalization in Online Learning

On the Relationship between Newcomer Motivations and Contribution Barriers in Open Source Projects

An Author Network to Classify Open Online Discussions

How are Open Source Practices Possible within a Medical Diagnostics Company?: Developing and Testing a Maturity Model of Inner Source Implementation

How Is Value Created Within An Inner Source Environment?

An end-to-end learning solution for assessing the quality of Wikipedia articles

Open Peer Review CMS Support

Before the Sense of 'We': Identity Work as a Bridge from Mass Collaboration to Group Emergence

Managing Risk in Business Centric Crowdfunding Platforms

Opening up new channels for scholarly review, dissemination, and assessment

Does Miner Pooling Impact Bitcoin's Ability to Stay Decentralized?

Everyday Creativity on a University Campus: Crafting a challenge to journey beyond the formal

Sharing Knowledge about Open Source Licenses at DLR

Implementing Federated Social Networking: Report from the Trenches

When to use Rewards in Charitable Crowdfunding

On licensing and other conditions for contributing to widely used open source projects: an exploratory analysis

SMW Based VRE for Addressing Multi-Layered Data Analysis: The Use Case of Classroom Interaction Interpretation

Social Identity and Social Media Activities in Equity Crowdfunding

QueryShare: Working Together to Facilitate Exploratory Multimedia Searches without Skill in Creating

Exploring the Application of Blockchain Technology to Combat the Effects of Social Loafing in Cross Functional Group Projects

A Glimpse into Babel: An Analysis of Multilinguality in Wikidata

The Lives and Deaths of Open Source Code Forges

Brazilian Public Software Portal: an integrated platform for collaborative development

Understanding Organization and Open Source Community Relations through the Attraction-Selection-Attrition Model

The many hats and the broken binoculars: State of the practice in developer community management

Interpolating Quality Dynamics in Wikipedia and Demonstrating the Keilana Effect

An end-to-end learning solution for assessing the quality of Wikipedia articles

Content Provider	ACM Digital Library
Author	Dang, Quang-Vinh Ignat, Claudia-Lavinia
Abstract	Wikipedia is considered as the largest knowledge repository in the history of humanity and plays a crucial role in modern daily life. Assigning the correct quality class to Wikipedia articles is an important task in order to provide guidance for both authors and readers of Wikipedia. The manual review cannot cope with the editing speed of Wikipedia. An automatic classification is required to classify the quality of Wikipedia articles. Most existing approaches rely on traditional machine learning with manual feature engineering, which requires a lot of expertise and effort. Furthermore, it is known that there is no general perfect feature set because information leak always occurs in feature extraction phase. Also, for each language of Wikipedia, a new feature set is required. In this paper, we present an approach relying on deep learning for quality classification of Wikipedia articles. Our solution relies on Recurrent Neural Networks (RNN) which is an end-to-end learning technique that eliminates disadvantages of feature engineering. Our approach learns directly from raw data without human intervention and is language-neutral. Experimental results on English, French and Russian Wikipedia datasets show that our approach outperforms state-of-the-art solutions.
Starting Page	1
Ending Page	10
Page Count	10
File Format	PDF
ISBN	9781450351874
DOI	10.1145/3125433.3125448
Language	English
Publisher	Association for Computing Machinery (ACM)
Publisher Date	2017-08-23
Publisher Place	New York
Access Restriction	Subscribed
Subject Keyword	Deep learning Document quality End-to-end learning Recurrent neural network (rnn) Wikipedia Long-short term memory (lstm)
Content Type	Text
Resource Type	Article

Sl.	Authority	Responsibilities	Communication Details
1	Ministry of Education (GoI), Department of Higher Education	Sanctioning Authority	https://www.education.gov.in/ict-initiatives
2	Indian Institute of Technology Kharagpur	Host Institute of the Project: The host institute of the project is responsible for providing infrastructure support and hosting the project	https://www.iitkgp.ac.in
3	National Digital Library of India Office, Indian Institute of Technology Kharagpur	The administrative and infrastructural headquarters of the project	Dr. B. Sutradhar bsutra@ndl.gov.in
4	Project PI / Joint PI	Principal Investigator and Joint Principal Investigators of the project	Dr. B. Sutradhar bsutra@ndl.gov.in Prof. Saswat Chakrabarti will be added soon
5	Website/Portal (Helpdesk)	Queries regarding NDLI and its services	support@ndl.gov.in
6	Contents and Copyright Issues	Queries related to content curation and copyright issues	content@ndl.gov.in
7	National Digital Library of India Club (NDLI Club)	Queries related to NDLI Club formation, support, user awareness program, seminar/symposium, collaboration, social media, promotion, and outreach	clubsupport@ndl.gov.in
8	Digital Preservation Centre (DPC)	Assistance with digitizing and archiving copyright-free printed books	dpc@ndl.gov.in
9	IDR Setup or Support	Queries related to establishment and support of Institutional Digital Repository (IDR) and IDR workshops	idr@ndl.gov.in

A Novel Modeling Strategy of Weighted Mean Temperature in China Using RNN and LSTM

Application of Long Short-Term Memory (LSTM) Neural Network for Flood Forecasting

Hybrid LSTM Neural Network for Short-Term Traffic Flow Prediction

Multimedia Data Modelling Using Multidimensional Recurrent Neural Networks

Deep Learning (CNN, RNN) Applications for Smart Homes: A Systematic Review

Control of an active suspension system based on long short-term memory (LSTM) learning

Applying PCA to Deep Learning Forecasting Models for Predicting $PM_{2.5}$

Using a Long Short-Term Memory Recurrent Neural Network (LSTM-RNN) to Classify Network Attacks

Employing a Long-Short-Term Memory Neural Network to Improve Automatic Sleep Stage Classification of Pharmaco-EEG Profiles

An end-to-end learning solution for assessing the quality of Wikipedia articles

Similar Documents

A Novel Modeling Strategy of Weighted Mean Temperature in China Using RNN and LSTM

Application of Long Short-Term Memory (LSTM) Neural Network for Flood Forecasting

Hybrid LSTM Neural Network for Short-Term Traffic Flow Prediction

Multimedia Data Modelling Using Multidimensional Recurrent Neural Networks

Deep Learning (CNN, RNN) Applications for Smart Homes: A Systematic Review

Control of an active suspension system based on long short-term memory (LSTM) learning

Applying PCA to Deep Learning Forecasting Models for Predicting $PM_{2.5}$

Using a Long Short-Term Memory Recurrent Neural Network (LSTM-RNN) to Classify Network Attacks

Employing a Long-Short-Term Memory Neural Network to Improve Automatic Sleep Stage Classification of Pharmaco-EEG Profiles

An end-to-end learning solution for assessing the quality of Wikipedia articles