NDLI: Automatic and interactive rule inference without ground truth

Content Provider	IEEE Xplore Digital Library
Author	Carton, C. Lemaitre, A. Couasnon, B.
Copyright Year	2015
Description	Author affiliation: IRISA, Univ. Rennes 2, Rennes, France (Lemaitre, A.) \|\| Univ. Eur. de Bretagne, Rennes, France (Carton, C.; Couasnon, B.)
Abstract	Dealing with non annotated documents for the design of a document recognition system is not an easy task. In general, statistical methods cannot learn without an annotated ground truth, unlike syntactical methods. However their ability to deal with non annotated data comes from the fact that the description is manually made by a user. The adaptation to a new kind of document is then tedious as the whole manual process of extraction of knowledge has to be redone. In this paper, we propose a method to extract knowledge and generate rules without any ground truth. Using large volume of non annotated documents, it is possible to study redundancies of some extracted elements in the document images. The redundancy is exploited through an automatic clustering algorithm. An interaction with the user brings semantic to the detected clusters. In this work, the extracted elements are some keywords extracted with word spotting. This approach has been applied to old marriage record field detection on the FamilySearch HIP2013 competition database. The results demonstrate that we successfully automatically infer rules from non annotated documents using the redundancy of extracted elements of the documents.
Starting Page	696
Ending Page	700
File Size	1526340
Page Count	5
File Format	PDF
e-ISBN	9781479918058
DOI	10.1109/ICDAR.2015.7333851
Language	English
Publisher	Institute of Electrical and Electronics Engineers, Inc. (IEEE)
Publisher Date	2015-08-23
Publisher Place	Tunisia
Access Restriction	Subscribed
Rights Holder	Institute of Electrical and Electronics Engineers, Inc. (IEEE)
Subject Keyword	Reliability Learning automata Niobium Atmospheric modeling Manuals
Content Type	Text
Resource Type	Article

Sl.	Authority	Responsibilities	Communication Details
1	Ministry of Education (GoI), Department of Higher Education	Sanctioning Authority	https://www.education.gov.in/ict-initiatives
2	Indian Institute of Technology Kharagpur	Host Institute of the Project: The host institute of the project is responsible for providing infrastructure support and hosting the project	https://www.iitkgp.ac.in
3	National Digital Library of India Office, Indian Institute of Technology Kharagpur	The administrative and infrastructural headquarters of the project	Dr. B. Sutradhar bsutra@ndl.gov.in
4	Project PI / Joint PI	Principal Investigator and Joint Principal Investigators of the project	Dr. B. Sutradhar bsutra@ndl.gov.in Prof. Saswat Chakrabarti will be added soon
5	Website/Portal (Helpdesk)	Queries regarding NDLI and its services	support@ndl.gov.in
6	Contents and Copyright Issues	Queries related to content curation and copyright issues	content@ndl.gov.in
7	National Digital Library of India Club (NDLI Club)	Queries related to NDLI Club formation, support, user awareness program, seminar/symposium, collaboration, social media, promotion, and outreach	clubsupport@ndl.gov.in
8	Digital Preservation Centre (DPC)	Assistance with digitizing and archiving copyright-free printed books	dpc@ndl.gov.in
9	IDR Setup or Support	Queries related to establishment and support of Institutional Digital Repository (IDR) and IDR workshops	idr@ndl.gov.in

Automatic and interactive rule inference without ground truth

Automatic and interactive rule inference without ground truth

On Distinguishing between Reliable and Unreliable Sensors Without a Knowledge of the Ground Truth

A Reversible Automata Approach to Modeling Birdsongs

Semi-automatic ground truth annotation in videos: An interactive tool for polygon-based object annotation and segmentation

Automatic Model Inference of Web Applications for Security Testing

Learning from the Truth: Fully Automatic Ground Truth Generation for Training of Medical Deep Learning Networks*

Improving the Automatic Email Responding System for computer manufacturers via machine learning

Interactive Access Rule Learning: Generating Adapted Access Rule Sets (2010)

Automatic and interactive rule inference without ground truth

Similar Documents

Automatic and interactive rule inference without ground truth

Automatic and interactive rule inference without ground truth

On Distinguishing between Reliable and Unreliable Sensors Without a Knowledge of the Ground Truth

A Reversible Automata Approach to Modeling Birdsongs

Semi-automatic ground truth annotation in videos: An interactive tool for polygon-based object annotation and segmentation

Automatic Model Inference of Web Applications for Security Testing

Learning from the Truth: Fully Automatic Ground Truth Generation for Training of Medical Deep Learning Networks*

Improving the Automatic Email Responding System for computer manufacturers via machine learning

Interactive Access Rule Learning: Generating Adapted Access Rule Sets (2010)

Automatic and interactive rule inference without ground truth