NDLI: Injury Narrative Text Classification: A Preliminary Study

Please wait, while we are loading the content...

Parsing Clinical Text: How Good are the State-of-the-Art Parsers?

Discriminatory Analysis of Alzheimer's Disease through pathway Activity inference in the Resting-State brain

Grounded Feature Selection for Biomedical Relation Extraction by the Combinative Approach

Entity Linking for Biomedical Literature

Inference of Disease E3s from Integrated Functional Relation Network

An Exploration of the Collaborative Networks for Clinical and Academic Domains in AIDS Research: A Spatial Scientometric Approach

Biomedical Named Entity Recognition Based on the Combination of Regional and Global Text Features

Identification of Coexpressed Gene Modules across Multiple Brain Diseases by a Biclustering Analysis on Integrated Gene Expression Data

A Display of Conceptual Structures in the Epidemiologic Literature

Injury Narrative Text Classification: A Preliminary Study

Identifying Cancer Subtypes based on Somatic Mutation Profile

Inferring Undiscovered Public Knowledge by Using Text Mining-driven Graph Model

Systematic Identification of Context-dependent Conflicting Information in Biological Pathways

Identification of Genomic Features in the Classification of Loss- and Gain-of-Function Mutation: [Extended Abstract]

Mining the Main Health Trend of the General Public based on Opinion Mining of Korean Blogsphere

Identification of a Specific Base Sequence of Pathogenic E. Coli through a Genomic Analysis

Integrative Database for Exploring Compound Combinations of Natural Products for Medical Effects

Mining Context-Specific Rules from the Literature for Virtual Human Model Simulation

Visualization of Zoomable Network for Multi-Compounds and Multi-Targets Analysis

Construction of Multi-level Networks Incorporating Molecule, Cell, Organ and Phenotype Properties for Drug-induced Phenotype Prediction

Detecting Phosphorylation Determined Active Protein Interaction Network during Cancer Development by Robust Network Component Analysis

TILD: A Strategy to Identify Cancer-related Genes Using Title Information in Literature Data

Injury Narrative Text Classification: A Preliminary Study

Content Provider	ACM Digital Library
Author	Nayak, Richi Vallmuur, Kirsten Chen, Lin
Abstract	Description of a patient's injuries is recorded in narrative text form by hospital emergency departments. For statistical reporting, this text data needs to be mapped to pre-defined codes. Existing research in this field uses the Naïve Bayes probabilistic method to build classifiers for mapping. In this paper, we focus on providing guidance on the selection of a classification method. We build a number of classifiers belonging to different classification families such as decision tree, probabilistic, neural networks, and instance-based, ensemble-based and kernel-based linear classifiers. An extensive pre-processing is carried out to ensure the quality of data and, in hence, the quality classification outcome. The records with a null entry in injury description are removed. The misspelling correction process is carried out by finding and replacing the misspelt word with a soundlike word. Meaningful phrases have been identified and kept, instead of removing the part of phrase as a stop word. The abbreviations appearing in many forms of entry are manually identified and only one form of abbreviations is used. Clustering is utilised to discriminate between non-frequent and frequent terms. This process reduced the number of text features dramatically from about 28,000 to 5000. The medical narrative text injury dataset, under consideration, is composed of many short documents. The data can be characterized as high-dimensional and sparse, i.e., few features are irrelevant but features are correlated with one another. Therefore, Matrix factorization techniques such as Singular Value Decomposition (SVD) and Non Negative Matrix Factorization (NNMF) have been used to map the processed feature space to a lower-dimensional feature space. Classifiers with these reduced feature space have been built. In experiments, a set of tests are conducted to reflect which classification method is best for the medical text classification. The Non Negative Matrix Factorization with Support Vector Machine method can achieve 93% precision which is higher than all the tested traditional classifiers. We also found that TF/IDF weighting which works well for long text classification is inferior to binary weighting in short document classification. Another finding is that the Top-n terms should be removed in consultation with medical experts, as it affects the classification performance.
Starting Page	7
Ending Page	7
Page Count	1
File Format	PDF
ISBN	9781450312752
DOI	10.1145/2665970.2665976
Language	English
Publisher	Association for Computing Machinery (ACM)
Publisher Date	2014-11-07
Publisher Place	New York
Access Restriction	Subscribed
Subject Keyword	Narrative text classification
Content Type	Text
Resource Type	Article

Sl.	Authority	Responsibilities	Communication Details
1	Ministry of Education (GoI), Department of Higher Education	Sanctioning Authority	https://www.education.gov.in/ict-initiatives
2	Indian Institute of Technology Kharagpur	Host Institute of the Project: The host institute of the project is responsible for providing infrastructure support and hosting the project	https://www.iitkgp.ac.in
3	National Digital Library of India Office, Indian Institute of Technology Kharagpur	The administrative and infrastructural headquarters of the project	Dr. B. Sutradhar bsutra@ndl.gov.in
4	Project PI / Joint PI	Principal Investigator and Joint Principal Investigators of the project	Dr. B. Sutradhar bsutra@ndl.gov.in Prof. Saswat Chakrabarti will be added soon
5	Website/Portal (Helpdesk)	Queries regarding NDLI and its services	support@ndl.gov.in
6	Contents and Copyright Issues	Queries related to content curation and copyright issues	content@ndl.gov.in
7	National Digital Library of India Club (NDLI Club)	Queries related to NDLI Club formation, support, user awareness program, seminar/symposium, collaboration, social media, promotion, and outreach	clubsupport@ndl.gov.in
8	Digital Preservation Centre (DPC)	Assistance with digitizing and archiving copyright-free printed books	dpc@ndl.gov.in
9	IDR Setup or Support	Queries related to establishment and support of Institutional Digital Repository (IDR) and IDR workshops	idr@ndl.gov.in

A preliminary study of demonstratives in aklanon narratives.

Multidisciplinary techniques for Classifying Styles in Narrative Texts: a Preliminary Study

For patients with an acquired brain injury: a preliminary study.

Cross-lingual query classification: a preliminary study

Discursive Narrative Analysis: A Study of Online Autobiographical Accounts of Self-Injury

Discursive Narrative Analysis: A Study of Online Autobiographical Accounts of Self-Injury

The identification of narrative genres in Upper Tanana Athabascan : a preliminary study

Rehabilitation as a fight: A narrative case study of the first year after a spinal cord injury

Discursive Narrative Analysis: A Study of Online Autobiographical Accounts of Self-Injury

Injury Narrative Text Classification: A Preliminary Study

Similar Documents

A preliminary study of demonstratives in aklanon narratives.

Multidisciplinary techniques for Classifying Styles in Narrative Texts: a Preliminary Study

For patients with an acquired brain injury: a preliminary study.

Cross-lingual query classification: a preliminary study

Discursive Narrative Analysis: A Study of Online Autobiographical Accounts of Self-Injury

Discursive Narrative Analysis: A Study of Online Autobiographical Accounts of Self-Injury

The identification of narrative genres in Upper Tanana Athabascan : a preliminary study

Rehabilitation as a fight: A narrative case study of the first year after a spinal cord injury

Discursive Narrative Analysis: A Study of Online Autobiographical Accounts of Self-Injury

Injury Narrative Text Classification: A Preliminary Study