NDLI: Feature Selection Methods for Early Predictive Biomarker Discovery Using Untargeted Metabolomic Data

Please wait, while we are loading the content...

Frontiers in Aerospace Engineering

Frontiers in Aging

Frontiers in Aging Neuroscience

Frontiers in Agronomy

Frontiers in Allergy

Frontiers in Amphibian and Reptile Science

Frontiers in Analytical Science

Frontiers in Anesthesiology

Frontiers in Animal Science

Frontiers in Antennas and Propagation

Frontiers in Antibiotics

Frontiers in Applied Mathematics and Statistics

Frontiers in Aquaculture

Frontiers in Arachnid Science

Frontiers in Artificial Intelligence

Frontiers in Astronomy and Space Sciences

Frontiers in Audiology and Otology

Frontiers in Bacteriology

Frontiers in Batteries and Electrochemistry

Frontiers in Bee Science

Frontiers in Behavioral Economics

Frontiers in Behavioral Neuroscience

Frontiers in Big Data

Frontiers in Bioengineering and Biotechnology

Frontiers in Bioinformatics

Frontiers in Biomaterials Science

Frontiers in Biophysics

Frontiers in Bird Science

Frontiers in Blockchain

Frontiers in Built Environment

Frontiers in Carbon

Frontiers in Cardiovascular Medicine

Frontiers in Catalysis

Frontiers in Cell Death

Frontiers in Cell and Developmental Biology

Frontiers in Cellular Neuroscience

Frontiers in Cellular and Infection Microbiology

Frontiers in Ceramics

Frontiers in Chemical Biology

Frontiers in Chemical Engineering

Frontiers in Chemistry

Frontiers in Child and Adolescent Psychiatry

Frontiers in Climate

Frontiers in Clinical Diabetes and Healthcare

Frontiers in Coatings, Dyes and Interface Engineering

Frontiers in Cognition

Frontiers in Communication

Frontiers in Communications and Networks

Frontiers in Complex Systems

Frontiers in Computational Neuroscience

Frontiers in Computer Science

Frontiers in Conservation Science

Frontiers in Control Engineering

Frontiers in Dementia

Frontiers in Dental Medicine

Frontiers in Digital Health

Frontiers in Digital Humanities

Frontiers in Disaster and Emergency Medicine

Frontiers in Drug Delivery

Frontiers in Drug Discovery

Frontiers in Drug Safety and Regulation

Frontiers in Earth Science

Frontiers in Ecology and Evolution

Frontiers in Education

Frontiers in Electronic Materials

Frontiers in Electronics

Frontiers in Endocrinology

Frontiers in Energy Efficiency

Frontiers in Energy Research

Frontiers in Environmental Archaeology

Frontiers in Environmental Chemistry

Frontiers in Environmental Economics

Frontiers in Environmental Engineering

Frontiers in Environmental Health

Frontiers in Environmental Science

Frontiers in Epidemiology

Frontiers in Epigenetics and Epigenomics

Frontiers in Ethology

Frontiers in Food Science and Technology

Frontiers in Forests and Global Change

Frontiers in Freshwater Science

Frontiers in Fuels

Frontiers in Fungal Biology

Frontiers in Future Transportation

Frontiers in Gastroenterology

Frontiers in Genetics

Frontiers in Genome Editing

Frontiers in Geochemistry

Frontiers in Global Women's Health

Frontiers in Health Services

Frontiers in Hematology

Frontiers in High Performance Computing

Frontiers in Horticulture

Frontiers in Human Dynamics

Frontiers in Human Neuroscience

Frontiers in ICT

Frontiers in Imaging

Frontiers in Immunology

Frontiers in Industrial Engineering

Frontiers in Insect Science

Frontiers in Integrative Neuroscience

Frontiers in Lab on a Chip Technologies

Frontiers in Language Sciences

Frontiers in Lupus

Frontiers in Malaria

Frontiers in Mammal Science

Frontiers in Manufacturing Technology

Frontiers in Marine Science

Frontiers in Materials

Frontiers in Mechanical Engineering

Frontiers in Medical Engineering

Frontiers in Medical Technology

Frontiers in Medicine

Frontiers in Membrane Science and Technology

Frontiers in Metals and Alloys

Frontiers in Microbiology

Frontiers in Microbiomes

Frontiers in Molecular Biosciences

Year: 2023 Month: August Volume: 10

Year: 2023 Month: June Volume: 10

Year: 2023 Month: May Volume: 10

Year: 2023 Month: April Volume: 10

Year: 2023 Month: March Volume: 10

Year: 2023 Month: February Volume: 10

Year: 2023 Month: January Volume: 10

Year: 2023 Month: February Volume: 9

Year: 2023 Month: January Volume: 9

Year: 2022 Month: December Volume: 9

Year: 2022 Month: November Volume: 9

Year: 2022 Month: October Volume: 9

Year: 2022 Month: September Volume: 9

Year: 2022 Month: August Volume: 9

Year: 2022 Month: July Volume: 9

Year: 2022 Month: June Volume: 9

Year: 2022 Month: May Volume: 9

Year: 2022 Month: April Volume: 9

Year: 2022 Month: March Volume: 9

Year: 2022 Month: March Volume: 8

Year: 2022 Month: February Volume: 9

Year: 2022 Month: February Volume: 8

Year: 2022 Month: January Volume: 9

Year: 2022 Month: January Volume: 8

Year: 2021 Month: December Volume: 8

Year: 2021 Month: November Volume: 8

Year: 2021 Month: October Volume: 8

Year: 2021 Month: September Volume: 8

Year: 2021 Month: August Volume: 8

Year: 2021 Month: August Volume: 7

Year: 2021 Month: July Volume: 8

Year: 2021 Month: June Volume: 8

Year: 2021 Month: May Volume: 8

Year: 2021 Month: April Volume: 8

Year: 2021 Month: April Volume: 7

Year: 2021 Month: March Volume: 8

Year: 2021 Month: March Volume: 7

Year: 2021 Month: February Volume: 8

Year: 2021 Month: February Volume: 7

Year: 2021 Month: January Volume: 7

Year: 2020 Month: December Volume: 7

Year: 2020 Month: November Volume: 7

Year: 2020 Month: October Volume: 7

Year: 2020 Month: September Volume: 7

Year: 2020 Month: August Volume: 7

Year: 2020 Month: July Volume: 7

Year: 2020 Month: June Volume: 7

Year: 2020 Month: May Volume: 7

Year: 2020 Month: April Volume: 7

Year: 2020 Month: March Volume: 7

Year: 2020 Month: February Volume: 7

Year: 2020 Month: January Volume: 7

Year: 2020 Month: January Volume: 6

Year: 2019 Month: December Volume: 6

Year: 2019 Month: November Volume: 6

Year: 2019 Month: October Volume: 6

Year: 2019 Month: September Volume: 6

Year: 2019 Month: August Volume: 6

Year: 2019 Month: July Volume: 6

Year: 2019 Month: June Volume: 6

Year: 2019 Month: May Volume: 6

Year: 2019 Month: April Volume: 6

Year: 2019 Month: March Volume: 6

Year: 2019 Month: February Volume: 6

Year: 2019 Month: January Volume: 5

Year: 2018 Month: December Volume: 5

Year: 2018 Month: November Volume: 5

Year: 2018 Month: October Volume: 5

Year: 2018 Month: September Volume: 5

Year: 2018 Month: August Volume: 5

Year: 2018 Month: July Volume: 5

Year: 2018 Month: June Volume: 5

Year: 2018 Month: May Volume: 5

Year: 2018 Month: April Volume: 5

Year: 2018 Month: March Volume: 5

Year: 2018 Month: February Volume: 5

Year: 2018 Month: February Volume: 4

Year: 2018 Month: January Volume: 5

Year: 2018 Month: January Volume: 4

Year: 2017 Month: December Volume: 4

Year: 2017 Month: November Volume: 4

Year: 2017 Month: October Volume: 4

Year: 2017 Month: September Volume: 4

Year: 2017 Month: August Volume: 4

Year: 2017 Month: July Volume: 4

Year: 2017 Month: June Volume: 4

Year: 2017 Month: May Volume: 4

Year: 2017 Month: April Volume: 4

Year: 2017 Month: March Volume: 4

Year: 2017 Month: February Volume: 4

Year: 2017 Month: January Volume: 4

Year: 2017 Month: January Volume: 3

Year: 2016 Month: December Volume: 3

Year: 2016 Month: November Volume: 3

Year: 2016 Month: October Volume: 3

Year: 2016 Month: September Volume: 3

Year: 2016 Month: August Volume: 3

Year: 2016 Month: July Volume: 3

The Escherichia Coli Hfq Protein: An Unattended DNA-Transactions Regulator

PLS-Based and Regularization-Based Methods for the Selection of Relevant Variables in Non-targeted Metabolomics Data

The Verrucomicrobia LexA-Binding Motif: Insights into the Evolutionary Dynamics of the SOS Response

Conjugative DNA Transfer Is Enhanced by Plasmid R1 Partitioning Proteins

Age-Dependent Effects of Haptoglobin Deletion in Neurobehavioral and Anatomical Outcomes Following Traumatic Brain Injury

Feature Selection Methods for Early Predictive Biomarker Discovery Using Untargeted Metabolomic Data

Editorial: Function and Flexibility: Friend or Foe?

Year: 2016 Month: June Volume: 3

Year: 2016 Month: May Volume: 3

Year: 2016 Month: April Volume: 3

Year: 2016 Month: March Volume: 3

Year: 2016 Month: February Volume: 3

Year: 2016 Month: January Volume: 2

Year: 2015 Month: December Volume: 2

Year: 2015 Month: November Volume: 2

Year: 2015 Month: October Volume: 2

Year: 2015 Month: September Volume: 2

Year: 2015 Month: August Volume: 2

Year: 2015 Month: July Volume: 2

Year: 2015 Month: June Volume: 3

Year: 2015 Month: June Volume: 2

Year: 2015 Month: May Volume: 2

Year: 2015 Month: April Volume: 2

Year: 2015 Month: March Volume: 2

Year: 2015 Month: February Volume: 2

Year: 2015 Month: January Volume: 2

Year: 2014 Month: December Volume: 1

Year: 2014 Month: November Volume: 1

Frontiers in Molecular Medicine

Frontiers in Molecular Neuroscience

Frontiers in Nanotechnology

Frontiers in Natural Products

Frontiers in Nephrology

Frontiers in Network Physiology

Frontiers in Neural Circuits

Frontiers in Neuroanatomy

Frontiers in Neuroengineering

Frontiers in Neuroergonomics

Frontiers in Neuroimaging

Frontiers in Neuroinformatics

Frontiers in Neurology

Frontiers in Neurorobotics

Frontiers in Neuroscience

Frontiers in Nuclear Engineering

Frontiers in Nuclear Medicine

Frontiers in Nutrition

Frontiers in Ocean Sustainability

Frontiers in Oncology

Frontiers in Ophthalmology

Frontiers in Oral Health

Frontiers in Pain Research

Frontiers in Parasitology

Frontiers in Pediatrics

Frontiers in Pharmacology

Frontiers in Photonics

Frontiers in Physics

Frontiers in Physiology

Frontiers in Plant Science

Frontiers in Political Science

Frontiers in Psychiatry

Frontiers in Psychology

Frontiers in Public Health

Frontiers in Quantum Science and Technology

Frontiers in RNA Research

Frontiers in Radiology

Frontiers in Rehabilitation Sciences

Frontiers in Remote Sensing

Frontiers in Reproductive Health

Frontiers in Research Metrics and Analytics

Frontiers in Robotics and AI

Frontiers in Science

Frontiers in Sensors

Frontiers in Signal Processing

Frontiers in Sleep

Frontiers in Smart Grids

Frontiers in Sociology

Frontiers in Soft Matter

Frontiers in Soil Science

Frontiers in Space Technologies

Frontiers in Sports and Active Living

Frontiers in Stroke

Frontiers in Surgery

Frontiers in Sustainability

Frontiers in Sustainable Cities

Frontiers in Sustainable Energy Policy

Frontiers in Sustainable Food Systems

Frontiers in Sustainable Resource Management

Frontiers in Sustainable Tourism

Frontiers in Synaptic Neuroscience

Frontiers in Systems Biology

Frontiers in Systems Neuroscience

Frontiers in Thermal Engineering

Frontiers in Toxicology

Frontiers in Transplantation

Frontiers in Tropical Diseases

Frontiers in Urology

Frontiers in Veterinary Science

Frontiers in Virology

Frontiers in Virtual Reality

Frontiers in Water

Frontiers in the Internet of Things

Feature Selection Methods for Early Predictive Biomarker Discovery Using Untargeted Metabolomic Data

Content Provider	frontiers
Author	Grissa, Dhouha Pétéra, Mélanie Brandolini, Marion Napoli, Amedeo Comte, Blandine Pujos-Guillot, Estelle
Abstract	Untargeted metabolomics is a powerful phenotyping tool for better understanding biological mechanisms involved in human pathology development and identifying early predictive biomarkers. This approach, based on multiple analytical platforms, such as mass spectrometry, chemometrics and bioinformatics, generates massive and complex data that need appropriate analyses to extract the biologically meaningful information. Despite various tools available, it is still a challenge to handle such large and noisy datasets with limited number of individuals without risking overfitting. Moreover, when the objective is focused on the identification of early predictive makers of clinical outcome, few years before occurrence, it becomes essential to use the appropriate algorithms and workflow to be able to discover subtle effects among this large amount of data. In this context, this work consists in studying a workflow describing the general feature selection process, using knowledge discovery and data mining methodologies to propose advanced solutions for predictive biomarker discovery. The strategy was focused on evaluating a combination of numeric-symbolic approaches for feature selection with the objective of obtaining the best combination of metabolites producing an effective and accurate predictive model. Relying first on numerical approaches, and especially on machine learning methods (SVM-RFE, RF, RF-RFE) and on univariate statistical analyses (ANOVA), a comparative study was performed on the original metabolomic dataset and reduced subsets. As resampling method, LOOCV was applied to minimize the risk of overfitting. The best k-features obtained with different scores of importance from the combination of these different approaches were compared and allowed determining the variable stabilities using Formal Concept Analysis. The results revealed the interest of RF-Gini combined with ANOVA for feature selection as these two complementary methods allowed selecting the 48 best candidates for prediction. Using linear logistic regression on this reduced dataset enabled us to obtain the best performances in terms of prediction accuracy and number of false positive with a model including 5 top variables. Therefore, these results highlighted the interest of feature selection methods and the importance of working on reduced datasets for the identification of predictive biomarkers issued from untargeted metabolomics data.
ISSN	2296889X
DOI	10.3389/fmolb.2016.00030
Volume Number	3
Journal	Frontiers in Molecular Biosciences
Language	English
Publisher Date	2016-07-08
Access Restriction	Open
Subject Keyword	Feature Selection Formal Concept Analysis Machine learning Metabolomics Prediction Visualization Biomarker Discovery Univariate statistics
Content Type	Text
Resource Type	Article
Subject	Biochemistry Molecular Biology

Central Library (ISO-9001:2015 Certified)
Indian Institute of Technology Kharagpur
Kharagpur, West Bengal, India | PIN - 721302

See location in the Map
03222 282435
Mail: support@ndl.gov.in

Sl.	Authority	Responsibilities	Communication Details
1	Ministry of Education (GoI), Department of Higher Education	Sanctioning Authority	https://www.education.gov.in/ict-initiatives
2	Indian Institute of Technology Kharagpur	Host Institute of the Project: The host institute of the project is responsible for providing infrastructure support and hosting the project	https://www.iitkgp.ac.in
3	National Digital Library of India Office, Indian Institute of Technology Kharagpur	The administrative and infrastructural headquarters of the project	Dr. B. Sutradhar bsutra@ndl.gov.in
4	Project PI / Joint PI	Principal Investigator and Joint Principal Investigators of the project	Dr. B. Sutradhar bsutra@ndl.gov.in Prof. Saswat Chakrabarti will be added soon
5	Website/Portal (Helpdesk)	Queries regarding NDLI and its services	support@ndl.gov.in
6	Contents and Copyright Issues	Queries related to content curation and copyright issues	content@ndl.gov.in
7	National Digital Library of India Club (NDLI Club)	Queries related to NDLI Club formation, support, user awareness program, seminar/symposium, collaboration, social media, promotion, and outreach	clubsupport@ndl.gov.in
8	Digital Preservation Centre (DPC)	Assistance with digitizing and archiving copyright-free printed books	dpc@ndl.gov.in
9	IDR Setup or Support	Queries related to establishment and support of Institutional Digital Repository (IDR) and IDR workshops	idr@ndl.gov.in