NDLI: Collecting Conceptualized Relations from Terabytes of Web Texts for Understanding Unknown Terms

Please wait, while we are loading the content...

WI 2014 Title Page i

WI 2014 Title Page iii

WI 2014 Copyright Page

WI 2014 Preface - I

WI 2014 Non-Program Committee Reviewers - I

Quantum Cognition

Structural Vulnerability Analysis of Overlapping Communities in Complex Networks

Cloud Application Development Methodology

Multi-agent Information Diffusion Model for Twitter

Adaptive Landmark Selection Strategies for Fast Shortest Path Computation in Large Real-World Graphs

Enumerating Communities for a Deeper Understanding of Community Finding

Big Data - Characterizing an Emerging Research Field Using Topic Models

Efficient Spatio-textual Similarity Join Using MapReduce

Aspect-Based Similar Entity Search in Semantic Knowledge Graphs with Diversity-Awareness and Relaxation

Learning Concise Pattern for Interlinking with Extended Version Space

SORM: A Social Opinion Relevance Model

Collecting Conceptualized Relations from Terabytes of Web Texts for Understanding Unknown Terms

Person Identification between Different Online Social Networks

Obtaining Technology Insights from Large and Heterogeneous Document Collections

A New Sentence Similarity Method Based on a Three-Layer Sentence Representation

Multi-feature and DAG-Based Multi-tree Matching Algorithm for Automatic Web Data Mining

A Comparison of Mobile Rule Engines for Reasoning on Semantic Web Based Health Data

Locality-Sensitive Hashing for Massive String-Based Ontology Matching

Semantic Heuristic Search in Collaborative Networks: Measures and Contexts

An Approach for Learning and Construction of Expressive Ontology from Text in Natural Language

Mining Twitter Data with Resource Constraints

Conjunctive Query Programming: A Paradigm for Knowledge Engineering of Optimization Problems in the Semantic Web

Finding the Needle in the Haystack: Identifying Business Communities in Internet Traffic

NCDREC: A Decomposability Inspired Framework for Top-N Recommendation

Optimizing Personalized Ranking in Recommender Systems with Metadata Awareness

Improving Personalized Ranking in Recommender Systems with Multimodal Interactions

A Link Prediction Approach for Item Recommendation with Complex Number

SemanticSVD++: Incorporating Semantic Taste Evolution for Predicting Ratings

Propagating Users' Similarity towards Improving Recommender Systems

Online Recommender System for Radio Station Hosting: Experimental Results Revisited

Fuzziness and Ontology in Personalization of Selection Processes in the Semantic Web

Sentence-Based Plot Classification for Online Review Comments

Summarizing Search Results with Community-Based Question Answering

Lazy Walks Versus Walks with Backstep: Flavor of PageRank

A Machine Learning Approach to SPARQL Query Performance Prediction

Improving Biodiversity Data Retrieval through Semantic Search and Ontologies

Online Retweet Recommendation with Item Count Limits

Extracting Attributes and Synonymous Attributes from Online Encyclopedias

Credibility Microscope: Relating Web Page Credibility Evaluations to Their Textual Content

Harnessing Social Signals to Enhance a Search

Relevant Sources of Information Are Not Necessarily Popular Ones

Hybrid Algorithm for Precise Recommendation from Almost Infinite Set of Websites

Combining Query Terms Extension and Weight Correlative for Expert Finding

Recommender System for Crowdsourcing Platform Witology

Feature Selection and Term Weighting

Improving Collaborative Filtering Based Recommenders Using Topic Modelling

Dynamic Learning of Keyword-Based Preferences for News Recommendation

Link Prediction Based on Multi-steps Resource Allocation

Fuzzy Subjective Sentiment Phrases: A Context Sensitive and Self-Maintaining Sentiment Lexicon

On the Effectiveness of Emergent Task Allocation of Virtual Programmer Teams

Identification of Opinion Leaders Based on User Clustering and Sentiment Analysis

Indirect Keyword Recommendation

Tweet Sentiment Analytics with Context Sensitive Tone-Word Lexicon

An Ontology for Guiding Performance Testing

Ontology as a Base for State Space of Expert Systems

Learning of Legal Ontology Supporting the User Queries Satisfaction

Multi-agent Based System for Multilingual Ontologies Maintenance

An Approach to Join Ontologies and Their Reuse in the Construction of Application Ontologies

Interpreting Discovered Patterns in Terms of Ontology Concepts

Detecting and Correlating Video-Based Event Patterns: An Ontology Driven Approach

Mapping Word Senses of Middle Ancient Chinese to WordNet

Application of TextRank Algorithm for Credibility Assessment

Integrating Pinyin to Improve Spelling Errors Detection for Chinese Language

Exploratory Study of Relationships among Statement Credibility, Context, and Semantic Similarity

WI 2014 Author Index

WI 2014 Publisher's Information

Collecting Conceptualized Relations from Terabytes of Web Texts for Understanding Unknown Terms

Content Provider	ACM Digital Library
Author	Nakayama, Kotaro Aramaki, Eiji Hara, Takahiro Nishio, Shojiro Shirakawa, Masumi
Abstract	This paper describes our attempt to extract various relations between super ordinate concepts from terabytes of Web corpus for human-like speculation of the meaning of unknown terms. In order to discover various conceptualized relations, we focus on Web-scale text corpora and introduce a simple string-matching method to process them. To derive relations between concepts, our method first extracts relations between terms and next replaces each term by appropriate concepts using Wikipedia, Word Net, and YAGO knowledge. We extracted over 10 million relations between concepts in a day from more than 10TB of Web texts using 100 machines. Experimental results revealed that extracted relations by our method contained much more meaningless relations than those by NLP-based methods. Nevertheless, they were useful in an application of speculating the meaning of unknown terms, improving the recall by more than 0.06 points and decreasing the accuracy by only 0.04 points (the improvement of the F1-measure was 0.03 points). We found from the results that the coverage of conceptualized relations is important to improve the precision in the application. This is because the lack of knowledge (conceptualized relations) leads to misunderstanding of the meaning of unknown terms, as we humans misunderstand things with our insufficient knowledge.
Starting Page	86
Ending Page	93
Page Count	8
File Format	PDF
ISBN	9781479941438
DOI	10.1109/WI-IAT.2014.20
Language	English
Publisher	Association for Computing Machinery (ACM)
Publisher Date	2014-08-11
Access Restriction	Subscribed
Content Type	Text
Resource Type	Article

Central Library (ISO-9001:2015 Certified)
Indian Institute of Technology Kharagpur
Kharagpur, West Bengal, India | PIN - 721302

See location in the Map
03222 282435
Mail: support@ndl.gov.in

Sl.	Authority	Responsibilities	Communication Details
1	Ministry of Education (GoI), Department of Higher Education	Sanctioning Authority	https://www.education.gov.in/ict-initiatives
2	Indian Institute of Technology Kharagpur	Host Institute of the Project: The host institute of the project is responsible for providing infrastructure support and hosting the project	https://www.iitkgp.ac.in
3	National Digital Library of India Office, Indian Institute of Technology Kharagpur	The administrative and infrastructural headquarters of the project	Dr. B. Sutradhar bsutra@ndl.gov.in
4	Project PI / Joint PI	Principal Investigator and Joint Principal Investigators of the project	Dr. B. Sutradhar bsutra@ndl.gov.in Prof. Saswat Chakrabarti will be added soon
5	Website/Portal (Helpdesk)	Queries regarding NDLI and its services	support@ndl.gov.in
6	Contents and Copyright Issues	Queries related to content curation and copyright issues	content@ndl.gov.in
7	National Digital Library of India Club (NDLI Club)	Queries related to NDLI Club formation, support, user awareness program, seminar/symposium, collaboration, social media, promotion, and outreach	clubsupport@ndl.gov.in
8	Digital Preservation Centre (DPC)	Assistance with digitizing and archiving copyright-free printed books	dpc@ndl.gov.in
9	IDR Setup or Support	Queries related to establishment and support of Institutional Digital Repository (IDR) and IDR workshops	idr@ndl.gov.in