NDLI: Reducing information redundancy in search results

Please wait, while we are loading the content...

Spatial interpolation: an analytical comparison between kriging and RBF networks

A multiple feature vector framework for forest species recognition

An intelligent building that listens to your needs

Many-to-many interchangeable sets of values in CSPs

Out-of-bag discriminative graph mining

Evolutionary optimization of wetlands design

Leader-follower formation control of multiple nonholonomic robots based on backstepping

Heterogeneous data fusion via matrix factorization for augmenting item, group and friend recommendations

Computing semantic relatedness using word frequency and layout information of Wikipedia

Modeling I/O interference for data intensive distributed applications

Improving context interpretation by using fuzzy policies: the case of adaptive video streaming

Google play is not a long tail market: an empirical analysis of app adoption on the Google play app market

A semi-supervised graph-based algorithm for detecting outliers in online-social-networks

Privacy-friendly tasking and trading of energy in smart grids

The impact of user-browser interaction on web performance

Supporting distributed software development through context awareness on software artifacts: the DiSEN-CollaborAR approach

Continuous query processing with concurrency control: reading updatable resources consistently

Driver input selection for main-memory multi-way joins

XML search personalization strategies using query expansion, reranking and a search engine modification

Hierarchical visual filtering, pragmatic and epistemic actions for database visualization

WAVE-CIA: a novel CIA approach based on call graph mining

Energy consumption estimation of virtual machines

Accelerated robustness testing of state-based components using reverse execution

Advanced modularity for building SPL feature models: a model-driven approach

A conceptual approach to gene expression analysis enhanced by visual analytics

Internet of things: a process calculus approach

A framework for the intelligent delivery and user-adequate visualization of process information

LINK-GC: a preemptive approach for garbage collection in NAND flash storages

Implementing Java-like languages in Xtext with Xsemantics

Integrating memory management with a file system on a non-volatile main memory system

New exception interfaces for Java-like languages

Practical use of static composition of refactoring operations

Smart cities software architectures: a survey

Exploiting visual appearance to cluster and detect rogue software

Performance analysis of a rule-based SOA component for real-time applications

Credible recommendation exchange mechanism for P2P reputation systems

A framework for semantic annotation of digital evidence

Faster construction of ball-partitioning-based metric access methods

Towards skeleton biometric identification using the microsoft kinect sensor

A collective robotic architecture in search and rescue scenarios

Risk-neutral bounded max-sum for distributed constraint optimization

A supervised machine learning classification algorithm for research articles

Disguised malware script detection system using hybrid genetic algorithm

A Kalman filter based approach to probabilistic gas distribution mapping

Inferring user utility for query revision recommendation

Improved text annotation with Wikipedia entities

Building an on-demand virtual computing market in non-commercial communities

Hyphen: a hybrid protocol for generic overlay construction in P2P environments

iLauncher: an intelligent launcher for mobile apps based on individual usage patterns

Service-centric networking extensions

A combined structural and dynamic modelling approach for dependability analysis in smart grid

Exploiting emoticons in sentiment analysis

A publication-subscription interaction schema for desktop grid computing

Novelty detection algorithm for data streams multi-class problems

Adaptive memory-aware chunk sizing techniques for data-intensive queries over web services

Discovering unexpected information on the basis of popularity/unpopularity analysis of coordinate objects and their relationships

Assessment of a user centered interface for teleoperation and 3D environments

Representing dynamic pluggable software units

Energy-driven consolidation in digital home

Using cross-entropy for satisfiability

Aspect interaction chart - a UML approach for modularizing aspect interaction conflicts

Cross-lattice behavior of general ACO folding for proteins in the HP model

Reliable supervisory coordination of stochastic communicating processes with data

Towards data-aware constraints in declare

Virtualization for safety-critical, deeply-embedded devices

The ruby type checker

Analyzing resource interdependencies in multi-core architectures to improve scheduling decisions

Formal semantics and expressiveness of a web service composition language

Fine-grained annotations for pointcuts with a finer granularity

Derivation of domain-specific architectural knowledge views from governance and security compliance metadata

Lightweight energy consumption based intrusion detection system for wireless sensor networks

On the reconfiguration of software connectors

Composite trust-based public key management in mobile ad hoc networks

Building a scalable spatial OLAP system

Indoor localization using SLAM in parallel with a natural marker detector

Performance based task assignment in multi-robot patrolling

Dynamic virtual arc consistency

Discovering influential nodes from trust network

Automatic generation of evolutionary operators: a study with mutation strategies for the differential evolution

An investigation into the development of service-oriented robotic systems

Recommending insurance riders

Semantic news recommendation using wordnet and bing similarities

A progress and profile-driven cloud-VM for resource-efficiency and fairness in e-science environments

Stheno, a real-time fault-tolerant $P_{2}P$ middleware platform for light-train systems

iPrevention: towards a novel real-time smartphone-based fall prevention system

Bounded gossip: a gossip protocol for large-scale datacenters

On the security of distributed power system state estimation under targeted attacks

Extending the web to support personal network services

Distributed dynamic data driven prediction based on reinforcement learning approach

Efficient data stream classification via probabilistic adaptive windows

Efficient XML duplicate detection using an adaptive two-level optimization

Reducing information redundancy in search results

Video shot representation based on histograms

A novel watermarking method for Java programs

Energy efficiency management in computational grids through energy-aware scheduling

Static analysis of list-manipulating programs via bit-vectors and numerical abstractions

A catalogue of functional software requirement patterns for the domain of content management systems

BenchDW: a generic framework for biological data warehouse benchmarking

A peer to peer agent coordination framework for IHE based cross-community health record exchange

Assessing the best-order for business process model refactoring

Energy-aware real-time task synchronization in multi-core embedded systems

Run-time checking of data- and protocol-oriented properties of Java programs: an industrial case study

Analysis of client/server interactions in a reservation-based system

Reliable scalable symbolic computation: the design of SymGridPar2

Exploiting points-to maps for de-/serialization code generation

Modeling dynamic adaptations using augmented feature models

EARs in the wild: large-scale analysis of execution after redirect vulnerabilities

Apprehensive QoS monitoring of Service choreographies

Estimating domain-based user influence in social networks

Convexity local contour sequences for gesture recognition

Towards solving an obstacle problem by the cooperation of UAVs and UGVs

Solving equations on words through boolean satisfiability

Incremental linear model trees on massive datasets: keep it simple, keep it fast

Using polynomial reductions to test the suitability of metaheuristics for solving NP-complete problems

A feasibility analysis on using bathymetry for navigation of autonomous underwater vehicles

Constructing and comparing user mobility profiles for location-based services

Environmental service discovery based on semantically annotated OGC service descriptions

Input data organization for batch processing in time window based computations

Understanding the quality of experience in modern distributed interactive multimedia applications in presence of failures: metrics and analysis

Predictive indoor navigation using commercial smart-phones

DoS-resilient virtual networks through multipath embedding and opportunistic recovery

Impact assessment of smart meter grouping on the accuracy of forecasting algorithms

Model words-driven approaches for duplicate detection on the web

HawkEye: a tool for collaborative business process modelling and verification

STONE: a stream-based DDoS defense framework

Abstract program slicing of database query languages

Predicting query reformulation type from user behavior

Interactive coffee table for exploration of personal photos and videos

An empirical study on developer interactions in StackOverflow

Meta-learning based architectural and algorithmic optimization for achieving green-ness in predictive workload analytics

The search for the laws of automatic random testing

A requirements catalog for mobile learning environments

A data warehouse as an infrastructure to mine molecular descriptors for virtual screening

Specifying and analysing reputation systems with a coordination language

Start time and duration distribution estimation in semi-structured processes

Sensor streams middleware for easy configuration and processing in hybrid sensor network

Meso: an object-oriented programming language for building strongly-typed internet-based network applications

Design analysis for real-time video transcoding on cloud systems

End-to-end latency computation in a multi-periodic design

An infrastructure for the life cycle management of multi product lines

Secure roaming and infrastructure sharing for multi-operator WMNs

An integrated framework for QoS-based adaptation and exception resolution in WS-BPEL scenarios

A framework for evaluating trust of service providers in cloud marketplaces

An evolutionary spline fitting algorithm for identifying filamentous cyanobacteria

Towards a domain specific modeling language for agent-based models in land use science

Model selection based product kernel learning for regression on graphs

A hybrid compact genetic algorithm applied to the multi-level capacitated lot sizing problem

Towards a software tool for ultrasound guided robotic hip resurfacing surgery

Mining frequent itemsets over tuple-evolving data streams

A software measurement task ontology

GCplace: geo-cloud based correlation aware data replica placement

Maximizing availability of content in disruptive environments by cross-layer optimization

Cross-platform model-driven development of mobile applications with $md^{2}$

A novel demand-aware fairness metric for IEEE 802.11 wireless networks

Demand response computation for future smart grids incorporating wind power

Detecting tip spam in location-based social networks

Random rules from data streams

CodeBlast: a two-stage algorithm for improved program similarity matching in large software repositories

Determining language variant in microblog messages

The CAS project: a general infrastructure for pervasive capture and access systems

A study of COTS integration projects: product characteristics, organization, and life cycle models

Comparing mobile applications' energy consumption

Test case generation from natural language requirements based on SCR specifications

Test intents: enhancing the semantics of requirements traceability links in test cases

Combining self-organisation, context-awareness and semantic reasoning: the case of resource discovery in opportunistic networks

IT evaluation in business groups: a maturity model

Improving the performance of message parsers for embedded systems

Concurrent typed intermediate language

Onion and pizza: new disk partitioning schemes for virtualization systems

@Java: annotations in freedom

A generic framework for deriving architecture modeling methods for large-scale software-intensive systems

Mobile-sandbox: having a deeper look into android applications

Efficient data-intensive event-driven interaction in SOA

Gesture unit segmentation using support vector machines: segmenting gestures from rest positions

An algorithm for discovering clusters of different densities or shapes in noisy data sets

Optimization metaheuristics for minimizing variance in a real-world statistical application

Real time autonomous navigation and obstacle avoidance using a semi-global stereo method

Learning hybrid recommender models for heterogeneous semantic data

Enhancing scientific information systems with semantic annotations

High-resolution spatial interpolation on cloud platforms

MoSQL: an elastic storage engine for MySQL

LOCCAM - loosely coupled context acquisition middleware

A delivery method considering communication loads for sensor data stream with different collection cycles

Towards cosimulating network and electrical systems for performance evaluation in smart grid

Discovering local attractions from geo-tagged photos

An adaptive regression tree for non-stationary data streams

Using maude rewriting system to modularize and extend SQL

gSVD++: supporting implicit feedback on recommender systems with metadata awareness

Adaptive video-aware FEC-based mechanism with unequal error protection scheme

Test-based SPL extraction: an exploratory study

A design method for modular energy-aware software

Mutation testing strategies using mutant classification

On the use of metamodeling for relating requirements and architectural design decisions

Constrained global types for dynamic checking of protocol conformance in multi-agent systems

Amending C-net discovery algorithms

An instruction-level fine-grained recovery approach for soft errors

A dynamically reconfigurable operating system for manycore systems

Online identification of frequently executed acyclic paths by leveraging data stream algorithms

The BRICS component model: a model-based development paradigm for complex robotics software systems

Malicious takeover of voting systems: arbitrary code execution on optical scan voting terminals

Disciplined structured communications with consistent runtime adaptation

A data reduction and organization approach for efficient image annotation

Comparing relational and non-relational algorithms for clustering propositional data

Horizontal partitioning of very-large data warehouses under dynamically-changing query workloads via incremental algorithms

Enhancing social matrix factorization with privacy

Ontology acquisition from web service descriptions

Matchmaking of IaaS cloud computing offers leveraging linked data

A multi-resource load balancing algorithm for cloud cache systems

Eliminating the XML overhead in embedded XML languages

A decentralized utility-based grid scheduling algorithm

Modeling fundamentals for smart grid enabled ecodistricts

Supporting entailment constraints in the context of collaborative web applications

Extracting differences between regular tree grammars

Effectiveness of state-of-the-art features for microblog search

CrowdVis: a framework for real time crowd visualization

A model to detect problems on scrum-based software development projects

Towards a definition of sustainability in and for software engineering

Common specification language for static and dynamic analysis of C programs

Dynamic decision tree for legacy use-case recovery

Probabilistic embedding: experiments with tuple-based probabilistic languages

Dynamic instance queuing in process-aware information systems

Throughput-constrained voltage and frequency scaling for real-time heterogeneous multiprocessors

Measuring similarity of windows applications using static and dynamic birthmarks

A preliminary assessment of Haskell's software transactional memory constructs

Applying software product line engineering in building web portals for supercomputing services

Verifying multicast-based security protocols using the inductive method

A flexible approach for considering interdependent security objectives in service composition

Speeding up graph clustering via modular decomposition based compression

Users segmentations for recommendation

Rank prediction for semantically annotated resources

Hospitality of cloud platforms

Adaptive monitoring of web-based applications: a performance study

Broadcast cancellation in search mechanisms

On the load balancing of virtual networks in distributed clouds

A scalable communication infrastructure for smart grid applications using multicast over public networks

Feature-based object identification for web automation

Reducing data transfer for charts on adaptive web sites

Lattice navigation for collaborative filtering by means of (fuzzy) formal concept analysis

A visual analytics tool for system logs adopting variable recommendation and feature-based filtering

An experiment specification language for goal-driven, automated performance evaluations

An interactive extension mechanism for reusing verified programs

Selecting among alternatives using dependencies: an NFR approach

Enterprise integration using event actor based event transformations

nuKernel: MicroKernel for multi-core DSP SoCs with load sharing and priority interrupts

An efficient similarity comparison based on core API calls

A model driven methodology for enabling autonomic reconfiguration of service oriented architecture

An empirical analysis of malicious internet banking software behavior

Monitoring SOA-based applications with business provenance

TNS: mining top-k non-redundant sequential rules

A mediator for statistical linked data

log2cloud: log-based prediction of cost-performance trade-offs for cloud deployments

Experience with a middleware infrastructure for service oriented financial applications

Sensor-field modeling based on in-network data prediction: an efficient strategy for answering complex queries in wireless sensor networks

A flow-based optimization model for throughput-oriented relay node placement in wireless sensor networks

Service farming: an ad-hoc and QoS-aware web service composition approach

Filtering XFD toward interoperability

Text clustering using one-mode projection of document-word bipartite graphs

Failure-detection capability analysis of implementing parallelism in adaptive random testing algorithms

Common criteria compliant software development (CC-CASD)

Generic support for RBAC break-glass policies in process-aware information systems

An FPGA-based multi-core approach for pipelining computing stages

Software plagiarism detection via the static API call frequency birthmark

An updated threat model for security ceremonies

User centric complex event processing based on service oriented architectures

Learning non-linear classifiers with a sparsity constraint using L1 regularization

Study on supporting technology for operational procedure design of IT systems in cloud-era datacenters

Identifying incompatible service implementations using pooled decision trees

Distributed and efficient algorithm for self-reconfiguration of MEMS microrobots

ABOI: a novel strategy to mitigate the blocking due to outdated information in OCS/OBS network

Designing a 3D widget library for WebGL enabled browsers

When entities meet query recommender systems: semantic search shortcuts

WSCCT: a tool for WS-BPEL compositions conformance testing

The role of NFRs when transforming i* requirements models into OO-method models

Data flow abstractions and adaptations through updatable process views

MLC-flash-friendly logging and recovery for databases

Operating system reliability from the quality of experience viewpoint: an exploratory study

Slicing droids: program slicing for smali code

A conceptual framework for collective adaptive systems

Empowering automatic data-center management with machine learning

Inter cloud capable dynamic resource management with model of behavior

Improving transaction abort rates without compromising throughput through judicious scheduling

Participatory sensing based traffic condition monitoring using horn detection

isBF: scalable in-packet bloom filter based multicast

Process-aware web programming with Jolie

Feature selections for authorship attribution

Automatic recognition of design motifs using semantic conditions

Configuration support for feature models with soft constraints

Data-aware process mining: discovering decisions in processes using alignments

A novel approach for interactive debugging of dynamic dataflow embedded applications

Computation offloading for real-time systems

Bring your own device, securely

Towards an approach for modeling and formalizing SOA design patterns with Event-B

Stream mining of frequent sets with limited memory

SCAling: SLA-driven cloud auto-scaling

Towards a ranking framework for software components

Towards a total recall: an activity tracking and recall mechanism for mobile devices

A backward-compatible protocol for inter-routing over heterogeneous overlay networks

Evaluating the utilization of Twitter messages as a source of security alerts

A quantitative approach for evaluating software maintenance services

Modeling the alignment between business and IS/IT: a requirements engineering perspective

On the exploitation of process mining for security audits: the process discovery case

Demand-based flash translation layer considering spatial locality

A tour recommendation service for electric vehicles based on a hybrid orienteering model

Run-time control flow authentication: an assessment on contemporary x86 platforms

Heterogeneous device interaction using an IPv6 enabled service-oriented architecture for building automation systems

Radio resource management in coordinated antenna system deployments

Towards a private vector space model for confidential documents

Multi-objective test case prioritization for GUI applications

$e^{3}RoME:$ a value-based approach for method bundling

Kernel-level time composability for avionics applications

Enhancing security enforcement on unmodified Android

Supporting visual security cues for WebView-based Android apps

A systematic review on mining techniques for crosscutting concerns

Product-based business processes interoperability

Communication support at the OS level to enhance design space exploration in multiprocessed embedded systems

Protecting Android applications with steganography-based software watermarking

A hybrid bug triage algorithm for developer recommendation

Evaluating a process for developing a capability maturity model

Executing and debugging UML models: an fUML extension

i*Chameleon: a platform for developing multimodal application with comprehensive development cycle

Software effort prediction: a hyper-heuristic decision-tree based approach

Quantified extreme scenario based design approach

Evaluating the conventional wisdom in clone removal: a genealogy-based empirical study

OSDC: adapting ODC for developing more secure software

A model-based framework for flexible safety-critical software development: a design study

Notation-driven vs metamodel-driven development of domain-specific modeling languages: an empirical study

Estimating the size of data mart projects

Reducing information redundancy in search results

Content Provider	ACM Digital Library
Author	Stamou, Sofia Plegas, Yannis
Abstract	It is well-known that the web contains many duplicate and near-duplicate documents. Despite the efforts that have been put towards equipping search engines with duplicate detection algorithms, still there are cases where the documents retrieved in response to web queries contain redundant information. In this paper, we are concerned with effectively identifying and reducing redundant information in search results. In particular, we describe how we automatically detect content that is lexically and/or semantically duplicated across search results and we introduce a novel algorithm that upon the detection of significant (i.e., above a given threshold) content duplication, it filters out redundant information. Information filtering takes place in two-steps depending on whether we are dealing with documents of (nearly) identical lexical content or with documents of lexically distinct but semantically equivalent content. In the first case, our algorithm retains in the result list the document that is the most relevant to the query intention and removes duplicates. In the second case, our algorithm merges into a single text, which we call SuperText, the documents of redundant information in a way that every document contributes diverse semantic content to the generated SuperText. Additionally, the algorithm re-ranks the remaining documents based on their contextual relevance to the query intention. The experimental evaluation of our approach demonstrates that it is very effective in identifying lexical and semantic information redundancy across search results. In addition, we have found that our algorithm manages to filter out successfully content duplication from the results list and the SuperTexts it generates for reducing information redundancy are syntactically and semantically coherent texts.
Starting Page	886
Ending Page	893
Page Count	8
File Format	PDF
ISBN	9781450316569
DOI	10.1145/2480362.2480533
Language	English
Publisher	Association for Computing Machinery (ACM)
Publisher Date	2013-03-18
Publisher Place	New York
Access Restriction	Subscribed
Subject Keyword	Ranking Semantics Shingling Removing redundant information Web search
Content Type	Text
Resource Type	Article

Central Library (ISO-9001:2015 Certified)
Indian Institute of Technology Kharagpur
Kharagpur, West Bengal, India | PIN - 721302

See location in the Map
03222 282435
Mail: support@ndl.gov.in

Sl.	Authority	Responsibilities	Communication Details
1	Ministry of Education (GoI), Department of Higher Education	Sanctioning Authority	https://www.education.gov.in/ict-initiatives
2	Indian Institute of Technology Kharagpur	Host Institute of the Project: The host institute of the project is responsible for providing infrastructure support and hosting the project	https://www.iitkgp.ac.in
3	National Digital Library of India Office, Indian Institute of Technology Kharagpur	The administrative and infrastructural headquarters of the project	Dr. B. Sutradhar bsutra@ndl.gov.in
4	Project PI / Joint PI	Principal Investigator and Joint Principal Investigators of the project	Dr. B. Sutradhar bsutra@ndl.gov.in Prof. Saswat Chakrabarti will be added soon
5	Website/Portal (Helpdesk)	Queries regarding NDLI and its services	support@ndl.gov.in
6	Contents and Copyright Issues	Queries related to content curation and copyright issues	content@ndl.gov.in
7	National Digital Library of India Club (NDLI Club)	Queries related to NDLI Club formation, support, user awareness program, seminar/symposium, collaboration, social media, promotion, and outreach	clubsupport@ndl.gov.in
8	Digital Preservation Centre (DPC)	Assistance with digitizing and archiving copyright-free printed books	dpc@ndl.gov.in
9	IDR Setup or Support	Queries related to establishment and support of Institutional Digital Repository (IDR) and IDR workshops	idr@ndl.gov.in