NDLI: Two-stream indexing for spoken web search

Please wait, while we are loading the content...

Toward optimal vaccination strategies for probabilistic models

OntoTrix: a hybrid visualization for populated ontologies

Social media analytics: tracking, modeling and predicting the flow of information through networks

Eighth workshop on information integration on the web (IIWeb 2011)

The computer is the new sewing machine: benefits and perils of crowdsourcing

Addressing the RDFa publishing bottleneck

The Lwazi community communication service: design and piloting of a voice-based information service

Timestamp-based cache invalidation for search engines

Factal: integrating deep web based on trust and relevance

Distributed web retrieval

4th linked data on the web workshop (LDOW2011)

Social media: source of information or bunch of noise

Towards liquid service oriented architectures

Analyzing and accelerating web access in a school in peri-urban India

Towards automatic quality assurance in Wikipedia

Automatically building probabilistic databases from the web

WWW 2011 invited tutorial overview: latent variable models on the internet

USEWOD2011: 1st international workshop on usage analysis and the web of data

Connecting the next billion web users

Analysis and tracking of emotions in english and bengali texts: a computational approach

Design and implementation of contextual information portals

Measuring the effectiveness of display advertising: a time series approach

Exploratory search in multi-domain information spaces with liquid query

Social recommender systems

The 1st temporal web analytics workshop (TWAW)

Computational advertising: leveraging user interaction & contextual factors for improved ad retrieval & ranking

Location specific summarization of climatic and agricultural trends

A middleware for securing mobile mashups

LivePulse: tapping social media for sentiments in real-time

Ranking on large-scale graphs with rich metadata

First international workshop on social media engagement (SoME 2011)

Standing on the shoulders of ants: stigmergy in the web

Low-infrastructure methods to improve internet access for mobile users in emerging regions

Language independent identification of parallel sentences using Wikipedia

Filtering microblogging messages for social tv

Managing crowdsourced human computation: a tutorial

Second international workshop on RESTful design (WS-REST 2011)

Ranked answer graph construction for keyword queries on RDF graphs without distance neighbourhood restriction

Identifying enrichment candidates in textbooks

From actors, politicians, to CEOs: domain adaptation of relational extractors using a latent relational mapping

T-RecS: team recommendation system through expertise and cohesiveness

Citizen sensor data mining, social media analytics and development centric web applications

Joint WICOW/AIRWeb workshop on web quality (WebQuality 2011)

A politeness recognition tool for Hindi: with special emphasis on online texts

Traffic characterization and internet usage in rural Africa

Recommendations for the long tail by term-query graph

Accelerating instant question search with database techniques

Game theoretic models for social network analysis

SemSearch'11: the 4th semantic search workshop

Measurement and analysis of cyberlocker services

Two-stream indexing for spoken web search

Efficient diversification of search results using query logs

CoSi: context-sensitive keyword query interpretation on RDF databases

Speech and multimodal interaction in mobile search

Second international workshop on web science and information exchange in the medical web (MedEx 2011)

Fuzzy associative rule-based approach for pattern mining and identification and pattern-based classification

Assistive technology for vision-impairments: anagenda for the ICTD community

EntityTagger: automatically tagging entities with descriptive phrases

Blognoon: exploring a topic in the blogosphere

Scalable integration and processing of linked data

DiversiWeb 2011: first international workshop on knowledge diversity on the web

Performance enhancement of scheduling algorithms in clusters and grids using improved dynamic load balancing techniques

Web-scale entity-relation search architecture

Visual query system for analyzing social semantic web

Web-based open-domain information extraction

Workshop on online reputation: context, privacy, and reputation management

Wikipedia vandalism detection

Survivability-oriented self-tuning of web systems

Embedding MindMap as a service for user-driven composition of web applications

The web of things

PlayIT 2011: first international workshop on games for knowledge acquisition

Dynamic learning-based mechanism design for dependent valued exchange economies

Learning facial attributes by crowdsourcing in social media

Helix: online enterprise data analytics

Sentence-level contextual opinion retrieval

Generating summaries for ontology search

YAGO2: exploring and querying world knowledge in time, space, context, and many languages

The OXPath to success in the deep web

Enhancing web search with entity intent

A demo search engine for products

Cooperative anti-spam system based on multilayer agents

Query completion without query logs for song search

DIDO: a disease-determinants ontology from web sources

Summarization of archived and shared personal photo collections

OntoWiki mobile: knowledge management in your pocket

A tool for fast indexing and querying of graphs

Application of semantic web technologies for multimedia interpretation

HyLiEn: a hybrid approach to general list extraction on the web

A user-tunable approach to marketplace search

WonderWhat: real-time event determination from photos

Truthy: mapping the spread of astroturf in microblog streams

Identifying overlapping communities in folksonomies or tripartite hypergraphs

VoiSTV: voice-enabled social TV

Spammers' networks within online social networks: a case-study on Twitter

Adapting a map query interface for a gesturing touch screen interface

Networked hierarchies for web directories

OXPath: little language, little memory, great value

A study on the impact of product images on user clicks for online shopping

CONQUER: a system for efficient context-aware query suggestions

CELF++: optimizing the greedy algorithm for influence maximization in social networks

CATE: context-aware timeline for entity illustration

Rolling boles, optimal XML structure integrity for updating operations

Einstein: physicist or vegetarian? summarizing semantic type graphs for knowledge discovery

SmartInt: using mined attribute dependencies to integrate fragmented web databases

Trust analysis with clustering

Automatic sanitization of social network data to prevent inference attacks

Predicting popular messages in Twitter

Automatically generating labels based on unified click model

Allocating inverted index into flash memory for search engines

Domain-independent entity extraction from web search query logs

Ranking in context-aware recommender systems

Ranking related entities for web search queries

GeoVisualRank: a ranking method of geotagged imagesconsidering visual similarity and geo-location proximity

Anytime algorithm for QoS web service composition

Smart news feeds for social networks using scalable joint latent factor models

Finding influential mediators in social networks

Hypergraph-based inductive learning for generating implicit key phrases

Open and decentralized access across location-based services

Personalized search on Flickr based on searcher's preference prediction

A classification based framework for concept summarization

A feature-pair-based associative classification approach to look-alike modeling for conversion-oriented user-targeting in tail campaigns

Casting a web of trust over Wikipedia: an interaction-based approach

Mobile topigraphy: large-scale tag cloud visualization for mobiles

Unsupervised query segmentation using only query logs

Detecting group review spam

A self organizing document map algorithm for large scale hyperlinked data inspired by neuronal migration

Collaborative classification over P2P networks

Generalized fact-finding

Investigating topic models for social media user recommendation

On using the real-time web for news recommendation & discovery

Extracting events and event descriptions from Twitter

Understanding the functions of business accounts on Twitter

A framework for evaluating network measures for functional importance

Comparative study of clustering techniques for short text documents

Influence and passivity in social media

Web information extraction using Markov logic networks

Towards identifying arguments in Wikipedia pages

How to choose combinations in a join of search results

REACTOR: a framework for semantic relation extraction and tagging over enterprise data

Harnessing the wisdom of crowds: video event detection based on synchronous comments

ReadAlong: reading articles and comments together

Effective summarization of large collections of personal photos

Learning to tokenize web domains

Coverage patterns for efficient banner advertisement placement

Using complex network features for fast clustering in the web

Identifying primary content from web pages and its application to web search ranking

A non-syntactic approach for text sentiment classification with stopwords

Evaluation of valuable user generated content on social news web sites

Is pay-per-click efficient?: an empirical analysis of click values

Scalable spatio-temporal knowledge harvesting

Growing parallel paths for entity-page discovery

Finding our way on the web: exploring the role of waypoints in search interaction

An adaptive ontology-based approach to identify correlation between publications

Mining collective local knowledge from Google MyMaps

A kernel approach to addressing term mismatch

A probabilistic model for opinionated blog feed retrieval

A finegrained digestion of news webpages through Event Snippet Extraction

Caching intermediate result of SPARQL queries

Autopedia: automatic domain-independent Wikipedia article generation

Location relevance classification for travelogue digests

Mobile search pattern evolution: the trend and the impact of voice queries

Exploiting session-like behaviors in tag prediction

On computing text-based similarity in scientific literature

Hierarchical organization of unstructured consumer reviews

The freshman handbook: a hint for the server placement of social networks

Leveraging auxiliary text terms for automatic image annotation

Two-stream indexing for spoken web search

Content Provider	ACM Digital Library
Author	Mukherjea, Sougata Sahay, Shrey Ajmera, Jitendra Rajput, Nitendra Srivastava, Kundan Joshi, Anupam Shrivastava, Mayank
Abstract	This paper presents two-stream processing of audio to index the audio content for Spoken Web search. The first stream indexes the meta-data associated with a particular audio document. The meta-data is usually very sparse, but accurate. This therefore results in a high-precision, low-recall index. The second stream uses a novel language-independent speech recognition to generate text to be indexed. Owing to the multiple languages and the noise in user generated content on the Spoken Web, the speech recognition accuracy of such systems is not high, thus they result in a low-precision, high-recall index. The paper attempts to use these two complementary streams to generate a combined index to increase the precision-recall performance in audio content search. The problem of audio content search is motivated by the real world implication of the Web in developing regions, where due to literacy and affordability issues, people use Spoken Web which consists of interconnected VoiceSites, which have content in audio. The experiments are based on more than 20,000 audio documents spanning over seven live VoiceSites and four different languages. The results suggest significant improvement over a meta-data-only or a speech-recognitiononly system, thus justifying the two-stream processing approach. Audio content search is a growing problem area and this paper wishes to be a first step to solving this at a large scale, across languages, in a Web context.
Starting Page	503
Ending Page	512
Page Count	10
File Format	PDF
ISBN	9781450306379
DOI	10.1145/1963192.1963364
Language	English
Publisher	Association for Computing Machinery (ACM)
Publisher Date	2011-03-28
Publisher Place	New York
Access Restriction	Subscribed
Subject Keyword	Mobile phone Spoken web Audio search World wide telecom web Developing regions Literacy
Content Type	Text
Resource Type	Article

Central Library (ISO-9001:2015 Certified)
Indian Institute of Technology Kharagpur
Kharagpur, West Bengal, India | PIN - 721302

See location in the Map
03222 282435
Mail: support@ndl.gov.in

Sl.	Authority	Responsibilities	Communication Details
1	Ministry of Education (GoI), Department of Higher Education	Sanctioning Authority	https://www.education.gov.in/ict-initiatives
2	Indian Institute of Technology Kharagpur	Host Institute of the Project: The host institute of the project is responsible for providing infrastructure support and hosting the project	https://www.iitkgp.ac.in
3	National Digital Library of India Office, Indian Institute of Technology Kharagpur	The administrative and infrastructural headquarters of the project	Dr. B. Sutradhar bsutra@ndl.gov.in
4	Project PI / Joint PI	Principal Investigator and Joint Principal Investigators of the project	Dr. B. Sutradhar bsutra@ndl.gov.in Prof. Saswat Chakrabarti will be added soon
5	Website/Portal (Helpdesk)	Queries regarding NDLI and its services	support@ndl.gov.in
6	Contents and Copyright Issues	Queries related to content curation and copyright issues	content@ndl.gov.in
7	National Digital Library of India Club (NDLI Club)	Queries related to NDLI Club formation, support, user awareness program, seminar/symposium, collaboration, social media, promotion, and outreach	clubsupport@ndl.gov.in
8	Digital Preservation Centre (DPC)	Assistance with digitizing and archiving copyright-free printed books	dpc@ndl.gov.in
9	IDR Setup or Support	Queries related to establishment and support of Institutional Digital Repository (IDR) and IDR workshops	idr@ndl.gov.in