NDLI: Estimating Embedding Vectors for Queries

Please wait, while we are loading the content...

Advances in Formal Models of Search and Search Behaviour

Exploiting the Bipartite Structure of Entity Grids for Document Coherence and Retrieval

Total Recall: Blue Sky on Mars

The Effect of Document Order and Topic Difficulty on Assessor Agreement

A Simple and Effective Approach to Score Standardisation

Estimating Embedding Vectors for Queries

Learning to Rank User Queries to Detect Search Tasks

On Horizontal and Vertical Separation in Hierarchical Text Classification

Query Anchoring Using Discriminative Query Models

Exploring Urban Lifestyles Using a Nonparametric Temporal Graphical Model

A Utility Maximization Framework for Privacy Preservation of User Generated Content

Utilizing Knowledge Bases in Text-centric Information Retrieval

Efficient and Effective Higher Order Proximity Modeling

Classifying User Search Intents for Query Auto-Completion

A Reproducibility Study of Information Retrieval Models

The Impact of Fixed-Cost Pooling Strategies on Test Collection Bias

Analysis of the Paragraph Vector Model for Information Retrieval

Fast Feature Selection for Learning to Rank

From "More Like This" to "Better Than This"

Rank-at-a-Time Query Processing

EventMiner: Mining Events from Annotated Documents

Joint Estimation of Topics and Hashtag Relevance in Cross-Lingual Tweets

Collaborative Information Retrieval: Frameworks, Theoretical Models, and Emerging Topics

PDF: A Probabilistic Data Fusion Framework for Retrieval and Ranking

An Analysis of the Cost and Benefit of Search Interactions

Power Analysis for Interleaving Experiments by Means of Offline Evaluation

Unbiased Comparative Evaluation of Ranking Functions

End to End Long Short Term Memory Networks for Non-Factoid Question Answering

A Unified Energy-based Framework for Learning to Rank

Understanding the Message of Images with Knowledge Base Traversals

A Study of Document Expansion using Translation Models and Dimensionality Reduction Methods

Who Wants to Join Me?: Companion Recommendation in Location Based Social Networks

Temporal Query Expansion Using a Continuous Hidden Markov Model

Topic Set Size Design and Power Analysis in Practice

Learning to Rank with Labeled Features

Lexical Query Modeling in Session Search

Retrievability in API-Based "Evaluation as a Service"

A Topical Approach to Retrievability Bias Estimation

Embedding-based Query Language Models

Bag-of-Entities Representation for Ranking

Exploiting Entity Linking in Queries for Entity Retrieval

Estimating Retrieval Performance Bound for Single Term Queries

Nearest Neighbour based Transformation Functions for Text Classification: A Case Study with StackOverflow

Optimization Method for Weighting Explicit and Latent Concepts in Clinical Decision Support Queries

Cross-Language Microblog Retrieval using Latent Semantic Modeling

Estimating Embedding Vectors for Queries

Content Provider	ACM Digital Library
Author	Zamani, Hamed Croft, W. Bruce
Abstract	The dense vector representation of vocabulary terms, also known as word embeddings, have been shown to be highly effective in many natural language processing tasks. Word embeddings have recently begun to be studied in a number of information retrieval (IR) tasks. One of the main steps in leveraging word embeddings for IR tasks is to estimate the embedding vectors of queries. This is a challenging task, since queries are not always available during the training phase of word embedding vectors. Previous work has considered the average or sum of embedding vectors of all query terms (AWE) to model the query embedding vectors, but no theoretical justification has been presented for such a model. In this paper, we propose a theoretical framework for estimating query embedding vectors based on the individual embedding vectors of vocabulary terms. We then provide a number of different implementations of this framework and show that the AWE method is a special case of the proposed framework. We also introduce pseudo query vectors, the query embedding vectors estimated using pseudo-relevant documents. We further extrinsically evaluate the proposed methods using two well-known IR tasks: query expansion and query classification. The estimated query embedding vectors are evaluated via query expansion experiments over three newswire and web TREC collections as well as query classification experiments over the KDD Cup 2005 test set. The experiments show that the introduced pseudo query vectors significantly outperform the AWE method.
Starting Page	123
Ending Page	132
Page Count	10
File Format	PDF
ISBN	9781450344975
DOI	10.1145/2970398.2970403
Language	English
Publisher	Association for Computing Machinery (ACM)
Publisher Date	2016-09-12
Publisher Place	New York
Access Restriction	Subscribed
Subject Keyword	Query classification Pseudo query vector Query expansion Query embedding vector Word embedding
Content Type	Text
Resource Type	Article

Sl.	Authority	Responsibilities	Communication Details
1	Ministry of Education (GoI), Department of Higher Education	Sanctioning Authority	https://www.education.gov.in/ict-initiatives
2	Indian Institute of Technology Kharagpur	Host Institute of the Project: The host institute of the project is responsible for providing infrastructure support and hosting the project	https://www.iitkgp.ac.in
3	National Digital Library of India Office, Indian Institute of Technology Kharagpur	The administrative and infrastructural headquarters of the project	Dr. B. Sutradhar bsutra@ndl.gov.in
4	Project PI / Joint PI	Principal Investigator and Joint Principal Investigators of the project	Dr. B. Sutradhar bsutra@ndl.gov.in Prof. Saswat Chakrabarti will be added soon
5	Website/Portal (Helpdesk)	Queries regarding NDLI and its services	support@ndl.gov.in
6	Contents and Copyright Issues	Queries related to content curation and copyright issues	content@ndl.gov.in
7	National Digital Library of India Club (NDLI Club)	Queries related to NDLI Club formation, support, user awareness program, seminar/symposium, collaboration, social media, promotion, and outreach	clubsupport@ndl.gov.in
8	Digital Preservation Centre (DPC)	Assistance with digitizing and archiving copyright-free printed books	dpc@ndl.gov.in
9	IDR Setup or Support	Queries related to establishment and support of Institutional Digital Repository (IDR) and IDR workshops	idr@ndl.gov.in

Relevance-based Word Embedding

Embedding-based Query Language Models

Word embedding based query expansion

Automatic query expansion and word sense disambiguation with long and short queries using WordNet under vector model

Embedding-based Query Expansion for Weighted Sequential Dependence Retrieval Model

Word Embedding Models for Query Expansion in Answer Passage Retrieval

Query-drift prevention for robust query expansion

Toward Word Embedding for Personalized Information Retrieval

Probabilistic Query Expansion method using recommended past user queries

Estimating Embedding Vectors for Queries

Similar Documents

Relevance-based Word Embedding

Embedding-based Query Language Models

Word embedding based query expansion

Automatic query expansion and word sense disambiguation with long and short queries using WordNet under vector model

Embedding-based Query Expansion for Weighted Sequential Dependence Retrieval Model

Word Embedding Models for Query Expansion in Answer Passage Retrieval

Query-drift prevention for robust query expansion

Toward Word Embedding for Personalized Information Retrieval

Probabilistic Query Expansion method using recommended past user queries

Estimating Embedding Vectors for Queries