NDLI: Implementation and evaluation of scalable data structure over HBase

Please wait, while we are loading the content...

An effective e-governance model for financial institutions in India

Cloudpress 2.0: a next generation news retrieval system on the cloud with a built-in summarizer

Sentiment dictionary for effective detection of web users' opinion

N-ary tree based key distribution in a network as a service provisioning model

Semantic broken link detection using structured tagging scheme

Web data mining trends and techniques

A novel framework for enterprise web services change management

Web analytics and metrics: a survey

Efficient evaluation of partial path queries over a XML compact storage structure

Skin melanoma segmentation by morphological approach

Error encoding and decoding model over communication channel for synthesis of proteins from DNA sequences

Neuro-fuzzy expert system for breast cancer diagnosis

Finding gene coherent patterns using PATSUB+

Capacity planning of telemedicine network through molecular assembly

Detection and measurement of bimalleolar fractures using Harris corner

Use of higher order spectrum in characterizing nonlinear interactions in human brain signals

CASH: context aware scheduler for Hadoop

Mining of classification patterns in clinical data through data mining algorithms

Resource optimization for processing of stream data in data warehouse environment

Removal and interpolation of missing values using wavelet neural network for heterogeneous data sets

Evaluation of approaches for designing secure data warehouse

Implementation and evaluation of scalable data structure over HBase

Distributed frequent itemset mining framework for incremental data using MPI-style WSRF services

Parameter-lite clustering algorithm based on MST and fuzzy similarity merging

Detecting dependencies in an anonymized dataset

Hierarchically clustered technical blogs

Dominators vs pure dominators on the accuracy of a classifier with a multi objective cultural algorithm

Applicability of data mining algorithms for recommendation system in e-learning

Sampling correctly for improving classification accuracy: a hybrid higher order neural classifier (HHONC) approach

Selection of evolutionary approach based hybrid data mining algorithms for decision support systems and business intelligence

An algorithm for fuzzy-based sentence-level document clustering for micro-level contradiction analysis

Efficient two dimensional clustering of microarray gene expression data by means of hybrid similarity measure

Far efficient K-means clustering algorithm

An evaluation of clustering technique over intrusion detection system

A self learning rough fuzzy neural network classifier for mining temporal patterns

An efficient approach for generating frequent patterns without candidate generation

Clustering and classifying informative attributes using rough set theory

Experiments on POS tagging and data driven dependency parsing for Telugu language

Identification of optimal cluster centroid of multi-variable functions for clustering concept-drift categorical data

On-line Hindi handwritten character recognition for mobile devices

Identity anonymization and secure data storage using group signature in private cloud

Neighbour based structural proximity measures for ontology matching systems

A framework for preserving privacy in cloud computing with user service dependent identity

SenSim: sentence similarity based on the concept of relevance

Scheduling using improved genetic algorithm in cloud computing for independent tasks

Correlation based multi-document summarization for scientific articles and news group

Pricing models and pricing schemes of IaaS providers: a comparison study

Analysis of double edge triggered clocked storage elements

V2C: a secure vehicle to cloud framework for virtualized and on-demand service provisioning

High speed VLSI implementation of lifting based DWT

Authenticated and persistent skip graph: a data structure for cloud based data-centric applications

FPGA design and implementation of truncated multipliers using bypassing technique

Flexible power consumption management in smart homes

Hardware-software co-design of AES on FPGA

A novel software system to facilitate better and easier communication for people with speaking disabilities

A hybrid embedded steganography technique: optimum pixel method and matrix embedding

Genetic algorithm based airlines booking terminal open/close decision system

Symmetric key based blocked oriented digital enveloping

Active machine learning technique for named entity recognition

An improved secure code encryption approach based on indexed table

FPGA based computing displacement of moving object in a real time video

An efficient and secure key agreement scheme using physiological signals in body area networks

Text extraction from videos using a hybrid approach

Multi-modal biometric approach to enable high security in mobile adhoc network

Object detection in video using Lorenz information measure and discrete wavelet transform

Ultra low power device to track environmental sensitive items in transit

VLSI architectures for lifting based DWT: a detailed survey

Locating and monitoring emergency responder using a wearable device

Time based agent garbage collection algorithm for multicore architectures

Control of computer process using image processing and computer vision for low-processing devices

Formal verification methodology considerations for network on chips

Efficient method for noise removal techniques and video object segmentation using color based fuzzy c means

Low power energy efficient pipelined multiply-accumulate architecture

Intelligent tutoring systems: a new proposed structure

Current differencing buffered amplifier an active element: a review of recent developments

Auto-clever fuzzy (ACF) based intelligent system for monitoring and controlling the hydrocarbons -air toxics emitted by the vehicle motors

Advanced adaptive call admission control for mobile cellular networks: cell breathing, load shedding and bandwidth degradation

An approach towards dynamic assembling of learning objects

Evaluation of mobile games using playability heuristics

n-Gram modeling of relevant features for lip-reading

Tuning transmission opportunity (TXOP) limits for providing bit-based fairness in IEEE 802.11p V2I networks

Upper and lower bound analysis of the OFDM system impaired by CFO over slow fading channels

Temporal characteristics of clustering in mobile ad hoc network

Microstrip patch antenna combining crown and Sierpinski fractal-shapes

A rate adaptive and multipath routing protocol to support video streaming in MANETs

A new multi-agent system for video objects segmentation and tracking based on spatio-temporal descriptor

Load balancer for energy efficient clustering protocol in MANETs

Medical image thresholding using WQPSO and maximum entropy

Wireless charging of lighting gadgets using low Q resonant coupling

Enhancement of co-authorship networks with content-similarity information

Identification, authentication and tracking algorithm for vehicles using VIN in distributed VANET

Block PIC technique for synchronous CI/MC-CDMA system using neural network

A hybrid defense mechanism for DDoS attacks using cluster analysis in MANET

Multimodal pattern-oriented software architecture for self-optimization and self-configuration in autonomic computing system using multi objective evolutionary algorithms

Modified weight function based network selection algorithm for 4G wireless networks

Fixed point results of transcendental superior antifractals

On energy consumption analysis for ad hoc routing protocols

Performance evaluation of motion estimation in H.264/AVC encoder

A novel algorithm for PAPR reduction in LTE system

Lifetime extension of wireless sensor network by selecting two cluster heads and hierarchical routing

Adaptive time synchronization for VHT wireless LAN

A survey on various improvements of hybrid zone routing protocol in MANET

Enhanced emergency communication using mobile sensing and MANET

Performance evaluation of regular cycle non-binary LDPC codes in AWGN channel

Higher layer issues in cognitive radio network

Source based trusted AODV routing protocol for mobile ad hoc networks

Resolving rate anomaly in IEEE 802.11p multi-rate vehicle-to-infrastructure networks using TXOP differentiation

A framework for optimizing GCC for ARM architecture

An efficient sleep protocol for lifetime enhancement in multi covered and multi connected WSNs

Performance analysis of cross layer protocols for wireless sensor networks

Diversity coded directed diffusion for WSN

An energy efficient audio compression scheme using wavelet with dynamic difference detection technique in wireless sensor network

A swarm intelligence based distributed localization technique for wireless sensor network

Performance comparison and node failure assessment of energy efficient two level balanced and progressive sensor networks

A fault tolerant approach for data aggregation in wireless sensor networks

Hexagonal groups based key management using deployment knowledge in wireless sensor networks

Analysis of IEEE 802.11 DCF for ad hoc networks under nonsaturation conditions

Extensions to wireless M-Bus protocol for smart metering and smart grid application

Inference of peer temperament in unstructured peer-to-peer networks by creating a virtual multi layer structure

Efficient dynamic itinerary and memory allocation for mobile agents

Online differential charging for blended services using service capability interaction manager in IMS network

A novel approach for developing JXTA peer-to-peer computing systems using aspect-oriented programming methodologies

Task scheduling using ACO-BP neural network in computational grids

Sybil resilient identity distribution in P2P networks

Cloud computing: from hype to reality: fast tracking cloud adoption

The role of psychophysics laws in quality of experience assessment: a video streaming case study

Implementing private cloud at IIT Roorkee: an initial experience

An improved biased random sampling algorithm for load balancing in cloud based systems

Securing cloud infrastructure against co-resident DoS attacks using game theoretic defense mechanisms

State-of-the-art cloud computing security taxonomies: a classification of security challenges in the present cloud computing environment

Cloud-based B2B systems integration for small-and-medium-sized enterprises

Implementation of next-generation traffic sign recognition system with two-tier classifier architecture

A new algorithm based on complex wavelet transform for protein sequence classification

A linear after-load model for a cardio-vascular pulse duplicator

An evaluation of hospital information systems integration approaches

Methodology to visualize electronic health record for chronic diseases on small display screens

A study on damping profile for prosthetic knee

A comprehensive machine learning approach to prognose pulmonary disease from home

Simulation of respiratory system under normal, hypoxia and hypercapnia conditions

An effective unsupervised network anomaly detection method

Certificateless strong designated verifier multisignature scheme using bilinear pairings

An efficient DoG based fingerprint enhancement scheme

Error management and detection in computer networks using Bloom filters

Securing fingerprint images using a hybrid technique

Genetic algorithm combined with support vector machine for building an intrusion detection system

A DCT-SVD based robust watermarking scheme for grayscale image

Synthesis of sustainability and secureness of software in public applications using deductive-nomological model

Efficient privacy-preserving data distribution in outsourced environments: a fragmentation-based approach

Intelligent intrusion detection system using fuzzy rough set based C4.5 algorithm

Secret information display based authentication technique towards preventing phishing attacks

Protecting web applications from SQL injection attacks by using framework and database firewall

FPGA based sliding window architecture for RC5 encryption

Resource efficient survivability approach for resilient WDM optical networks

Privacy rights management in multiparty multilevel DRM system

A novel AES-256 implementation on FPGA using co-processor based architecture

Layered approach for intrusion detection using naïve Bayes classifier

Network intrusion detection system using genetic network programming with support vector machine

A novel framework for preserving privacy of data using correlation analysis

A secure packet marking scheme for IP traceback in IPv6

A classification based framework for privacy preserving data mining

Exploiting second order statistics of the received signals in AF relay cooperative network for compound channel estimation

Performance comparison of M-DCSK schemes in MIMO Nakagami channels with and without diversity combining

Capacity analysis of LMF channels under different adaptation policies with and without diversity combining

Image retrieval using local and global properties of image regions with relevance feedback

Optimized trace transform based content based image retrieval algorithm

Detecting epileptic seizures using electroencephalogram: a novel frequency domain feature extraction technique for seizure classification using fast ANFIS

Towards improving automatic image annotation using improvised fractal SMOTE approach

Recognition of vocal emotions from acoustic profile

Monogenic scale space based region covariance matrix descriptor: an efficient and accurate face recognition algorithm

A particle swarm optimization method for tuning the parameters of multiscale retinex based color image enhancement

Handwritten character recognition system using a simple feature

Palmprint feature extraction approach using nonsubsampled contourlet transform and orthogonal moments

PAPR reduction for OFDM systems using clipping and square rooting techniques

Directional line edge binary pattern for texture image indexing and retrieval

Fractional Fourier transform: a survey

Improving the energy efficiency of power line communications by spectrum sensing

Implementation of dictation system for Malayalam office document

Generating OLAP queries from natural language specification

Reformulation of Telugu web query using word semantic relationships

The study of different similarity measure techniques in recognition of handwritten characters

Divergence patterns in machine translation between Malayalam and English

Novel approach based feature extraction for Marathi continuous speech recognition

A compact feature set for recognition of handwritten numerals and vowels in the Kanarese script

A new combined method for character recognizing in Farsi printed scripts using principal component analysis

Morphar+: an Arabic morphosyntactic analyzer

Subject and object identification in Malayalam text

Morphological analyzer and generator for Tulu language: a novel approach

Active learning technique for biomedical named entity extraction

Analysis and evaluation of stemming algorithms: a case study with Assamese

A deconverter framework for Malayalam

An approach to summarizing Bengali news documents

Example-based single image enhanced up-sampling

A review on binarization algorithms for camera based natural scene images

Shift-invariant texture retrieval using P- contourlet

Improved edge preserving lossy image compression using wavelet transform

Ranking importance based information on the world wide web

A three phase approach for mental task classification using EEG

Feature based retrieval for animation video

Towards a unified 3D animated dictionary for Saudi sign language

Palmprint identification based on wide principal lines

The current state of art: handwriting a behavioral biometric for person identification and verification

Multi-normalization: a new method for improving biometric fusion

Performance evaluation of subspace methods to tackle small sample size problem in face recognition

Implementation and evaluation of scalable data structure over HBase

Content Provider	ACM Digital Library
Author	Rao, G. V. Prabhakara Kumar, K. Arun Voruganti, Kaladhar Konishetty, Vamshi Krishna
Abstract	With the emergence of commodity hardware architectures and distributed open source software, users are performing analytics on more types of data. Web 2.0 applications like social networking sites have to deal with a lot of meta-data which in some cases can't fit into main memory. Currently, it is the responsibility of the application programmers to manually map these in-memory data structures into persistent storage systems like a database or file system. Ideally, the application programmers would like the underlying programming language/middle ware software to seamlessly manage the scalable data structures. It is increasingly becoming hard to use the traditional database or storage controller systems to store this metadata because of cost and scale reasons. Thus, new NoSQL database architectures are emerging that are built on commodity hardware architectures and they can scale to large sizes in an incremental manner. Thus, there is an opportunity for the builders of NoSQL systems to provide scalable in-memory data structures. However, currently, these types of data structure interfaces are not available in the popular Hadoop NoSQL infrastructure. In this paper, we show how to implement the Set data structure and its operations in a scalable manner on top of Hadoop HBase. We then propose and implement optimizations for three Set operations. We also discuss the limitations of implementing this data structure in the Hadoop ecosystem. We evaluate our algorithms and optimizations on a real Hadoop cluster. Our primary conclusion is that the Hadoop ecosystem provides an excellent framework to implement scalable data structures.
Starting Page	1010
Ending Page	1018
Page Count	9
File Format	PDF
ISBN	9781450311960
DOI	10.1145/2345396.2345559
Language	English
Publisher	Association for Computing Machinery (ACM)
Publisher Date	2012-08-03
Publisher Place	New York
Access Restriction	Subscribed
Subject Keyword	Nosql Distributed data strcture Hadoop Mapreduce Hbase Optimization
Content Type	Text
Resource Type	Article

Central Library (ISO-9001:2015 Certified)
Indian Institute of Technology Kharagpur
Kharagpur, West Bengal, India | PIN - 721302

See location in the Map
03222 282435
Mail: support@ndl.gov.in

Sl.	Authority	Responsibilities	Communication Details
1	Ministry of Education (GoI), Department of Higher Education	Sanctioning Authority	https://www.education.gov.in/ict-initiatives
2	Indian Institute of Technology Kharagpur	Host Institute of the Project: The host institute of the project is responsible for providing infrastructure support and hosting the project	https://www.iitkgp.ac.in
3	National Digital Library of India Office, Indian Institute of Technology Kharagpur	The administrative and infrastructural headquarters of the project	Dr. B. Sutradhar bsutra@ndl.gov.in
4	Project PI / Joint PI	Principal Investigator and Joint Principal Investigators of the project	Dr. B. Sutradhar bsutra@ndl.gov.in Prof. Saswat Chakrabarti will be added soon
5	Website/Portal (Helpdesk)	Queries regarding NDLI and its services	support@ndl.gov.in
6	Contents and Copyright Issues	Queries related to content curation and copyright issues	content@ndl.gov.in
7	National Digital Library of India Club (NDLI Club)	Queries related to NDLI Club formation, support, user awareness program, seminar/symposium, collaboration, social media, promotion, and outreach	clubsupport@ndl.gov.in
8	Digital Preservation Centre (DPC)	Assistance with digitizing and archiving copyright-free printed books	dpc@ndl.gov.in
9	IDR Setup or Support	Queries related to establishment and support of Institutional Digital Repository (IDR) and IDR workshops	idr@ndl.gov.in