NDLI: Segmentation of Bangla words in scene images

Please wait, while we are loading the content...

Are buildings only instances?: exploration in architectural style categories

Geometry directed browser for personal photographs

Heritage app: annotating images on mobile phones

High resolution 3-D MR image reconstruction from multiple views

Content level access to digital library of India pages

A method for motion detection and categorization in perfusion weighted MRI

Large-scale statistical modeling of motion patterns: a Bayesian nonparametric approach

FaSTIP: a new method for detection and description of space-time interest points for human activity classification

Salient object detection using a fuzzy theoretic approach

A finite mixture model based on pair-copula construction of multivariate distributions and its application to color image segmentation

Aerial scene recognition using efficient sparse representation

Local appearance based robust tracking via sparse representation

Semi-supervised multiple instance learning based domain adaptation for object detection

A grammar-based GUI for single view reconstruction

MAPS: midline analysis and propagation of segmentation

Accelerating non-local denoising with a patch based dictionary

Depth from images of external outdoor scenes

Viewpoint based mobile robotic exploration aiding object search in indoor environment

Realtime motion detection based on the spatio-temporal median filter using GPU integral histograms

Human gait recognition using depth camera: a covariance based approach

Distinguishing cognitive states using iterative classification

Vote based correspondence for 3D point-set registration

LBP-SURF descriptor with color invariant and texture based features for underwater images

How do warm colors affect visual attention?

Feature prominence-based weighting scheme for video tracking

Haptic rendering of variable density point cloud through local kernel bandwidth estimation

Estimation of the area of mouth opening during speech production

Synthesis of emotional expressions specific to facial structure

Recognizing facial expressions using a novel shape motion descriptor

Neti Neti: in search of deity

The role of spatial context in activity recognition

Increasing intensity resolution on a single display using spatio-temporal mixing

Improved quadric surface impostors for large bio-molecular visualization

Accurate and efficient rendering of detail using directional distance maps

Hybrid ray tracing and path tracing of Bezier surfaces using a mixed hierarchy

Tweening boundary curves of non-simple immersions of a disk

Efficient texture mapping by homogeneous patch discovery

A pipeline for building 3D models using depth cameras

A novel skull stripping technique for T1-weighted MRI human head scans

Matte based generation of land cover maps

3DTV view generation with virtual pan/tilt/zoom functionality

Distributed massive model rendering

HD-GraphViz: highly distributed graph visualization on tiled displays

Accurate reconstruction of engineered models with surfaces of revolution

Feature match: an efficient low dimensional PatchMatch technique

Detection and segmentation of approximate repetitive patterns in relief images

Motion pattern-based image features for glaucoma detection from retinal images

Enhanced eigenspace separation transform for classification

Digital restoration of damaged mural images

Joint MAP estimation for blind deconvolution: when does it work?

On the use of regions for semantic image segmentation

A stochastic image denoising algorithm using 3-D block filtering under a non-local means framework

Real time mosaicing and change detection system

A Bayes filter based adaptive floor segmentation with homography and appearance cues

Sparse discriminative Fisher vectors in visual classification

Detection of doctored images using correlations of PSF

A new denoising filter for brain MR images

Novel color Gabor-LBP-PHOG (GLP) descriptors for object and scene image classification

Automatic extraction of road segments from high resolution satellite images: a region and boundary based approach

Document image binarization using wavelets for OCR applications

A non-local MRF model for heritage architectural image completion

Table detection in document images using header and trailer patterns

Image recovery from partial wavelet coefficients via sparse representation

PCA based video denoising in a non-local means framework

An approach towards a full-reference-based benchmarking for quality-optimized endoscopic video stabilization systems

Automated petroglyph image segmentation with interactive classifier fusion

Fuzzy based diffusion coefficient function in anisotropic diffusion for impulse noise removal

Automatic crack detection in heritage site images for image inpainting

VLSI architecture for real time edge detection of monochrome video sequences

Segmentation of Bangla words in scene images

Image retargeting using controlled shrinkage

Objective evaluation of noisy multimodal medical image fusion using Daubechies complex wavelet transform

Wavelet-based contrast enhancement of dark images using dynamic stochastic resonance

A DCT based reversible data embedding scheme for MPEG-4 video using HVS characteristics

Fuzzy graph modeling for text segmentation from land map images

Contrast enhancement in wavelet domain for graph-based segmentation in medical imaging

Super resolution via sparse representation in $l_{1}$ framework

Multichannel texture image segmentation using local feature fitting based variational active contours

Steganographic algorithm based on parametric discrete cosine transform

Assessment of computational visual attention models on medical images

Segmentation of brain MR images using intuitionistic fuzzy clustering algorithm

Segmentation of Bangla words in scene images

Content Provider	ACM Digital Library
Author	Banik, Prakriti Bhattacharya, Ujjwal Parui, Swapan K.
Abstract	Some studies on extraction of Bangla texts from scene images are available in the literature. Also, OCR of printed Bangla texts has been extensively studied. However, the performance of available Bangla OCR on scene texts is not acceptable. In this article, we present our recent study of segmentation of characters or their parts from Bangla texts extracted from scene images. The proposed approach detects the background and text by a combination of two algorithms: unsupervised learning algorithm K-means clustering and Otsu's threshold selection. We propose a criterion to choose an optimal K value for K-means clustering. The text segmentation is based on region growing and extraction of both headline and baseline of such texts. These two lines divide a Bangla word into three horizontal zones. The present algorithm segments characters or their parts in each individual zone. This zone-based segmentation approach helps to reduce the number of symbols to be handled by the classifier in the next stage of the OCR system. Our algorithm can also detect an image having only numerals, avoiding zone detection in that case. Extracted scene texts are often affected by artifacts and our segmentation algorithm can remove them efficiently. Our algorithm has been tested on 2460 Bangla words extracted from 260 scene images.
Starting Page	1
Ending Page	7
Page Count	7
File Format	PDF
ISBN	9781450316606
DOI	10.1145/2425333.2425403
Language	English
Publisher	Association for Computing Machinery (ACM)
Publisher Date	2012-12-16
Publisher Place	New York
Access Restriction	Subscribed
Subject Keyword	Rgb normalization k-means clustering Baseline detection Character segmentation
Content Type	Text
Resource Type	Article

Central Library (ISO-9001:2015 Certified)
Indian Institute of Technology Kharagpur
Kharagpur, West Bengal, India | PIN - 721302

See location in the Map
03222 282435
Mail: support@ndl.gov.in

Sl.	Authority	Responsibilities	Communication Details
1	Ministry of Education (GoI), Department of Higher Education	Sanctioning Authority	https://www.education.gov.in/ict-initiatives
2	Indian Institute of Technology Kharagpur	Host Institute of the Project: The host institute of the project is responsible for providing infrastructure support and hosting the project	https://www.iitkgp.ac.in
3	National Digital Library of India Office, Indian Institute of Technology Kharagpur	The administrative and infrastructural headquarters of the project	Dr. B. Sutradhar bsutra@ndl.gov.in
4	Project PI / Joint PI	Principal Investigator and Joint Principal Investigators of the project	Dr. B. Sutradhar bsutra@ndl.gov.in Prof. Saswat Chakrabarti will be added soon
5	Website/Portal (Helpdesk)	Queries regarding NDLI and its services	support@ndl.gov.in
6	Contents and Copyright Issues	Queries related to content curation and copyright issues	content@ndl.gov.in
7	National Digital Library of India Club (NDLI Club)	Queries related to NDLI Club formation, support, user awareness program, seminar/symposium, collaboration, social media, promotion, and outreach	clubsupport@ndl.gov.in
8	Digital Preservation Centre (DPC)	Assistance with digitizing and archiving copyright-free printed books	dpc@ndl.gov.in
9	IDR Setup or Support	Queries related to establishment and support of Institutional Digital Repository (IDR) and IDR workshops	idr@ndl.gov.in