Sharif Digital Repository / Sharif University of Technology / Search result

An image annotation rectifying method based on deep features

, Article 2nd International Conference on Digital Signal Processing, ICDSP 2018, 25 February 2018 through 27 February 2018 ; 2018 , Pages 88-92 ; 9781450364027 (ISBN) Ghostan Khatchatoorian, A ; Jamzad, M ; Sharif University of Technology

Association for Computing Machinery 2018

Abstract

Automatic image annotation methods generate a list of tags for each test image and present it in a matrix structure. To achieve a more accurate annotation, we propose a method with the aim of correcting the tag list. In our method, we detect an indicator for each group of tags and use it to rectify the annotation results. To find a correct indicator, we apply a deep feature vector generated by the “AlexNet” model. Using this indicator, we determine the suitable tags for an image. The purposed method is independent of feature vector, dataset, and annotation method. It can be applied to the currently available annotation methods. Our experiments showed improvement in all annotation methods...

EEG Signal Processing in BCI Applications

, M.Sc. Thesis Sharif University of Technology Kheirandish, Malihe (Author) ; Haj Sadeghi, Khosrow (Supervisor)

Abstract

Brain-inspired methods are now widely used to process the data generated by the brain with the aim of improving our understanding of how the brain functions and produces the remarkable intelligence exhibited by humans, which is the source of all realizations, perception and actions. Therefore brain-computer interface (BCI) is one of the most challenging scientific problems which focuses scientists attention, in most cases these systems are based on EEG signals recorded from the surface of the scalp because this method of the brain activity monitoring is noninvasive, easy to use and quit inexpensive. Brain computer interface (BCI) systems analyse the EEG signals and translate person’s...

محتواي کتاب

From windows to logos: analyzing outdoor images to aid flyer classification

, Article 15th International Conference on Image Analysis and Recognition, ICIAR 2018, 27 June 2018 through 29 June 2018 ; Volume 10882 LNCS , 2018 , Pages 175-184 ; 03029743 (ISSN); 9783319929996 (ISBN) Pourashraf, P ; Tomuro, N ; Bagheri Shouraki, S ; Sharif University of Technology

Springer Verlag 2018

Abstract

The goal of this paper was to create a new method for analyzing the online real estate flyers based on their property types. We created an algorithm which identifies the buildings and windows from the buildings in order to extract some useful features for classifying the flyers. Our novel approach for building recognition has two main steps: 1- Building Detector 2- Region Growing. Our novel window detection algorithm uses vanishing point to identify nearly the best angle for applying window detection. It transforms the 2D image into 3D and rotates the 3D image around the z-axis and picks the appropriate angle based on the vanishing points. Using these two novel techniques we were be able to...

A new image texture extraction algorithm based on Matching Pursuit Gabor wavelets

, Article 2005 IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP '05, Philadelphia, PA, 18 March 2005 through 23 March 2005 ; Volume II , 2005 , Pages II740-II744 ; 15206149 (ISSN); 0780388747 (ISBN); 9780780388741 (ISBN) Yaghoobi, M ; Rabiee, H. R ; Ghanbari, M ; Shamsollahi, M. B ; Sharif University of Technology

2005

Abstract

Feature vector extraction, based on local image texture, is a primitive algorithm for many other applications, like segmentation, clustering and identification. If these feature vectors are a good match to the human visual system (HVS), we can expect to get the appropriate results by using them. Gabor filters has been used for this purpose successfully. In this paper we introduce a novel refinement, with the use of Matching Pursuit (MP) to improve the Gabor based texture feature extractor. With this improvement, we show that the separability of different textures will increase. Another consideration in this work is computation complexity. Therefore, we limit the basis function set to reduce...

Wavelet transform and fusion of linear and non linear method for face recognition

, Article DICTA 2009 - Digital Image Computing: Techniques and Applications, 1 December 2009 through 3 December 2009, Melbourne ; 2009 , Pages 296-302 ; 9780769538662 (ISBN) Mazloom, M ; Kasaei, S ; Neissi, N. A ; Sharif University of Technology

Abstract

This work presents a method to increase the face recognition accuracy using a combination of Wavelet, PCA, KPCA, and RBF Neural Networks. Preprocessing, feature extraction and classification rules are three crucial issues for face recognition. This paper presents a hybrid approach to employ these issues. For preprocessing and feature extraction steps, we apply a combination of wavelet transform, PCA and KPCA. During the classification stage, the Neural Network (RBF) is explored to achieve a robust decision in presence of wide facial variations. At first derives a feature vector from a set of downsampled wavelet representation of face images, then the resulting PCA-based linear features and...

Content Based Image Retrieval Using Segmentation Similarity Measure

, M.Sc. Thesis Sharif University of Technology Farhadi, Marzyieh (Author) ; Jamzad, Mansour (Supervisor)

Abstract

Content Based Image Retrieval (CBIR) is a research area in computer vision. This area comprises of two main steps, low level feature extraction such as color, texture and shape extraction and also similarity measures for comparison of images. The challenge in this system is the existence semantic gap between the low level visual features and the high level image semantics. The aim of research in this field is to reduce this semantic gap. In this study the images are divided into regions using Meanshift method, for color segmentation and then moments of each region as color feature are calculated. Also for extracting texture the images are divided into regions using Jseg method, and then...

محتواي کتاب

The effect of a two steps searching mechanism Using Feature Vectors Related to Image Class in Improving the Performance of CBIR System

, M.Sc. Thesis Sharif University of Technology Sherafati, Shima (Author) ; Jamzad, Mansoor (Supervisor) ; Manzuri Shalmani, Mohammad Taghi (Co-Advisor)

Abstract

Nowadays, retrieval is an inseparable part of user activities and due to growing usage of Content-Based Image Retrieval (CBIR), it has become a hot and challenging research topic specially in the past decade. The most important challenge that retrieval systems (including CBIR systems) are facing is the semantic gap between abstractions in the user’s mind and what is searched. One of the ways of dealing with this challenge is getting more information from the user about what he needs and so decreasing the distance between user’s will and what he gives to search engine as the description of his need. In this research, the class of query image is supposed to be given. For using this...

محتواي کتاب

Estimating Stopping Time Using Function Approximation Algorithms in Reinforcement Learning

, M.Sc. Thesis Sharif University of Technology Daei Naby, Ali (Author) ; Alishahi, Kasra (Supervisor) ; Haji Mirsadeghi, Mir omid (Supervisor)

Abstract

We study the expected value of stopping times in stochastic processes. Since there is no rigorous solution for computing stopping times in many processes, our approach is based on estimation using well-known methods in the Reinforcement Learning literature. The primary method in this research is the temporal difference algorithm. With some modifications, we can study the role of some state features in determining the stopping time. Moreover, without a complicated mathematical analysis, we can find functions closely enough to the goal function.Furthermore, we compare our proposed algorithm to the well-known regression methods and show our algorithm's advantages and disadvantages. The primary...

محتواي کتاب

Universal image steganalysis using singular values of DCT coefficients

, Article 2013 10th International ISC Conference on Information Security and Cryptology ; 2013 Heidari, M ; Gaemmaghami, S ; Sharif University of Technology

IEEE Computer Society 2013

Abstract

We propose a blind image steganalysis method based on the Singular Value Decomposition (SVD) of the Discrete Cosine Transform (DCT) coefficients that are revisited in this work. We compute geometric mean, mean of log values, and statistical moments (mean, variance and skewness) of the SVDs of the DCT sub-blocks that are averaged over the whole image to construct a 480-element feature vector for steganalysis. These features are fed to a Support Vector Machine (SVM) classifier to discriminate between stego and cover images. Experimental results show that the proposed method outperforms most powerful steganalyzers when applied to some well-known steganography algorithms

MMRO: A feature selection criterion for MR images based on alpha stable filter responses

, Article 2011 7th Iranian Conference on Machine Vision and Image Processing, MVIP 2011 - Proceedings ; 2011 ; 9781457715358 (ISBN) Abbasi Asl, R ; Fatemizadeh, E ; Sharif University of Technology

Abstract

In feature-based image registration, feature selection and reduction methods play an important role in decreasing computational burden of these operations. In this paper, a new approach is introduced to reduce the dimension of extracted feature vectors of MR images. This approach is based on the selection of the maximum and minimum responses of the alpha stable filter for the MR images over the extracted features with different orientation in frequency domain. This algorithm selects the rotation invariant features which are suitable for image registration purposes. It has been shown that these features could efficiently describe the image elements. The discriminating ability of the features...

Video keyframe analysis using a segment-based statistical metric in a visually sensitive parametric space

, Article IEEE Transactions on Image Processing ; Volume 20, Issue 10 , Oct , 2011 , Pages 2730-2737 ; 10577149 (ISSN) Omidyeganeh, M ; Ghaemmaghami, S ; Shirmohammadi, S ; Sharif University of Technology

2011

Abstract

This paper addresses a new approach to the keyframe extraction problem employing generalized Gaussian density (GGD) parameters of wavelet transform subbands along with Kullback-Leibler distance (KLD) measurement. Shot and cluster boundaries are selected using KLDs between GGD feature vectors, and then keyframes are located based on similarity and dissimilarity criteria. Objective and subjective evaluations show the high accuracy of this new approach compared with traditional methods

Learning strengths and weaknesses of classifiers for RGB-D semantic segmentation

, Article 9th Iranian Conference on Machine Vision and Image Processing, 18 November 2015 through 19 November 2015 ; Volume 2016-February , 2015 , Pages 176-179 ; 21666776 (ISSN) ; 9781467385398 (ISBN) Fooladgar, F ; Kasaei, S ; Sharif University of Technology

IEEE Computer Society

Abstract

3D scene understanding is an open challenge in the field of computer vision. Most of the focus is on 2D methods in which the semantic labeling of each RGB pixel is considered. But, in this paper, the 3D semantic labeling of RGB-D images is considered. In the proposed method, to extract some meaningful features, the superpixel generation algorithm is applied to the RGB image to segment it into a set of disjoint pixels. After that, the set of three powerful classifiers are utilized to semantically label each superpixel. In the proposed method, the probability outputs of these classifiers are concatenated as the novel feature vector for each superpixel. Consequently, to analyze the strengths...

A new word clustering method for building n-gram language models in continuous speech recognition systems

, Article Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), 18 June 2008 through 20 June 2008, Wroclaw ; Volume 5027 LNAI , 2008 , Pages 286-293 ; 03029743 (ISSN) ; 354069045X (ISBN); 9783540690450 (ISBN) Bahrani, M ; Sameti, H ; Hafezi, N ; Momtazi, S ; Sharif University of Technology

2008

Abstract

In this paper a new method for automatic word clustering is presented. We used this method for building n-gram language models for Persian continuous speech recognition (CSR) systems. In this method, each word is specified by a feature vector that represents the statistics of parts of speech (POS) of that word. The feature vectors are clustered by k-means algorithm. Using this method causes a reduction in time complexity which is a defect in other automatic clustering methods. Also, the problem of high perplexity in manual clustering methods is abated. The experimental results are based on "Persian Text Corpus" which contains about 9 million words. The extracted language models are evaluated...

Developing a Deep Neural Network for Bio-sequence Classification Capable of Optical Computing

, M.Sc. Thesis Sharif University of Technology Mohammadi, Amir Hossein (Author) ; Koohi, Somayyeh (Supervisor)

Abstract

The classification of biological sequences is an open issue for a variety of data sets, such as viral and metagenomics sequences. Therefore, many studies utilize neural network tools, as the well-known methods in this field, and focus on designing customized network structures. However, a few works focus on more effective factors, such as input encoding method or implementation technology, to address accuracy and efficiency issues in this area. Therefore, in this work, we propose an image-based encoding method, called as WalkIm, whose adoption, even in a simple neural network, provides competitive accuracy and superior efficiency, compared to the existing classification methods (e.g. VGDC,...

محتواي کتاب

Feature-Based Online Pricing

, M.Sc. Thesis Sharif University of Technology Naderi Khahan, Farnaz (Author) ; Alishahi, Kasra (Supervisor)

Abstract

Nowadays, the online markets can easily change and adjust price of the product to an optimal price to increase the profit from the sale of their products. Because of this pricing flexibility, there are many applications of online pricing in online markets and so on.We study the problem of online pricing and specifically feature-based online pricing as an online learning problem in which a seller receives highly differentiated products online and prices them with the goal of obtaining the highest possible profit.The seller does not initially know the values of the different features, but can learn the values of the features based on whether products were sold at the posted prices in the...

محتواي کتاب

Hippocampal shape analysis in the Laplace Beltrami feature space for temporal lobe epilepsy diagnosis and lateralization

, Article Proceedings - International Symposium on Biomedical Imaging ; 2012 , Pages 150-153 ; 19457928 (ISSN) ; 9781457718588 (ISBN) Shishegar, R ; Gandomkar, Z ; Soltaman Zadeh, H ; Moghadasi, S. R ; Sharif University of Technology

IEEE 2012

Abstract

Shape analysis plays an important role in many medical imaging studies. One of the recent shape analysis methods uses the Laplace Beltrami operator which is also used in this paper for hippocampal shape comparison. We proposed a feature vector which consists of size measures and shape descriptors based on Laplace Beltrami eigenvalues and eigenfunctions. The aforementioned feature space is utilised for automatic differentiating normal subjects from epileptic patients as well as distinguishing epileptic patients with left temporal lobe epilepsy (LTLE) from patients with right temporal lobe epilepsy (RTLE). Achieved results are diagnostic accuracy of 93% with 95% sensitivity and lateralization...

Support vector data description for spoken digit recognition

, Article BIOSIGNALS 2012 - Proceedings of the International Conference on Bio-Inspired Systems and Signal Processing ; 2012 , Pages 32-37 ; 9789898425898 (ISBN) Tavanaei, A ; Ghasemi, A ; Tavanaei, M ; Sameti, H ; Manzuri, M. T ; Inst. Syst. Technol. Inf., Control Commun. (INSTICC) ; Sharif University of Technology

2012

Abstract

A classifier based on Support Vector Data Description (SVDD) is proposed for spoken digit recognition. We use the Mel Frequency Discrete Wavelet Coefficients (MFDWC) and the Mel Frequency cepstral Coefficients (MFCC) as the feature vectors. The proposed classifier is compared to the HMM and results are promising and we show the HMM and SVDD classifiers have equal accuracy rates. The performance of the proposed features and SVDD classifier with several kernel functions are evaluated and compared in clean and noisy speech. Because of multi resolution and localization of the Wavelet Transform (WT) and using SVDD, experiments on the spoken digit recognition systems based on MFDWC features and...

Mel-scaled Discrete Wavelet Transform and dynamic features for the Persian phoneme recognition

, Article 2011 International Symposium on Artificial Intelligence and Signal Processing, AISP 2011, 15 June 2011 through 16 June 2011 ; June , 2011 , Pages 138-140 ; 9781424498345 (ISBN) Tavanaei, A ; Manzuri, M. T ; Sameti, H ; Sharif University of Technology

2011

Abstract

In this paper we use a feature vector consisting of the Mel Frequency Discrete Wavelet Coefficients to recognize spoken phonemes in the Persian language. The purpose of using wavelet in feature extraction is to benefit from its multi resolution analysis and localization property in time and frequency domains. The MFDWCs are obtained by applying the Discrete Wavelet Transform (DWT) to the Mel-scaled log filter bank energies of a speech frame. Feature vectors are used for the HMM-based phoneme recognition on a portion of the FarsDat Persian language database consisting of 35 hour recorded data for training and 15 hour for testing. We evaluate the performance of new features for clean speech...

Car type recognition in highways based on wavelet and contourlet feature extraction

, Article Proceedings of the 2010 International Conference on Signal and Image Processing, ICSIP 2010, 15 December 2010 through 17 December 2010, Chennai ; 2010 , Pages 353-356 ; 9781424485949 (ISBN) Arzani, M. M ; Jamzad, M ; Sharif University of Technology

2010

Abstract

Recently many works focus on the vehicle type recognition because it is important in security and authentication systems. Computational complexity and low recognition rate especially when the system has to recognize among a large number of vehicles, are two major problems in vehicle type recognition. In recent years wavelet and contourlet transform have been applied in the recognition tasks successfully. In this paper we proposed a method for recognizing vehicle type in different lighting conditions. We used wavelet and contourlet as tools for feature extraction. These features are powerful and robust to illumination and scale variation. We reduced the dimension of feature vector by resizing...

Off-line Arabic/Farsi handwritten word recognition using RBF neural network and genetic algorithm

, Article Proceedings - 2010 IEEE International Conference on Intelligent Computing and Intelligent Systems, ICIS 2010, 29 October 2010 through 31 October 2010, Xiamen ; Volume 3 , 2010 , Pages 352-357 ; 9781424465835 (ISBN) Bahmani, Z ; Alamdar, F ; Azmi, R ; Haratizadeh, S ; Sharif University of Technology

2010

Abstract

In this paper an off-line ArabiclFarsi handwritten recognition Algorithm on a subset of Farsi name is proposed. In this system, There is no sub-word segmentation phase. Script database includes 3300 images of 30 Farsi common names. The features are wavelet coefficients extracted from smoothed word image profiles in four standard directions. The Centers of competitive layer of RBF neural network have been determined by combining GA and K-Means clustering algorithm. Weights of supervised layer has been trained by using LMS rule and the distances of feature vector of each sample to the centre of RBF network have been computed based on warping function. Experimental results show advantages of...