Sharif Digital Repository / Sharif University of Technology / Search result

A multi-view-group non-negative matrix factorization approach for automatic image annotation

, Article Multimedia Tools and Applications ; 2017 , Pages 1-21 ; 13807501 (ISSN) Rad, R ; Jamzad, M ; Sharif University of Technology

Abstract

In automatic image annotation (AIA) different features describe images from different aspects or views. Part of information embedded in some views is common for all views, while other parts are individual and specific. In this paper, we present the Mvg-NMF approach, a multi-view-group non-negative matrix factorization (NMF) method for an AIA system which considers both common and individual factors. The NMF framework discovers a latent space by decomposing data into a set of non-negative basis vectors and coefficients. The views divided into homogeneous groups and latent spaces are extracted for each group. After mapping the test images into these spaces, a unified distance matrix is...

Multi-modal deep distance metric learning

, Article Intelligent Data Analysis ; Volume 21, Issue 6 , 2017 , Pages 1351-1369 ; 1088467X (ISSN) Roostaiyan, S. M ; Imani, E ; Soleymani Baghshah, M ; Sharif University of Technology

IOS Press 2017

Abstract

In many real-world applications, data contain heterogeneous input modalities (e.g., web pages include images, text, etc.). Moreover, data such as images are usually described using different views (i.e. different sets of features). Learning a distance metric or similarity measure that originates from all input modalities or views is essential for many tasks such as content-based retrieval ones. In these cases, similar and dissimilar pairs of data can be used to find a better representation of data in which similarity and dissimilarity constraints are better satisfied. In this paper, we incorporate supervision in the form of pairwise similarity and/or dissimilarity constraints into...

Steganalysis of LSB based image steganography using spatial and frequency domain features

, Article 2009 IEEE International Conference on Multimedia and Expo, ICME 2009, New York, NY, 28 June 2009 through 3 July 2009 ; 2009 , Pages 1744-1747 ; 9781424442911 (ISBN) Malekmohamadi, H ; Ghaemmaghami, S ; Sharif University of Technology

2009

Abstract

In this paper, we propose a method for steganalysis of grayscale images using both spatial and Gabor features. The basis of our work is to use Gabor filter coefficients and statistics of the graylevel co-occurrence matrix of images to train a support vector machine. We show that this feature set works well in steganalysis of grayscale images steganographied by LSB matching and S-tools. ©2009 IEEE

Content based image retrieval using the knowledge of texture, color and binary tree structure

, Article 2009 Canadian Conference on Electrical and Computer Engineering, CCECE '09, St. Johns, NL, 3 May 2009 through 6 May 2009 ; 2009 , Pages 999-1003 ; 08407789 (ISSN); 9781424435081 (ISBN) Mansoori, Z ; Jamzad, M ; Sharif University of Technology

2009

Abstract

Content base image retrieval is an important research field with many applications. In this paper we presents a new approach for finding similar images to a given query, in a general-purpose image database using content-based image retrieval. Color and texture are used as basic features to describe images. In addition, a binary tree structure is used to describe higher level features of an image. It has been used to keep information about separate segments of the images. The performance of the proposed system has been compared with the SIMPLIcity system using COREL image database. Our experimental results showed that among 10 image categories available in COREL database, our system had a...

Secure steganography based on embedding capacity

, Article International Journal of Information Security ; Volume 8, Issue 6 , 2009 , Pages 433-445 ; 16155262 (ISSN) Sajedi, H ; Jamzad, M ; Sharif University of Technology

2009

Abstract

Mostly the embedding capacity of steganography methods is assessed in non-zero DCT coefficients. Due to unequal distribution of non-zero DCT coefficients in images with different contents, images with the same number of non-zero DCT coefficients may have different actual embedding capacities. This paper introduces embedding capacity as a property of images in the presence of multiple steganalyzers, and discusses a method for computing embedding capacity of cover images. Using the capacity constraint, embedding can be done more secure than the state when the embedder does not know how much data can be hidden securely in an image. In our proposed approach, an ensemble system that uses...

High rate data hiding in speech using voicing diversity in an adaptive MBE scheme

, Article 2008 IEEE Region 10 Conference, TENCON 2008, Hyderabad, 19 November 2008 through 21 November 2008 ; 2008 ; 1424424089 (ISBN); 9781424424085 (ISBN) Jahangiri, E ; Ghaemmaghami, S ; Sharif University of Technology

2008

Abstract

This paper addresses a new approach to data hiding that leads to a high data embedding rate of tens of kbps in a typical digital voice file transmission scheme. The purpose of the proposed method is restricted to offline voice transmission that uses stego speech files in wave format. The basic idea of the algorithm is to embed encrypted covert message in the unvoiced bands of spectrum of the cover speech. Inaudibility of the proposed hiding scheme is investigated through both support vector machines (SVM)-based steganalysis and the ITU-T P.862 PESQ standard speech quality assessment. The results assure imperceptibility and transparency of the stego speech

Automatic Image Annotation by Multi-view Non-negative Matrix Factorization

, Ph.D. Dissertation Sharif University of Technology Rad, Roya (Author) ; Jamzad, Mansour (Supervisor)

Abstract

Nowadays the number of digital images has largely increased because of progress in internet technology. Management of this volume of data needs an efficient system for browsing, categorizing, and searching the images. The goal of this research is to design a system for automatic annotation of unobserved images for better search in image data bases. Automatic image annotation is a multi-label classification problem with many labels which suggests some words for describing the content of an image. Designing AIA systems faces chanllenges like semantic gap between low level image features and high level human expressions (tags), incompelete tags and imbalance images per tags in the datasets....

محتواي کتاب

A Self-Tag Rectifier Model for Automatic Image Annotation

, Ph.D. Dissertation Sharif University of Technology Ghostan Khatchatoorian, Artin (Author) ; Jamzad, Mansour (Supervisor) ; Beigy, Hamid (Co-Supervisor)

Abstract

Automatic image annotation is an image retrieval mechanism to extract relative semantic tags from visual contents. The number of digital images uploaded in the virtual world is rapidly growing every day. Most of those images are not assigned with proper tags or labels. Although automatic image annotation methods are developed to assign proper tags to images, most of these methods assign some irrelevant tags and also sometimes a few relevant tags are missing. So far, the improvements of accuracy in newly developed automatic image annotation methods have been about one or two percent in F1-score compared to the previous methods. To reach much better performance, we analyzed most of the...

محتواي کتاب

Robust algorithm for brain magnetic resonance image (MRI) classification based on GARCH variances series

, Article Biomedical Signal Processing and Control ; Volume 8, Issue 6 , 2013 , Pages 909-919 ; 17468094 (ISSN) Kalbkhani, H ; Shayesteh, M. G ; Zali Vargahan, B ; Sharif University of Technology

2013

Abstract

In this paper, a robust algorithm for disease type determination in brain magnetic resonance image (MRI) is presented. The proposed method classifies MRI into normal or one of the seven different diseases. At first two-level two-dimensional discrete wavelet transform (2D DWT) of input image is calculated. Our analysis show that the wavelet coefficients of detail sub-bands can be modeled by generalized autoregressive conditional heteroscedasticity (GARCH) statistical model. The parameters of GARCH model are considered as the primary feature vector. After feature vector normalization, principal component analysis (PCA) and linear discriminant analysis (LDA) are used to extract the proper...

Large-scale image annotation using prototype-based models

, Article ISPA 2011 - 7th International Symposium on Image and Signal Processing and Analysis ; 2011 , Pages 449-454 ; 9789531841597 (ISBN) Amiri, S. H ; Jamzad, M ; European Association for Signal Processing (EURASIP); IEEE Signal Processing Society; IEEE Region 8; IEEE Croatia Section; IEEE Croatia Section Signal Processing Chapter ; Sharif University of Technology

Abstract

Automatic image annotation is a challenging problem in the field of image retrieval. Dealing with large databases makes the annotation problem more difficult and therefore an effective approach is needed to manage such databases. In this work, an annotation system has been developed which considers images in separate categories and constructs a profiling model for each category. To describe an image, we propose a new feature extraction method based on color and texture information that describes image content using discrete distribution signatures. Image signatures of one category are partitioned using spectral clustering and a prototype is determined for each cluster by solving an...

Car type recognition in highways based on wavelet and contourlet feature extraction

, Article Proceedings of the 2010 International Conference on Signal and Image Processing, ICSIP 2010, 15 December 2010 through 17 December 2010, Chennai ; 2010 , Pages 353-356 ; 9781424485949 (ISBN) Arzani, M. M ; Jamzad, M ; Sharif University of Technology

2010

Abstract

Recently many works focus on the vehicle type recognition because it is important in security and authentication systems. Computational complexity and low recognition rate especially when the system has to recognize among a large number of vehicles, are two major problems in vehicle type recognition. In recent years wavelet and contourlet transform have been applied in the recognition tasks successfully. In this paper we proposed a method for recognizing vehicle type in different lighting conditions. We used wavelet and contourlet as tools for feature extraction. These features are powerful and robust to illumination and scale variation. We reduced the dimension of feature vector by resizing...

Tensor-based face representation and recognition using multi-linear subspace analysis

, Article 2009 14th International CSI Computer Conference, CSICC 2009, 20 October 2009 through 21 October 2009, Tehran ; 2009 , Pages 658-663 ; 9781424442621 (ISBN) Mohseni, H ; Kasaei, S ; Sharif University of Technology

Abstract

Discriminative subspace analysis is a popular approach for a variety of applications. There is a growing interest in subspace learning techniques for face recognition. Principal component analysis (PCA) and eigenfaces are two important subspace analysis methods have been widely applied in a variety of areas. However, the excessive dimension of data space often causes the curse of dimensionality dilemma, expensive computational cost, and sometimes the singularity problem. In this paper, a new supervised discriminative subspace analysis is presented by encoding face image as a high order general tensor. As face space can be considered as a nonlinear submanifold embedded in the tensor space, a...

A fusion-based gender recognition method using facial images

, Article 26th Iranian Conference on Electrical Engineering, ICEE 2018, 8 May 2018 through 10 May 2018 ; 2018 , Pages 1493-1498 ; 9781538649169 (ISBN) Ghojogh, B ; Bagheri Shouraki, S ; Mohammadzade, H ; Iranmehr, E ; Sharif University of Technology

Institute of Electrical and Electronics Engineers Inc 2018

Abstract

This paper proposes a fusion-based gender recognition method which uses facial images as input. Firstly, this paper utilizes pre-processing and a landmark detection method in order to find the important landmarks of faces. Thereafter, four different frameworks are proposed which are inspired by state-of-the-art gender recognition systems. The first framework extracts features using Local Binary Pattern (LBP) and Principal Component Analysis (PCA) and uses back propagation neural network. The second framework uses Gabor filters, PCA, and kernel Support Vector Machine (SVM). The third framework uses lower part of faces as input and classifies them using kernel SVM. The fourth framework uses...

A multi-view-group non-negative matrix factorization approach for automatic image annotation

, Article Multimedia Tools and Applications ; Volume 77, Issue 13 , 2018 , Pages 17109-17129 ; 13807501 (ISSN) Rad, R ; Jamzad, M ; Sharif University of Technology

Springer New York LLC 2018

Abstract

In automatic image annotation (AIA) different features describe images from different aspects or views. Part of information embedded in some views is common for all views, while other parts are individual and specific. In this paper, we present the Mvg-NMF approach, a multi-view-group non-negative matrix factorization (NMF) method for an AIA system which considers both common and individual factors. The NMF framework discovers a latent space by decomposing data into a set of non-negative basis vectors and coefficients. The views divided into homogeneous groups and latent spaces are extracted for each group. After mapping the test images into these spaces, a unified distance matrix is...

Partial discharges pattern recognition of transformer defect model by LBP & HOG features

, Article IEEE Transactions on Power Delivery ; 2018 ; 08858977 (ISSN) Firuzi, K ; Vakilian, M ; Phung, B. T ; Blackburn, T. R ; Sharif University of Technology

Institute of Electrical and Electronics Engineers Inc 2018

Abstract

Partial discharge (PD) measurement and identification have great importance to condition monitoring of power transformers. In this paper a new method for recognition of single and multi-source of PD based on extraction of high level image features have been introduced. A database, involving 365 samples of phase-resolved PD (PRPD) data, is developed by measurement carried out on transformer artificial defect models (having different sizes of defect) under a specific applied voltage, to be used for proposed algorithm validation. In the first step, each set of PRPD data is converted into grayscale images to represent different PD defects. Two “image feature extraction” methods, the Local Binary...

Using geometrical routing for overlay networking in MMOGs

, Article Multimedia Tools and Applications ; Volume 45, Issue 1-3 , 2009 , Pages 61-81 ; 13807501 (ISSN) Hariri, B ; Pakravan, M. R ; Shirmohammadi, S ; Alavi, M. H ; Sharif University of Technology

2009

Abstract

At a first glance, transmitting update information to a geographic region in the virtual space seems to be an attractive primitive in Massively Multiplayer Online Gaming (MMOG) applications where players are constantly moving and need to send updates to their neighbors who are in the same region of the virtual space. The system would become more scalable if entities did not need to keep track of each other or send messages directly to one another. Rather, an entity could just send a message to a specific region in the virtual space (its area of effect), as opposed to sending packets to specific IP addresses, significantly reducing tracking and routing overhead. Fundamentally speaking, update...

Partial discharges pattern recognition of transformer defect model by LBP & HOG features

, Article IEEE Transactions on Power Delivery ; Volume 34, Issue 2 , 2019 , Pages 542-550 ; 08858977 (ISSN) Firuzi, K ; Vakilian, M ; Phung, B. T ; Blackburn, T. R ; Sharif University of Technology

Institute of Electrical and Electronics Engineers Inc 2019

Abstract

Partial discharge (PD) measurement and identification have great importance to condition monitoring of power transformers. In this paper, a new method for recognition of single and multi-source of PD based on extraction of high level image features has been introduced. A database, involving 365 samples of phase-resolved PD (PRPD) data, is developed by measurement carried out on transformer artificial defect models (having different sizes of defect) under a specific applied voltage, to be used for proposed algorithm validation. In the first step, each set of PRPD data is converted into grayscale images to represent different PD defects. Two 'image feature extraction' methods, the Local Binary...

Fuzzy support vector machine: An efficient rule-based classification technique for microarrays

, Article BMC Bioinformatics ; Volume 14, Issue SUPPL13 , 2013 ; 14712105 (ISSN) Hajiloo, M ; Rabiee, H. R ; Anooshahpour, M ; Sharif University of Technology

2013

Abstract

Background: The abundance of gene expression microarray data has led to the development of machine learning algorithms applicable for tackling disease diagnosis, disease prognosis, and treatment selection problems. However, these algorithms often produce classifiers with weaknesses in terms of accuracy, robustness, and interpretability. This paper introduces fuzzy support vector machine which is a learning algorithm based on combination of fuzzy classifiers and kernel machines for microarray classification.Results: Experimental results on public leukemia, prostate, and colon cancer datasets show that fuzzy support vector machine applied in combination with filter or wrapper feature selection...

Autoregressive video modeling through 2D Wavelet Statistics

, Article Proceedings - 2010 6th International Conference on Intelligent Information Hiding and Multimedia Signal Processing, IIHMSP 2010, 15 October 2010 through 17 October 2010 ; October , 2010 , Pages 272-275 ; 9780769542225 (ISBN) Omidyeganeh, M ; Ghaemmaghami, S ; Shirmohammadi, S ; Sharif University of Technology

2010

Abstract

We present an Autoregressive (AR) modeling method for video signal analysis based on 2D Wavelet Statistics. The video signal is assumed to be a combination of spatial feature time series that are temporally approximated by the AR model. The AR model yields a linear approximation to the temporal evolution of a stationary stochastic process. Generalized Gaussian Density (GGD) parameters, extracted from 2D wavelet transform subbands, are used as the spatial features. Wavelet transform efficiently resembles the Human Visual System (HVS) characteristics and captures more suitable features, as compared to color histogram features. The AR model describes each spatial feature vector as a linear...

Hierarchical concept score post-processing and concept-wise normalization in CNN based video event recognition

, Article IEEE Transactions on Multimedia ; Volume: 21 , Issue: 1 , Jan , 2019 , 157 - 172 ; 15209210 (ISSN) Soltanian, M ; Ghaemmaghami, S ; Sharif University of Technology

Institute of Electrical and Electronics Engineers Inc 2018

Abstract

This paper is focused on video event recognition based on frame level CNN descriptors. Using transfer learning, the image trained descriptors are applied to the video domain to make event recognition feasible in scenarios with limited computational resources. After fine-tuning of the existing Convolutional Neural Network (CNN) concept score extractors, pre-trained on ImageNet, the output descriptors of the different fully connected layers are employed as frame descriptors. The resulting descriptors are hierarchically post-processed and combined with novel and efficient pooling and normalization methods. As major contributions of this work to the video event recognition, we present a...