Sharif Digital Repository / Sharif University of Technology / Search result

Automatic image annotation using semi-supervised generative modeling

, Article Pattern Recognition ; Volume 48, Issue 1 , January , 2015 , Pages 174-188 ; 00313203 (ISSN) Amiri, S. H ; Jamzad, M ; Sharif University of Technology

Elsevier Ltd 2015

Abstract

Image annotation approaches need an annotated dataset to learn a model for the relation between images and words. Unfortunately, preparing a labeled dataset is highly time consuming and expensive. In this work, we describe the development of an annotation system in semi-supervised learning framework which by incorporating unlabeled images into training phase reduces the system demand to labeled images. Our approach constructs a generative model for each semantic class in two main steps. First, based on Gamma distribution, a generative model is constructed for each semantic class using labeled images in that class. The second step incorporates the unlabeled images by using a modified EM...

Automatic Image Annotation Using Deep Learning

, M.Sc. Thesis Sharif University of Technology Bahramipoor, Misagh (Author) ; Jamzad, Mansour (Supervisor)

Abstract

With the advances in technology, Nowadays digital cameras are everywhere. As a result very large amount of images are on the web. Searching through these images intelligently and purposively is an essential need. Recently the possibility of retrieving images with some conceptual words along side of content based image retrieval has been studied in computer vision. For this purpose it’s required that for each image several words that describe its content be assigned automatically. One of the main problems for this task is semantic gap, meaning that the low level features such as color, texture,… don’t have the ability to describe the high level concepts in images which are comprehensible by...

محتواي کتاب

Architecture to improve the accuracy of automatic image annotation systems

, Article IET Computer Vision ; Volume 14, Issue 5 , August , 2020 , Pages 214-223 Khatchatoorian, A. G ; Jamzad, M ; Sharif University of Technology

Institution of Engineering and Technology 2020

Abstract

Automatic image annotation (AIA) is an image retrieval mechanism to extract relative semantic tags from visual content. So far, the improvement of accuracy in newly developed such methods have been about 1 or 2% in the F1-score and the architectures seem to have room for improvement. Therefore, the authors designed a more detailed architecture for AIA and suggested new algorithms for its main parts. The proposed architecture has three main parts: feature extraction, learning, and annotation. They designed a novel learning method using machine learning and probability bases. In the annotation part, they suggest a novel method that gains the maximum benefit from the learning part. The...

A novel semantic statistical model for automatic image annotation using the relationship between the regions based on multi-criteria decision making

, Article International Journal of Electrical and Computer Engineering ; Vol. 4, issue. 1 , 2014 , p. 37-51 Deljooi, H ; Eskandari, A. R ; Sharif University of Technology

Abstract

Automatic image annotation has emerged as an important research topic due to the existence of the semantic gap and in addition to its potential application on image retrieval and management. In this paper we present an approach which combines regional contexts and visual topics to automatic image annotation. Regional contexts model the relationship between the regions, whereas visual topics provide the global distribution of topics over an image. Conventional image annotation methods neglected the relationship between the regions in an image, while these regions are exactly explanation of the image semantics, therefore considering the relationship between them are helpful to annotate the...

Efficient multi-modal fusion on supergraph for scalable image annotation

, Article Pattern Recognition ; Volume 48, Issue 7 , July , 2015 , Pages 2241-2253 ; 00313203 (ISSN) Amiri, S. H ; Jamzad, M ; Sharif University of Technology

Elsevier Ltd 2015

Abstract

Different types of visual features provide multi-modal representation for images in the annotation task. Conventional graph-based image annotation methods integrate various features into a single descriptor and consider one node for each descriptor on the learning graph. However, this graph does not capture the information of individual features, making it unsuitable for propagating the labels of annotated images. In this paper, we address this issue by proposing an approach for fusing the visual features such that a specific subgraph is constructed for each visual modality and then subgraphs are connected to form a supergraph. As the size of supergraph grows linearly with the number of...

Automatic image annotation by a loosely joint non-negative matrix factorisation

, Article IET Computer Vision ; Volume 9, Issue 6 , November , 2015 , Pages 806-813 ; 17519632 (ISSN) Rad, R ; Jamzad, M ; Sharif University of Technology

Institution of Engineering and Technology 2015

Abstract

Nowadays, the number of digital images has increased so that the management of this volume of data needs an efficient system for browsing, categorising and searching. Automatic image annotation is designed for assigning tags to images for more accurate retrieval. Non-negative matrix factorisation (NMF) is a traditional machine learning technique for decomposing a matrix into a set of basis and coefficients under the non-negative constraints. In this study, the authors propose a two-step algorithm for designing an automatic image annotation system that employs the NMF framework for its first step and a variant of K-nearest neighbourhood as its second step. In the first step, a new multimodal...

A multi-view-group non-negative matrix factorization approach for automatic image annotation

, Article Multimedia Tools and Applications ; 2017 , Pages 1-21 ; 13807501 (ISSN) Rad, R ; Jamzad, M ; Sharif University of Technology

Abstract

In automatic image annotation (AIA) different features describe images from different aspects or views. Part of information embedded in some views is common for all views, while other parts are individual and specific. In this paper, we present the Mvg-NMF approach, a multi-view-group non-negative matrix factorization (NMF) method for an AIA system which considers both common and individual factors. The NMF framework discovers a latent space by decomposing data into a set of non-negative basis vectors and coefficients. The views divided into homogeneous groups and latent spaces are extracted for each group. After mapping the test images into these spaces, a unified distance matrix is...

Automatic image annotation using tag relations and graph convolutional networks

, Article 5th International Conference on Pattern Recognition and Image Analysis, IPRIA 2021, 28 April 2021 through 29 April 2021 ; 2021 ; 9781665426596 (ISBN) Lotfi, F ; Jamzad, M ; Beigy, H ; Sharif University of Technology

Institute of Electrical and Electronics Engineers Inc 2021

Abstract

Automatic image annotation is a mechanism to assign a list of appropriate tags that describe the visual content of a given image. Most methods only focus on the content of the images and ignore the relationship between the tags in vocabulary. In this work, we propose a new deep learning-based automatic image annotation architecture, which considers label dependencies in a graph convolution neural network structure and extracts tag descriptors to re-weight the output class scores based on their relationships. The proposed architecture has three main parts: feature extraction, graph convolutional network, and annotation. In graph convolutional network, we apply one layer convolution on...

Unsupervised estimation of conceptual classes for semantic image annotation

, Article 2011 19th Iranian Conference on Electrical Engineering, ICEE 2011, 17 May 2011 through 19 May 2011 ; May , 2011 ; 9789644634284 (ISBN) Teimoori, F ; Esmaili, H ; Shirazi, A. A. B ; Sharif University of Technology

2011

Abstract

A probabilistic formulation for semantic image annotation and retrieval is proposed. Annotation and retrieval are posed as classification problems where each class is defined as the group of database images labeled with a common semantic label. It is shown that, by establishing this one-to-one correspondence between semantic labels and semantic classes, a minimum probability of error annotation and retrieval are feasible with algorithms that are 1) conceptually simple and 2) computationally efficient. In this article, a content-based image retrieval and annotation architecture is proposed. Its attitude is decreasing the semantic gap by partitioning the image to its semantic regions and using...

A multi-view-group non-negative matrix factorization approach for automatic image annotation

, Article Multimedia Tools and Applications ; Volume 77, Issue 13 , 2018 , Pages 17109-17129 ; 13807501 (ISSN) Rad, R ; Jamzad, M ; Sharif University of Technology

Springer New York LLC 2018

Abstract

In automatic image annotation (AIA) different features describe images from different aspects or views. Part of information embedded in some views is common for all views, while other parts are individual and specific. In this paper, we present the Mvg-NMF approach, a multi-view-group non-negative matrix factorization (NMF) method for an AIA system which considers both common and individual factors. The NMF framework discovers a latent space by decomposing data into a set of non-negative basis vectors and coefficients. The views divided into homogeneous groups and latent spaces are extracted for each group. After mapping the test images into these spaces, a unified distance matrix is...

An image annotation rectifying method based on deep features

, Article 2nd International Conference on Digital Signal Processing, ICDSP 2018, 25 February 2018 through 27 February 2018 ; 2018 , Pages 88-92 ; 9781450364027 (ISBN) Ghostan Khatchatoorian, A ; Jamzad, M ; Sharif University of Technology

Association for Computing Machinery 2018

Abstract

Automatic image annotation methods generate a list of tags for each test image and present it in a matrix structure. To achieve a more accurate annotation, we propose a method with the aim of correcting the tag list. In our method, we detect an indicator for each group of tags and use it to rectify the annotation results. To find a correct indicator, we apply a deep feature vector generated by the “AlexNet” model. Using this indicator, we determine the suitable tags for an image. The purposed method is independent of feature vector, dataset, and annotation method. It can be applied to the currently available annotation methods. Our experiments showed improvement in all annotation methods...

Automatic Image Annotation by Multi-view Non-negative Matrix Factorization

, Ph.D. Dissertation Sharif University of Technology Rad, Roya (Author) ; Jamzad, Mansour (Supervisor)

Abstract

Nowadays the number of digital images has largely increased because of progress in internet technology. Management of this volume of data needs an efficient system for browsing, categorizing, and searching the images. The goal of this research is to design a system for automatic annotation of unobserved images for better search in image data bases. Automatic image annotation is a multi-label classification problem with many labels which suggests some words for describing the content of an image. Designing AIA systems faces chanllenges like semantic gap between low level image features and high level human expressions (tags), incompelete tags and imbalance images per tags in the datasets....

محتواي کتاب

A Self-Tag Rectifier Model for Automatic Image Annotation

, Ph.D. Dissertation Sharif University of Technology Ghostan Khatchatoorian, Artin (Author) ; Jamzad, Mansour (Supervisor) ; Beigy, Hamid (Co-Supervisor)

Abstract

Automatic image annotation is an image retrieval mechanism to extract relative semantic tags from visual contents. The number of digital images uploaded in the virtual world is rapidly growing every day. Most of those images are not assigned with proper tags or labels. Although automatic image annotation methods are developed to assign proper tags to images, most of these methods assign some irrelevant tags and also sometimes a few relevant tags are missing. So far, the improvements of accuracy in newly developed automatic image annotation methods have been about one or two percent in F1-score compared to the previous methods. To reach much better performance, we analyzed most of the...

محتواي کتاب

Large-scale image annotation using prototype-based models

, Article ISPA 2011 - 7th International Symposium on Image and Signal Processing and Analysis ; 2011 , Pages 449-454 ; 9789531841597 (ISBN) Amiri, S. H ; Jamzad, M ; European Association for Signal Processing (EURASIP); IEEE Signal Processing Society; IEEE Region 8; IEEE Croatia Section; IEEE Croatia Section Signal Processing Chapter ; Sharif University of Technology

Abstract

Automatic image annotation is a challenging problem in the field of image retrieval. Dealing with large databases makes the annotation problem more difficult and therefore an effective approach is needed to manage such databases. In this work, an annotation system has been developed which considers images in separate categories and constructs a profiling model for each category. To describe an image, we propose a new feature extraction method based on color and texture information that describes image content using discrete distribution signatures. Image signatures of one category are partitioned using spectral clustering and a prototype is determined for each cluster by solving an...

Image annotation using multi-view non-negative matrix factorization with different number of basis vectors

, Article Journal of Visual Communication and Image Representation ; Volume 46 , 2017 , Pages 1-12 ; 10473203 (ISSN) Rad, R ; Jamzad, M ; Sharif University of Technology

Academic Press Inc 2017

Abstract

Automatic Image Annotation (AIA) helps image retrieval systems by predicting tags for images. In this paper, we propose an AIA system using Non-negative Matrix Factorization (NMF) framework. The NMF framework discovers a latent space, by factorizing data into a set of non-negative basis and coefficients. To model the images, multiple features are extracted, each one represents images from a specific view. We use multi-view graph regularization NMF and allow NMF to choose a different number of basis vectors for each view. For tag prediction, each test image is mapped onto the multiple latent spaces. The distances of images in these spaces are used to form a unified distance matrix. The...

Suggesting an integration system for image annotation

, Article Multimedia Tools and Applications ; 2022 ; 13807501 (ISSN) Ghostan Khatchatoorian, A ; Jamzad, M ; Sharif University of Technology

Springer 2022

Abstract

The number of digital images uploaded in the virtual world is rapidly growing every day. Therefore, an automatic image annotation system that can retrieve information from these images seems to be in high demand. One of the challenges in this field is the imbalanced data sets and the difficulty of successfully learning tags from them. Even if a nearly balanced data set exists for image annotation, it is unlikely to find a single learner, which could learn all tags with the same accuracy. In this paper, we suggest a novel integration system that selects an elite group of models from all existing annotation models and then combines them to take the best advantage of each model’s learning...

Image Annotation Using Semi-supervised Learning

, Ph.D. Dissertation Sharif University of Technology Amiri, Hamid (Author) ; Jamzad, Mansour (Supervisor)

Abstract

Aautomatic image annotation that assigns some labels to input images and provides a textual description for the contents of images has become an active field in machine vision community. To design an annotation system, we need a dataset that contains images and labels for them. However, a large amount of manual efforts is required to annotate all images in a dataset. To reduce the demand of annotation systems on the labeled images, one solution is to exploit useful information embedded into the unlabeled images and incorporate them into learning process. In machine learning community, semi-supervised learning (SSL) has been introduced with the aim of incorporating unlabeled samples into the...

محتواي کتاب

Leveraging multi-modal fusion for graph-based image annotation

, Article Journal of Visual Communication and Image Representation ; Volume 55 , 2018 , Pages 816-828 ; 10473203 (ISSN) Amiri, S. H ; Jamzad, M ; Sharif University of Technology

Academic Press Inc 2018

Abstract

Considering each of the visual features as one modality in image annotation task, efficient fusion of different modalities is essential in graph-based learning. Traditional graph-based methods consider one node for each image and combine its visual features into a single descriptor before constructing the graph. In this paper, we propose an approach that constructs a subgraph for each modality in such a way that edges of subgraph are determined using a search-based approach that handles class-imbalance challenge in the annotation datasets. Multiple subgraphs are then connected to each other to have a supergraph. This follows by introducing a learning framework to infer the tags of...

Toward real-time image annotation using marginalized coupled dictionary learning

, Article Journal of Real-Time Image Processing ; Volume 19, Issue 3 , 2022 , Pages 623-638 ; 18618200 (ISSN) Roostaiyan, S. M ; Hosseini, M. M ; Mohammadi Kashani, M ; Amiri, S. H ; Sharif University of Technology

Springer Science and Business Media Deutschland GmbH 2022

Abstract

In most image retrieval systems, images include various high-level semantics, called tags or annotations. Virtually all the state-of-the-art image annotation methods that handle imbalanced labeling are search-based techniques which are time-consuming. In this paper, a novel coupled dictionary learning approach is proposed to learn a limited number of visual prototypes and their corresponding semantics simultaneously. This approach leads to a real-time image annotation procedure. Another contribution of this paper is that utilizes a marginalized loss function instead of the squared loss function that is inappropriate for image annotation with imbalanced labels. We have employed a marginalized...

3D Image segmentation with sparse annotation by self-training and internal registration

, Article IEEE Journal of Biomedical and Health Informatics ; 2020 Bitarafan, A ; Nikdan, M ; Soleymanibaghshah, M ; Sharif University of Technology

Institute of Electrical and Electronics Engineers Inc 2020

Abstract

Anatomical image segmentation is one of the foundations for medical planning. Recently, convolutional neural networks (CNN) have achieved much success in segmenting volumetric (3D) images when a large number of fully annotated 3D samples are available. However, rarely a volumetric medical image dataset containing a sufficient number of segmented 3D images is accessible since providing manual segmentation masks is monotonous and time-consuming. Thus, to alleviate the burden of manual annotation, we attempt to effectively train a 3D CNN using a sparse annotation where ground truth on just one 2D slice of the axial axis of each training 3D image is available. To tackle this problem, we propose...