Sharif Digital Repository / Sharif University of Technology / Search result

Fault Detection and Smart Monitoring of Industrial Fans Based on Vibration Signals

, M.Sc. Thesis Sharif University of Technology Moeeni, Hamed (Author) ; Manzuri Shalmani, Mohammad Taghi (Supervisor)

Abstract

Data Oriented Smart Monitoring for Industrial Machineries include approaches for fault detection and prognosis which only rely on non-stationary signals sampled from sensors and do not rely on physical model of machineries nor expert knowledge. Fault detection is task of determining state of machinery in present moment using past data. But in Prognosis focus is on predicting future state of machinery using past data. Most researches in this category are based on supervised algorithms, but in many applications labeling data is expensive. In this thesis some approaches for semi-superviseddiagnosis, based on markov random walk an K-NN have been implemented, also some improvements for K-NN have...

محتواي کتاب

An Efficient semi-supervised multi-label classifier capable of handling missing labels

, Article IEEE Transactions on Knowledge and Data Engineering ; Volume 31, Issue 2 , 2019 , Pages 229-242 ; 10414347 (ISSN) Hosseini Akbarnejad, A ; Soleymani Baghshah, M ; Sharif University of Technology

IEEE Computer Society 2019

Abstract

Multi-label classification has received considerable interest in recent years. Multi-label classifiers usually need to address many issues including: handling large-scale datasets with many instances and a large set of labels, compensating missing label assignments in the training set, considering correlations between labels, as well as exploiting unlabeled data to improve prediction performance. To tackle datasets with a large set of labels, embedding-based methods represent the label assignments in a low-dimensional space. Many state-of-the-art embedding-based methods use a linear dimensionality reduction to map the label assignments to a low-dimensional space. However, by doing so, these...

Network-based direction of movement prediction in financial markets

, Article Engineering Applications of Artificial Intelligence ; Volume 88 , February , 2020 Kia, A. N ; Haratizadeh, S ; Shouraki, S. B ; Sharif University of Technology

Elsevier Ltd 2020

Abstract

Market prediction has been an important research problem for decades. Having better predictive models that are both more accurate and faster has been attractive for both researchers and traders. Among many approaches, semi-supervised graph-based prediction has been used as a solution in recent researches. Based on this approach, we present two prediction models. In the first model, a new network structure is introduced that can capture more information about markets’ direction of movements compared to the previous state of the art methods. Based on this novel network, a new algorithm for semi-supervised label propagation is designed that is able to prediction the direction of movement faster...

Regularization from the Machine Learning Point of View

, M.Sc. Thesis Sharif University of Technology Ghaemi, Mohammad Sajjad (Author) ; Daneshgar, Amir (Supervisor)

Abstract

In traditional machine learning approaches to classification, one uses only a labeled set to train the classifier. Labeled instances however are often difficult, expensive, or time consuming to obtain, as they require the efforts of experienced human annotators. Meanwhile unlabeled data may be relatively easy to collect, but there has been few ways to use them. Semi-supervised learning addresses this problem by using large amount of unlabeled data, together with the labeled data, to build better classifiers. Because semi-supervised learning requires less human effort and gives higher accuracy.Formally, this intuition corresponds to estimating a label function f on the graph so that it...

محتواي پايان نامه

Self-Supervised Image Representation Learning

, M.Sc. Thesis Sharif University of Technology Aghababazadeh, Arash (Author) ; Kasaei, Shohreh (Supervisor)

Abstract

Self-supervied learning is a method to reduce the need for large labeled datasets in supervised learning. In self-supervised learning, the goal is to design a pretext task that can be trained without any labels. This pretext task results in learning a representation of data that can reduce the need for labels when used for different tasks. In the domain of images, data augmenting transformations which are often a composition of simple transformations such as random cropping and color jitter have been used for the design of pretext tasks. These simple transformations can cause information loss in some datasets which limits the usage of the learned representations for various downstream tasks....

محتواي کتاب

Manifold coarse graining for online semi-supervised learning

, Article Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), 5 September 2011 through 9 September 2011 ; Volume 6911 LNAI, Issue PART 1 , September , 2011 , Pages 391-406 ; 03029743 (ISSN) ; 9783642237799 (ISBN) Farajtabar, M ; Shaban, A ; Rabiee, H. R ; Rohban, M. H ; Sharif University of Technology

2011

Abstract

When the number of labeled data is not sufficient, Semi-Supervised Learning (SSL) methods utilize unlabeled data to enhance classification. Recently, many SSL methods have been developed based on the manifold assumption in a batch mode. However, when data arrive sequentially and in large quantities, both computation and storage limitations become a bottleneck. In this paper, we present a new semi-supervised coarse graining (CG) algorithm to reduce the required number of data points for preserving the manifold structure. First, an equivalent formulation of Label Propagation (LP) is derived. Then a novel spectral view of the Harmonic Solution (HS) is proposed. Finally an algorithm to reduce...

Incremental evolving domain adaptation

, Article IEEE Transactions on Knowledge and Data Engineering ; Volume 28, Issue 8 , 2016 , Pages 2128-2141 ; 10414347 (ISSN) Bitarafan, A ; Soleymani Baghshah, M ; Gheisari, M ; Sharif University of Technology

IEEE Computer Society

Abstract

Almost all of the existing domain adaptation methods assume that all test data belong to a single stationary target distribution. However, in many real world applications, data arrive sequentially and the data distribution is continuously evolving. In this paper, we tackle the problem of adaptation to a continuously evolving target domain that has been recently introduced. We assume that the available data for the source domain are labeled but the examples of the target domain can be unlabeled and arrive sequentially. Moreover, the distribution of the target domain can evolve continuously over time. We propose the Evolving Domain Adaptation (EDA) method that first finds a new feature space...

Semi-supervised Learning and its Application to Image Categorization

, M.Sc. Thesis Sharif University of Technology Farajtabar, Mehrdad (Author) ; Rabiee, Hamid Reza (Supervisor)

Abstract

Traditional methods for data classiﬁcation only make use of the labeled data. However, in most of the applications, labeling the unlabeled data is expensive, time consuming and requires expert knowledge. To overcome these problems, Semi-supervised Learning (SSL) methods have become an area of recent research that aim to eﬀectively addressing the problem of limited labeled data.One of the recently introduced SSL methods is the classiﬁcation based on geometric structure of the data, namely the data manifold. In this approach unlabeled data is utilized to recover the underlying structure of the data. The common assumption is that despite of being represented in a high dimensional space, data...

محتواي پايان نامه

Automatic image annotation using semi-supervised generative modeling

, Article Pattern Recognition ; Volume 48, Issue 1 , January , 2015 , Pages 174-188 ; 00313203 (ISSN) Amiri, S. H ; Jamzad, M ; Sharif University of Technology

Elsevier Ltd 2015

Abstract

Image annotation approaches need an annotated dataset to learn a model for the relation between images and words. Unfortunately, preparing a labeled dataset is highly time consuming and expensive. In this work, we describe the development of an annotation system in semi-supervised learning framework which by incorporating unlabeled images into training phase reduces the system demand to labeled images. Our approach constructs a generative model for each semantic class in two main steps. First, based on Gamma distribution, a generative model is constructed for each semantic class using labeled images in that class. The second step incorporates the unlabeled images by using a modified EM...

An ensemble of cluster-based classifiers for semi-supervised classification of non-stationary data streams

, Article Knowledge and Information Systems ; Volume 46, Issue 3 , 2016 , Pages 567-597 ; 02191377 (ISSN) Hosseini, M. J ; Gholipour, A ; Beigy, H ; Sharif University of Technology

Springer-Verlag London Ltd

Abstract

Recent advances in storage and processing have provided the possibility of automatic gathering of information, which in turn leads to fast and continuous flows of data. The data which are produced and stored in this way are called data streams. Data streams are produced in large size, and much dynamism and have some unique properties which make them applicable to model many real data mining applications. The main challenge of streaming data is the occurrence of concept drift. In addition, regarding the costs of labeling of instances, it is often assumed that only a small fraction of instances are labeled. In this paper, we propose an ensemble algorithm to classify instances of non-stationary...

Transductive multi-label learning from missing data using smoothed rank function

, Article Pattern Analysis and Applications ; Volume 23, Issue 3 , 2020 , Pages 1225-1233 Esmaeili, A ; Behdin, K ; Fakharian, M. A ; Marvasti, F ; Sharif University of Technology

Springer 2020

Abstract

In this paper, we propose two new algorithms for transductive multi-label learning from missing data. In transductive matrix completion (MC), the challenge is prediction while the data matrix is partially observed. The joint MC and prediction tasks are addressed simultaneously to enhance accuracy in comparison with separate tackling of each. In this setting, the labels to be predicted are modeled as missing entries inside a stacked matrix along the feature-instance data. Assuming the data matrix is of low rank, we propose a new recommendation method for transductive MC by posing the problem as a minimization of the smoothed rank function with non-affine constraints, rather than its convex...

The ensemble approach in comparison with the diverse feature selection techniques for estimating NPPs parameters using the different learning algorithms of the feed-forward neural network

, Article Nuclear Engineering and Technology ; Volume 53, Issue 12 , 2021 , Pages 3944-3951 ; 17385733 (ISSN) Moshkbar Bakhshayesh, K ; Sharif University of Technology

Korean Nuclear Society 2021

Abstract

Several reasons such as no free lunch theorem indicate that there is not a universal Feature selection (FS) technique that outperforms other ones. Moreover, some approaches such as using synthetic dataset, in presence of large number of FS techniques, are very tedious and time consuming task. In this study to tackle the issue of dependency of estimation accuracy on the selected FS technique, a methodology based on the heterogeneous ensemble is proposed. The performance of the major learning algorithms of neural network (i.e. the FFNN-BR, the FFNN-LM) in combination with the diverse FS techniques (i.e. the NCA, the F-test, the Kendall's tau, the Pearson, the Spearman, and the Relief) and...

Exploiting multiview properties in semi-supervised video classification

, Article 2012 6th International Symposium on Telecommunications, IST 2012 ; 2012 , Pages 837-842 ; 9781467320733 (ISBN) Karimian, M ; Tavassolipour, M ; Kasaei, S ; Sharif University of Technology

Abstract

In large databases, availability of labeled training data is mostly prohibitive in classification. Semi-supervised algorithms are employed to tackle the lack of labeled training data problem. Video databases are the epitome for such a scenario; that is why semi-supervised learning has found its niche in it. Graph-based methods are a promising platform for semi-supervised video classification. Based on the multiview characteristic of video data, different features have been proposed (such as SIFT, STIP and MFCC) which can be utilized to build a graph. In this paper, we have proposed a new classification method which fuses the results of manifold regularization over different graphs. Our...

Isograph: Neighbourhood graph construction based on geodesic distance for semi-supervised learning

, Article Proceedings - IEEE International Conference on Data Mining, ICDM, 11 December 2011 through 14 December 2011 ; December , 2011 , Pages 191-200 ; 15504786 (ISSN) ; 9780769544083 (ISBN) Ghazvininejad, M ; Mahdieh, M ; Rabiee, H. R ; Roshan, P. K ; Rohban, M. H ; Sharif University of Technology

2011

Abstract

Semi-supervised learning based on manifolds has been the focus of extensive research in recent years. Convenient neighbourhood graph construction is a key component of a successful semi-supervised classification method. Previous graph construction methods fail when there are pairs of data points that have small Euclidean distance, but are far apart over the manifold. To overcome this problem, we start with an arbitrary neighbourhood graph and iteratively update the edge weights by using the estimates of the geodesic distances between points. Moreover, we provide theoretical bounds on the values of estimated geodesic distances. Experimental results on real-world data show significant...

Combining Supervised and Semi-Supervised Learning in the Design of a New Identifier for NPPs Transients

, Article IEEE Transactions on Nuclear Science ; Volume 63, Issue 3 , 2016 , Pages 1882-1888 ; 00189499 (ISSN) Moshkbar Bakhshayesh, K ; Ghofrani, M. B ; Sharif University of Technology

Institute of Electrical and Electronics Engineers Inc 2016

Abstract

This study introduces a new identifier for nuclear power plants (NPPs) transients. The proposed identifier performs its function in two steps. First, the transient is identified by the previously developed supervised classifier combining ARIMA model and EBP algorithm. In the second step, the patterns of unknown transients are fed to the identifier based on the semi-supervised learning (SSL). The transductive support vector machine (TSVM) as a semi-supervised algorithm is trained by the labeled data of transients to predict some unlabeled data. The labeled and newly predicted data is then used to train the TSVM for another portion of unlabeled data. Training and prediction is continued until...

Leveraging multi-modal fusion for graph-based image annotation

, Article Journal of Visual Communication and Image Representation ; Volume 55 , 2018 , Pages 816-828 ; 10473203 (ISSN) Amiri, S. H ; Jamzad, M ; Sharif University of Technology

Academic Press Inc 2018

Abstract

Considering each of the visual features as one modality in image annotation task, efficient fusion of different modalities is essential in graph-based learning. Traditional graph-based methods consider one node for each image and combine its visual features into a single descriptor before constructing the graph. In this paper, we propose an approach that constructs a subgraph for each modality in such a way that edges of subgraph are determined using a search-based approach that handles class-imbalance challenge in the annotation datasets. Multiple subgraphs are then connected to each other to have a supergraph. This follows by introducing a learning framework to infer the tags of...

Video Classification Usinig Semi-supervised Learning Methods

, M.Sc. Thesis Sharif University of Technology Karimian, Mahmood (Author) ; Kasaei, Shohreh (Supervisor)

Abstract

In large databases, availability of labeled training data is mostly prohibitive in classification. Semi-supervised algorithms are employed to tackle the lack of labeled training data problem. Video databases are the epitome for such a scenario; that is why semi-supervised learning has found its niche in it. Graph-based methods are a promising platform for semi-supervised video classification. Based on the multiview characteristic of video data, different features have been proposed (such as SIFT, STIP and MFCC) which can be utilized to build a graph. In this project, we have proposed a new classification method which fuses the results of manifold regularization over different graphs. Our...

محتواي پايان نامه

Application of Semi-Supervised Learning in Image Processing

, M.Sc. Thesis Sharif University of Technology Mianjy, Poorya (Author) ; Rabiee, Hamidreza (Supervisor)

Abstract

In recent years, the emergence of semi-supervised learning methods has broadened the scope of machine learning, especially for pattern classification. Besides obviating the need for experts to label the data, efficient use of unlabeled data causes a significant improvement in supervised learning methods in many applications. With the advent of statistical learning theory in the late 80's, and the emergence of the concept of regularization, kernel learning has always been in deep concentration. In recent years, semi-supervised kernel learning, which is a combination of the two above-mentioned viewpoints, has been considered greatly.
Large number of dimensions of the input data along with...

محتواي پايان نامه

Molecular Property Prediction Using a Graph based Deep Learning Method

, M.Sc. Thesis Sharif University of Technology Shahcheraghi, Shamim (Author) ; Hossein Khalaj, Babak (Supervisor) ; Soleymani, Mahdieh (Supervisor)

Abstract

The goal of drug design is to identify new molecules with a set of desirable properties. The molecular search space is large, discrete, and unstructured, which results in a prolonged construction and testing process of new compounds and requires significant costs. Furthermore, there is a wide variety of appealing options to choose from. Recent advances in the field of machine learning have led to the emergence of generative models that, after training on real examples, can suggest suitable molecules with less time and cost. One of the stages that should be considered in the path of drug production is predicting the properties of the chemical molecule and its effect on the desired protein. By...

محتواي کتاب

Efficient kernel learning from constraints and unlabeled data

, Article Proceedings - International Conference on Pattern Recognition, 23 August 2010 through 26 August 2010, Istanbul ; 2010 , Pages 3364-3367 ; 10514651 (ISSN) ; 9780769541099 (ISBN) Soleymani Baghshah, M ; Bagheri Shouraki, S ; Sharif University of Technology

2010

Abstract

Recently, distance metric learning has been received an increasing attention and found as a powerful approach for semi-supervised learning tasks. In the last few years, several methods have been proposed for metric learning when must-link and/or cannot-link constraints as supervisory information are available. Although many of these methods learn global Mahalanobis metrics, some recently introduced methods have tried to learn more flexible distance metrics using a kernel-based approach. In this paper, we consider the problem of kernel learning from both pairwise constraints and unlabeled data. We propose a method that adapts a flexible distance metric via learning a nonparametric kernel...