Loading...
Search for: gene-expression
0.009 seconds
Total 123 records

    A Semi-Supervised Algorithms for Clustering Microarray Data

    , M.Sc. Thesis Sharif University of Technology Eslamzadeh, Habibollah (Author) ; Mahdavi Amiri, Nezamoddin (Supervisor) ; Madadkar Sobhani, Armin (Supervisor)
    Abstract
    Microarray which is also known as Biochip is a flat substrate of glass with the size of 1 ×1 cm on which a numerous number of biosensors are placed in an array format. Microarray DNAs are used to measure expression level of thousands of genes. Repeating these experiments in different conditions can result in patterns of expression. After preparation, the florescent sample is hybridized with the sensors of microarray surface and fluoresce intensities of the spots are measured by a special camera called CCD. The obtained pictures are examined by a computer and the spot lights converted into numerical data by image processing algorithms. Putting these numbers into matrices of size m×n is... 

    Using Statistical Pattern Recognition on Gene Expression Data for Prediction of Cancer

    , M.Sc. Thesis Sharif University of Technology Hajiloo, Mohsen (Author) ; Rabiee, Hamid Reza (Supervisor)
    Abstract
    The classification of different tumor types is of great importance in cancer diagnosis and drug discovery. However, most previous cancer classification studies are clinical based and have limited diagnostic ability. Cancer classification using gene expression data is known to contain the keys for addressing the fundamental problems relating to cancer diagnosis. The recent advent of DNA microarray technique has made simultaneous monitoring of thousands of gene expressions possible. With this abundance of gene expression data, researchers have started to explore the possibilities of cancer classification using gene expression data and quite a number of Pattern Recognition approaches have been... 

    Using Transductive Learning Classification in Bioinformatics

    , M.Sc. Thesis Sharif University of Technology Tajari, Hossein (Author) ; Beigy, Hamid (Supervisor)
    Abstract
    Classification is one of the most important problems in machine learning area. Reliable and successful classification is essential for diagnosing patients for further treatment. In many applications such as bioinformatics unlabeled data is abundant and available. However labeling data is much more difficult and expensive to obtain. This dissertation presents a novel transductive approach for the development of robust microarray data classification. The transduction problem is to estimate the value of classification function at the given points in the working set. This contrasts with the standard inductive learning problem of estimating the classification method at all possible values and... 

    Prediction of DNA/RNA Sequence Binding Site to Protein with the Ability to Implement on GPU

    , M.Sc. Thesis Sharif University of Technology Fatemeh Tabatabaei (Author) ; Koohi, Sommaye (Supervisor)
    Abstract
    Based on the importance of DNA/RNA binding proteins in different cellular processes, finding binding sites of them play crucial role in many applications, like designing drug/vaccine, designing protein, and cancer control. Many studies target this issue and try to improve the prediction accuracy with three strategies: complex neural-network structures, various types of inputs, and ML methods to extract input features. But due to the growing volume of sequences, these methods face serious processing challenges. So, this paper presents KDeep, based on CNN-LSTM and the primary form of DNA/RNA sequences as input. As the key feature improving the prediction accuracy, we propose a new encoding... 

    Isoform Function Prediction Using Deep Neural Network

    , M.Sc. Thesis Sharif University of Technology Ghazanfari, Sara (Author) ; Motahari, Abolfazl (Supervisor) ; Soleymani, Mahdieh (Supervisor)
    Abstract
    Isoforms are mRNAs that are produced from a same gene site in the phenomenon called Alternative Splicing. Studies have shown that more than 95% of multiexon genes in humans have undergone Alternative Splicing. Although there are few changes in mRNA sequence, They may have a systematic effect on cell function and regulation. It is widely reported that isoforms of a gene have distinct or even contrasting functions. Most studies have shown that alternative splicing plays a significant role in human health and disease. Despite the wide range of gene function studies, there is little information about isoforms’ functionalities. Recently, some computational methods based on Multiple Instance... 

    Distributed Processing of Next Generation Sequencing Data Set

    , M.Sc. Thesis Sharif University of Technology Hadadian Nejad Yousefi, Mostafa (Author) ; Goudarzi, Maziar (Supervisor) ; Motahari, Abolfazl (Supervisor)
    Abstract
    DNA analysis plays a significant role in fields such as pharmacy, agriculture, genealogy, and forensics. Next generation sequencing datasets cover a gene several times due to a large number of readings. Therefore, the initial data volume is several times the amount of memory required to store the DNA strand. First, the DNA sequence of a sample should be made using the primary data, and then the difference should be found by comparing the sample DNA sequence with the reference DNA sequence. By finding these differences, one can extract the characteristics of the tested species. The extracted properties are precious for genetics researchers. For example, they can produce drugs that are... 

    Bayesian Filtering Approach to Improve Gene Regulatory Networks Inference Using Gene Expression Time Series

    , M.Sc. Thesis Sharif University of Technology Fouladi, Ramouna (Author) ; Fatemizadeh, Emadoddin (Supervisor) ; Arab, Shahriar (Co-Advisor)
    Abstract
    Gene regulatory modeling in different species is one of the main aims of Bioinformatics. Regarding the limitations of the data available and the perspectives which should be taken into account for modeling such networks, proposed methods up to now have not yet been successful in yielding a comprehensive model. In one of the recent researches, the Gene regulation process is considered as a nonlinear dynamic stochastic process and described by state space equations. Afterwards, in order for the unknown parameters to be estimated, Extended Kalman Filtering is used. In this thesis, first of all, Gene complexes are taken into consideration instead of genes and afterwards, Extended Kalman... 

    Modelling Cell`s State in Different Cell Types

    , M.Sc. Thesis Sharif University of Technology Saberi, Amir Hossein (Author) ; Hossein Khalaj, Babak (Supervisor) ; Motahari, Abolfazl (Co-Supervisor)
    Abstract
    Existence of heterogeneity in vital tissues of complex multicellular organisms like mammals, and fatal tissues like cancer on one hand, and limited access to biological properties of their components on the other hand, turn the study of these tissue traits to one of the most interesting fields in bioinformatics. One of the hottest subjects in this field is the recognition of functional components of these tissues by using bulk data extracted from the whole tissue.Almost every method that aims to achieve such a purpose, particularly using gene expression data, assumes that all of the cell types which constitute the studied tissue have a deterministic expression profile.In this thesis we... 

    Modeling of Genetic Mutations Associated with Protein Pathway Common in Alzheimer, Parkinson and Macular Degeneration Diseases

    , M.Sc. Thesis Sharif University of Technology Ghahremani, Amin (Author) ; Jahed, Mehran (Supervisor) ; Hossein Khalaj, Babak (Supervisor) ; Shahpasand, Kourosh (Co-Supervisor)
    Abstract
    Extensive studies have been performed on the genetic variations involved in common neurodegenerative diseases such as Alzheimer's, macular degeneration, and Parkinson's. In most cases, no specific gene has been identified pointing to a distinct pathogenic pathway, therefore, this study mainly aims to find common genes among aforementioned diseases according to determination of a specific pathogenic protein pathway.In this study, we reached a deep understanding of the function of nervous system and the discovery of causative agents of the diseases by applying the sources of information from genome datasets in bioinformatics analysis. The utilized database comprises the classification of... 

    Identifying Core Genes in Estimation of Missing Gene Expressions

    , M.Sc. Thesis Sharif University of Technology Darvish Shafighi, Shadi (Author) ; Motahari, Abolfazl (Supervisor)
    Abstract
    Characterizing cellular states in response to various disease conditions is an important issue which is addressed by different methods such as Large-scale gene expression profiling. One of the most important challenges in front of bioinformaticians is the loss of data because expression profiling is still very expensive. It is understood that profiling a group of selected genes could be enough for understanding all of the gene expression profile.In this research, we propose a fast method for estimation of the missing values inlow-rank matrices. We consider the highly correlated expression profiles as a low-rank matrix. Then, we used this new method in a proposed algorithm which will select... 

    Identification of Driver Genes in Glioblastoma Based on Single-Cell Gene Expression Data Utilizing the Concept of Pseudotime and Phylogenetic Analysis

    , M.Sc. Thesis Sharif University of Technology Mirza Abolhassani, Fatemeh (Author) ; Foroughmand Aarabi, Mohammad Hadi (Supervisor) ; Kavousi, Kaveh (Co-Supervisor) ; Zare Mirakabad, Fatemeh (Co-Supervisor)
    Abstract
    Genetic heterogeneity within a tumor, which occurs during cancer evolution, is one of the reasons for treatment failure and increased chances of drug resistance. Cancer cells initially derive from a mutated progenitor cell, resulting in shared mutated genes. Throughout the course of tumor formation and progression, the occurrence of new mutations is possible, leading to the generation of cancer cells with various mutated genes. An appropriate approach is to identify the sequence of mutations that have occurred in the tumor, which can be inferred from single-cell sequencing data. Singlecell data provides valuable information about branching events in the evolution of a cancerous tumor. In... 

    Identifying Cancer-related Genes Via Network Feature Learning and Multi-Omics Data Integration

    , M.Sc. Thesis Sharif University of Technology Safari, Monireh (Author) ; Rabiee, Hamid Reza (Supervisor)
    Abstract
    The highly developed biological data collection methods enable scientists to capture protein-protein interaction (PPI) in the human body, which could be analyzed as biological networks such as protein-protein interaction networks. These networks reveal essential information about the biological process in human cells and can be used to identify genes associated with cancers. Effectively identifying disease-related genes would contribute to improving the treatment and diagnosis of various diseases. Current methods for identifying disease-related genes mainly focus on the hypothesis of guilt-by-association and do not consider the global information in the PPI network. Besides, most methods pay... 

    Detection and Estimation of Key Parameters in Traffic Models Using Data Mining Tools

    , M.Sc. Thesis Sharif University of Technology Moadab, Amir Hossein (Author) ; Khedmati, Majid (Supervisor)
    Abstract
    Nowadays, investigating the factors affecting traffic models from different aspects such as metropolitan planning according to the present conditions can help high-level decision-makers and also, at the micro-level, help the travelers to make appropriate decisions for scheduling affairs, route selection, and vehicle type selection. Given the importance of this topic, a framework will be presented in this study that will evaluate the impact of some identified factors such as travel distance, climate, and urban events, and then all these factors will be presented in mathematical formulas. In the end, based on the model, the travel time will be predicted. In this framework, gene expression... 

    Identification of the Set of Single Nucleotide Variants in Genome Responsible for the Differentiation of Expression of Genes

    , M.Sc. Thesis Sharif University of Technology Khatami, Mahshid (Author) ; Rabiee, Hamid Reza (Supervisor) ; Beigi, Hamid (Supervisor)
    Abstract
    Single nucleotide polymorphs, There are changes caused by a mutation in a nucleotide in the Dena sequence. Mononucleotide polymorphisms are the most common type of genetic variation. Some of these changes have little or no effect on cells, while others cause significant changes in the expression of cell genes that can lead to disease or resistance to certain diseases. Because of the importance of these changes and their effect on cell function, the relationships between these changes are also important. Over the past decade, thousands of single disease-related mononucleotide polymorphisms have been identified in genome-related studies. Studies in this field have shown that the expression of... 

    Semi-supervised Breast Cancer Subtype Clustering Using Microarray Datasets

    , M.Sc. Thesis Sharif University of Technology Vasei, Hamed (Author) ; Motahhari, Abolfazl (Supervisor)
    Abstract
    Gene expression microarrays can be used for precision medicine and targeted therapies. The data generated by microarrays are high-dimensional causing statistical inference of any parameter a daunting task. In this thesis, it is shown that regardless of high-dimensional datasets produced by microarrays, the inference can be robust in the sense that random selection of features results in the same conclusion as far as the number of selected features are chosen appropriately. Stratifying patients with breast cancer based on their gene expression levels shows that patient subtypes are almost independent of the feature selection strategy. Moreover, using less noisy datasets coming from RNAseq... 

    Estimation of Pressure Fluctuation Coefficient in Stilling Basins Using Computational Intelligent Models

    , M.Sc. Thesis Sharif University of Technology Mazandarani, Mahan (Author) ; Shamsai, Abolfazl (Supervisor)
    Abstract
    Hydraulic jump is a significant hydraulic phenomenon that occurs in stilling basins and causes energy dissipation of water flow. Due to the severe pressure fluctuations, cavitation, and fatigue damage to concrete materials, hydraulic jump can cause damage to the stilling basin and its related components. Therefore, studying pressure fluctuations is one of the essential topics in the safe design and operation of stilling basins. Due to the nonlinear relationship between the effective variables in the pressure fluctuation phenomenon, the use of computational intelligent models that can extract the relationship between the effective variables is necessary. In this study, laboratory data... 

    Single-Cell RNA-seq Dropout Imputation and Noise Reduction by Machine Learning

    , M.Sc. Thesis Sharif University of Technology Moinfar, Amir Ali (Author) ; Soleymani Baghshah, Mahdih (Supervisor) ; Sharifi Zarchi, Ali (Supervisor) ; Goodarzi, Hani (Co-Supervisor)
    Abstract
    Single-cell RNA sequencing (scRNA-seq) technologies have empowered us to study gene expressions at the single-cell resolution. These technologies are developed based on barcoding of single cells and sequencing of transcriptome using next-generation sequencing technologies. Achieving this single-cell resolution is specially important when the target population is complex or heterogeneous, which is the case for most biological samples, including tissue samples and tumor biopsies.Single-cell technologies suffer from high amounts of noise and missing values, generally known as dropouts. This complexity can affect a number of key downstream analyses such as differential expression analysis,... 

    Drug Synergy Prediction on Diverse Cancer Cell-Lines Using Deep Learning

    , M.Sc. Thesis Sharif University of Technology Labbaf, Farzaneh (Author) ; Hossein Khalaj, Babak (Supervisor)
    Abstract
    Despite significant progress in cancer treatment, drug resistance remains a major challenge. Synergistic drug combinations offer a promising approach to overcome drug resistance and reduce side effects. Still, despite high-throughput testing technologies, existing drug combination databases suffer from biases and a lack of diversity in tested cancer cell lines, which challenges the prediction of drug response on novel cell targets. To address this critical need, we designed a two-level deep learning method that uses large-scale gene expression datasets to estimate the score and synergy of drug compounds on a wide variety of cancer cell lines. Our model includes an auto-encoder that train on... 

    Analysis and Design of Single-cell RNA Sequencing Data Normalization Algorithms

    , M.Sc. Thesis Sharif University of Technology Mohseni, Sepideh (Author) ; Hossein Khalaj, Babak (Supervisor)
    Abstract
    Single Cell RNA sequencing (scRNA-seq) data provides more information about gene expression at cellular level. However, because of noise and sparsity that exist in scRNA-seq data, analysis of this data has faced to obstacles. Global normalization approach can not resolve correctly missing data that come from technical variability. So this approach cause emerging incorrect bias and dishonest conclusion about cell type. In this study we review some models for scRNA-seq data imputation,explain a new method for filtering genes and clustering data and use matrix completion algorithm for imputation data  

    Analyzing Cancer Cell Identity and Appropriative Subnetworks using Machine Learning

    , M.Sc. Thesis Sharif University of Technology Saberi, Ali (Author) ; Rabiee, Hamid Reza (Supervisor) ; Sharifi Zarchi, Ali (Supervisor)
    Abstract
    From a long time ago cancer has been threatening human’s health, and researchers have been grappling with the phenomenon for numerous years. In the annals of this struggle, the number of cancer victims has outnumbered the survivals in a way that,until recently, suffering from cancer was perceived to be equivalent to death. Permanent defeat against cancer stems from the incomplete recognition of the phenomenon. In recent years, with the advent of technologies to extract information from the heart of cells and at the genome and transcriptome levels, man has been able to acquire a deeper understanding of cancer, its behavior and operation. Now that cancer is regarded to be a genetic disease,...