Loading...
Search for: open-reading-frame
0.005 seconds

    The performances of the chi-square test and complexity measures for signal recognition in biological sequences

    , Article Journal of Theoretical Biology ; Volume 251, Issue 2 , 2008 , Pages 380-387 ; 00225193 (ISSN) Pirhaji, L ; Kargar, M ; Sheari, A ; Poormohammadi, H ; Sadeghi, M ; Pezeshk, H ; Eslahchi, C ; Sharif University of Technology
    2008
    Abstract
    With large amounts of experimental data, modern molecular biology needs appropriate methods to deal with biological sequences. In this work, we apply a statistical method (Pearson's chi-square test) to recognize the signals appear in the whole genome of the Escherichia coli. To show the effectiveness of the method, we compare the Pearson's chi-square test with linguistic complexity on the complete genome of E. coli. The results suggest that Pearson's chi-square test is an efficient method for distinguishing genes (coding regions) form pseudogenes (noncoding regions). On the other hand, the performance of the linguistic complexity is much lower than the chi-square test method. We also use the... 

    A novel pattern matching algorithm for genomic patterns related to protein motifs

    , Article Journal of Bioinformatics and Computational Biology ; Volume 18, Issue 1 , 2020 Foroughmand Araabi, M. H ; Goliaei, S ; Goliaei, B ; Sharif University of Technology
    World Scientific Publishing Co. Pte Ltd  2020
    Abstract
    Patterns on proteins and genomic sequences are vastly analyzed, extracted and collected in databases. Although protein patterns originate from genomic coding regions, very few works have directly or indirectly dealt with coding region patterns induced from protein patterns. Results: In this paper, we have defined a new genomic pattern structure suitable for representing induced patterns from proteins. The provided pattern structure, which is called "Consecutive Positions Scoring Matrix (CPSSM)", is a replacement for protein patterns and profiles in the genomic context. CPSSMs can be identified, discovered, and searched in genomes. Then, we have presented a novel pattern matching algorithm...