Requirement or exclusion of inverted repeat sequences with cruciform-forming potential in Escherichia coli revealed by genome-wide analyses

Osamu Miura, Toshihiro Ogake, Takashi Ohyama

    研究成果: Article

    1 引用 (Scopus)

    抄録

    Inverted repeat (IR) sequences are DNA sequences that read the same from 5′ to 3′ in each strand. Some IRs can form cruciforms under the stress of negative supercoiling, and these IRs are widely found in genomes. However, their biological significance remains unclear. The aim of the current study is to explore this issue further. We constructed the first Escherichia coli genome-wide comprehensive map of IRs with cruciform-forming potential. Based on the map, we performed detailed and quantitative analyses. Here, we report that IRs with cruciform-forming potential are statistically enriched in the following five regions: the adjacent regions downstream of the stop codon-coding sites (referred to as the stop codons), on and around the positions corresponding to mRNA ends (referred to as the gene ends), ~ 20 to ~45 bp upstream of the start codon-coding sites (referred to as the start codons) within the 5′-UTR (untranslated region), ~ 25 to ~ 60 bp downstream of the start codons, and promoter regions. For the adjacent regions downstream of the stop codons and on and around the gene ends, most of the IRs with a repeat unit length of ≥ 8 bp and a spacer size of ≤ 8 bp were parts of the intrinsic terminators, regardless of the location, and presumably used for Rho-independent transcription termination. In contrast, fewer IRs were present in the small region preceding the start codons. In E. coli, IRs with cruciform-forming potential are actively placed or excluded in the regulatory regions for the initiation and termination of transcription and translation, indicating their deep involvement or influence in these processes.

    元の言語English
    ページ(範囲)1-14
    ページ数14
    ジャーナルCurrent Genetics
    DOI
    出版物ステータスAccepted/In press - 2018 2 27

    Fingerprint

    Inverted Repeat Sequences
    Initiator Codon
    Terminator Codon
    Genome
    Escherichia coli
    5' Untranslated Regions
    Nucleic Acid Regulatory Sequences
    Genetic Promoter Regions
    Genes
    Messenger RNA

    ASJC Scopus subject areas

    • Genetics

    これを引用

    @article{c6eea78fff1c4ee592bfe6e2ee36a655,
    title = "Requirement or exclusion of inverted repeat sequences with cruciform-forming potential in Escherichia coli revealed by genome-wide analyses",
    abstract = "Inverted repeat (IR) sequences are DNA sequences that read the same from 5′ to 3′ in each strand. Some IRs can form cruciforms under the stress of negative supercoiling, and these IRs are widely found in genomes. However, their biological significance remains unclear. The aim of the current study is to explore this issue further. We constructed the first Escherichia coli genome-wide comprehensive map of IRs with cruciform-forming potential. Based on the map, we performed detailed and quantitative analyses. Here, we report that IRs with cruciform-forming potential are statistically enriched in the following five regions: the adjacent regions downstream of the stop codon-coding sites (referred to as the stop codons), on and around the positions corresponding to mRNA ends (referred to as the gene ends), ~ 20 to ~45 bp upstream of the start codon-coding sites (referred to as the start codons) within the 5′-UTR (untranslated region), ~ 25 to ~ 60 bp downstream of the start codons, and promoter regions. For the adjacent regions downstream of the stop codons and on and around the gene ends, most of the IRs with a repeat unit length of ≥ 8 bp and a spacer size of ≤ 8 bp were parts of the intrinsic terminators, regardless of the location, and presumably used for Rho-independent transcription termination. In contrast, fewer IRs were present in the small region preceding the start codons. In E. coli, IRs with cruciform-forming potential are actively placed or excluded in the regulatory regions for the initiation and termination of transcription and translation, indicating their deep involvement or influence in these processes.",
    keywords = "Cruciform, E. coli, Genome-wide distribution, Intrinsic terminator, Inverted repeat (IR) sequence",
    author = "Osamu Miura and Toshihiro Ogake and Takashi Ohyama",
    year = "2018",
    month = "2",
    day = "27",
    doi = "10.1007/s00294-018-0815-y",
    language = "English",
    pages = "1--14",
    journal = "Current Genetics",
    issn = "0172-8083",
    publisher = "Springer Verlag",

    }

    TY - JOUR

    T1 - Requirement or exclusion of inverted repeat sequences with cruciform-forming potential in Escherichia coli revealed by genome-wide analyses

    AU - Miura, Osamu

    AU - Ogake, Toshihiro

    AU - Ohyama, Takashi

    PY - 2018/2/27

    Y1 - 2018/2/27

    N2 - Inverted repeat (IR) sequences are DNA sequences that read the same from 5′ to 3′ in each strand. Some IRs can form cruciforms under the stress of negative supercoiling, and these IRs are widely found in genomes. However, their biological significance remains unclear. The aim of the current study is to explore this issue further. We constructed the first Escherichia coli genome-wide comprehensive map of IRs with cruciform-forming potential. Based on the map, we performed detailed and quantitative analyses. Here, we report that IRs with cruciform-forming potential are statistically enriched in the following five regions: the adjacent regions downstream of the stop codon-coding sites (referred to as the stop codons), on and around the positions corresponding to mRNA ends (referred to as the gene ends), ~ 20 to ~45 bp upstream of the start codon-coding sites (referred to as the start codons) within the 5′-UTR (untranslated region), ~ 25 to ~ 60 bp downstream of the start codons, and promoter regions. For the adjacent regions downstream of the stop codons and on and around the gene ends, most of the IRs with a repeat unit length of ≥ 8 bp and a spacer size of ≤ 8 bp were parts of the intrinsic terminators, regardless of the location, and presumably used for Rho-independent transcription termination. In contrast, fewer IRs were present in the small region preceding the start codons. In E. coli, IRs with cruciform-forming potential are actively placed or excluded in the regulatory regions for the initiation and termination of transcription and translation, indicating their deep involvement or influence in these processes.

    AB - Inverted repeat (IR) sequences are DNA sequences that read the same from 5′ to 3′ in each strand. Some IRs can form cruciforms under the stress of negative supercoiling, and these IRs are widely found in genomes. However, their biological significance remains unclear. The aim of the current study is to explore this issue further. We constructed the first Escherichia coli genome-wide comprehensive map of IRs with cruciform-forming potential. Based on the map, we performed detailed and quantitative analyses. Here, we report that IRs with cruciform-forming potential are statistically enriched in the following five regions: the adjacent regions downstream of the stop codon-coding sites (referred to as the stop codons), on and around the positions corresponding to mRNA ends (referred to as the gene ends), ~ 20 to ~45 bp upstream of the start codon-coding sites (referred to as the start codons) within the 5′-UTR (untranslated region), ~ 25 to ~ 60 bp downstream of the start codons, and promoter regions. For the adjacent regions downstream of the stop codons and on and around the gene ends, most of the IRs with a repeat unit length of ≥ 8 bp and a spacer size of ≤ 8 bp were parts of the intrinsic terminators, regardless of the location, and presumably used for Rho-independent transcription termination. In contrast, fewer IRs were present in the small region preceding the start codons. In E. coli, IRs with cruciform-forming potential are actively placed or excluded in the regulatory regions for the initiation and termination of transcription and translation, indicating their deep involvement or influence in these processes.

    KW - Cruciform

    KW - E. coli

    KW - Genome-wide distribution

    KW - Intrinsic terminator

    KW - Inverted repeat (IR) sequence

    UR - http://www.scopus.com/inward/record.url?scp=85042547386&partnerID=8YFLogxK

    UR - http://www.scopus.com/inward/citedby.url?scp=85042547386&partnerID=8YFLogxK

    U2 - 10.1007/s00294-018-0815-y

    DO - 10.1007/s00294-018-0815-y

    M3 - Article

    C2 - 29484452

    AN - SCOPUS:85042547386

    SP - 1

    EP - 14

    JO - Current Genetics

    JF - Current Genetics

    SN - 0172-8083

    ER -