Trisha Shetty (Editor)

Repeated sequence (DNA)

Updated on
Edit
Like
Comment
Share on FacebookTweet on TwitterShare on LinkedInShare on Reddit

Repeated sequences (also known as repetitive elements, or repeats) are patterns of nucleic acids (DNA or RNA) that occur in multiple copies throughout the genome. Repetitive DNA was first detected because of its rapid reassociation kinetics.

Contents

In many organisms, a significant fraction of the genomic DNA is highly repetitive, with over two-thirds of the sequence consisting of repetitive elements in human.

Repetitive elements found in genomes fall into different classes, depending on their mode of multiplication and/or structure. The disposition of repetitive elements consists either in arrays of tandemly repeated sequences, or in repeats dispersed throughout the genome (see below).

Functions

Debates regarding the potential functions of these elements have been long standing. Controversial references to ‘junk’ or ‘selfish’ DNA were put forward early on, implying that repetitive DNA segments are remainders from past evolution or autonomous self-replicating sequences hacking the cell machinery to proliferate. Originally discovered by Barbara McClintock, dispersed repeats have been increasingly recognized as a potential source of genetic variation and regulation. Together with these regulatory roles, a structural role of repeated DNA in shaping the 3D folding of genomes has also been proposed. This hypothesis is only supported by a limited set of experimental evidence. For instance in human, mouse and fly, several classes of repetitive elements present a high tendency for co-localization within the nuclear space, suggesting that DNA repeats positions can be used by the cell as a genome folding map.

Main types

There are 3 major categories of repeated sequence or repeats:

  • Terminal repeats
  • Tandem repeats: copies which lie adjacent to each other, either directly or inverted
  • Satellite DNA - typically found in centromeres and heterochromatin
  • Minisatellite - repeat units from about 10 to 60 base pairs, found in many places in the genome, including the centromeres
  • Microsatellite - repeat units of less than 10 base pairs; this includes telomeres, which typically have 6 to 8 base pair repeat units
  • Interspersed repeats (aka. interspersed nuclear elements)
  • Transposable elements
  • DNA transposons
  • retrotransposons
  • LTR-retrotranposons (HERVs)
  • non LTR-retrotranposons
  • SINEs (Short Interspersed Nuclear Elements)
  • LINEs (Long Interspersed Nuclear Elements)
  • SVAs
  • In primates, the majority of LINEs are LINE-1 and the majority of SINEs are Alu's. SVAs are hominoid specific.

    In prokaryotes, CRISPR are arrays of alternating repeats and spacers.

    Other types

    Note: The following are covered in detail in "Computing for Comparative Microbial Genomics".

  • Direct repeats
  • Global direct repeat
  • Local direct simple repeats
  • Local direct repeats
  • Local direct repeats with spacer
  • Inverted repeats
  • Global inverted repeat
  • Local inverted repeat
  • Inverted repeat with spacer
  • Palindromic repeat
  • Mirror and everted repeats
  • References

    Repeated sequence (DNA) Wikipedia