- Svs Creative Composition Download Torrent 2017
- Svs Creative Composition Download Torrent Pc
- Creative Composition Orland Ca
The site does not give electronic versions of products, and is engaged only in a collecting and cataloguing of the references sent and published at a forum by our readers. 1337x is a search engine to find your favorite torrents. 1337x main domain is 1337x.to, if this domain is blocked for you try to use 1337x.is, for full list of alternative domains see about page. Watch your torrents without waiting. Convert files, play in HD & on the go. Protect your PC from viruses & malware. CyberGhost VPN: be anonymous online. $4.95 /annual. By clicking 'I accept', you accept use of our cookies, and by continuing to use our services.
It’s meant to eventually replace the original CC class. I couldn’t figure out how to watch it without paying $150, but it says you are supposed to be able to stream it with an active subscription. Download Adobe Creative Suite 5 Torrent at TorrentFunk. We have 206 Adobe Creative Suite 5 Other torrents for you! Aug 3, 2017 - How do I download paid Android apps for free other than torrents? You can make innovative and creative apps for your business. Additionally, your use of certain Services and Content may also be subject to. Ideas, opinions, photographs, images, video, musical compositions (including lyrics),. New Product Development; Editorial Services; Creative Packaging; Composition; Project Management; Creative Design; Portfolio. Bestsellers; Authentic/IBS; B&H Publishing Group. Apologetics Study Bible; Apologetics Study Bible for Students; Baker Publishing Group. A to Z Guide to Finding It in the Bible; Household Money Organizer; Life.
Abstract
The human genome is laden with both non-LTR (long-terminal repeat) retrotransposons and microsatellite repeats. Both types of sequences are able to, either actively or passively, mutagenize the genomes of human individuals and are therefore poised to dynamically alter the human genomic landscape across generations. Non-LTR retrotransposons, such as L1 and Alu, are a major source of new microsatellites, which are born both concurrently and subsequently to L1 and Alu integration into the genome. Likewise, the mutation dynamics of microsatellite repeats have a direct impact on the fitness of their non-LTR retrotransposon parent owing to microsatellite expansion and contraction. This review explores the interactions and dynamics between non-LTR retrotransposons and microsatellites in the context of genomic variation and evolution.
Introduction: Non-LTR Retrotransposons and Microsatellites
The human genome is laden with repetitive sequences in the form of transposable elements (TEs) and tandem repeats, which comprise 45% and 3% of the human genome, respectively. Using new methods capable of annotating repetitive sequences that have substantially diverged from known TE sequences, recent analyses indicate that up to 66–69% of the human genome may be composed of repetitive sequences, predominately in the form of TEs.
TEs, as their name implies, are able to move about in the genome and create interspersed repeats. TEs include DNA transposons, long-terminal repeat (LTR) retrotransposons and non-LTR retrotransposons. In this review, we will focus on non-LTR retrotransposons, which are the only active TEs in the human genome. Non-LTR retrotransposons include long interspersed elements (LINEs) and short interspersed elements (SINEs).- Both mobilize via a “copy and paste” mechanism, which requires the transcription of a donor element into an RNA intermediate and the subsequent reverse transcription and incorporation of the RNA intermediate into the genome as a new DNA copy (i.e., the retrotransposition process). Among them, LINE-1s (L1s) are autonomous; full-length L1s encode two proteins, ORF1 and ORF2, which are essential for their mobilization (Fig. 1A). In contrast, SINEs are non-autonomous and their mobilization relies on L1 proteins. There are two main classes of SINEs., The first class includes human Alu (Fig. 1B) and mouse B1, both of which are derived from 7SL RNA; the second class is typified by mouse B2 elements, which are derived from tRNAs. The quantity and mobility of non-LTR retrotransposons have made them important drivers of genomic diversity and evolution.,
Figure 1. Non-LTR retrotransposons are hotbeds for new microsatellites. (A) The locations of mononucleotide microsatellite seed sequences (i.e., 5–7 bp proto-microsatellites) are diagramed along the length of the consensus human L1 sequence. Five proto-microsatellites within the sequence are 7 units long, only one nucleotide below the threshold for mononucleotide microsatellites. A long poly(A) tail is highlighted at the 3′ end of the L1 element (in bold font; only 7 A’s shown due to space limitation). (B) The locations of 3–6 bp mononucleotide proto-microsatellites are diagramed along the length of the consensus Alu sequence. Unlike L1, Alu is G/C rich and only two poly(A) proto-microsatellites are found in the middle linker region between left and right monomers. Over time, the short proto-microsatellites may expand beyond the microsatellite threshold. A long poly(A) tail is highlighted at the 3′ end of the Alu element (in bold font; only 7 A’s shown due to space limitation). Note the 10-fold difference in scale between L1 and Alu elements.
In contrast to interspersed repeats, tandem repeats consist of sequences that are sequentially repeated. Tandem repeats are classified by the size of their repeating unit: those with a repeating unit length of 1–6 base pairs (bp) are commonly referred to as microsatellites (Fig. 2A).- However, the boundary between microsatellites and minisatellites is often disputed, where the latter typically refer to tandem repeats of greater unit sizes. As they are highly polymorphic, both microsatellites and minisatellites have been instrumental in genetic mapping, evolutionary and phylogenetic studies as well as DNA forensics. This review focuses on microsatellites, which encompass mono-, di-, tri-, tetra-, penta- and hexa-nucleotide repeats. Notably, mononucleotide A repeats (i.e., poly(A) DNA tracts) represent the most abundant microsatellite class with an overall length of 12 bp in the human genome. Due to the repeating nature of these DNA tracts, they are prone to rapid and variable contraction and expansion of repeat length (Fig. 2B).
Figure 2. The anatomy and lifecycle of a microsatellite. (A) Microsatellite anatomy and terminology. A mononucleotide microsatellite and a trinucleotide microsatellite are depicted. Each repeating unit is depicted as a box. The repeat unit length, the number of repeating units, and the overall length are noted for each microsatellite. Both microsatellites have the same number of repeating units, but the unit length and the overall length vary among them. (B) The birth and death lifecycle for microsatellites. Two pathways lead to microsatellite birth. The first involves LINE/SINE retrotransposition; a microsatellite, typically a mature poly(A) repeat, is born into the genome concurrently upon LINE/SINE integration. The second pathway involves birth from random sequences (not shown) and/or proto-microsatellites via base substitution, indel and replication slippage. When a poly(A) tract reaches 8–9 A’s, it is considered a mature microsatellite. As the length increases, length contractions outweigh expansions, or vice versa, forming an indefinite loop (thus, bypassing the path to death). Microsatellite sequences can also be interrupted by mutations. The interrupted microsatellites may recover and continue expanding (not shown) or may be further mutated and eventually die.
Collectively, the dynamics of these two entities are a major source of genomic variation and serve as the architectural framework for protein-coding genes. Recent work has revealed an intimate relationship between non-LTR retrotransposons and microsatellites. On one hand, non-LTR retrotransposons are a major source of new microsatellites as they give birth to microsatellites both concurrent and subsequent to their integration into the genome. On the other hand, the inherent instability of microsatellite repeats has a direct impact on the fitness of their non-LTR retrotransposon parent. This review will focus on the interactions and mutation dynamics between non-LTR retrotransposons and microsatellites as well as the variability and rates of mutation of microsatellite repeats that make them instigators of genomic evolution.
Non-LTR Retrotransposons Give Birth to Microsatellites
Relatively little attention has been paid to the evolutionary origins of microsatellites. It is proposed that each microsatellite locus possesses a lifecycle that begins with its “birth” into the genome and ends with its “death” (Fig. 2B). Most studies focus on the intermediate “growth” phase, a period that is punctuated by contractions and expansions, which dynamically shorten and lengthen the overall microsatellite length. The most commonly proposed mechanism for microsatellite contractions and expansions is replication slippage by DNA polymerases. The main factors involved in replication slippage will be discussed in the next section. Two other mutation mechanisms, base substitution and indel slippage, are also thought to play an important role in microsatellite birth and death., Overall, two distinct pathways have been postulated for microsatellite genesis (Fig. 2B). One is through de novo retrotransposition by non-LTR retrotransposons, while the other involves the birth of proto-microsatellites from random sequences., It is the goal of this review to integrate these two pathways and highlight the role of non-LTR retrotransposons in microsatellite genesis.
Non-LTR retrotransposons can give birth to microsatellites concurrently to their integration into the genome (i.e., the first pathway; Fig. 2B). The association between A-rich microsatellites and non-LTR retrotransposons has been repeatedly observed by many independent studies.- A recent analysis of microsatellites at orthologous loci in three primate genomes estimated that upon retrotransposition, Alus and L1s contribute to 36% of mono-, di, tri- and tetranucleotide microsatellites in human, chimpanzee and orangutan genomes. This pathway is best illustrated by the birth of poly(A) microsatellites. To mobilize, non-LTR retrotransposons require a poly(A) tail at the 3′ end of their sequence (discussed later)., Thus, a common feature of Alu and L1 insertions that have integrated into the genome is the presence of a 3′ poly(A) DNA tail (Fig. 1). These A-rich sequences are born into the genome as mature microsatellites, far above the threshold size (Fig. 3). In humans, 90% of these poly(A) repeats are derived from poly(A) tails of L1 and Alu sequences. Even this may be an underestimate because L1 can integrate a pure poly(A) DNA tract into the genome if the insertion is severely 5′ truncated; such events would typically be undetected. The abundance of L1 and Alu sequences in the primate genomes likely accounts for the overrepresentation of poly(A)/poly(T) repeats compared with poly(C)/poly(G) repeats. In contrast, dinucleotide microsatellites are most abundant in rodents, and AC repeats are the most frequent dinucleotide repeat in all vertebrates. Not surprisingly, 15% of AC repeats are derived from the AC-rich tails of B2 SINE elements found in the mouse genome.
Figure 3. Non-LTR retrotransposons can give birth to microsatellites both concurrently and subsequently to their integration in the genome. (A) Concurrent birth occurs via the integration of a long poly(A) tract, which is part of the Alu or L1 element. Additionally, the integrated element carries proto-microsatellites, depicted by the middle A stretch. Here, the waved line represents genomic DNA target, and the gray box represents the body of an Alu or L1 element. (B) Subsequent to integration, both the poly(A) tail microsatellite and the internal proto-microsatellites may expand or contract due to DNA polymerase slippage. This process creates a variety of microsatellite length polymorphisms in somatic and germ cells. (C) Subsequent to integration, the poly(A) tail microsatellite may also be mutated via base substitution or indel to form a new repeating unit, here depicted as TA at the end of the poly(A) tract. The combination of mutational forces may eventually lead to the formation a new dinucleotide microsatellite at the 3′ end of the retrotransposon.
The second pathway posits that microsatellites initially arise from random sequences via mutation as proto-microsatellites, small tandem repeated sequences that are below the length threshold for mature microsatellites (Fig. 2B)., Theoretically, these initiating loci can be found everywhere in the genome. However, the overwhelming abundance of TEs in the human genome made it the perfect hotbed for microsatellite births., It has been suggested that the conversion of AAA to NAA is a major mechanism for the generation of all A-rich trinucleotide repeats in the human genome as over 60% of GAA, CAA and TAA repeats with repeat number of 8 or higher are located within Alu poly(A) tails. Furthermore, poly(A) tails have also been implicated in the birth of dinucleotide and tetranucleotide microsatellites., Aside from their 3′ tails, both Alu and L1 have internal proto-microsatellite sequences that can birth new microsatellites subsequent to their integration (Fig. 1; Fig. 3). An analysis of microsatellites at orthologous loci in three primate genomes indicates that 26% of microsatellite births and 24% of microsatellite deaths occur within Alu and L1 sequences subsequent to retrotransposition. It is noteworthy that microsatellites are found in distinct distributions over the length of L1 and Alu elements, likely reflecting differences in their sequence compositions. L1 sequences are AT-rich and give rise to microsatellites during and after their insertion along their entire length (Fig. 1A). In contrast, Alus are GC-rich and microsatellite births and deaths are enriched at the 3′ poly(A) tail and the middle A stretch that connects the left and the right monomers (Fig. 1B).,,, The middle A-rich region of Alu elements tends to expand and mature into poly(A) microsatellites and can also give rise to GAA repeats. Approximately 2% of GAA repeats with a repeat number of 8 or longer are located in the Alu middle A tract; one of such GAA microsatellite is positioned in intron 1 of the FRDA gene and its expansion is responsible for the neuromuscular disorder Friedreich’s ataxia.
Microsatellite Mutation Dynamics
An important characteristic of microsatellites is their high mutation rates. Classic studies on microsatellite mutation rates utilize pedigree analysis, usually in cancer patients with microsatellite instability. Several model systems, including E. coli, yeast, human and mouse cells have also been described and used to obtain in vivo mutation rates. In addition, computational and mathematical models have been utilized to describe the behavior of microsatellites by comparing genomic sequences from large scale sequencing efforts and from evolutionarily related species. The majority of computational studies and in vivo studies focus exclusively on pure or perfect repeats, which contain only nucleotides pertaining to the repeating unit. Imperfect repeats, in contrast, have additional nucleotides that do not belong to the repeating units. Overall, the rate of microsatellite mutation is highly variable, and it is influenced by several factors, including the number of repeating units, unit repeat length, sequence composition, genomic loation and the secondary structures of the repeat.,,
The number of repeating units is a major factor in microsatellite mutation rates. Specifically, microsatellites become highly mutable when the number of repeating units exceeds a threshold value.,, Although there is disagreement on the exact threshold value, the consensus is that the threshold represents the size that allows the sequence to acquire slippage mutations at a higher rate than the genomic average. The threshold values for mono-, di-, tri- and tetranucleotide repeats are estimated to be 8–9, 4–5, 3.4–4 and 4 respectively.- In fact, the distinct mutational behaviors manifested by microsatellite before and after they reach the threshold have prompted some researchers to consider microsatellites only as those that exceed their respective threshold., Once a tandem repeat has reached its threshold, experimental studies with human and mouse models have shown that the mutation rate increases exponentially as the number of repeating units increases (Table 1).,,- The same trend is also observed in computational studies.,, However, due to the differences in the experimental systems used, it is difficult to compare across these studies. The exponential increase in mutation rates is not seen in mismatch repair deficient cells, indicating that mismatch repair proteins are implicated in the expansion process (Table 1).
Svs Creative Composition Download Torrent 2017
Biological system | Repeat size | Repeat sequence | Rate | Units | Ref. |
---|---|---|---|---|---|
Mononucleotide repeats | |||||
Human lymphoblastoid cell line | 10 | C | 2.9E-05 | per generation | 12 |
Human pedigree | ~40 | A | 6.8E-02 | per locus per generation | 38 |
Mouse pedigree with L1 transgene | > > 50 | A | 1.33 to 1.48E-01a | per locus per generation | 39 |
Dinucleotide repeats | |||||
Human lymphoblastoid cell line | 11 | GA | 3.2E-06 | per generation | 12 |
Mouse fibroblast cell line (CAK-Stu3) | 17 | GA | 3.0E-05 | per cell per generation | 40 |
H6 cells (colon tumor origin), mismatch repair deficient | 17 | GA | 1.6E-04 | per locus per generation | 41 |
Human lymphoblastoid cell line | 17 | GA | 9.8E-06 | per generation | 12 |
Human lymphoblastoid cell line | 20 | GA | 2.1E-04 | per generation | 12 |
Mouse fibroblast cell line (CAK-Stu3) | 8 | CA | 3.7E-06 (7.9E-08)b | per cell per generation | 42 |
Human lymphoblastoid cell line | 10 | CA | 2.1E-07 | per generation | 43 |
Human lymphoblastoid cell line | 13 | CA | 6.9E-07 | per generation | 43 |
Human lymphoblastoid cell line | 16 | CA | 3.4E-06 | per generation | 12 |
Mouse fibroblast cell line (CAK-Stu3) | 17 | CA | 2.4E-05 (1.7E-05)b | per cell per generation | 42 |
Mouse fibroblast cell line (CAK-Stu3) | 30 | CA | 1.6E-04 (3.8E-05)b | per cell per generation | 42 |
Human pedigree analysis; 15 autosomal locations | variable | CA | 5.6E-04 | per generation | 44 |
Trinucleotide repeats | |||||
Human embryonic kidney cells (293T) | 33 | CAG | 5.7E-03 | per generation | 45 |
Human embryonic kidney cells (293T) | 33 | CAG | 5.0E-04 | per generation | 45 |
Tetranucleotide repeats | |||||
CAK mouse cells | 17 | GAAA | 1.2E-06 | per cell per generation | 46 |
H6 cells (colon tumor origin), mismatch repair deficient | 17 | GAAA | 3.3E-05 | per cell per generation | 46 |
Pedigree analysis; Y chromosome sperm | 8 to 15 | GATA | 4.0E-03 | per generation | 47 |
Pedigree analysis; autosomal locations | variable | GATA | 2.1E-03 | per generation | 44 |
human lymphoblastoid cell line | 9 | TTCC | 5.6E-06 | per generation | 12 |
human lymphoblastoid cell line | 9 | TTTC | 3.5E-05 | per generation | 12 |
human lymphoblastoid cell line | 9 | TCTA | 4.8E-05 | per generation | 12 |
a Contraction rate; bNumber in parenthesis is the contraction rate for the same repeat; cTrinucleotide studies focus primarily on mismatch repair deficient cells. Trinucleotide repeats that cause disease tend to be more unstable in the germline than in somatic cell lines. This review does not focus on the mismatch repair component of microsatellite slippage.
Sequence composition also affects mutability. For similar sized repeats, GA repeats are more mutable than CA repeats and sequences with higher GC content have lower rates of mutation (Table 1)., DNA structure and thermodynamic stability have been implicated in explaining this phenomenon., Stability increases with the complexity of the repeating pattern, as demonstrated in tetranucleotide repeat studies. Despite a doubling of repeat unit length, di- and tetranucleotide repeats have similar stabilities (Table 1;TC vs. TTCC). In contrast, mononucleotide microsatellites, such as those associated with retrotransposons, are highly unstable and hypermutable in the germline., For example, the human BAT-40 locus is an intronic poly(A) tract of 40 A’s in the 3-β-hydroxysteroid dehydrogenase gene (HSD3B1). Its instability is well documented in tumor cells deficient in mismatch repair. This locus is also highly polymorphic among healthy individuals. In fact, pedigree analysis indicates BAT-40 is inherently unstable in the germline. Five-fold more mutant alleles are found in sperm than in leukocytes of the same individual. The BAT-40 poly(A) microsatellite has not been identified to be associated with non-LTR retrotransposons; here, we report that it is in fact the poly(A) tail of an AluJb element.
Despite experimental focus on the expansion dynamics of microsatellites, expansion is not indefinite; instead, longer alleles show a bias toward contraction. The mutation dynamics for very long (> 50 bp) mononucleotide repeats had been frequently overlooked because of their rarity in the genome. Using a transgenic L1 mouse model, we characterized the contraction dynamics of long poly(A) tracts associated with nascent L1 insertions. These poly(A) microsatellites show rapid and variable contraction to lengths below 50 bp in both somatic and germ cells at frequencies of 14.8% and 13.3%, respectively. Of note, these values represent the highest mutation frequencies hitherto reported for all microsatellite repeats, highlighting the extraordinary mutability of long poly(A) tracts (Table 1). Thus, long poly(A) microsatellites birthed into the genome via retrotransposition are predicted to shorten in length rapidly within the first few generations. This prediction is consistent with observations from surveys of genomic resident Alu and L1 sequences, which demonstrated an inverse correlation of poly(A) tract length with the evolutionary age of the elements.,,
One prominent form of microsatellite death is interruption by a non-repeat base, which can arise from base substitutions or short indel mutations.,,, In particular, a comparison of human, chimpanzee and orangutan genomes suggests that substitutions are the primary cause of deaths for microsatellites of all lengths. Although not widely recognized, non-LTR retrotransposons may also act as a mechanism for microsatellite interruption. Such interruption can occur when new copies of non-LTR retrotransposons insert into pre-existing microsatellites. Mammalian L1s are members of a diverse and widely distributed group of autonomous non-LTR retrotransposons that encode an endonuclease with homology to apurinic/apyrimidinic endonucleases (APE). A small subset of these APE-type elements insert in a sequence-specific manner into microsatellites: (1) ACAY microsatellites are targeted by three Waldo elements (WaldoAg1,WaldoAg2 in the African malaria mosquito A. gambiae and WalsoFs1 in the earwig F. scudderi); (2) AC microsatellites are targeted by a Mino element in A. gambiae (MinoAg1); (3) TC microsatellites are targeted by four Kibi elements (KibiDr1 and KibiDr2 in zebrafish D. rerio, KibiFr1 in the torafugu F. rubripes, and KibiTn1 in the green spotted pufferfish T. nigroviridis); (4) TTC microsatellites are targeted by two Koshi elements (KoshiFr1 in F. rubripes and KoshiTn1 in T. nigroviridis); (5) TAA microsatellites are targeted by three Dong elements (Dong in the silkworm B. mori, DongAg in A. gambiae and DongBg in the freshwater snail B. glabrata)., The molecular mechanisms underlying microsatellite targeting by these APE-type non-LTR retrotransposons are not well understood. It is believed that the sequence-specific cleavage by the endonuclease plays a major role in active targeting. In addition, two APE-type elements (Waldo-A and Waldo-B in the fruitfly D. melanogaster) are frequently found within or close to CA microsatellites, most likely as a result of passive accumulation in these low recombination regions., Microsatellites may also be targeted by DNA transposons; examples include MINE-1 helitrons and Micron MITEs. The former insert at GAAA microsatellites in the genomes of the European corn borer O. nubilalis, the silkworm B. mori and the pink bollworm P. gossypiella, while the latter insert into TA microsatellites in the rice genome., In contrast, mammalian L1s have a weak target site preference with a consensus sequence 5′-TTAAAA-3′.,- Although they may fortuitously land in or near certain microsatellite sites, they are unlikely to serve as a major factor in disrupting microsatellites.
Microsatellite Instability Affects Non-LTR Retrotransposon Mobility
The relationship between microsatellites and non-LTR retrotransposons is not unidirectional. While both L1s and Alus give birth to microsatellites, especially poly(A) mononucleotide microsatellites, these microsatellite sequences can also affect the fitness of their “parent” due to their unusually high mutation rates. The impact of microsatellite instability on non-LTR retrotransposons depends on the location of the microsatellite, i.e., whether it is internal or at the 3′ terminal of the element. The effect of variation in microsatellites internal to non-LTR retrotransposons is less understood. Newly inserted L1 and Alu copies carry many mononucleotide proto-microsatellites (Fig. 1). As microsatellites are predicted to mutate faster than the genomic average, expansion and contraction of microsatellite loci internal to L1s may introduce frameshift mutations, abolishing L1 coding capacity (Fig. 4A).
Figure 4. Microsatellite instability alters the retrotransposition potential of non-LTR retrotransposons. (A) Effect of internal microsatellite loci. Expansions or contractions of proto-microsatellite loci inside an L1 element can cause frameshift mutations, leading to loss of ORF1 and/or OFR2 function. The mutagenized L1 element would be unable to mobilize itself and to support Alu retrotransposition. (B) Effect of the 3′ poly(A) microsatellite on Alu elements. The contraction or expansion of an Alu’s poly(A) tail diminishes or stimulates its retrotransposition, respectively, presumably because the length of its poly(A) RNA tail is positively correlated with the efficiency of target-primed reverse transcription (TPRT). (C) Effect of the 3′ poly(A) microsatellite on L1 elements. Due to the random nature of integration, an L1 sequence may not have a GT rich region downstream to its polyadenylation signal. However, there is evidence that the L1 poly(A) tail exerts a crucial role in ensuring proper polyadenylation of L1 mRNA in the absence of a strong GT-rich downstream sequence. The relative efficiency in mobilization is depicted by a difference in size of L1 mRNA and protein products.
The 3′ poly(A) tail of an L1 or Alu element is a crucial component of the retrotransposition process and therefore, its length directly affects their retrotransposition potential. Distinct cellular processes are responsible for the poly(A) tail formation in L1 and Alu elements. L1 elements are transcribed by RNA polymerase II and poly(A) polymerases produce a poly(A) RNA tail as the 3′ end of an L1 mRNA. In contrast, Alu elements are transcribed by RNA polymerase III and the resulting transcripts are not polyadenylated. However, active Alu elements have a poly(A) DNA tract, which is transcribed as part of the Alu RNA. The 3′ poly(A) RNA tail is predicted to serve two important roles during retrotransposition. First, the initial base pairing of these A’s with the T-rich DNA sequence at the target site may be required for efficient first-strand cDNA synthesis during target-primed reverse transcription (TPRT).- The L1 ORF1 protein is an RNA-binding protein with nucleic acid chaperone activity and it may facilitate this strand transfer and annealing process. Second, increasing evidence indicates that these poly(A) RNA tails are bound by poly(A) binding proteins (PABPs) and that this interaction is critical for the formation of the ribonucleoprotein complex between L1 proteins and L1/Alu RNAs., PABPC1 may also facilitate the nuclear import of L1 RNP., Indeed, retrotransposition assays demonstrate that the poly(A) tail is strictly required for Alu mobilization and that its retrotransposition activity is positively correlated with the length of poly(A) tails., In parallel, poly(A) tail shortening in expressed L1 mRNAs impairs RNP formation and retrotransposition.
Therefore, the high mutability of poly(A) tails has direct impact on L1 and Alu retrotransposition. Once captured in genomic DNA, the initial long 3′ poly(A) tract undergoes rapid shortening in the first few generations. In fact, genome-wide, the length of the poly(A) tract is inversely correlated with the evolutionary age of Alu and L1 subfamilies., As the length of the poly(A) tail is a key determinant of Alu activity, A-tail expansion due to microsatellite instability is predicted to promote its retrotransposition and, conversely, its contraction would predict the extinction of an Alu’s ability to mobilize (Fig. 4B). Indeed, newly generated Alu copies typically have longer poly(A) tails than the original donor elements. Additionally, a nascent retrotransposition-competent Alu element is able to propagate its newfound mobility to progeny copies through poly(A) tail expansion during reverse transcription.
What impact does poly(A) tail instability have on L1 retrotransposition? It is generally accepted that the L1 poly(A) tail is regenerated during each round of retrotransposition via reverse transcription of the polyadenylated L1 mRNA. Thus, the length of the poly(A) tail in the donor L1 element may affect L1 retrotransposition only if it alters transcription and polyadenylation. A canonical polyadenylation signal contains a conserved AATAAA hexamer and a downstream GT-rich region. The L1 polyadenylation signal is atypical in that a poly(A) tail separates the AATAAA hexamer from any downstream sequence; the sequence composition of the latter is determined by chance depending on where the donor element is inserted during the previous round of retrotransposition. Such a configuration may predispose an L1 element to bypass its own polyadenylation signal and instead to use a stronger polyadenylation signal fortuitously situated further downstream. Nevertheless, the poly(A) tail appears to exert a critical role in proper polyadenylation of L1 mRNA in the absence of a strong GT-rich downstream sequence (Fig. 4C). Although full length transcription is possible in the context of a weak polyadenylation context, the presence of a string of 17 A’s brings about significant increases in the transcriptional efficiency. Although its presence and length is not as critical for L1 mobility as for Alu mobility, the poly(A) tail’s dynamics still subtly influence the efficiency of its parent in a disadvantageous genomic neighborhood.
Genomic Distribution and Polymorphisms
The availability of complete genome sequences for many organisms has enabled genome-wide analyses of repeat distribution. Microsatellites with overall repeat length of 12 bp or longer make up 1.3% of the reference human genome (i.e., 13.3 kb per Mb genome, excluding hexameric repeats at telomeres). Although the overall density of microsatellites is relatively uniform in all human chromosomes, the distribution of each repeat class (mono-, di-, tri- etc.) varies: (1) Mononucleotide microsatellites show a uniform distribution among exonic, intronic and intergenic regions (note “exonic regions” herein include not only protein-coding sequences but also 5′ and 3′ untranslated regions); (2) Di-, tetra- and pentanucleotide microsatellites are relatively depleted in exonic regions (74%, 75% and 87% of the genome average, respectively); (3) Tri- and hexanucleotide microsatellites are enriched in exonic regions (191% and 146% of the genome average, respectively). The vast majority of human microsatellites that are conserved during vertebrate evolution are found in intergenic and intronic regions, consistent with the prediction that microsatellites in exonic regions are more disruptive to gene function.
In comparison, the two major types of non-LTR retrotransposons, L1s and Alus, display contrasting distribution patterns relative to genes. Alus typically are overrepresented inside genes (i.e., residing anywhere in a transcription unit) as well as within 30 kb from annotated genes. In addition, Alus display no orientation bias relative to genes., In contrast, L1s show prominent orientation bias within genes: while antisense L1s are close to the predicted density, sense L1s are strongly depleted within genes., In addition, both sense and antisense L1s are significantly underrepresented within 5 kb from annotated genes. The differential distribution of Alus and L1s relative to genes is not a result of target preference because both are retrotransposed by L1 proteins. Rather it is consistent with the predicted impact of sense-oriented L1s on premature polyadenylation, cryptic splicing and/or inhibition of transcriptional elongation.- This idea is further supported by the random chromosomal distribution of recent endogenous L1 and Alu copies in the human genome,- and de novo insertions from L1 transgenes in mice., Furthermore, the large catalog of polymorphic Alu and L1 insertions clearly shows that exons can be targeted in proportion to their abundance in the genome.,
Due to their dynamic nature, microsatellite loci are highly polymorphic among individuals. By comparing the reference human genome to a second genome, a survey of variation in microsatellites with at least three perfect repeats was conducted. Overall, 2.7% of the microsatellite loci are polymorphic, a frequency that is 45-fold higher than single nucleotide polymorphism (0.06%). Exonic, especially the coding exonic sequences, show the least amount of microsatellite variation as compared with intergenic and intronic sequences, reflecting increased selective constraints on exonic sequences. Polymorphism incidence increases with repeat number, an observation that is reproduced by surveying polymorphic microsatellite loci among individual genomes from the 1000-genome project., There is remarkable heterogeneity in the levels of variation at specific microsatellite loci. For example, the trinucleotide repeat in the 5′UTR of the fragile X mental retardation 1 (FMR1) gene is one of the most studied microsatellite loci with respect to polymorphism in ethnically diverse, healthy individuals. The average number of repeat variants is approximately 9.4 per 100 individuals surveyed. The range of repeat number varies from population to population; the overall range is 6 to 87 in surveyed populations. A repeat number of > 200 is considered a full mutation, which is typically associated with intellectual and developmental disabilities.
Whole-genome as well as targeted sequencing has also started to reveal the extent of genetic variations caused by non-LTR retrotransposons. Polymorphisms in non-LTR retrotransposons are manifested as indels (i.e., insertions in one genome and deletions in another genome), a common form of structural variations (other forms include tandem duplications, inversions and translocations). Recent whole-genome sequencing efforts have cataloged a large number of structural variations among diverse human populations. Detailed analyses of mutational mechanisms indicate 20–30% of SVs are caused by non-LTR retrotransposons.- On average, each human genome contains 1000–2000 polymorphic non-LTR retrotransposons, with 79–85% Alu, 12–17% L1s and 3% SVAs.,,, Non-LTR retrotransposons are actively modifying individual genomes. Based on the number of polymorphic insertions and the estimated genetic distance between individual genomes, the frequency of retrotransposition has been calculated as one Alu insertion in 21 births and one L1 insertion in 108 to 212 births.,, Most intriguingly, an analysis of the 1000 Genomes Project pilot data set suggests a recent increase in retrotransposition rates in humans. These insertions continuously generate new germline polymorphisms that can be passed across generations (Fig. 5). As compared with humans, TEs play an even bigger role in generating structural variations in the mouse genome.- This discrepancy can be largely explained by the presence of a large number of active LTR retrotransposons in the mouse genome besides non-LTR retrotransposons. Together, retrotransposon insertions contribute to ¾ of structural variations between two inbred mouse lines, with 52%, 43% and 5% contribution from LTR retrotransposons, L1s and SINEs, respectively. In addition to retrotransposition in the germline, non-LTR retrotransposons are active in somatic tissues, including normal brain tissues and most epithelial cancers (further discussed in the next section). Despite the significant progress made thus far, it should be noted that the discovery of structural variations mediated by TEs remains a challenge because of the technical difficulty in de novo assembling repetitive sequences from short sequence reads as well as the high sequence depth that is required for detecting low-frequency alleles.
Figure 5. Polymorphism and population dynamics of poly(A) microsatellites. The schematic illustrates the mutation dynamics of poly(A) microsatellites across generations, beginning with an individual who has a newly acquired long poly(A) microsatellite from L1 or Alu retrotransposition. Color is used to distinguish the size and mutability of the microsatellite, where darker color indicates longer poly(A) tails that are more mutable. A long poly(A) microsatellite can expand or contract between generations and give rise to variations in the offspring. However, when a poly(A) microsatellite mutates below the threshold repeat number, the rate of mutation decreases to the average genomic mutation rate.
Functional Impacts on Genomes and Epigenomes
The impact of a microsatellite or a transposable element and its associated variability is dependent on its genomic location. The most direct phenotypic impact of microsatellites and retrotransposons is manifested as disease causing mutations. Microsatellite instability has been implicated in a variety of human diseases with over 40 neurological, neurodegenerative and neuromuscular disorders associated with trinucleotide repeat instability., Well known examples include Huntington’s disease, Fragile X syndrome and Friedreich’s Ataxia. Unlike static mutations, which can be stably passed from one generation to the other, alleles with expanded microsatellites may continue to expand over the course of the individual’s life or through germline transmission. Depending on the genomic neighborhood, variation in microsatellite length can induce phenotypic changes through multiple pathways. These mechanisms have been best characterized in trinucleotide repeat expansion disorders and typically consist of one (or the combination) of three following three pathways: the loss of function at the protein level, the gain of function at the protein level, or the gain of function at the RNA level. The latter is gaining increased attention due to the recognized role of non-coding RNA in epigenetic pathways. A noticeable absence among these molecular mechanisms is a loss of function at the RNA level, a potential mechanism that should be investigated in future studies.
Likewise, de novo retrotransposition has been implicated in a variety of sporadic human diseases, including hemophilia, Apert syndrome, neurofibromatosis type 1 and congenital muscular dystrophy., A multitude of molecular mechanisms have been proposed to account for the mutagenesis effect for intronic or exonic insertions. A less emphasized impact is post-insertional genome instability mediated by non-LTR retrotransposons, which likely poses even greater hazard to the human genome., For example, nonallelic homologous recombination (NAHR) between two separate copies of Alu repeats may generate structural variations, such as indel, inversion and translocation mutations., In addition, a hallmark of non-LTR retrotransposition is the formation of a pair of target site duplications (TSDs) that flank the new insertion. These TSDs represent pre-insertion L1 endonuclease cleavage sites and may be targeted for subsequent cleavage by other L1 endonucleases. It is proposed that new double strand breaks in TSDs may trigger rearrangement events due to their adjacency to non-LTR retrotransposons with abundant homologous copies. These mechanisms highlight the dynamic nature of L1 and Alu sequences in the human genome. Furthermore, as has been discussed earlier, non-LTR retrotransposons are not static entities after insertion; they are a significant source for microsatellite genesis.
Both microsatellites and non-LTR retrotransposons have been associated with tumorigenesis and cancer. High-frequency microsatellite instability (MSI-H) is a molecular feature of tumors associated with hereditary nonpolyposis colorectal cancers (HNPCC; Lynch syndrome) and has been seen in other sporadic cancers, usually of the gastrointestinal system., These tumors generally arise from defects in the mismatch repair system, leaving DNA vulnerable to a high degree of mutation within microsatellites.- Detection of MSI-H tumors is often achieved with a reference panel of five microsatellites, including two mononucleotide and three dinucleotide markers. The two mononucleotide markers, BAT26 and BAT25, contain (A)26 and (A)25 repeats, respectively; their length changes can be easily discerned even in the absence of normal tissue. It is thought that MSI-H contributes to tumorigenesis mainly through mutating genes with coding microsatellites. The role of microsatellite instability in non-coding regions remains to be characterized. A recent genome-wide survey yields the initial glimpse of microsatellite mutational landscapes in MSI-H gastric cancers. There are four to 6-fold higher microsatellite mutations in genes in MSI-H gastric cancers than microsatellite stable (MSS) cancers. Only a very small fraction of microsatellite mutations (0.1%) are located in protein-coding exonic regions; the remaining are found in 5′UTR, 3′UTR, intronic and intergenic regions of the genome. Importantly, genes with significant changes in mRNA expression between MSI-H and MSS samples are enriched for microsatellite mutations in their UTRs, indicating a role of non-coding microsatellite instability in regulating gene expression.
A role of non-LTR retrotransposons in tumorigenesis has gained increasing attention in recent years.- Several studies have used NextGen sequencing approaches to detect genome-wide structural variations or specifically new retrotransposon insertions in tumor samples.,,- The first study identified 9 tumor-specific (i.e., somatic) L1 insertions in 6 of the 20 lung tumors but none in 10 brain tumors by targeted sequencing. A subsequent study characterized retrotransposon insertions in 43 human tumors of different origins by whole-genome sequencing. Tumor-specific somatic insertions were found in all tumors of epithelial origin (colorectal, prostate and ovarian), but not in blood or brain cancers. Among epithelial cancers, colorectal cancers had the highest incidence of L1 insertions at an average of 9 insertions per tumor among 4 colorectal samples; the fifth colorectal tumor sample alone had an astonishingly high number of 102 L1 insertions. Interestingly, this tumor was the only colorectal sample that manifested MSI-H with altered DNA repair pathways, suggesting a common origin to both sources of genomic instability. Microsatellite instability may be accompanied by increased retrotransposon activity. An independent study mapped tumor-specific L1 insertions in 16 colorectal tumors by targeted L1 sequencing. An average of 4.4 somatic L1 insertions was found. Intriguingly, although two samples with the highest number of somatic insertions showed microsatellite instability, no correlation was found between the number of insertions and the microsatellite instability status. Future studies with higher coverage and/or larger sample size may be needed to establish a connection between microsatellite instability and retrotransposon mutagenesis. It should be emphasized that, in addition to somatic insertions, germline insertions may also be an important etiological factor in predisposing the carriers to cancers, as has been recently shown for hepatocellular carcinoma patients. The precise role of retrotransposon insertions in cancer remains to be determined but they can have a significant impact on gene function. Consistent with sense-oriented L1 insertions being more disruptive to gene expression, genes targeted by sense insertions, but not those targeted by antisense insertions, showed statistically significant decrease in transcription.
Although usually highlighted for their negative roles, microsatellites and retrotransposons have evolutionary importance due to their ability to provide genetic novelties in human populations.,, A salient example is sequence shuffling through 3′ transduction.,, Due to the weak nature of the L1 poly(A) signal, it is possible for L1 to bypass the signal during transcription and use an alternative poly(A) signal downstream. This results in the mobilization of the 3′ non-L1 sequence. Surveys of genomic L1 copies indicate up to 23% of them underwent 3′ transduction., Some 3′ transductions may remain undetected due to so-called orphan 3′ transduction. The latter involves a 3′ transduction event that is severely 5′ truncated, which leaves behind no L1 sequence to be associated with the mobilized sequence. Such an event was responsible for Duchene muscular dystrophy in a Japanese patient. Thus, 3′ transduction has the potential to generate new genes and bring new regulatory features to the vicinity of other genes. Although no cases have been reported, it is conceivable that L1s could also mobilize microsatellites by the same mechanism. Another way of introducing evolutionary novelties is through trans-mobilization of cellular RNAs, which form processed pseudogenes. There are approximately 10,000 copies of processed pseudogenes in the human genome., Some of them have acquired novel functions and are thus classified as retrogenes. Lastly, since the exciting discovery of somatic retrotransposition in the brain, it has been postulated that L1-induced neuronal diversity may be an important source for behavioral phenotypes and confer evolutionary advantage.-
Beyond genome sequence changes, the importance of other epigenetic mechanism, including DNA methylation, histone modification and nucleosome occupancy, is coming to light. Retrotransposons are known to be associated with epigenetic changes. Therefore, the insertion of an L1 into a genomic region may alter the previously existing epigenetic state (, and our unpublished data). Due to their unique sequence composition, microsatellites can change the physical forms of DNA where they occur,, which can have implications for gene expression and genome stability. Though not all microsatellites meet the criteria to do so, many repeats form non-B DNA structures. These non-B forms of DNA are often associated with locations of chromosomal breakage. An inverse correlation exists between a repeat’s ability to form non-B form DNA and its abundance in the genome, suggesting that these unstable structures are selected against due to reduced evolutionary fitness. The presence of poly(A) tracts within promoter sequences also has implications for nucleosome organization. Poly(A) repeats form shorter and more rigid helix structures and have a narrower minor groove, altering nucleosome positioning in manners which increase the binding ability of transcription factors., Thus, the mobilization of retroelements and their associated poly(A) tails present a direct manner for L1, Alu and SVA elements to affect the genome compaction and expression dynamics of the local regions where they land.
In summary, microsatellites and non-LTR retrotransposons are dynamic DNA sequences that rapidly change from generation to generation. This makes them independent drivers of genomic variation and evolution. Expansion and mutation of microsatellites within L1 and Alu sequences may change the size and sequence identity of these non-LTR retrotransposons, perhaps modulating their mobility. Future studies to compare the sequence of full length L1s may reveal how the dynamics of microsatellites affect the retrotransposons from which they are born.
Acknowledgments
The authors wish to thank James Rosser and Simon Newkirk as well as an anonymous reviewer for critical reading and helpful suggestions in the composition of this review.
Disclosure of Potential Conflicts of Interest
No potential conflicts of interest were disclosed.
Footnotes
Previously published online: www.landesbioscience.com/journals/mge/article/25674