Documentos de Académico
Documentos de Profesional
Documentos de Cultura
Introduction
Current knowledge indicates that DNA is
a stable information storage medium. As proposed in an article by Vetsigian, Woese, and
Goldenfeld, it serves as an evolutionary protocol for evolutionary novelties and selected
properties. A wide variety of small RNAs
regulate key cellular processes of replication
as well as genetic arrangements, rearrangements, recombination and repair, and even
inventions. In most cases they act after being transcribed from the stable DNA storage
medium into a pretranscriptional or transcriptional modus, with the advantage of being active prior to all translated proteins. Elements
such as micro-RNAs, small nuclear and small
nucleolar RNAs, tRNA, and rRNA as well as
the assemblies of the spliceosome and ribosome
are vital to all life processes. In this respect I re-
Natural Genetic Engineering and Natural Genome Editing: Ann. N.Y. Acad. Sci. 1178: 244267 (2009).
c 2009 New York Academy of Sciences.
doi: 10.1111/j.1749-6632.2009.04989.x
244
245
246
247
248
replication, tissue growth, or similar developmental contexts. Should this suppressor function become unbalanced, then the normally
downregulated part may become lytic with potentially lethal consequences, as documented
for symbiodinium and its major role in coral
bleaching.25
Retroviral Competences in a Persistent
Symbiotic Lifestyle: Endogenization
Endogenous retroviral competences in the
persistent status are often characterized by features expressed only in the strict time window of a developmental process, such as axis
formation, trophectoplast formation, or the S
phase of the cell cycle. In these highly specialized contexts they are replicated through signalling, which blocks the suppression of the
replication process. After the function is fulfilled, a signal once again initiates suppressor
function. Retroelements, with their (i) higherorder regulatory functions, (ii) capability for
genetic creativity, and (iii) capacity for innovation of new regulatory patterns and combinations descended from retroviruses, which
can easily be identified by their three essential
parts gag, pol, and env.2628 Most endogenous
retroviruses have been degraded into formerly
connected domains, but they can still be recognized by retroposons or one of these three
genes,21,2931 which means their formerly connected genomic content may be used by host organisms as single or networking modular tools
for a variety of new regulatory functions. The
gag gene encodes structural proteins, pol encodes enzymes such as reverse transcriptase,
protease, ribonuclease and integrase functions,
and env encodes envelope proteins, surface and
transmembrane proteins and proteins causing
host cell fusion and immunosuppression.22
Reconstructing the Highways that also
Play Important Roles in Persistence
Interestingly we also find small DNA
viruses as genetically stabilized and co-evolved
249
250
complexes. This is an important behavioral motif in relation to noncoding RNA transcript processing into small noncoding RNA species, as
discussed later.
These smaller parts move on the highway
of the actin-based cytoskeleton and its microtubules in the direction of the nucleus35 by using the kinesin-motor protein superfamily.36 After reaching the nuclear membrane these viral
parts pass through the membrane through nuclear pores by using importin proteins (which
consists of two subunits with complementary
functions: an indicator for an ancient addiction
module itself) which bind to a special recognition sequence. Afterwards, integrase, which
is also produced and used by DNA viruses for
the same purpose, integrates these viral parts
into the genome. The integration process is
not random but involves strict coherence to
the syntax of the nucleotide sequences of the
host.
If the host cell replicates, these RNA viral
parts are transcribed into DNA. As DNA sequences they pass through the membrane of the
nucleus again to move towards the cell membrane. Again they use the highway of microtubules, but unlike earlier, they use dynein as
the motor protein.36 Interestingly, the change
in direction of the kinesin/dynein transporter
proteins depends on suppression of antagonists
of the dynein or kinesin. This could be an indicator that both are part of an addiction module
of former competing genetic parasites. Prior to
becoming an internal part of cellular transport
it could have been an external system for movement in and out of cells. And indeed there is a
connection between these two motor proteins
and retroviral (gag)-parts.28,37
When they reach the cell membrane the viral parts form a patch, via protein assembly
on internal membranes, that is divided from
the cell by exocytosis and produces its own
capsule in which the RNA genome matures.
In other contexts, similar proteins are involved
in cytokinesis,38 and most interestingly, in neural and immunological synaptic communication.39,40 Other retroviruses build their cap-
251
252
introns, are spliced out; later the remaining exons that code for proteins are combined into
a coherent protein-coding content ready for
translation.
As opposed to persistent endogenous agents
of natural genome editing in eukaryotes, we
find persistent exogenous agents in prokaryotes
that are competent in natural genome editing
of the prokaryotic gene pool. This process has
long been visualized as horizontal gene transfer
and is now recognized as occurring via plasmids, phages, and transposons, all with viral
ancestors.84
It is difficult to perceive mere molecules or
molecule buildings as being competent to
process the sophisticated DNA language. It is
less difficult to think of viruses as these subjectlike agents.
Large Noncoding RNAs: Competent
Regulators of Gene Expression
Noncoding RNAs that function in gene regulation coordinate and organize various actions,
such as chromatin modification and epigenetic
memory, transcriptional regulation, control of
alternative splicing, RNA modification and
RNA editing, control of mRNA turnover, control of translation, and signal transduction.85
In contrast to former opinions about the expression levels of genomes it is now increasingly
clear that most eukaryotic genomes are highly
expressed and a great abundance of noncoding RNAs with regulatory functions are transcribed. Most of these noncoding RNAs are
alternatively spliced and divided into smaller
RNAs that are integral parts of ribonucleoprotein (RNP) complexes. They regulate nearly all
aspects of gene regulation. Small RNA species
include micro-RNAs, small interfering RNAs,
small nuclear RNAs, and small nucleolar and
transfer RNAs.86
Although recent research has tried to evaluate the enormous regulatory networks of small
RNAs, the role of thousands of longer transcripts is not yet clear. We know that they
play important roles in histone modification,
methylation, that is, epigenetic control of developmental processes such as the mammalian
HOX clusters,85 and also transcriptional interference, promoter inactivation, and effects
on enzymatic pathways. Interestingly, these
large noncoding RNAs are found as interlacing and overlapping sense and antisense transcripts derived from introns or intergenic regions, which means that they stem from the
preferred life habitat of persistent viral settlers. That they may be of viral descent is
indicated by their developmental stage- and
tissue-specific expression patterns, which are
typical habits of persistent viral settlers. Similar to their smaller relatives they are involved
in the formation of ribonucleoprotein (RNP)
complexes.
Micro-RNAs and Their Associated
Proteins are RNPs
Small noncoding RNAs also share a special competence for epigenetic regulation of
gene expression87 and are derived from repetitive genomic sequences.88 Repetitive genomic
sequences indicate descent from retroviral infection events.89 The capacity for epigenetic
regulation of gene expression includes the
recognition (identification) of specific sequences in other nucleic acids and is common to RNAs,83 especially small nuclear RNAs
and tRNAs that (i) identify splice junctions in
both pre-mRNAs and codons, and (ii) process
both the subunits of the spliceosome and the
ribosome.17 This implicates their capacity for
self/nonself distinction as well as for identifying the molecular syntax. If one of these does
not function, that is, error or damage occurs,
the regulation or structural features of small
noncoding RNAs do not function.
Maybe a key feature of noncoding RNAs
is that they share an analog/digital languagecompetence,90 such that in their secondary
molecular structure they can act as molecular
adaptors to the protein world, whereas their
nucleotide word order seems to be digitally
structured information. Their ability to edit
253
regions of long inverted repeats and can inhibit the expression of nearly any target gene
in response to double-stranded RNA and have
a very efficient and ancient immune function
against genetic parasites.98,99 This functions by
identifying foreign RNA sequences and inhibiting their replication. The ancient RNAi immune function is based on self/nonself identification competence.98 Many of these elements
are retroposons or transposons and are encoded
in the repetitive sequences of the genome99 and
therefore are clearly of viral origin.
Micro-RNAs are single-stranded RNAs of
1925 nucleotides in length and are generated
from endogenous hairpin transcripts of 70 nucleotide precursor miRNAs.100 The transcription of this pre-miRNA is processed by RNA
polymerases pol II and pol III. Whereas pol
II produces the messenger RNA, small nucleolar and small nuclear RNAs of the spliceosome
pol III produce shorter noncoding RNAs, such
as tRNAs, some rRNAs, and a nuclear RNA
that is part of the spliceosome.18 They control
not only developmental timing, hematopoiesis,
organogenesis, apoptosis, and cell proliferation
but also fat metabolism in flies, neuronal patterning in nematodes, and control of leaf and
flower development in plants.18 Most of them
are processed out of introns!
It is predicted that every metazoan cell type
at each developmental stage has a distinct
miRNA expression profile.18 The most characteristic differences of miRNAs acting in plants
and animals are found in the stem loop.
Micro-RNAs, as well as small interfering
RNAs, seem to have descended from transposable elements with an inherent regulatory ratio
on gene regulation that is fulfilled by a variety of small interfering RNAs or micro-RNAs
that act in a coordinated manner in that they
share a division of labor in hierarchical steps
of suppression and amplification. This is indicated in transposable elements that encode
both siRNAs and miRNAs.101 They can be
found in intronic regions and build stem loop
structures (hairpins) as a common feature of
active RNA species, such as ribozymes. The
254
gene-gene interactions as well as the integration and regulation of most of the gene activities on different levels, such as chromatin structure, DNA-methylation, transcription, RNA
splicing, RNA translation, RNA stability, and
RNA signalling. This means that most of the
functions of gene control that we currently
know17,66,67,73 descended from retro-elements.
One important feature is that noncoding RNAs
function as RNPs and not as naked RNAs.106
Some of them are small nuclear and small
nucleolar RNAs (see below). Interestingly, the
nucleolus plays an important role with similar performance to other endogenous retroelements during a small time window in the
cell cycle. Nucleoli appear during the interphase of mammalian cells and are the locus of
ribosome processing. Nucleoli also control regulation of the cell cycle.107 In general, noncoding RNAs play important roles as intermediate
products in the function and structure of the
host genome, for example, in the reproductive
cycle of host cells from the start to the end of
the S-phase. This is a common feature of persistent endogenous retroviruses that adapted to
the host genome as an addiction module. They
now represent a phenotypic function of the
host organism as neutralized (balanced) former
competing viral agents. If one of these agents
is damaged or its regulation becomes unstable
the counterpart may still become virulent with
potential disease-causing consequences for the
host organism.89
Regulatory functions are restricted to specific phases of the cell cycle. One antagonist is suppressed and therefore the other can
be produced at increasing rates according to
the phase of the cell cycle, for example, production of tubulin and/or nanotubulin structures108 or production of proteins with a variety of functions in cell division. At the end
of this specific phase of the cell cycle the suppression of the antagonist ceases, mediated by
specific signalling processes, and the antagonist
then suppresses the increased production of its
counterpart. This is the precondition for terminating its function in this phase of the cell cycle.
255
highly lineage specific and form a separate family of mobile genetic elements.111
Genes coding for snoRNAs switch on host
genes by retroposition. They also play important roles in excision and integration of specific sequences. They are as important as Alu,
SINEs, and LINEs.111 Interestingly, the bestknown mobile genetic elements are integrated
into snoRetroposons. The insertion is characterized by their individually different target site duplication-repeats, for example, fulllength LINEs, that is, ALUs in primates. snoReverse Transcriptases (snoRTs) are also inserted
into other mobile elements as well as into
DNA transposons.111 Retro-elements seem to
be compatible with each other and are integrated into small nucleolar reverse transcriptase sequences, each of them with its individual target-site duplication repeat.111 This
could be an indicator of identification processes
and identity modules. Many of the snoRNA
genes are retroposons with retroviral ancestry.111 snoRTs that are part of the introns of
host genes can be used as RNAs with novel
functions. After a retroposition event they can
silence the snoRNA copy of their parents and
lead to a new function of the new snoRNA.111
This seems to be a common feature of endogenous retroposons and transposons in that
these genetic invaders are driving forces of genomic creativity: the same competences in different contexts may lead to new functions and
phenotypes.
Transport of small nucleolar RNPs is
characerized by both nuclear and cytoplasmic
cycles. Cajal bodies play important roles in intranuclear transport as centers for RNP assembly, transport, modification, and editing.
Their production is a complex highly coordinated and conserved process. When the small
nuclear RNPs arrive in the nucleoplasm they
spread over the whole interchromatin space.
Newly generated RNPs are assembled in the
Cajal bodies before they are integrated into the
nucleolar subdomains fibrallin and interchromatin clusters. This seems to be an indicator
that they are processed and modified within
256
257
258
259
260
which became active may even fail. These abilities are still evident in the t-loop structure of tRNAs, a variety of ribozymes, and self-enforcing
RNAi loops, which couple heterochromatin assembly to siRNA production.138
The high density of early RNA life led to
competing situations in which it was an advantage to escape into DNA informational storage
and protein-based cellular life as outlined in
another article.143 Protein and DNA invention
were a prerequisite for the evolution of divergence and the variety of life because all evolutionary inventions could be stored as evolutionary protocols77 in this stable storage medium.
The information content of the human genome
is comparable to an archive of 5000 books with
300 pages each.
It appears that all the detailed steps of evolution stored in DNA that are read, transcribed,
and translated in every developmental and
growth process of each individual cell depend
on RNA-mediated processes, in most cases interconnected with other RNAs and their associated protein complexes and functions in a
strict hierarchy of temporal and spatial steps.
It is clear that this regulatory order could not
evolve by chance or that it represents solely a
randomly derived mixture of nucleotides, but
that it is composed of individual functions and
integration into one developmental target, in
strict coherence to the syntax of the nucleic
acid language.
Today we are beginning to realize the degree of abundance and variety of RNA species
with their different, sometimes complementary
and competing roles in all key processes of life.
RNAs play complementary roles in information processing and regulation.144 In most cases
they have an inheritable status, being integrated
in the genome of organisms, and are termed endogenous. In other cases they are ancient individuals living in the cytoplasm of cells as persistent nonlytic parasites similar to DNA settlers,
with important endosymbiotic roles. Their relation to viruses is close and some virologists consider an evolutionary tree of RNA species and
RNA viruses. Interestingly, some DNA viruses
have other features (linear chromosome, telomere ends, intron-like structures) that indicate a
different origin with ancient roots comparable
to RNA viruses. These features connect the evolutionary roots of archaea and eukaryotes, because ancient dsDNA viruses have similar features in viruses of archaea and the eukaryotic
nucleus. A special feature is the RNA proofreading and repair ability of RNA polymerases,
which would be the precondition of an RNA
genome in the early RNA world because of the
relatively unstable RNA structures.145147 On
the other hand, this instability is a necessary
precondition for the high productivity of different RNA sequences with the rapidly adapting
features necessary in the early RNA world, that
is, a large variety of different RNA identities.
In particular, the mode of replication of eukaryotes shows a wide variety of hierachical
ordered processes that each depend on signalling processes to indicate the successful termination of the preceding process. A temporal order exists, with time windows processed
by different process design connected with the
cell cycle, along with transport systems of signals and complex messages, agents, co-agents
and helpers, such as ancillary proteins and
a network of interwoven regulatory elements
which suppress or amplify the start and stops of
semioses, production, regulation, and the whole
toolbox of natural genetic engineering. We now
know that all these processes involve RNAs,
which become active via transcription out of
the DNA storage medium prior to translation
into proteins.
During translation from digital DNA storage into the analog code of protein language
it is interesting that DNA information contains
multiple RNA- and protein meanings, that is,
from the same genetic data set it is possible to
transcribe multiple RNA species or translate a
variety of proteins according to the higher order regulations inherent in epigenetic control
and/or transcriptional, pre- or posttranscriptional modification targets. Additionally, in eukaryotes we find some noncoding RNA species,
such as small nuclear, small nucleolar, small
261
262
263
Gag proteins of the two Drosophila telomeric retrotransposons are targeted to chromosome ends. J.
Cell Biol. 159: 397402.
Peterson-Burch, B.D. & D.F. Voytas. 2002. Genes
of the Pseudoviridae (Ty1/copia retrotransposons).
Mol. Biol. Evol. 19: 18321845.
Tang, Y., U. Winkler, E.O. Fredd, et al. 1999. Cellular motor protein KIF-4 associates with retroviral
gag. J. Virol. 73: 1050810513.
Gao, X., E.R. Havecker, P.V. Baranov, et al. 2003.
Translational recoding signals between gag and pol
in diverse LTR retrotransposons. RNA 9: 1422
1430.
Sfakianos, J.N. & E. Hunter. 2003. M-PMV capsid
transport is mediated by env/gag interactions at the
pericentriolar recycling endosome. Traffic 4: 671
680.
Ivanyi-Nagy, R. & J.L. Darlix. 2008. Ancient retrotransposons as possible remnants of the primitive RNP World. In Retrotransposition, Diversity
and the Brain. F.H. Gage & Y. Christen, Eds.
Springer. Berlin/Heidelberg, doi: 10.1007/978-3540-74966-0_4.
Shadan, F.F. & L.P. Villarreal. 1996. The evolution
of small DNA viruses of eukaryotes: Past and present
considerations. Vir. Gen. 11: 239257.
Nisole, S. & A. Saib. 2004. Early Steps of retrovirus
replicative cycle. Retrovir. 1: 9.
Greber, U.F. 2002. Signalling in viral entry. Cell Mol.
Life Sci. 59: 608626.
Greber, U.F. 2005. Viral trafficking violations in axons: The herpesvirus case. Proc. Natl. Acad. Sci. USA
102: 56395640.
Vale, R.D. 2003. The molecular motor toolbox for
intracellular transport. Cell 112: 467480.
Cochrane, A.W., M.T. McNally & A.J. Mouland.
2006. The retrovirus RNA trafficking granule: From
birth to maturity. Retrovir. 3: 18.
Carlton, J.G. & J. Martin-Serrano. 2007. Parallels between cytokinesis and retroviral budding: A
role for the ESCRT machinery. Science 316: 1908
1912.
Dustin, M.L. & D.R. Colman. 2002. Neural and
immunological synaptic relations. Science 298: 785.
Piguet, V. & Q. Sattenaut. 2004. Dangerous liaisons
at the virological synapse. J. Clin. Invest. 114: 605.
Shav-Tal, Y. & R.H. Singer. 2005. RNA localisation.
J. Cell Sci. 118: 40774081.
Mingle, L.A., N.N. Okuhama, J. Shi, et al. 2005.
Localization of all seven messenger RNAs for the
actin-polymerazitation nucleator arp2/3 complex
in the protrusions of fibroblasts. J. Cell Sci. 118:
24252433.
Yedavalli, V.S., C. Neuveut, Y.H. Chi, et al. 2004.
Requirement of DDX3 dead box RNA helicase for
9.
10.
11.
12.
13.
14.
15.
16.
17.
18.
19.
20.
21.
22.
23.
24.
25.
26.
27.
28.
29.
30.
31.
32.
33.
34.
35.
36.
37.
38.
39.
40.
41.
42.
43.
264
44.
45.
46.
47.
48.
49.
50.
51.
52.
53.
54.
55.
56.
57.
58.
59.
60.
61.
62.
63.
64.
65.
66.
67.
68.
69.
70.
71.
72.
73.
74.
75.
76.
77.
265
266
111. Weber, M.J. 2006. Mammalian small nucleolar
RNAs are mobile genetic elements. PLoS Genet. 2:
e205.
112. Platini, M., I.Goldberg, A.I. Lamond, et al. 2002.
Cajal Body dynamics and association with chromatin are ATP dependent. Nat. Cell. Biol. 4: 502
508.
113. Tomlinson, R.L., T.A. Ziegler, T. Supakorndej, et
al. 2006. Cell cycle-regulated trafficking of human
telomerase to telomeres. Mol. Biol. Cell. 17: 955
965.
114. Kiss, T. 2002. Small nucleolar RNAs: An abundant group of noncoding RNAs with diverse cellular
functions. Cell 109: 145148.
115. Kiss, A.M., B.E. Jady, X. Darzacq, et al. 2002. A
Cajal body-specific pseudouridylation guide RNA
is composed of two box H/ACA snoRNA-like domains. Nuc. Ac. Res. 30: 46434649.
116. Clouet dOrval, B., M.L. Bortolin, C. Gaspin, et al.
2001. Box C/D RNA guides for the ribose methylation of archaeal tRNAs. The tRNATrp intron guides
the formation of two ribose-methylated nucleosides
in the mature tRNATrp. Nuc. Ac. Res. 29: 4518
4529.
117. Wang, C. & U.T. Meier. 2004. Architecture and
assembly of mammalian H/ACA small nucleolar
and telomerase ribonucleoproteins. EMBO J. 23:
18571867.
118. Witzany, G. 2008. The viral origins of telomeres,
telomerases and their important role in eukaryogenesis and genome maintenance. Biosem. 2: 191206.
119. Jady, B.E., E. Bertrand & T. Kiss. 2004. Human
telomerase RNA and box H/ACA scaRNAs share
a common Cajal body-specific localization signal. J.
Cell. Biol. 164: 647652.
120. Fu, D. & K. Collins. 2006. Human telomerase and
Cajal body ribonucleoproteins share a unique specificity of Sm protein association. Gen. Dev. 20: 531
536.
121. Darzacq, X., B.E. Jady, C. Verheggen, et al. 2002.
Cajal body-specific small nuclear RNAs: A novel
class of 2 -O-methylation and pseudouridylation
guide RNAs. EMBO J. 21: 27462756.
122. Richard, P., X. Darzacq, E. Bertrand, et al. 2003.
A common sequence motif determines the Cajal
body-specific localization of box H/ACA scaRNAs.
EMBO J. 22: 42834293.
123. Eickbush, T.H. & D.G. Eickbush. 2007. Finely orchestrated movements: Evolution of the ribosomal
RNA genes. Genet. 175: 477485.
124. Narayanan, A., A. Lukowiak, B.E. Jady, et al. 1999.
Nucleolar localization signals of box H/ACA small
nucleolar RNAs. EMBO J. 18: 51205130.
125. Brosius, J. 2005. Echoes from the past are we still
in an RNP world? Cyt. Gen. Res. 110: 14.
267