We study the distribution of new classes of motifs in genes, a research field that has not been investigated to date. A single-frame motif
SF has no trinucleotide in reading frame (frame 0) that occurs in a shifted frame (frame 1 or 2),
[...] Read more.
We study the distribution of new classes of motifs in genes, a research field that has not been investigated to date. A single-frame motif
SF has no trinucleotide in reading frame (frame 0) that occurs in a shifted frame (frame 1 or 2), e.g., the dicodon
AAACAA is
as the trinucleotides
AAA and
CAA do not occur in a shifted frame. A motif which is not single-frame
is multiple-frame
. Several classes of
motifs are defined and analysed. The distributions of single-frame
motifs (associated with an unambiguous trinucleotide decoding in the two
and
directions) and 5′ unambiguous motifs
(associated with an unambiguous trinucleotide decoding in the
direction only) are analysed without and with constraints. The constraints studied are: initiation and stop codons, periodic codons
, antiparallel complementarity and parallel complementarity. Taken together, these results suggest that the complementarity property involved in the antiparallel (DNA double helix, RNA stem) and parallel sequences could also be fundamental for coding genes with an unambiguous trinucleotide decoding in the two
and
directions or the
direction only. Furthermore, the single-frame motifs
with a property of trinucleotide decoding and the framing motifs
(also called circular code motifs; first introduced by Michel (2012)) with a property of reading frame decoding may have been involved in the early life genes to build the modern genetic code and the extant genes. They could have been involved in the stage without anticodon-amino acid interactions or in the Implicated Site Nucleotides (ISN) of RNA interacting with the amino acids. Finally, the
and
dipeptides associated with the
and
dicodons, respectively, are studied and their importance for biology and the origin of life discussed.
Full article