Nonlinear Dynamics, Psychology, and Life Sciences, Vol. 26, Iss. 4, Oct, 2022, pp. 371-387
@2022 Society for Chaos Theory in Psychology & Life Sciences

 
Discovering y-Narratives Hidden in Non-Coding Human Genome: y-Text-Finder and Genomic Multilayer Store

Havard R. Glattre, Norwegian School of Veterinary Science, Oslo, Norway
Eystein Glattre, Norwegian School of Veterinary Science, Oslo, Norway
Lars Moe, Norwegian School of Veterinary Science, Oslo, Norway

Abstract: This article is about the discovery of thousands of narrative-like structures, like human corpus-texts, but written by y-words which are nucleotide strings delimited by stop codons in the non-coding part of the human genome. In a previous article these strings were shown to behave like human words. We use a text-finder to search for texts composed of the y-words, forming what we call y-narratives, and demonstrate that the non-coding human genome behaves like a multilayer structure due to the way the text-finder works. Tables are presented which show that y-narratives of increasing y-word-length are found in increasingly superior layers of the multilayer structure, although the uppermost layers of many chromosomes may lack a specific y-narrative. We discriminate between two types of y-narratives, one more like human language than the other, and demonstrate some of their linguistic characteristics. Some of the seemingly important consequences of the Astonishing Conjecture are briefly discussed. The overall objective of the paper is to establish an understanding of the corpus-narrative properties of the non-protein-coding genome and enable future study of the informational structure therein.

Keywords: sequence analysis, CCIC, long-range correlation, text mining, datamining, natural language processing, network analysis, bioinformatics