Refine
Document Type
- Article (3)
Language
- English (3)
Has Fulltext
- yes (3)
Is part of the Bibliography
- no (3)
Keywords
- - (2)
- heat stress (2)
- <i>Solanum lycopersicum</i> L. (1)
- Massive Analysis of cDNA Ends (MACE) (1)
- RNA-seq (1)
- alternative splicing (1)
- convolutional neural networks (1)
- deep learning (1)
- machine learning (1)
- ncRNA (1)
Institute
- Institut für Biometrie und Medizinische Informatik (3) (remove)
Publisher
- MDPI (2)
- Frontiers Media S.A. (1)
Non-coding RNA (ncRNA) classes take over important housekeeping and regulatory functions and are quite heterogeneous in terms of length, sequence conservation and secondary structure. High-throughput sequencing reveals that the expressed novel ncRNAs and their classification are important to understand cell regulation and identify potential diagnostic and therapeutic biomarkers. To improve the classification of ncRNAs, we investigated different approaches of utilizing primary sequences and secondary structures as well as the late integration of both using machine learning models, including different neural network architectures. As input, we used the newest version of RNAcentral, focusing on six ncRNA classes, including lncRNA, rRNA, tRNA, miRNA, snRNA and snoRNA. The late integration of graph-encoded structural features and primary sequences in our MncR classifier achieved an overall accuracy of >97%, which could not be increased by more fine-grained subclassification. In comparison to the actual best-performing tool ncRDense, we had a minimal increase of 0.5% in all four overlapping ncRNA classes on a similar test set of sequences. In summary, MncR is not only more accurate than current ncRNA prediction tools but also allows the prediction of long ncRNA classes (lncRNAs, certain rRNAs) up to 12.000 nts and is trained on a more diverse ncRNA dataset retrieved from RNAcentral.
Identification and Regulation of Tomato Serine/Arginine-Rich Proteins Under High Temperatures
(2021)
Alternative splicing is an important mechanism for the regulation of gene expression in eukaryotes during development, cell differentiation or stress response. Alterations in the splicing profiles of genes under high temperatures that cause heat stress (HS) can impact the maintenance of cellular homeostasis and thermotolerance. Consequently, information on factors involved in HS-sensitive alternative splicing is required to formulate the principles of HS response. Serine/arginine-rich (SR) proteins have a central role in alternative splicing. We aimed for the identification and characterization of SR-coding genes in tomato (Solanum lycopersicum), a plant extensively used in HS studies. We identified 17 canonical SR and two SR-like genes. Several SR-coding genes show differential expression and altered splicing profiles in different organs as well as in response to HS. The transcriptional induction of five SR and one SR-like genes is partially dependent on the master regulator of HS response, HS transcription factor HsfA1a. Cis-elements in the promoters of these SR genes were predicted, which can be putatively recognized by HS-induced transcription factors. Further, transiently expressed SRs show reduced or steady-state protein levels in response to HS. Thus, the levels of SRs under HS are regulated by changes in transcription, alternative splicing and protein stability. We propose that the accumulation or reduction of SRs under HS can impact temperature-sensitive alternative splicing.