Refine
Document Type
- Article (6)
Language
- English (6)
Has Fulltext
- yes (6)
Is part of the Bibliography
- no (6)
Keywords
- - (3)
- AUGUSTUS (1)
- Alignment (1)
- Alignment anchor (1)
- Gene prediction (1)
- Genome alignment (1)
- Geometric hashing (1)
- Miniprot (1)
- Protein coding gene (1)
- Spaced seeds (1)
- YOLOv4 (1)
- animal activity (1)
- animal monitoring (1)
- bacterial-fungal interaction (1)
- computer vision (1)
- fluorescent pseudomonads (1)
- fungal growth inhibition (1)
- genome comparison (1)
- hybridization (1)
- lineage‐specific region (1)
- machine learning (1)
- microfluidic device (1)
- multiple sequence alignment (1)
- pathogenicity (1)
- plant pathogen (1)
- profile hidden Markov model (1)
- red foxes (1)
Institute
Publisher
- BioMed Central (BMC) (2)
- Frontiers Media S.A. (1)
- MDPI (1)
- Oxford University Press (1)
Abstract
Amphidiploid fungal Verticillium longisporum strains Vl43 and Vl32 colonize the plant host Brassica napus but differ in their ability to cause disease symptoms. These strains represent two V. longisporum lineages derived from different hybridization events of haploid parental Verticillium strains. Vl32 and Vl43 carry same‐sex mating‐type genes derived from both parental lineages. Vl32 and Vl43 similarly colonize and penetrate plant roots, but asymptomatic Vl32 proliferation in planta is lower than virulent Vl43. The highly conserved Vl43 and Vl32 genomes include less than 1% unique genes, and the karyotypes of 15 or 16 chromosomes display changed genetic synteny due to substantial genomic reshuffling. A 20 kb Vl43 lineage‐specific (LS) region apparently originating from the Verticillium dahliae‐related ancestor is specific for symptomatic Vl43 and encodes seven genes, including two putative transcription factors. Either partial or complete deletion of this LS region in Vl43 did not reduce virulence but led to induction of even more severe disease symptoms in rapeseed. This suggests that the LS insertion in the genome of symptomatic V. longisporum Vl43 mediates virulence‐reducing functions, limits damage on the host plant, and therefore tames Vl43 from being even more virulent.
Phytopathogenic Verticillia cause Verticillium wilt on numerous economically important crops. Plant infection begins at the roots, where the fungus is confronted with rhizosphere inhabiting bacteria. The effects of different fluorescent pseudomonads, including some known biocontrol agents of other plant pathogens, on fungal growth of the haploid Verticillium dahliae and/or the amphidiploid Verticillium longisporum were compared on pectin-rich medium, in microfluidic interaction channels, allowing visualization of single hyphae, or on Arabidopsis thaliana roots. We found that the potential for formation of bacterial lipopeptide syringomycin resulted in stronger growth reduction effects on saprophytic Aspergillus nidulans compared to Verticillium spp. A more detailed analyses on bacterial-fungal co-cultivation in narrow interaction channels of microfluidic devices revealed that the strongest inhibitory potential was found for Pseudomonas protegens CHA0, with its inhibitory potential depending on the presence of the GacS/GacA system controlling several bacterial metabolites. Hyphal tip polarity was altered when V. longisporum was confronted with pseudomonads in narrow interaction channels, resulting in a curly morphology instead of straight hyphal tip growth. These results support the hypothesis that the fungus attempts to evade the bacterial confrontation. Alterations due to co-cultivation with bacteria could not only be observed in fungal morphology but also in fungal transcriptome. P. protegens CHA0 alters transcriptional profiles of V. longisporum during 2 h liquid media co-cultivation in pectin-rich medium. Genes required for degradation of and growth on the carbon source pectin were down-regulated, whereas transcripts involved in redox processes were up-regulated. Thus, the secondary metabolite mediated effect of Pseudomonas isolates on Verticillium species results in a complex transcriptional response, leading to decreased growth with precautions for self-protection combined with the initiation of a change in fungal growth direction. This interplay of bacterial effects on the pathogen can be beneficial to protect plants from infection, as shown with A. thaliana root experiments. Treatment of the roots with bacteria prior to infection with V. dahliae resulted in a significant reduction of fungal root colonization. Taken together we demonstrate how pseudomonads interfere with the growth of Verticillium spp. and show that these bacteria could serve in plant protection.
Background
An important initial phase of arguably most homology search and alignment methods such as required for genome alignments is seed finding. The seed finding step is crucial to curb the runtime as potential alignments are restricted to and anchored at the sequence position pairs that constitute the seed. To identify seeds, it is good practice to use sets of spaced seed patterns, a method that locally compares two sequences and requires exact matches at certain positions only.
Results
We introduce a new method for filtering alignment seeds that we call geometric hashing. Geometric hashing achieves a high specificity by combining non-local information from different seeds using a simple hash function that only requires a constant and small amount of additional time per spaced seed. Geometric hashing was tested on the task of finding homologous positions in the coding regions of human and mouse genome sequences. Thereby, the number of false positives was decreased about million-fold over sets of spaced seeds while maintaining a very high sensitivity.
Conclusions
An additional geometric hashing filtering phase could improve the run-time, accuracy or both of programs for various homology-search-and-align tasks.
Background
The Earth Biogenome Project has rapidly increased the number of available eukaryotic genomes, but most released genomes continue to lack annotation of protein-coding genes. In addition, no transcriptome data is available for some genomes.
Results
Various gene annotation tools have been developed but each has its limitations. Here, we introduce GALBA, a fully automated pipeline that utilizes miniprot, a rapid protein-to-genome aligner, in combination with AUGUSTUS to predict genes with high accuracy. Accuracy results indicate that GALBA is particularly strong in the annotation of large vertebrate genomes. We also present use cases in insects, vertebrates, and a land plant. GALBA is fully open source and available as a docker image for easy execution with Singularity in high-performance computing environments.
Conclusions
Our pipeline addresses the critical need for accurate gene annotation in newly sequenced genomes, and we believe that GALBA will greatly facilitate genome annotation for diverse organisms.
Background
The alignment of large numbers of protein sequences is a challenging task and its importance grows rapidly along with the size of biological datasets. State-of-the-art algorithms have a tendency to produce less accurate alignments with an increasing number of sequences. This is a fundamental problem since many downstream tasks rely on accurate alignments.
Results
We present learnMSA, a novel statistical learning approach of profile hidden Markov models (pHMMs) based on batch gradient descent. Fundamentally different from popular aligners, we fit a custom recurrent neural network architecture for (p)HMMs to potentially millions of sequences with respect to a maximum a posteriori objective and decode an alignment. We rely on automatic differentiation of the log-likelihood, and thus, our approach is different from existing HMM training algorithms like Baum–Welch. Our method does not involve progressive, regressive, or divide-and-conquer heuristics. We use uniform batch sampling to adapt to large datasets in linear time without the requirement of a tree. When tested on ultra-large protein families with up to 3.5 million sequences, learnMSA is both more accurate and faster than state-of-the-art tools. On the established benchmarks HomFam and BaliFam with smaller sequence sets, it matches state-of-the-art performance. All experiments were done on a standard workstation with a GPU.
Conclusions
Our results show that learnMSA does not share the counterintuitive drawback of many popular heuristic aligners, which can substantially lose accuracy when many additional homologs are input. LearnMSA is a future-proof framework for large alignments with many opportunities for further improvements.