Refine
Year of publication
Document Type
- Doctoral Thesis (52)
- Article (15)
Has Fulltext
- yes (67)
Is part of the Bibliography
- no (67)
Keywords
- - (15)
- Statistik (5)
- Numerische Mathematik (4)
- Optimale Kontrolle (4)
- permutation entropy (4)
- Bioinformatik (3)
- Fraktal (3)
- Optimale Steuerung (3)
- Selbstähnlichkeit (3)
- fractal (3)
- Algebra (2)
- Algorithmus (2)
- Bundle Gerbes (2)
- Fock-Raum (2)
- Funktionalanalysis (2)
- Hierarchie (2)
- Kategorientheorie (2)
- Lévy-Prozess (2)
- Mathematical Physics (2)
- Netzwerk (2)
- Robuste Statistik (2)
- autocorrelation (2)
- microarray (2)
- order pattern (2)
- self-similarity (2)
- signal processing (2)
- statistics (2)
- time series (2)
- (generalized) linear mixed model (1)
- (verallgemeinertes) lineares gemischtes Modell (1)
- 1H-NMR (1)
- 33C10 (1)
- 60B15 (1)
- 60H05 (primary) (1)
- <i>A. thaliana</i> (1)
- <i>Mycobacterium avium</i> ssp. <i>paratuberculosis</i> (MAP) (1)
- Abweichung (1)
- Akute lymphatische Leukämie (1)
- Algebra Bundles (1)
- Algorithmic Cheminformatics (1)
- Alignment <Biochemie> (1)
- Approximationsalgorithmus (1)
- Assemblierung (1)
- Ausreißer (1)
- BAYES (1)
- BLAST (1)
- Bayes-Netz (1)
- Beobachtungsprozess (1)
- Bernoulli convolution (1)
- Bernoulli convolutions (1)
- Bernoulli-Faltungen (1)
- Bialgebra (1)
- Bioinformatics (1)
- Biomathematik , Bioinformatik (1)
- Blutdruck (1)
- Box-Cox-Transformation (1)
- Box-Cox-transformation (1)
- Brustkrebs (1)
- Bundle gerbe (1)
- Buscher rules (1)
- Bündelgerbe (1)
- CRF (1)
- Call Center (1)
- Characteristic Attribute Organization System (1)
- Chromosomenaberration (1)
- Chromosomenanalyse (1)
- Clade Annotation (1)
- Classification (1)
- Cluster (1)
- Clusteranalyse (1)
- Comparative Gene Finding (1)
- Comparative Genomics (1)
- Convergence of Markov chains (1)
- Covid-19 (1)
- DNA taxonomy (1)
- Darbellay-Vajda Partitionsregel (1)
- Darstellung (1)
- Data Science (1)
- Diagnostik (1)
- Dichteschätzung (1)
- Differentialgeometrie (1)
- Differentialgleichung (1)
- Differentialspiele (1)
- Dimensionsreduktion (1)
- Diskrete Optimierung (1)
- Dual Decomposition (1)
- Dual Fitting (1)
- Dualität (1)
- Dynamical System (1)
- Dynamische Systeme (1)
- Elastizitätstheorie (1)
- Elektrodiagnostik (1)
- Elektroencephalogramm (1)
- Elektrokardiogramm (1)
- Entropie (1)
- Enzym (1)
- Eulerian numbers (1)
- Evolution (1)
- Evolution der Genregulation (1)
- Evolution of Gene Regulation (1)
- Expressionsdaten (1)
- Fehler in Variablen Modellen (1)
- Fettleber (1)
- Freies Produkt (1)
- GNS-Konstruktion (1)
- Gammaverteilung (1)
- Ganzzahlige lineare Optimierung (1)
- Gene Structure Prediction (1)
- Genetic Evolution (1)
- Genetische Netzwerke (1)
- Genome Annotation (1)
- Genregulation (1)
- Geometrie (1)
- Geometry (1)
- Germany (1)
- Gewöhnliche Differentialgleichungen (1)
- Graph (1)
- Graph Theory (1)
- Graphentheorie (1)
- Graphenzeichnen (1)
- Grenzwertsatz (1)
- Harn (1)
- Hausdorff-Dimension (1)
- Heart Beat Detection (1)
- Herzfrequenzvariabilität (1)
- Herzschlagerkennung (1)
- High-Dimensional Data (1)
- Highest density regions (1)
- Histogramm (1)
- Histogramm-Konstruktion (1)
- Histogramme (1)
- Hoch-dimensionale Daten (1)
- Hochdurchsatz (1)
- Homotopie (1)
- Homotopy methods (1)
- Hopf bundle (1)
- Hopf-Algebra (1)
- Hopfbündel (1)
- Hutchinson Operator (1)
- Hutchinson-Operator (1)
- Hybride stochastische Modelle (1)
- Hybridisierung (1)
- IL-10 (1)
- Infinite Dimensional Geometry (1)
- Informationsabhängigkeit (1)
- Informationsvisualisierung (1)
- Innerwurfvarianz (1)
- Integer Linear Program (1)
- Interaktionsfläche (1)
- Interpretability (1)
- Interpretable Machine Learning (1)
- Iterated Function System (1)
- Iteriertes Funktionensystem (1)
- Kernspintomografie (1)
- Klassifikation (1)
- Klimaschwankungen (1)
- Kotensorfunktor (1)
- L1-Regression (1)
- Labelingproblem (1)
- Lagrangian Relaxation (1)
- Latent Structure (1)
- Leukämie (1)
- Level-Set-Methode (1)
- Lie-Algebra (1)
- Likelihood-Quotienten-Test (1)
- Linear Elasticity (1)
- Lineare Elastizitätstheorie (1)
- Lévy processes (1)
- Lévy-prozess (1)
- MATLAB (1)
- MCMC (1)
- MRI (1)
- Mahalanobisdistanz (1)
- Mapping (1)
- Markierungswiederherstellung (1)
- Markov-Kette (1)
- Maschinelles Lernen (1)
- Mathematical Phylogenetic (1)
- Mathematical Phylogenetics (1)
- Mathematik (1)
- Mathematische Modellierung (1)
- Mathematische Phylogenetik (1)
- Maximumprinzip (1)
- Maßanalyse (1)
- Medizin (1)
- Metagenomanalyse (1)
- Microarray (1)
- Minkowski-Metrik (1)
- Model selection (1)
- Modellierung (1)
- Monoidale Kategorie (1)
- Mustererkennung (1)
- NGS (1)
- NOD1/2 (1)
- NOD‐like receptors (1)
- NP-completeness (1)
- NP-vollständiges Problem (1)
- Network (1)
- Next Generation Sequencing (1)
- Nichtkommutative Wahrscheinlichkeit (1)
- Nichtlineare Optimierung (1)
- Nichtlineares dynamisches System (1)
- Niere (1)
- Nilpotente Lie-Algebra (1)
- Normalverteilung (1)
- One Health (1)
- Optimal Control (1)
- Optimal control (1)
- Optimalsteuerung (1)
- Optimierung (1)
- PD-L1 (1)
- PDE (1)
- Paläoklimarekonstruktion (1)
- Parameter <Mathematik> (1)
- Permutationsentropie (1)
- Perron-Frobenius theorem (1)
- Photoplethysmogram (1)
- Phylogenetics (1)
- Physiologie (1)
- Pisot-Zahl (1)
- Plethysmogramm (1)
- Podocyte (1)
- Position Specific Scoring Matrix (1)
- Protein (1)
- Proteine (1)
- Proteinwechselwirkungsstelle (1)
- Protonen-NMR-Spektroskopie (1)
- Pseudo-Riemannscher Raum (1)
- Quanten-Lévy-Prozess (1)
- Quantengruppe (1)
- Quantenwahrscheinlichkeitstheorie (1)
- Quantifizierung (1)
- Quasi-Pseudo-Metrik (1)
- RIEMS (1)
- RR Intervals (1)
- RR-Abstände (1)
- Random matrices (1)
- Randomisierung (1)
- Rechenmethoden (1)
- SPecies IDentity and Evolution in R (1)
- Schale (1)
- Schalentheorie (1)
- Scheduling (1)
- Segmentierung (1)
- Self-assembling protein design (1)
- Sequentielle Optimierung (1)
- Shells (1)
- Signal (1)
- Signalanalyse (1)
- Simulation (1)
- Slice (1)
- Spannender Baum (1)
- Spektrum (1)
- Spike and Slab (1)
- Sprungbestrafte Schätzer (1)
- Statistics (1)
- Statistische Analyse (1)
- Statistischer Test (1)
- Stetiger Markov-Prozess (1)
- Stoffwechselkrankheit (1)
- Stoffwechselweg (1)
- String Geometry (1)
- Stückweise deterministischer Markov Prozess (1)
- Symplektischer Raum (1)
- System von gewöhnlichen (1)
- T-Duality (1)
- T-Dualität (1)
- Taxonomie (1)
- Tensoralgebra (1)
- Test von zufälligen Effekten (1)
- Tollwut (1)
- Transcription Factor Binding Site (1)
- Transfer Operator (1)
- Transfer-Operator (1)
- Transkriptionsfaktorbindestellen (1)
- Translokation (1)
- Treemap (1)
- Twisted K-Theory (1)
- Unabhängigkeitanalyse (1)
- Ungewissheit (1)
- Variationsrechnung (1)
- Visualisierung (1)
- Visualization (1)
- Vorhersage (1)
- YOLOv4 (1)
- Zufallsmatrizen (1)
- Zylinderschale (1)
- aging (1)
- animal activity (1)
- animal monitoring (1)
- bacterial-fungal interaction (1)
- beta-representation (1)
- bioinformatics (1)
- biologischer Quellen (1)
- cattle trade (1)
- cell fractionation (1)
- cellular homeostasis (1)
- classification models (1)
- clustering (1)
- computer vision (1)
- concentration coefficient (1)
- conditional random field (1)
- dairy cows (1)
- diagnostic (genetic) characters (1)
- drug (1)
- duale Halbgruppe (1)
- emerging diseases (1)
- epidemic model (1)
- exhaled breath (1)
- fecal headspace (1)
- fluorescent pseudomonads (1)
- free probability (1)
- fungal growth inhibition (1)
- gamma distribution (1)
- granuloma (1)
- histogram construction (1)
- histograms (1)
- hybridization (1)
- immunology (1)
- inborn errors of metabolism (1)
- infectious diseases (1)
- inflammation (1)
- integrative taxonomy (1)
- invariant measure (1)
- invariantes Maß (1)
- konsistente Schätzung (1)
- kw: Graph Theory (1)
- mRNA (1)
- mathematical modeling (1)
- mathematical statistics (1)
- mathematische Statistik (1)
- microfluidic device (1)
- migratory connectivity (1)
- mixed binomial point process (1)
- mixture proposals (1)
- modeling (1)
- modularity (1)
- multinomial distribution (1)
- myeloid-derived suppressor cells (1)
- neighbor map (1)
- network analysis (1)
- nichtparametrische Dichteschätzung (1)
- non-commutative independences (1)
- nonparametric density estimation (1)
- normal distribution (1)
- observation process (1)
- oocyte (1)
- ordinal (1)
- ordinal time series (1)
- ordinale Muster (1)
- outlier detection (1)
- ovine anaplasmosis (1)
- parametric bootstrap (1)
- parametrisches Bootstrap (1)
- paratuberculosis (1)
- pattern (1)
- phylogenetic (1)
- plant pathogen (1)
- pn: Nikolai Nøjgaard (1)
- probabilistic interface labeling problem (1)
- protein (1)
- proteome (1)
- quantum groups (1)
- quantum probability (1)
- rabies (1)
- random forest (1)
- randomization (1)
- red foxes (1)
- relative entropy (1)
- reprogramming (1)
- restricted M-splines (1)
- räumliches Überleben (1)
- sheep (1)
- sleep stages (1)
- small molecule (1)
- snoRNAs (1)
- sparsame Darstellung von Daten (1)
- sparse representation of data (1)
- spatial survival (1)
- spectrum (1)
- sphingolipid metabolism (1)
- stable air (1)
- stichprobenabhängige Partitionsregel (1)
- stochastic block model (1)
- systems biology (1)
- test of random effects (1)
- tick-borne fever (1)
- tiling, self-similarity, fractal, aperiodic, iterated function system (1)
- tissue specificity (1)
- trans dimensional sampling (1)
- tuberculosis (1)
- universelles Produkt (1)
- virology (1)
- volatile organic compound (VOC) (1)
- within-litter variation (1)
Institute
- Institut für Mathematik und Informatik (67) (remove)
Publisher
- MDPI (8)
- Frontiers Media S.A. (4)
- Wiley (1)
Geometric T-Duality
(2022)
From a physicists point of view T-duality is a relation connecting string
theories on different spacetimes. Mathematically speaking, T-duality should be a symmetric relation on
the space of toroidal string backgrounds. Such a background consists of: a smooth manifold M; a torus bundle E over M - the total space modelling spacetime; a Riemannian metric g on E - modelling the field of gravity; a U(1)-bundle gerbe G with connection over E - modelling the Kalb-
Ramond field.
But as of now no complete model for T-duality exists. The three most notable
approaches for T-duality are given by the differential approaches by Buscher in the form of the Buscher rules and by Bouwknegt, Evslin and Mathai in the form of T-duality with H-flux on the one hand, and by the topological approach given by Bunke, Rumpf and Schick which is known as topological T-duality. In this thesis we combine these different approaches to form the first model for T-duality over complete geometric toroidal string backgrounds and we will introduce an example for this geometric T-duality inspired by the Hopf bundle.
Discovering Latent Structure in High-Dimensional Healthcare Data: Toward Improved Interpretability
(2022)
This cumulative thesis describes contributions to the field of interpretable machine learning in the healthcare domain. Three research articles are presented that lie at the intersection of biomedical and machine learning research. They illustrate how incorporating latent structure can provide a valuable compression of the information hidden in complex healthcare data.
Methodologically, this thesis gives an overview of interpretable machine learning and the discovery of latent structure, including clusters, latent factors, graph structure, and hierarchical structure. Different workflows are developed and applied to two main types of complex healthcare data (cohort study data and time-resolved molecular data). The core result builds on Bayesian networks, a type of probabilistic graphical model. On the application side, we provide accurate predictive or discriminative models focusing on relevant medical conditions, related biomarkers, and their interactions.
Spatial variation in survival has individual fitness consequences and influences population dynamics. It proximately and ultimately impacts space use including migratory connectivity. Therefore, knowing spatial patterns of survival is crucial to understand demography of migrating animals. Extracting information on survival and space use from observation data, in particular dead recovery data, requires explicitly identifying the observation process. The main aim of this work is to establish a modeling framework which allows estimating spatial variation in survival, migratory connectivity and observation probability using dead recovery data. We provide some biological background on survival and migration and a short methodological overview of how similar situations are modeled in literature.
Afterwards, we provide REML-like estimators for discrete space and show identifiability of all three parameters using the characteristics of the multinomial distribution. Moreover, we formulate a model in continuous space using mixed binomial point processes. The continuous model assumes a constant recovery probability over space. To drop this strict assumption, we develop an optimization procedure combining the discrete and continuous space model. Therefore, we use penalized M-splines. In simulation studies we demonstrate the performance of the estimators for all three model approaches. Furthermore, we apply the models to real-world data sets of European robins \textit{Erithacus rubecula} and ospreys \textit{Pandion haliaetus}.
We discuss how this study can be embedded in the framework of animal movement and the capture mark recapture/recovery methodology. It can be seen as a contribution and an extension to distance sampling, local stationary everyday movement and dispersal. We emphasize the importance of having a mathematically clearly formulated modeling framework for applied methods. Moreover, we comment on model assumptions and their limits. In the future, it would be appealing to extend this framework to the full annual cycle and carry-over effects.
Anaplasma phagocytophilum and Anaplasma ovis–Emerging Pathogens in the German Sheep Population
(2021)
Knowledge on the occurrence of pathogenic tick-borne bacteria Anaplasma phagocytophilum and Anaplasma ovis is scarce in sheep from Germany. In 2020, owners from five flocks reported ill thrift lambs and ewes with tick infestation. Out of 67 affected sheep, 55 animals were clinically examined and hematological values, blood chemistry and fecal examinations were performed to investigate the underlying disease causes. Serological tests (cELISA, IFAT) and qPCR were applied to all affected sheep to rule out A. phagocytophilum and A. ovis as a differential diagnosis. Ticks were collected from selected pastures and tested by qPCR. Most animals (n = 43) suffered from selenium deficiency and endoparasites were detected in each flock. Anaplasma spp. antibodies were determined in 59% of examined sheep. Seventeen animals tested positive for A. phagocytophilum by qPCR from all flocks and A. phagocytophilum was also detected in eight pools of Ixodes ricinus. Anaplasma phagocytophilum isolates from sheep and ticks were genotyped using three genes (16S rRNA, msp4 and groEL). Anaplasma ovis DNA was identified in six animals from one flock. Clinical, hematological and biochemical changes were not significantly associated with Anaplasma spp. infection. The 16S rRNA analysis revealed known variants of A. phagocytophilum, whereas the msp4 and groEL showed new genotypes. Further investigations are necessary to evaluate the dissemination and health impact of both pathogens in the German sheep population particularly in case of comorbidities.
Abstract
Cellular stress has been associated with inflammation, yet precise underlying mechanisms remain elusive. In this study, various unrelated stress inducers were employed to screen for sensors linking altered cellular homeostasis and inflammation. We identified the intracellular pattern recognition receptors NOD1/2, which sense bacterial peptidoglycans, as general stress sensors detecting perturbations of cellular homeostasis. NOD1/2 activation upon such perturbations required generation of the endogenous metabolite sphingosine‐1‐phosphate (S1P). Unlike peptidoglycan sensing via the leucine‐rich repeats domain, cytosolic S1P directly bound to the nucleotide binding domains of NOD1/2, triggering NF‐κB activation and inflammatory responses. In sum, we unveiled a hitherto unknown role of NOD1/2 in surveillance of cellular homeostasis through sensing of the cytosolic metabolite S1P. We propose S1P, an endogenous metabolite, as a novel NOD1/2 activator and NOD1/2 as molecular hubs integrating bacterial and metabolic cues.
Phytopathogenic Verticillia cause Verticillium wilt on numerous economically important crops. Plant infection begins at the roots, where the fungus is confronted with rhizosphere inhabiting bacteria. The effects of different fluorescent pseudomonads, including some known biocontrol agents of other plant pathogens, on fungal growth of the haploid Verticillium dahliae and/or the amphidiploid Verticillium longisporum were compared on pectin-rich medium, in microfluidic interaction channels, allowing visualization of single hyphae, or on Arabidopsis thaliana roots. We found that the potential for formation of bacterial lipopeptide syringomycin resulted in stronger growth reduction effects on saprophytic Aspergillus nidulans compared to Verticillium spp. A more detailed analyses on bacterial-fungal co-cultivation in narrow interaction channels of microfluidic devices revealed that the strongest inhibitory potential was found for Pseudomonas protegens CHA0, with its inhibitory potential depending on the presence of the GacS/GacA system controlling several bacterial metabolites. Hyphal tip polarity was altered when V. longisporum was confronted with pseudomonads in narrow interaction channels, resulting in a curly morphology instead of straight hyphal tip growth. These results support the hypothesis that the fungus attempts to evade the bacterial confrontation. Alterations due to co-cultivation with bacteria could not only be observed in fungal morphology but also in fungal transcriptome. P. protegens CHA0 alters transcriptional profiles of V. longisporum during 2 h liquid media co-cultivation in pectin-rich medium. Genes required for degradation of and growth on the carbon source pectin were down-regulated, whereas transcripts involved in redox processes were up-regulated. Thus, the secondary metabolite mediated effect of Pseudomonas isolates on Verticillium species results in a complex transcriptional response, leading to decreased growth with precautions for self-protection combined with the initiation of a change in fungal growth direction. This interplay of bacterial effects on the pathogen can be beneficial to protect plants from infection, as shown with A. thaliana root experiments. Treatment of the roots with bacteria prior to infection with V. dahliae resulted in a significant reduction of fungal root colonization. Taken together we demonstrate how pseudomonads interfere with the growth of Verticillium spp. and show that these bacteria could serve in plant protection.
Diese Dissertation untersucht Zusammenhänge der spieltheoretischen Begriffe des Nash- und Stackelberg-Gleichgewichts in Differenialspielen im N-Spieler-Fall. Weiterhin werden drei verschiedene Lösungskonzepte für das Finden von Gleichgewichten in 2-Spieler-Differentialspielen vorgestellt. Direkte Methoden aus der nichtlinearen Optimierung, der globalen Optimierung und der optimalen Steuerung werden verwendet, um Nash- und Stackelberg-Gleichgewichte in 2-Spieler-Differentialspielen zu finden. Anhand von Anwendungsbeispielen werden die Methoden getestet, analysiert und ausgewertet. Eine Erweiterung des Verfolgungsspiels von Isaacs auf Beschleunigungskomponenten wird betrachtet. Ein bisher unbekanntes Stackelberg-Gleichgewicht wird im Kapitalismusspiel nach Lancaster numerisch berechnet. Zuletzt wird ein Problem aus der Fischerei modelliert und anhand der eingeführten Methoden gelöst.
Abstract
The expected signature is an analogue of the Laplace transform for probability measures on rough paths. A key question in the area has been to identify a general condition to ensure that the expected signature uniquely determines the measures. A sufficient condition has recently been given by Chevyrev and Lyons and requires a strong upper bound on the expected signature. While the upper bound was verified for many well‐known processes up to a deterministic time, it was not known whether the required bound holds for random time. In fact, even the simplest case of Brownian motion up to the exit time of a planar disc was open. For this particular case we answer this question using a suitable hyperbolic projection of the expected signature. The projection satisfies a three‐dimensional system of linear PDEs, which (surprisingly) can be solved explicitly, and which allows us to show that the upper bound on the expected signature is not satisfied.
Entropy Ratio and Entropy Concentration Coefficient, with Application to the COVID-19 Pandemic
(2020)
Abstract
With the advent of molecular genetic methods, an increasing number of morphologically cryptic taxa has been discovered. The majority of them, however, remains formally undescribed and without a proper name although their importance in ecology and evolution is increasingly being acknowledged. Despite suggestions to complement traditional descriptions with genetic characters, the taxonomic community appears to be reluctant to adopt this proposition. As an incentive, we introduce QUIDDICH, a tool for the QUick IDentification of DIgnostic CHaracters, which automatically scans a DNA or amino acid alignment for those columns that allow to distinguish taxa and classifies them into four different types of diagnostic characters. QUIDDICH is a system‐independent, fast and user‐friendly tool that requires few manual steps and provides a comprehensive output, which can be included in formal taxonomic descriptions. Thus, cryptic taxa do not have to remain in taxonomic crypsis and, bearing a proper name, can readily be included in biodiversity assessments and ecological and evolutionary analyses. QUIDDICH can be obtained from the comprehensive R archive network (CRAN, https://cran.r-project.org/package=quiddich).
Twisted topological K-theory is a twisted version of topological K-theory in the sense of twisted generalized cohomology theories. It was pioneered by Donavan and Karoubi in 1970 where they used bundles of central simple graded algebras to model twists of K-theory. By the end of the last century physicists realised that D-brane charges in the field of string theory may be studied in terms of twisted K-theory. This rekindled interest in the topic lead to a wave of new models for the twists and new ways to realize the respective twisted K-theory groups. The state-of-the-art models today use bundles of projective unitary operators on separable Hilbert spaces as twists and K-groups are modeled by homotopy classes of sections of certain bundles of Fredholm operators. From a physics perspective these treatments are not optimal yet: they are intrinsically infinite-dimensional and these models do not immediately allow the inclusion of differential data like forms and connections.
In this thesis we introduce the 2-stack of k-algebra gerbes. Objects, 1-morphisms and 2-morphisms consist of finite-dimensional geometric data simultaneously generalizing bundle gerbes and bundles of central simple graded k-algebras for k either the field of real numbers or the field of complex numbers. We construct an explicit isomorphism from equivalence classes of k-algebra gerbes over a space X to the full set of twists of real K-theory and complex K-theory respectively. Further, we model relative twisted K-groups for compact spaces X and closed subspaces Y twisted by algebra gerbes. These groups are modeled directly in terms of 1-morphisms and 2-morphisms of algebra gerbes over X. We exhibit a relation to the K-groups introduced by Donavan and Karoubi and we translate their fundamental isomorphism -- an isomorphism relating K-groups over Thom spaces with K-groups twisted by Clifford algebra bundles -- to the new setting. With the help of this fundamental isomorphism we construct an explicit Thom isomorphism and explicit pushforward homomorphisms for smooth maps between compact manifolds, without requiring these maps to be K-oriented. Further -- in order to treat K-groups for non-torsion twists -- we implement a geometric cocycle model, inspired by a related geometric cycle model developed by Baum and Douglas for K-homology in 1982, and construct an assembly map for this model.
Trade of cattle between farms forms a complex trade network. We investigate partitions of this network for cattle trade in Germany. These partitions are groups of farms with similar properties and they are inferred directly from the trade pattern between farms. We make use of a rather new method known as stochastic block modeling (SBM) in order to divide the network into smaller units. SBM turns out to outperform the more established community detection method in the context of disease control in terms of trade restriction. Moreover, SBM is also superior to geographical based trade restrictions and could be a promising approach for disease control.
A common task in natural sciences is to
describe, characterize, and infer relations between discrete
objects. A set of relations E on a set of objects V can
naturally be expressed as a graph G = (V, E). It is
therefore often convenient to formalize problems in natural
sciences as graph theoretical problems.
In this thesis we will examine a number of problems found in
life sciences in particular, and show how to use graph theoretical
concepts to formalize and solve the presented problems. The
content of the thesis is a collection of papers all
solving separate problems that are relevant to biology
or biochemistry.
The first paper examines problems found in self-assembling
protein design. Designing polypeptides, composed of concatenated
coiled coil units, to fold into polyhedra turns out
to be intimately related to the concept of 1-face embeddings in
graph topology. We show that 1-face embeddings can be
canonicalized in linear time and present algorithms to enumerate
pairwise non-isomorphic 1-face embeddings in orientable surfaces.
The second and third paper examine problems found in evolutionary
biology. In particular, they focus on
inferring gene and species trees directly from sequence data
without any a priori knowledge of the trees topology. The second
paper characterize when gene trees can be inferred from
estimates of orthology, paralogy and xenology relations when only
partial information is available. Using this characterization an
algorithm is presented that constructs a gene tree consistent
with the estimates in polynomial time, if one exists. The
shown algorithm is used to experimentally show that gene trees
can be accurately inferred even in the case that only 20$\%$ of
the relations are known. The third paper explores how to
reconcile a gene tree with a species tree in a biologically
feasible way, when the events of the gene tree are known.
Biologically feasible reconciliations are characterized using
only the topology of the gene and species tree. Using this
characterization an algorithm is shown that constructs a
biologically feasible reconciliation in polynomial time, if one
exists.
The fourth and fifth paper are concerned with with the analysis
of automatically generated reaction networks. The fourth paper
introduces an algorithm to predict thermodynamic properties of
compounds in a chemistry. The algorithm is based on
the well known group contribution methods and will automatically
infer functional groups based on common structural motifs found
in a set of sampled compounds. It is shown experimentally that
the algorithm can be used to accurately
predict a variety of molecular properties such as normal boiling
point, Gibbs free energy, and the minimum free energy of RNA
secondary structures. The fifth and final paper presents a
framework to track atoms through reaction networks generated by a
graph grammar. Using concepts found in semigroup theory, the
paper defines the characteristic monoid of a reaction network. It
goes on to show how natural subsystems of a reaction network organically
emerge from the right Cayley graph of said monoid. The
applicability of the framework is proven by applying it to the
design of isotopic labeling experiments as well as to the
analysis of the TCA cycle.
Mathematical phylogenetics provides the theoretical framework for the reconstruction and analysis of phylogenetic trees and networks. The underlying theory is based on various mathematical disciplines, ranging from graph theory to probability theory.
In this thesis, we take a mostly combinatorial and graph-theoretical position and study different problems concerning phylogenetic trees and networks.
We start by considering phylogenetic diversity indices that rank species for conservation. Two such indices for rooted trees are the Fair Proportion index and the Equal Splits index, and we analyze how different they can be from each other and under which circumstances they coincide. Moreover, we define and investigate analogues of these indices for unrooted trees.
Subsequently, we study the Shapley value of unrooted trees, another popular phylogenetic diversity index. We show that it may fail as a prioritization criterion in biodiversity conservation and is outcompeted by an existing greedy approach. Afterwards, we leave the biodiversity setting and consider the Shapley value as a tree reconstruction tool. Here, we show that non-isomorphic trees may have permutation-equivalent Shapley transformation matrices and identical Shapley values, implying that the Shapley value cannot reliably be employed in tree reconstruction.
In addition to phylogenetic diversity indices, another class of indices frequently discussed in mathematical phylogenetics, is the class of balance indices. In this thesis, we study one of the oldest and most popular of them, namely the Colless index for rooted binary trees. We focus on its extremal values and analyze both its maximum and minimum values as well as the trees that achieve them.
Having analyzed various questions regarding phylogenetic trees, we finally turn to phylogenetic networks. We focus on a certain class of phylogenetic networks, namely tree-based networks, and consider this class both in a rooted and in an unrooted setting.
First, we prove the existence of a rooted non-binary universal tree-based network with n leaves for all positive integers n, that is, we show that there exists a rooted non-binary tree-based network with $n$ leaves that has every non-binary phylogenetic tree on the same leaf set as a base tree.
Finally, we study unrooted tree-based networks and introduce a class of networks that are necessarily tree-based, namely edge-based networks. We show that edge-based networks are closely related to a family of graphs in classical graph theory, so-called generalized series-parallel graphs, and explore this relationship in full detail.
In summary, we add new insights into existing concepts in mathematical phylogenetics, answer open questions in the literature, and introduce new concepts and approaches. In doing so, we make a small but relevant contribution to current research in mathematical phylogenetics.
In this thesis, we elaborate upon Bayesian changepoint analysis, whereby our focus is on three big topics: approximate sampling via MCMC, exact inference and uncertainty quantification. Besides, modeling matters are discussed in an ongoing fashion. Our findings are underpinned through several changepoint examples with a focus on a well-log drilling data.