Institut für Mathematik und Informatik
Refine
Year of publication
Document Type
- Doctoral Thesis (57)
- Article (30)
- Final Thesis (1)
Has Fulltext
- yes (88)
Is part of the Bibliography
- no (88)
Keywords
- - (21)
- Statistik (5)
- Numerische Mathematik (4)
- Optimale Kontrolle (4)
- fractal (4)
- permutation entropy (4)
- Bioinformatik (3)
- Fraktal (3)
- Optimale Steuerung (3)
- Selbstähnlichkeit (3)
Institute
Publisher
- MDPI (14)
- Frontiers Media S.A. (6)
- Springer Nature (4)
- BioMed Central (BMC) (2)
- Oxford University Press (1)
- Wiley (1)
In this thesis, we elaborate upon Bayesian changepoint analysis, whereby our focus is on three big topics: approximate sampling via MCMC, exact inference and uncertainty quantification. Besides, modeling matters are discussed in an ongoing fashion. Our findings are underpinned through several changepoint examples with a focus on a well-log drilling data.
Tuberculosis (TB) has tremendous public health relevance. It most frequently affects the lung and is characterized by the development of unique tissue lesions, termed granulomas. These lesions encompass various immune populations, with macrophages being most extensively investigated. Myeloid derived suppressor cells (MDSCs) have been recently identified in TB patients, both in the circulation and at the site of infection, however their interactions with Mycobacterium tuberculosis (Mtb) and their impact on granulomas remain undefined. We generated human monocytic MDSCs and observed that their suppressive capacities are retained upon Mtb infection. We employed an in vitro granuloma model, which mimics human TB lesions to some extent, with the aim of analyzing the roles of MDSCs within granulomas. MDSCs altered the structure of and affected bacterial containment within granuloma-like structures. These effects were partly controlled through highly abundant secreted IL-10. Compared to macrophages, MDSCs activated primarily the NF-κB and MAPK pathways and the latter largely contributed to the release of IL-10 and replication of bacteria within in vitro generated granulomas. Moreover, MDSCs upregulated PD-L1 and suppressed proliferation of lymphocytes, albeit with negligible effects on Mtb replication. Further comprehensive characterization of MDSCs in TB will contribute to a better understanding of disease pathogenesis and facilitate the design of novel immune-based interventions for this deadly infection.
A New Kind of Permutation Entropy Used to Classify Sleep Stages from Invisible EEG Microstructure
(2017)
Given a manifold with a string structure, we construct a spinor bundle on its loop space. Our construction is in analogy with the usual construction of a spinor bundle on a spin manifold, but necessarily makes use of tools from infinite dimensional geometry. We equip this spinor bundle on loop space with an action of a bundle of Clifford algebras. Given two smooth loops in our string manifold that share a segment, we can construct a third loop by deleting this segment. If this third loop is smooth, then we say that the original pair of loops is a pair of compatible loops. It is well-known that this operation of fusing compatible loops is important if one wants to understand the geometry of a manifold through its loop space. In this work, we explain in detail how the spinor bundle on loop space behaves with respect to fusion of compatible loops. To wit, we construct a family of fusion isomorphisms indexed by pairs of compatible loops in our string manifold. Each of these fusion isomorphisms is an isomorphism from the relative tensor product of the fibres of the spinor bundle over its index pair of compatible loops to the fibre over the loop that is the result of fusing the index pair. The construction of a spinor bundle on loop space equipped with a fusion product as above was proposed by Stolz and Teichner with the goal of studying the Dirac operator on loop space". Our construction combines facets of the theory of bimodules for von Neumann algebras, infinite dimensional manifolds, and Lie groups and their representations. We moreover place our spinor bundle on loop space in the context of bundle gerbes and bundle gerbe modules.
In phylogenetics, evolutionary relationships of different species are represented by phylogenetic trees.
In this thesis, we are mainly concerned with the reconstruction of ancestral sequences and the accuracy of this reconstruction given a rooted binary phylogenetic tree.
For example, we wish to estimate the DNA sequences of the ancestors given the observed DNA sequences of today living species.
In particular, we are interested in reconstructing the DNA sequence of the last common ancestor of all species under consideration. Note that this last common ancestor corresponds to the root of the tree.
There exist various methods for the reconstruction of ancestral sequences.
A widely used principle for ancestral sequence reconstruction is the principle of parsimony (Maximum Parsimony).
This principle means that the simplest explanation it the best.
Applied to the reconstruction of ancestral sequences this means that a sequence which requires the fewest evolutionary changes along the tree is reconstructed.
Thus, the number of changes is minimized, which explains the name of Maximum Parsimony.
Instead of estimating a whole DNA sequence, Maximum Parsimony considers each position in the sequence separately. Thus in the following, each sequence position is regarded separately, and we call a single position in a sequence state.
It can happen that the state of the last common ancestor is reconstructed unambiguously, for example as A. On the other hand, Maximum Parsimony might be indecisive between two DNA nucleotides, say for example A and C.
In this case, the last common ancestor will be reconstructed as {A,C}.
Therefore we consider, after an introduction and some preliminary definitions, the following question in Section 3: how many present-day species need to be in a certain state, for example A, such that the Maximum Parsimony estimate of the last common ancestor is also {A}?
The answer of this question depends on the tree topology as well as on the number of different states.
In Section 4, we provide a sufficient condition for Maximum Parsimony to recover the ancestral state at the root correctly from the observed states at the leaves.
The so-called reconstruction accuracy for the reconstruction of ancestral states is introduced in Section 5. The reconstruction accuracy is the probability that the true root state is indeed reconstructed and always takes two processes into account: on the one hand the approach to reconstruct ancestral states, and on the other hand the way how the states evolve along the edges of the tree. The latter is given by an evolutionary model.
In the present thesis, we focus on a simple symmetric model, the Neyman model.
The symmetry of the model means for example that a change from A to C is equally likely than a change from C to A.
Intuitively, one could expect that the reconstruction accuracy it the highest when all present-day species are taken into account. However, it has long been known that the reconstruction accuracy improves when some taxa are disregarded for the estimation.
Therefore, the question if there exits at least a lower bound for the reconstruction accuracy arises, i.e. if it is best to consider all today living species instead of just one for the reconstruction.
This is bad news for Maximum Parsimony as a criterion for ancestral state reconstruction, and therefore the question if there exists at least a lower bound for the reconstruction accuracy arises.
In Section 5, we start with considering ultrametric trees, which are trees where the expected number of substitutions from the root to each leaf is the same.
For such trees, we investigate a lower bound for the reconstruction accuracy, when the number of different states at the leaves of the tree is 3 or 4.
Subsequently in Section 6, in order to generalize this result, we introduce a new method for ancestral state reconstruction: the coin-toss method.
We obtain new results for the reconstruction accuracy of Maximum Parsimony by relating Maximum Parsimony to the coin-toss method.
Some of these results do not require the underlying tree to be ultrametric.
Then, in Section 7 we investigate the influence of specific tree topologies on the reconstruction accuracy of Maximum Parsimony. In particular, we consider balanced and imbalanced trees as the balance of a tree may have an influence on the reconstruction accuracy.
We end by introducing the Colless index in Section 8, an index which measures the degree of balance a rooted binary tree can have, and analyze its extremal properties.
Self-affine tiles and fractals are known as examples in analysis and topology, as models of quasicrystals and biological growth, as unit intervals of generalized number systems, and as attractors of dynamical systems. The author has implemented a software which can find new examples and handle big databases of self-affine fractals. This thesis establishes the algebraic foundation of the algorithms of the IFStile package. Lifting and projection of algebraic and rational iterated function systems and many properties of the resulting attractors are discussed.
Objektive Eingruppierung sequenzierter Tollwutisolate mithilfe des Affinity Propagation Clusterings.
(2018)
Das International Committee on Taxonomy of Viruses (ICTV) reguliert die Nomenklatur von Viren sowie die Entstehung neuer Taxa (dazu gehören: Ordnung, Familie, Unterfamilie, Gattung und Art/Spezies). Dank dieser Anstrengungen ist die Einteilung für verschiedenste Viren in diese Kategorien klar und transparent nachvollziehbar. In den vergangenen Jahrzehnten sind insgesamt mehr als 21.000 Datensätze der Spezies „rabies lyssavirus“ (RABV) sequenziert worden. Eine weiterführende Unterteilung der sequenzierten Virusisolate dieser Spezies ist bislang jedoch nicht einheitlich vorgeschlagen. Die große Anzahl an sequenzierten Isolaten führte auf Basis von phylogenetischen Bäumen zu uneindeutigen Ergebnissen bei der Einteilung in Cluster. Inhalt meiner Dissertation ist daher ein Vorschlag, diese Problematik mit der Anwendung einer partitionierenden Clusteringmethode zu lösen. Dazu habe ich erstmals die Methodik des affinity propagation clustering (AP) für solche Fragestellungen eingesetzt. Als Datensatz wurden alle verfügbaren sequenzierten Vollgenomisolate der Spezies RABV analysiert. Die Analysen des Datensatzes ergaben vier Hauptcluster, die sich geographisch separieren ließen und entsprechend als „Arctic“, „Cosmopolitain“, „Asian“ und „New World“ bezeichnet wurden. Weiterführende Analysen erlaubten auch eine weitere Aufteilung dieser Hauptcluster in 12-13 Untercluster. Zusätzlich konnte ich einen Workflow generieren, der die Möglichkeit bietet, die mittels AP definierten Cluster mit den Ergebnissen der phylogenetischen Auswertungen zu kombinieren. Somit lassen sich sowohl Verwandtschaftsverhältnisse erkennen als auch eine objektive Clustereinteilung vornehmen. Dies könnte auch ein möglicher Analyseweg für weitere Virusspezies oder andere vergleichende Sequenzanalysen sein.
As the tree of life is populated with sequenced genomes ever more densely, the new challenge is the accurate and consistent annotation of entire clades of genomes. In my dissertation, I address this problem with a new approach to comparative gene finding that takes a multiple genome alignment of closely related species and simultaneously predicts the location and structure of protein-coding genes in all input genomes, thereby exploiting negative selection and sequence conservation. The model prefers potential gene structures in the different genomes that are in agreement with each other, or—if not—where the exon gains and losses are plausible given the species tree. The multi-species gene finding problem is formulated as a binary labeling problem on a graph. The resulting optimization problem is NP hard, but can be efficiently approximated using a subgradient-based dual decomposition approach.
I tested the novel approach on whole-genome alignments of 12 vertebrate and 12 Drosophila species. The accuracy was evaluated for human, mouse and Drosophila melanogaster and compared to competing methods. Results suggest that the new method is well-suited for annotation of a large number of genomes of closely related species within a clade, in particular, when RNA-Seq data are available for many of the genomes. The transfer of existing annotations from one genome to another via the genome alignment is more accurate than previous approaches that are based on protein-spliced alignments, when the genomes are at close to medium distances. The method is implemented in C++ as part of the gene finder AUGUSTUS.
Die vorliegende Arbeit ist im Bereich der parameterfreien Statistik anzusiedeln und beschäftigt sich mit der Anwendung von ordinalen Verfahren auf Zeitreihen und Bilddaten. Die Basis bilden dabei die sogenannten ordinalen Muster in ein bzw. zwei Dimensionen. Der erste Hauptteil der Arbeit gibt einen Überblick über die breiten Einsatzmöglichkeiten ordinaler Muster in der Zeitreihenanalyse. Mit ihrer Hilfe wird bei simulierten gebrochenen Brownschen Bewegungen der Hurst-Exponenten geschätzt und anhand von EEG-Daten eine Klassifikationsaufgabe gelöst. Des Weiteren wird die auf der Verteilung der ordinalen Muster beruhende Permutationsentropie eingesetzt, um in Magnetresonanztomographie (MRT)-Ruhedaten Kopfbewegungen der Probanden zu detektieren. Der zweite Hauptteil der Arbeit befasst sich mit der Erweiterung der ordinalen Muster auf zwei Dimensionen, um sie für Bilddaten nutzbar zu machen. Nach einigen Betrachtungen an fraktalen Oberflächen steht eine automatisierte und robuste Einschätzung der Qualität struktureller MRT-Daten im Vordergrund.