Now showing 1 - 3 of 3
  • 2014Journal Article
    [["dc.bibliographiccitation.firstpage","209"],["dc.bibliographiccitation.issue","2"],["dc.bibliographiccitation.journal","RAIRO - Theoretical Informatics and Applications"],["dc.bibliographiccitation.lastpage","245"],["dc.bibliographiccitation.volume","48"],["dc.contributor.author","Brodag, Thomas"],["dc.contributor.author","Herbold, Steffen"],["dc.contributor.author","Waack, Stephan"],["dc.date.accessioned","2018-11-07T09:41:31Z"],["dc.date.available","2018-11-07T09:41:31Z"],["dc.date.issued","2014"],["dc.description.abstract","We combine a new data model, where the random classification is subjected to rather weak restrictions which in turn are based on the Mammen-Tsybakov [E. Mammen and A. B. Tsybakov, Ann. Statis. 27 (1999) 1808-1829; A. B. Tsybakov, Ann. Statis. 32 (2004) 135-166.] small margin conditions, and the statistical query (SQ) model due to Kearns [M. J. Kearns, J. ACM 45 (1998) 983-1006] to what we refer to as PAC + SQ model. We generalize the class conditional constant noise (CCCN) model introduced by Decatur [S. E. Decatur, in ICML '97: Proc. of the Fourteenth Int. Conf. on Machine Learn. Morgan Kaufmann Publishers Inc. San Francisco, CA, USA (1997) 83-91] to the noise model orthogonal to a set of query functions. We show that every polynomial time PAC + SQ learning algorithm can be efficiently simulated provided that the random noise rate is orthogonal to the query functions used by the algorithm given the target concept. Furthermore, we extend the constant-partition classification noise (CPCN) model due to Decatur [S. E. Decatur, in ICML '97: Proc. of the Fourteenth Int. Conf. on Machine Learn. Morgan Kaufmann Publishers Inc. San Francisco, CA, USA (1997) 83-91] to what we call the constant-partition piecewise orthogonal (CPPO) noise model. We show how statistical queries can be simulated in the CPPO scenario, given the partition is known to the learner. We show how to practically use PAC + SQ simulators in the noise model orthogonal to the query space by presenting two examples from bioinformatics and software engineering. This way, we demonstrate that our new noise model is realistic."],["dc.identifier.doi","10.1051/ita/2014005"],["dc.identifier.isi","000339168300004"],["dc.identifier.uri","https://resolver.sub.uni-goettingen.de/purl?gro-2/33750"],["dc.language.iso","en"],["dc.notes.status","zu prüfen"],["dc.notes.submitter","Najko"],["dc.relation.issn","1290-385X"],["dc.relation.issn","0988-3754"],["dc.title","A Generalized Model of Pac Learning and its Applicability"],["dc.type","journal_article"],["dc.type.internalPublication","yes"],["dc.type.peerReviewed","yes"],["dspace.entity.type","Publication"]]
    Details DOI WOS
  • 2006Journal Article
    [["dc.bibliographiccitation.artnumber","142"],["dc.bibliographiccitation.journal","BMC Bioinformatics"],["dc.bibliographiccitation.volume","7"],["dc.contributor.author","Waack, Stephan"],["dc.contributor.author","Keller, Oliver"],["dc.contributor.author","Asper, Roman"],["dc.contributor.author","Brodag, Thomas"],["dc.contributor.author","Damm, Carsten"],["dc.contributor.author","Fricke, Wolfgang Florian"],["dc.contributor.author","Surovcik, Katharina"],["dc.contributor.author","Meinicke, Peter"],["dc.contributor.author","Merkl, Rainer"],["dc.date.accessioned","2018-11-07T10:05:47Z"],["dc.date.available","2018-11-07T10:05:47Z"],["dc.date.issued","2006"],["dc.description.abstract","Background: Horizontal gene transfer (HGT) is considered a strong evolutionary force shaping the content of microbial genomes in a substantial manner. It is the difference in speed enabling the rapid adaptation to changing environmental demands that distinguishes HGT from gene genesis, duplications or mutations. For a precise characterization, algorithms are needed that identify transfer events with high reliability. Frequently, the transferred pieces of DNA have a considerable length, comprise several genes and are called genomic islands (GIs) or more specifically pathogenicity or symbiotic islands. Results: We have implemented the program SIGI-HMM that predicts GIs and the putative donor of each individual alien gene. It is based on the analysis of codon usage (CU) of each individual gene of a genome under study. CU of each gene is compared against a carefully selected set of CU tables representing microbial donors or highly expressed genes. Multiple tests are used to identify putatively alien genes, to predict putative donors and to mask putatively highly expressed genes. Thus, we determine the states and emission probabilities of an inhomogeneous hidden Markov model working on gene level. For the transition probabilities, we draw upon classical test theory with the intention of integrating a sensitivity controller in a consistent manner. SIGI-HMM was written in JAVA and is publicly available. It accepts as input any file created according to the EMBL-format. It generates output in the common GFF format readable for genome browsers. Benchmark tests showed that the output of SIGI-HMM is in agreement with known findings. Its predictions were both consistent with annotated GIs and with predictions generated by different methods. Conclusion: SIGI-HMM is a sensitive tool for the identification of GIs in microbial genomes. It allows to interactively analyze genomes in detail and to generate or to test hypotheses about the origin of acquired genes."],["dc.identifier.doi","10.1186/1471-2105-7-142"],["dc.identifier.isi","000238903700001"],["dc.identifier.pmid","16542435"],["dc.identifier.purl","https://resolver.sub.uni-goettingen.de/purl?gs-1/12509"],["dc.identifier.uri","https://resolver.sub.uni-goettingen.de/purl?gro-2/38969"],["dc.language.iso","en"],["dc.notes.intern","Merged from goescholar"],["dc.notes.status","final"],["dc.notes.submitter","Najko"],["dc.relation.issn","1471-2105"],["dc.rights","CC BY 2.0"],["dc.rights.uri","https://creativecommons.org/licenses/by/2.0/"],["dc.title","Score-based prediction of genomic islands in prokaryotic genomes using hidden Markov models"],["dc.type","journal_article"],["dc.type.internalPublication","yes"],["dc.type.peerReviewed","yes"],["dc.type.version","published_version"],["dspace.entity.type","Publication"]]
    Details DOI PMID PMC WOS
  • 2006Journal Article
    [["dc.bibliographiccitation.artnumber","10"],["dc.bibliographiccitation.journal","Algorithms for Molecular Biology"],["dc.bibliographiccitation.volume","1"],["dc.contributor.author","Meinicke, Peter"],["dc.contributor.author","Brodag, Thomas"],["dc.contributor.author","Fricke, Wolfgang Florian"],["dc.contributor.author","Waack, Stephan"],["dc.date.accessioned","2018-11-07T10:28:14Z"],["dc.date.available","2018-11-07T10:28:14Z"],["dc.date.issued","2006"],["dc.description.abstract","Two important and not yet solved problems in bacterial genome research are the identification of horizontally transferred genes and the prediction of gene expression levels. Both problems can be addressed by multivariate analysis of codon usage data. In particular dimensionality reduction methods for visualization of multivariate data have shown to be effective tools for codon usage analysis. We here propose a multidimensional scaling approach using a novel similarity measure for codon usage tables. Our probabilistic similarity measure is based on P-values derived from the well-known chi-square test for comparison of two distributions. Experimental results on four microbial genomes indicate that the new method is well-suited for the analysis of horizontal gene transfer and translational selection. As compared with the widely-used correspondence analysis, our method did not suffer from outlier sensitivity and showed a better clustering of putative alien genes in most cases."],["dc.identifier.doi","10.1186/1748-7188-1-10"],["dc.identifier.isi","000252697600016"],["dc.identifier.pmid","16808834"],["dc.identifier.purl","https://resolver.sub.uni-goettingen.de/purl?gs-1/4405"],["dc.identifier.uri","https://resolver.sub.uni-goettingen.de/purl?gro-2/43380"],["dc.notes.intern","Merged from goescholar"],["dc.notes.status","zu prüfen"],["dc.notes.submitter","Najko"],["dc.publisher","Biomed Central Ltd"],["dc.relation.issn","1748-7188"],["dc.rights","Goescholar"],["dc.rights.uri","https://goescholar.uni-goettingen.de/licenses"],["dc.title","P-value based visualization of codon usage data"],["dc.type","journal_article"],["dc.type.internalPublication","yes"],["dc.type.peerReviewed","yes"],["dc.type.status","published"],["dc.type.version","published_version"],["dspace.entity.type","Publication"]]
    Details DOI PMID PMC WOS