DSHMM: Difference between revisions

From Jstacs
Jump to navigationJump to search
(New page: __NOTOC__ by Michael Seifert, Marc Strickert, Alexander Schliep, and Ivo Grosse == Description == === Motivation === Changes in gene expression levels play a central role in tumors. Infor...)
 
 
(20 intermediate revisions by 5 users not shown)
Line 4: Line 4:
== Description ==
== Description ==
=== Motivation ===
=== Motivation ===
Changes in gene expression levels play a central role in tumors. Information about the distribution of gene expression levels and distances between adjacent genes on chromosomes should be integrated into the analysis of tumor expression profiles.
Changes in gene expression levels play a central role in tumors. Additional information about the distribution of gene expression levels and distances between adjacent genes on chromosomes should be integrated into the analysis of tumor expression profiles.


=== Results ===
=== Results ===
We use a Hidden Markov Model with scaled transition matrices (SHMM) to incorporate chromosomal distances of adjacent genes on chromosomes into the identification of differentially expressed genes in breast cancer. We train the SHMM by integrating information about potential distributions of expression levels of differentially expressed and unchanged genes in tumor. We find that the usage of these information and the modeling of distances between adjacent genes lead to a substantial improvement of the identification of differentially expressed genes. In comparison to existing methods, we find that the SHMM identifies differentially expressed genes with higher accuracy than related methods for analyzing comparative genomic hybridization data. The performance benefit is further supported by the observation that the SHMM predicts genes well-known to be associated with breast cancer. This suggests applications of SHMMs for screening of other tumor expression profiles.
We use a Hidden Markov Model with distance-scaled transition matrices (DSHMM) to incorporate chromosomal distances of adjacent genes on chromosomes into the identification of differentially expressed genes in breast cancer. We train the DSHMM by integrating prior knowledge about potential distributions of expression levels of differentially expressed and unchanged genes in tumor. We find that especially the combination of this data and to a lesser extent the modeling of distances between adjacent genes contribute to a substantial improvement of the identification of differentially expressed genes in comparison to other existing methods. This performance benefit is also supported by the identification of genes well-known to be associated with breast cancer. That suggests applications of DSHMMs for screening of other tumor expression profiles.


== Paper ==
== Paper ==
The paper '''''Exploiting prior knowledge and gene distances in the analysis of tumor expression profiles by extended Hidden Markov Models''''' has been submitted to [http://bioinformatics.oxfordjournals.org/ Bioinformatics].
The paper [http://bioinformatics.oxfordjournals.org/content/27/12/1645 '''''Exploiting prior knowledge and gene distances in the analysis of tumor expression profiles with extended Hidden Markov Models'''''] has been published in [http://bioinformatics.oxfordjournals.org/ Bioinformatics].


== Download ==
== Download ==
* SHMMs will be availaible soon in Jstacs.
* [http://www.jstacs.de/downloads/supplementary_data.zip Supplementary data]: Containing the breast cancer gene expression data set and the breast cancer gene copy number data set (Pollack et al. (2002)) analyzed in the manuscript, and the predictions and scores of the compared methods.
* The data sets will be availaible soon.
* [http://www.jstacs.de/downloads/ModelTrainer.zip Model trainer]: JAR file for analyzing a data set by Gaussian mixture models, HMMs, SHMMs, and DSHMMs.
 
== Related Projects ==
* [[ARHMM]]: integrating local chromosomal dependencies into the analysis of tumor expression profiles
* [[PHHMM]]: improved analysis of Array-CGH data
* [[SHMM]]: utilizing gene-pair orientations for improved analysis of ChIP-chip promoter array data
* [[MeDIP-HMM]]: HMM-based analysis of DNA methylation profiles
* [https://sites.google.com/site/mseifertweb/hmm-book HMM Book]: Hidden Markov Models with Applications in Computational Biology
 
== Follow Me ==
* [https://sites.google.com/site/mseifertweb/home Personal Homepage]

Latest revision as of 09:23, 16 October 2016

by Michael Seifert, Marc Strickert, Alexander Schliep, and Ivo Grosse

Description

Motivation

Changes in gene expression levels play a central role in tumors. Additional information about the distribution of gene expression levels and distances between adjacent genes on chromosomes should be integrated into the analysis of tumor expression profiles.

Results

We use a Hidden Markov Model with distance-scaled transition matrices (DSHMM) to incorporate chromosomal distances of adjacent genes on chromosomes into the identification of differentially expressed genes in breast cancer. We train the DSHMM by integrating prior knowledge about potential distributions of expression levels of differentially expressed and unchanged genes in tumor. We find that especially the combination of this data and to a lesser extent the modeling of distances between adjacent genes contribute to a substantial improvement of the identification of differentially expressed genes in comparison to other existing methods. This performance benefit is also supported by the identification of genes well-known to be associated with breast cancer. That suggests applications of DSHMMs for screening of other tumor expression profiles.

Paper

The paper Exploiting prior knowledge and gene distances in the analysis of tumor expression profiles with extended Hidden Markov Models has been published in Bioinformatics.

Download

  • Supplementary data: Containing the breast cancer gene expression data set and the breast cancer gene copy number data set (Pollack et al. (2002)) analyzed in the manuscript, and the predictions and scores of the compared methods.
  • Model trainer: JAR file for analyzing a data set by Gaussian mixture models, HMMs, SHMMs, and DSHMMs.

Related Projects

  • ARHMM: integrating local chromosomal dependencies into the analysis of tumor expression profiles
  • PHHMM: improved analysis of Array-CGH data
  • SHMM: utilizing gene-pair orientations for improved analysis of ChIP-chip promoter array data
  • MeDIP-HMM: HMM-based analysis of DNA methylation profiles
  • HMM Book: Hidden Markov Models with Applications in Computational Biology

Follow Me