Robust detection and genotyping of single feature polymorphisms from gene expression data

Minghui Wang, Xiaohua Hu, Gang Li, Lindsey J. Leach, Elena Potokina, Arnis Druka, Robbie Waugh, Michael J. Kearsey, Zewei Luo

    Research output: Contribution to journalArticle

    9 Citations (Scopus)
    3 Downloads (Pure)

    Abstract

    It is well known that Affymetrix microarrays are widely used to predict genome-wide gene expression and genome-wide genetic polymorphisms from RNA and genomic DNA hybridization experiments, respectively. It has recently been proposed to integrate the two predictions by use of RNA microarray data only. Although the ability to detect single feature polymorphisms (SFPs) from RNA microarray data has many practical implications for genome study in both sequenced and unsequenced species, it raises enormous challenges for statistical modelling and analysis of microarray gene expression data for this objective. Several methods are proposed to predict SFPs from the gene expression profile. However, their performance is highly vulnerable to differential expression of genes. The SFPs thus predicted are eventually a reflection of differentially expressed genes rather than genuine sequence polymorphisms. To address the problem, we developed a novel statistical method to separate the binding affinity between a transcript and its targeting probe and the parameter measuring transcript abundance from perfect-match hybridization values of Affymetrix gene expression data. We implemented a Bayesian approach to detect SFPs and to genotype a segregating population at the detected SFPs. Based on analysis of three Affymetrix microarray datasets, we demonstrated that the present method confers a significantly improved robustness and accuracy in detecting the SFPs that carry genuine sequence polymorphisms when compared to its rivals in the literature. The method developed in this paper will provide experimental genomicists with advanced analytical tools for appropriate and efficient analysis of their microarray experiments and biostatisticians with insightful interpretation of Affymetrix microarray data.

    Original languageEnglish
    JournalPLoS Computational Biology
    Volume5
    Issue number3
    DOIs
    Publication statusPublished - 1 Mar 2009

    Fingerprint

    Polymorphism
    Gene Expression Data
    Gene expression
    genotyping
    gene expression
    Microarrays
    polymorphism
    genetic polymorphism
    Gene Expression
    Genome
    RNA
    Microarray Analysis
    Microarray Data
    Genes
    Nucleic Acid Hybridization
    Microarray
    Bayes Theorem
    Genetic Polymorphisms
    genome
    Transcriptome

    Cite this

    Wang, Minghui ; Hu, Xiaohua ; Li, Gang ; Leach, Lindsey J. ; Potokina, Elena ; Druka, Arnis ; Waugh, Robbie ; Kearsey, Michael J. ; Luo, Zewei. / Robust detection and genotyping of single feature polymorphisms from gene expression data. In: PLoS Computational Biology. 2009 ; Vol. 5, No. 3.
    @article{b9b33d3704c64322a5c3d7f101ca8458,
    title = "Robust detection and genotyping of single feature polymorphisms from gene expression data",
    abstract = "It is well known that Affymetrix microarrays are widely used to predict genome-wide gene expression and genome-wide genetic polymorphisms from RNA and genomic DNA hybridization experiments, respectively. It has recently been proposed to integrate the two predictions by use of RNA microarray data only. Although the ability to detect single feature polymorphisms (SFPs) from RNA microarray data has many practical implications for genome study in both sequenced and unsequenced species, it raises enormous challenges for statistical modelling and analysis of microarray gene expression data for this objective. Several methods are proposed to predict SFPs from the gene expression profile. However, their performance is highly vulnerable to differential expression of genes. The SFPs thus predicted are eventually a reflection of differentially expressed genes rather than genuine sequence polymorphisms. To address the problem, we developed a novel statistical method to separate the binding affinity between a transcript and its targeting probe and the parameter measuring transcript abundance from perfect-match hybridization values of Affymetrix gene expression data. We implemented a Bayesian approach to detect SFPs and to genotype a segregating population at the detected SFPs. Based on analysis of three Affymetrix microarray datasets, we demonstrated that the present method confers a significantly improved robustness and accuracy in detecting the SFPs that carry genuine sequence polymorphisms when compared to its rivals in the literature. The method developed in this paper will provide experimental genomicists with advanced analytical tools for appropriate and efficient analysis of their microarray experiments and biostatisticians with insightful interpretation of Affymetrix microarray data.",
    author = "Minghui Wang and Xiaohua Hu and Gang Li and Leach, {Lindsey J.} and Elena Potokina and Arnis Druka and Robbie Waugh and Kearsey, {Michael J.} and Zewei Luo",
    year = "2009",
    month = "3",
    day = "1",
    doi = "10.1371/journal.pcbi.1000317",
    language = "English",
    volume = "5",
    journal = "PLoS Computational Biology",
    issn = "1553-734X",
    publisher = "Public Library of Science",
    number = "3",

    }

    Robust detection and genotyping of single feature polymorphisms from gene expression data. / Wang, Minghui; Hu, Xiaohua; Li, Gang; Leach, Lindsey J.; Potokina, Elena; Druka, Arnis; Waugh, Robbie; Kearsey, Michael J.; Luo, Zewei.

    In: PLoS Computational Biology, Vol. 5, No. 3, 01.03.2009.

    Research output: Contribution to journalArticle

    TY - JOUR

    T1 - Robust detection and genotyping of single feature polymorphisms from gene expression data

    AU - Wang, Minghui

    AU - Hu, Xiaohua

    AU - Li, Gang

    AU - Leach, Lindsey J.

    AU - Potokina, Elena

    AU - Druka, Arnis

    AU - Waugh, Robbie

    AU - Kearsey, Michael J.

    AU - Luo, Zewei

    PY - 2009/3/1

    Y1 - 2009/3/1

    N2 - It is well known that Affymetrix microarrays are widely used to predict genome-wide gene expression and genome-wide genetic polymorphisms from RNA and genomic DNA hybridization experiments, respectively. It has recently been proposed to integrate the two predictions by use of RNA microarray data only. Although the ability to detect single feature polymorphisms (SFPs) from RNA microarray data has many practical implications for genome study in both sequenced and unsequenced species, it raises enormous challenges for statistical modelling and analysis of microarray gene expression data for this objective. Several methods are proposed to predict SFPs from the gene expression profile. However, their performance is highly vulnerable to differential expression of genes. The SFPs thus predicted are eventually a reflection of differentially expressed genes rather than genuine sequence polymorphisms. To address the problem, we developed a novel statistical method to separate the binding affinity between a transcript and its targeting probe and the parameter measuring transcript abundance from perfect-match hybridization values of Affymetrix gene expression data. We implemented a Bayesian approach to detect SFPs and to genotype a segregating population at the detected SFPs. Based on analysis of three Affymetrix microarray datasets, we demonstrated that the present method confers a significantly improved robustness and accuracy in detecting the SFPs that carry genuine sequence polymorphisms when compared to its rivals in the literature. The method developed in this paper will provide experimental genomicists with advanced analytical tools for appropriate and efficient analysis of their microarray experiments and biostatisticians with insightful interpretation of Affymetrix microarray data.

    AB - It is well known that Affymetrix microarrays are widely used to predict genome-wide gene expression and genome-wide genetic polymorphisms from RNA and genomic DNA hybridization experiments, respectively. It has recently been proposed to integrate the two predictions by use of RNA microarray data only. Although the ability to detect single feature polymorphisms (SFPs) from RNA microarray data has many practical implications for genome study in both sequenced and unsequenced species, it raises enormous challenges for statistical modelling and analysis of microarray gene expression data for this objective. Several methods are proposed to predict SFPs from the gene expression profile. However, their performance is highly vulnerable to differential expression of genes. The SFPs thus predicted are eventually a reflection of differentially expressed genes rather than genuine sequence polymorphisms. To address the problem, we developed a novel statistical method to separate the binding affinity between a transcript and its targeting probe and the parameter measuring transcript abundance from perfect-match hybridization values of Affymetrix gene expression data. We implemented a Bayesian approach to detect SFPs and to genotype a segregating population at the detected SFPs. Based on analysis of three Affymetrix microarray datasets, we demonstrated that the present method confers a significantly improved robustness and accuracy in detecting the SFPs that carry genuine sequence polymorphisms when compared to its rivals in the literature. The method developed in this paper will provide experimental genomicists with advanced analytical tools for appropriate and efficient analysis of their microarray experiments and biostatisticians with insightful interpretation of Affymetrix microarray data.

    UR - http://www.scopus.com/inward/record.url?scp=63549122013&partnerID=8YFLogxK

    U2 - 10.1371/journal.pcbi.1000317

    DO - 10.1371/journal.pcbi.1000317

    M3 - Article

    C2 - 19282978

    AN - SCOPUS:63549122013

    VL - 5

    JO - PLoS Computational Biology

    JF - PLoS Computational Biology

    SN - 1553-734X

    IS - 3

    ER -