view README.rst @ 4:f294fd77b143 draft

planemo upload commit 08f1831e097df5d74bf60ff5955e7e9c8e524cc8-dirty
author proteore
date Wed, 14 Mar 2018 12:22:51 -0400
parents d7f909ae24d9
children 2c0bab71a436
line wrap: on
line source

Wrapper for Get expression data by tissue Tool
=================================================

**Authors**

T.P. Lien Nguyen, Florence Combes, Yves Vandenbrouck CEA, INSERM, CNRS, Grenoble-Alpes University, BIG Institute, FR

Sandra Dérozier, Olivier Rué, Christophe Caron, Valentin Loux INRA, Paris-Saclay University, MAIAGE Unit, Migale Bioinformatics platform

This work has been partially funded through the French National Agency for Research (ANR) IFB project.

Contact support@proteore.org for any questions or concerns about the Galaxy implementation of this tool.

-------------------------------------------------

This tool retrieve information from Human Protein Atlas (https://www.proteinatlas.org/) 
regarding the expression profiles of human genes both on the mRNA and protein level. 

A list of ENSG (Ensembl gene) IDs must be entered (either via a copy/paste or by choosing a file), 
if it's not the case, please use the ID_Convert tool from ProteoRE.

The resources from Human Protein Atlas that can be queried are the following: 

* **Human normal tissue data**: expression profiles for proteins in human tissues based on immunohistochemisty using tissue micro arrays.

  The tab-separated file includes Ensembl gene identifier ("Gene"), tissue name ("Tissue"), annotated cell type ("Cell type"), expression value ("Level"), and the gene reliability of the expression value ("Reliability"). 

  The data is based on The Human Protein Atlas version 18 and Ensembl version 88.38.

* **Human tumor tissue data**: staining profiles for proteins in human tumor tissue based on immunohistochemisty using tissue micro arrays and log-rank P value for Kaplan-Meier analysis of correlation between mRNA expression level and patient survival. 

  The tab-separated file includes Ensembl gene identifier ("Gene"), gene name ("Gene name"), tumor name ("Cancer"), the number of patients annotated for different staining levels ("High", "Medium", "Low" & "Not detected") and log-rank p values for patient survival and mRNA correlation ("prognostic - favourable", "unprognostic - favourable", "prognostic - unfavourable", "unprognostic - unfavourable").

  The data is based on The Human Protein Atlas version 18 and Ensembl version 88.38.

-----

**Reliability score**

Reliability score is divided into Enhanced, Supported, Approved, or Uncertain with respect 
to the definitions from HPA:

Enhanced - One or several antibodies with non-overlapping epitopes targeting the same gene 
have obtained enhanced validation based on orthogonal or independent antibody validation method.

Supported - Consistency with RNA-seq and/or protein/gene characterization data, 
in combination with similar staining pattern if independent antibodies are available.

Approved - Consistency with RNA-seq data in combination with inconsistency with, or lack of, 
protein/gene characterization data. Alternatively, consistency with protein/gene characterization data 
in combination with inconsistency with RNA-seq data. If independent antibodies are available, 
the staining pattern is partly similar or dissimilar.

Uncertain - Inconsistency with, or lack of, RNA-seq and/or protein/gene characterization data, 
in combination with dissimilar staining pattern if independent antibodies are available.