The HipSci project includes mass spectrometry (MS)-based measurements of protein expression in many of the HipSci cell lines, from both normal and disease groups. This includes both label-free and TMT measurements of protein abundance. Work is ongoing to closely integrate the quantitative measurements of protein expression levels with parallel data generated within HipSci that document genome sequences and methylation patterns, mRNA expression and cell phenotypes, measured on the same cell lines.
Our proteomics data set currently spans in total ~17,000 different proteins that we can detect expressed in human iPS cells, together with >480,000 peptides and ~28 million peptide-spectral matches. The protein data from normal cell lines are being used to analyse human genetics and to map protein quantitative trait loci (pQTLs) in iPS cells.
Future plans include expanding the analysis also of phosphorylation sites and other PTMs in the HipSci cell lines and the analysis of changes in the proteomic landscape that occur when the iPS cells are differentiated into different lineages.
All the resulting HipSci raw MS data are available for download from the PRIDE repository. In addition, the corresponding processed MS data can be accessed via the Encyclopedia of Proteome Dynamics (EPD), which has a section dedicated to HipSci data. This is a rapidly developing resource and is regularly updated with new data, new interactive features and further integration of the proteome data with other HipSci data sets from both normal and disease cell lines.
These data can be accessed as follows:
Additionally, the EPD allows users to search for specific proteins of interest, based on uniprot protein IDs, which displays results that highlight data for the selected protein in the HipSci cell lines. The navigation pane has a text box labelled ‘Protein Search’, as displayed below.
You can search for proteins based on either a description, gene, name or Uniprot accession. When you have selected a protein of interest this will modify the output you receive from the interactive plots. * The Bubble plots will adapt to show you the abundance of the specific protein of interest across all HipSci cell lines where that protein has been detected. * The histogram will highlight in red the abundance bin where the protein of interest is located.