Discovering the Complexity of the Human Proteome

I should preface this blog by stating that I am a nucleic acids gal. My years in the lab were spent with tubes of DNA and RNA. In fact my one and only tentative foray into the field of protein resulted in a Western Blot so ugly that those who witnessed it have been sworn to secrecy. Given all of this, the mapping of the human proteome might seem like an odd topic for me to write about. Except that it isn’t really, because the sequencing of the proteome offers answers to some of the questions that the sequencing of the genome didn’t.

First, let’s start with what a proteome is: A proteome is all the proteins expressed at a certain time point. It can be as limited as the proteome of a single cell or as all encompassing as the proteome of an entire genome. However, unlike the genome, which is genetic information encoded in an organism’s DNA or RNA, the makeup of a proteome can vary dramatically as a result of expression patterns, alternative splicing events and post-translational modifications.

The genome is a constant, what you see today is what will still be there tomorrow. The proteome, on the other hand, is a constantly changing landscape. Up regulation or down regulation of a gene can mean more or less protein is present. Alternative splicing and post-translational modifications can result in fundamental changes to the protein itself.

In other words, if the genome is a beautiful, pristine Ansel Adams print, then the proteome is that same scene as interpreted by Andy Warhol—in Technicolor and 3D.

Earlier this year, two independent teams published first drafts of the human proteome. The teams took different approaches. One group, led by Akhilesh Pandey from John Hopkins University, isolated protein from 30 different tissue types from a single source. The team was able to catalog proteins encoded by about 84% of human genes predicted to code for proteins and determined the relative abundance of each protein using mass spectrometery (1).

The second group led by German researcher, Bernhard Küster from the Technische Universität München, used a different but complimentary approach. Küster’s team created a searchable public database, ProteomicsDB, using existing data from the proteomics community. To fill gaps in the public data, the team generated its own data using over 60 human tissues, 13 body fluids and 147 cancer cell lines. In total, the ProteomicsDB catalogs about 92 percent of human proteins (estimated to be 19,629) (2).

Scientist are still in the early stages of analyzing the results from these two studies, but already some interesting information has come to light. For one, some parts of the genome that we thought were non-coding aren’t, as evidenced by the identification of new proteins from some of these regions. In total, more than 400 translated long, intergenic non-coding RNAs (lincRNAs) were identified, as well as 193 new proteins.

Another, possibly paradigm-shifting, result involved the translation rate of mRNA. The Küster group compared the expression profiles of mRNA and proteins and found that although the level of mRNA and protein varies dramatically between tissue types, the ratio of mRNA to protein was surprisingly conserved for a given protein. This suggests that that at least at steady state, once the ratio for an mRNA/protein pair has been calculated, protein levels can be determined just from specific mRNA levels. Meaning that the translation rate of a particular mRNA is somehow coded into that transcript. If this hold true, it could mean that protein expression levels in a cell is largely controlled by regulating mRNA levels.

All of this new data and the promise of the answers it holds is almost enough to make me want go back into the lab and try my hand at the protein side of things again. Almost.

References

Kim, M-S. et al. (2014) A draft of the human proteome. Nature 509, 575–81.
Wilhelm, M. et al. (2014) Mass-spectrometery-based draft of the human proteome. Nature 509, 582–7.

Bio
Latest Posts

Kelly Grooms

Scientific Communications Specialist at Promega Corporation

Kelly earned her B.S. in Genetics from Iowa State University in Ames, IA. Prior to coming to Promega, she worked for biotech companies in San Diego and Madison. Kelly lives just outside Madison with her husband, son and daughter. Kelly collects hobbies including jewelry artistry, reading, writing and knitting. A black belt, she enjoys practicing karate with her daughter as well as hiking, biking and camping.

Latest posts by Kelly Grooms (see all)

IC50, EC50 and Kd: What is the Difference and Why Do They matter? - March 6, 2025
Understanding Stress Resilience in Tomatoes: Insights Into the Role of PP2C Genes - February 6, 2025
Live-Cell Imaging: It’s Time to See What Else Your Luminescence Assays Can Tell You - December 5, 2024

Cookie	Duration	Description
cookielawinfo-checbox-analytics	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Analytics".
cookielawinfo-checbox-functional	11 months	The cookie is set by GDPR cookie consent to record the user consent for the cookies in the category "Functional".
cookielawinfo-checbox-others	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Other.
cookielawinfo-checkbox-advertisement	1 year	The cookie is set by GDPR cookie consent to record the user consent for the cookies in the category "Advertisement".
cookielawinfo-checkbox-necessary	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookies is used to store the user consent for the cookies in the category "Necessary".
cookielawinfo-checkbox-performance	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Performance".
gdpr_status	6 months 2 days	This cookie is set by the provider Media.net. This cookie is used to check the status whether the user has accepted the cookie consent box. It also helps in not showing the cookie consent box upon re-entry to the website.
lang		This cookie is used to store the language preferences of a user to serve up content in that stored language the next time user visit the website.
viewed_cookie_policy	11 months	The cookie is set by the GDPR Cookie Consent plugin and is used to store whether or not user has consented to the use of cookies. It does not store any personal data.

Cookie	Duration	Description
SC_ANALYTICS_GLOBAL_COOKIE	10 years	This cookie is associated with Sitecore content and personalization. This cookie is used to identify the repeat visit from a single user. Sitecore will send a persistent session cookie to the web client.
vuid	2 years	This domain of this cookie is owned by Vimeo. This cookie is used by vimeo to collect tracking information. It sets a unique ID to embed videos to the website.
WMF-Last-Access	1 month 18 hours 24 minutes	This cookie is used to calculate unique devices accessing the website.
_ga	2 years	This cookie is installed by Google Analytics. The cookie is used to calculate visitor, session, campaign data and keep track of site usage for the site's analytics report. The cookies store information anonymously and assign a randomly generated number to identify unique visitors.
_gid	1 day	This cookie is installed by Google Analytics. The cookie is used to store information of how visitors use a website and helps in creating an analytics report of how the website is doing. The data collected including the number visitors, the source where they have come from, and the pages visted in an anonymous form.

Cookie	Duration	Description
IDE	1 year 24 days	Used by Google DoubleClick and stores information about how the user uses the website and any other advertisement before visiting the website. This is used to present users with ads that are relevant to them according to the user profile.
test_cookie	15 minutes	This cookie is set by doubleclick.net. The purpose of the cookie is to determine if the user's browser supports cookies.
VISITOR_INFO1_LIVE	5 months 27 days	This cookie is set by Youtube. Used to track the information of the embedded YouTube videos on a website.

Cookie	Duration	Description
BIGipServerwww.promega.com_sitecore		No description
CanCheckOut		No description
CommerceCustomerId		No description
CONSENT	16 years 7 months 15 days 6 hours 22 minutes	No description
cookies.js	session	No description
Country	3 months	No description
CountrySelected	3 months	No description
CustomerId		No description
PreferredLanguage	3 months	No description
PromegaCompno	3 months	No description
PromegaCountry	3 months	No description
RememberMe	6 months	No description
SameSite		No description
sc_ext_contact	2 years	No description
sc_ext_session	session	No description
TS01ae363a		No description
UID	2 years	No description
website#lang		This cookie is used for storing the visitor language preferences. It heps in delivering localised language version.
wp_api	past	No description
wp_api_sec	past	No description
_ga_WHZLGVEZ9X	2 years	No description

Cookie	Duration	Description
YSC	session	This cookies is set by Youtube and is used to track the views of embedded videos.
_gat_UA-62336821-1	1 minute	This is a pattern type cookie set by Google Analytics, where the pattern element on the name contains the unique identity number of the account or website it relates to. It appears to be a variation of the _gat cookie which is used to limit the amount of data recorded by Google on high traffic volume websites.

Promega Connections

Thoughts, tech tips and news about science

Discovering the Complexity of the Human Proteome

Kelly Grooms

Latest posts by Kelly Grooms (see all)

Like this:

Related

Leave a ReplyCancel reply

Kelly Grooms

Latest posts by Kelly Grooms (see all)

Share this:

Like this:

Related

Leave a ReplyCancel reply