The proteome of the cave bear
As a rule, it takes a genome to interpret a proteome.
A genome database gives the range of possible proteins that a sample is expected to contain, allowing a computer program to match short peptide fragments from the raw data to the full-length proteins they came from. The genome is like a picture showing how a jigsaw puzzle will look when it’s finished — and each peptide is a single tiny piece of the puzzle.

Richard Johnson, a staff scientist at the University of Washington’s department of , has spent nearly three decades working with no picture. Before genomes were assembled and available, he became an expert in de novo peptide sequencing, piecing together the overlapping puzzle pieces from mass spectra to determine the amino acid sequence of proteins.
That ability has been coming in handy recently since Johnson started seeing more requests for environmental proteomics and other exotic analyses.
“I sit next to an oceanographer, and she does these proteomics analyses on strange samples, like glacial meltwater and seawater,” he said. “Those are cases where it’s really difficult to decide what database to even search.”
To annotate a sample from a human, a researcher can use a human genome database. But a tablespoon of ocean water or glacial runoff is likely to contain a complex community of microbes. So which genome databases should the researcher survey? Usually, researchers solve this problem by sequencing as much DNA as they can from a sample and using the result, a metagenome, to guide protein identification.
But even with a metagenome, sometimes the proteins observed in a proteomics experiment just don’t match the given reference database. “I came up with a metric that can tell you whether the protein sequence database is any good for interpreting your mass spectrometry data,” Johnson said.
The technique, which Johnson and colleagues in 麻豆传媒色情片 & Cellular Proteomics, can be used to solve related problems, such as proteomic analysis of an animal whose genome has not been sequenced. “You typically use a sequence database from a closely related species and hope that the sequences did not diverge too much,” Johnson said. “Sometimes that hope is warranted, and other times it’s not.”
Johnson has used this approach to study the makeup of electrosensory organs in electric fish.
A third potential application is for analysis of very old but not fossilized tissues — those that come from extinct species, such as a vial of powdered cave bear bone that Johnson’s team obtained. Extinct species very rarely have a genome assembled, and the close-cousin conundrum is compounded by slow biochemical changes to proteins that happen over thousands of years.
But the approach doesn’t solve every problem. Johnson said, “Using this quality metric tells you how good or bad a sequence database is. But it won’t tell you what to do about it if it’s bad.”
Enjoy reading ASBMB Today?
Become a member to receive the print edition four times a year and the digital edition monthly.
Learn moreGet the latest from ASBMB Today
Enter your email address, and we鈥檒l send you a weekly email with recent articles, interviews and more.
Latest in Science
Science highlights or most popular articles

How scientists identified a new neuromuscular disease
NIH researchers discover Morimoto鈥揜yu鈥揗alicdan syndrome, after finding shared symptoms and RFC4 gene variants in nine patients, offering hope for faster diagnosis and future treatments.

Unraveling cancer鈥檚 spaghetti proteins
MOSAIC scholar Katie Dunleavy investigates how Aurora kinase A shields oncogene c-MYC from degradation, using cutting-edge techniques to uncover new strategies targeting 鈥渦ndruggable鈥 molecules.

How HCMV hijacks host cells 鈥 and beyond
Ileana Cristea, an ASBMB Breakthroughs webinar speaker, presented her research on how viruses reprogram cell structure and metabolism to enhance infection and how these mechanisms might link viral infections to cancer and other diseases.

Understanding the lipid link to gene expression in the nucleus
Ray Blind, an ASBMB Breakthroughs speaker, presented his research on how lipids and sugars in the cell nucleus are involved in signaling and gene expression and how these pathways could be targeted to identify therapeutics for diseases like cancer.

Receptor antagonist reduces age-related bone loss in mice
Receptor antagonist reduces bone loss and promotes osteoblast activity in aging mice, highlighting its potential to treat osteoporosis. Read more about this recent JBC paper.

Engineered fusion protein targets kiwifruit pathogen
Synthetic protein selectively kills kiwifruit pathogen, offering a promising biocontrol strategy for agriculture. Read more about this recent JBC paper.