System Wide Analyses have Underestimated Protein Abundances and the Importance of Transcription in Mammals
Large scale surveys in mammalian tissue culture cells suggest that the protein expressed at the median abundance is present at 8,000 - 16,000 molecules per cell and that differences in mRNA expression between genes explain only 10-40% of the differences in protein levels. We find, however, that these surveys have significantly underestimated protein abundances and the relative importance of transcription. Using individual measurements for 61 housekeeping proteins to rescale whole proteome data from Schwanhausser et al., we find that the median protein detected is expressed at 170,000 molecules per cell and that our corrected protein abundance estimates show a higher correlation with mRNA abundances than do the uncorrected protein data. In addition, we estimated the impact of further errors in mRNA and protein abundances, showing that mRNA levels explain at least 56% of the differences in protein abundance for the genes detected by Schwanhausser et al., though because one major source of error could not be estimated the true percent contribution could be higher. We also employed a second, independent strategy to determine the contribution of mRNA levels to protein expression. We show that the variance in translation rates directly measured by ribosome profiling is only 12% of that inferred by Schwanhausser et al. and that the measured and inferred translation rates correlate only poorly (R2=0.13). Based on this, our second strategy suggests that mRNA levels explain ~81% of the variance in protein levels. We also determined the percent contributions of transcription, RNA degradation, translation and protein degradation to the variance in protein abundances using both of our strategies. While the magnitudes of the two estimates vary, they both suggest that transcription plays a more important role than the earlier studies implied and translation a much smaller role.
💡 Research Summary
The paper re‑examines the quantitative relationship between mRNA and protein abundances in mammalian cells, challenging the conclusions of earlier large‑scale proteomics surveys such as that of Schwanhausser et al. (2011). Those surveys, which relied on mass‑spectrometry‑derived spectral counts, reported a median protein copy number of only 8 000–16 000 molecules per cell and concluded that mRNA levels explain merely 10–40 % of the variance in protein abundance, implying a dominant role for translational regulation.
The authors argue that systematic biases in the original data—particularly limited detection sensitivity for low‑abundance proteins, saturation effects for highly abundant proteins, and an inadequate scaling standard—lead to a severe under‑estimation of absolute protein quantities and an inflated estimate of translational influence. To correct this, they measured 61 housekeeping proteins using orthogonal, absolute quantification methods (e.g., SILAC, AQUA peptides, immunoblotting). Because these proteins span a wide dynamic range and are presumed to be constitutively expressed, they serve as reliable internal standards. By fitting a linear rescaling model to the Schwanhausser dataset, the authors recalibrated the entire proteome. After correction, the median protein is present at roughly 1.7 × 10⁵ copies per cell—an order of magnitude higher than previously reported.
When the rescaled protein abundances are correlated with matched mRNA levels, the coefficient of determination (R²) rises from ~0.4 to ~0.56. This indicates that transcriptional output alone accounts for at least 56 % of the observed protein variability. The authors further performed an error‑propagation analysis, incorporating known measurement uncertainties for both mRNA (≈15 %) and protein (≈30 %). Because an unquantified systematic bias remains (e.g., sample loss, peptide selection bias), the true contribution of mRNA could be even larger.
To assess the role of translation independently, the study leveraged ribosome‑profiling data, which directly measures ribosome occupancy and thus translation rates. Comparing these empirically derived rates with the translation efficiencies inferred by Schwanhausser et al. revealed a poor correlation (R² = 0.13) and showed that the actual variance in translation rates explains only about 12 % of the total protein variance. Using this ribosome‑profiling‑based estimate, the authors calculate that mRNA levels could explain roughly 81 % of the protein variance, dramatically reducing the inferred impact of translational control.
Finally, the authors decompose the total protein variance into contributions from four processes: transcription, RNA degradation, translation, and protein degradation. Both the spectral‑count‑rescaling approach and the ribosome‑profiling approach converge on a picture in which transcription dominates (≈50–60 % of variance), while translation contributes modestly (≈10–15 %). RNA and protein turnover together account for the remaining 25–40 %.
In summary, this work demonstrates that previous large‑scale proteomic surveys have substantially underestimated absolute protein abundances and overestimated the importance of translational regulation. By applying a rigorous, experimentally validated rescaling using housekeeping proteins and by cross‑validating with ribosome profiling, the authors provide a more accurate quantification of the relative contributions of each step in gene expression. The findings underscore the necessity of absolute quantification standards in systems‑level studies and suggest that transcriptional control is the primary driver of protein abundance differences in mammalian cells, with translation playing a secondary, though still biologically relevant, role. Future investigations should expand the set of internal standards, integrate protein half‑life measurements, and refine models that jointly consider transcription, RNA stability, translation, and protein degradation to achieve a truly comprehensive understanding of proteome regulation.
Comments & Academic Discussion
Loading comments...
Leave a Comment