Quantitative Proteomics - Oral Questions
A comprehensive collection of oral exam questions covering quantitative proteomics methods: SILAC, ICAT, iTRAQ, TMT, and Label-Free approaches.
Key Workflow Overview
When does labeling occur?
Stage Method Metabolic (in vivo) SILAC, SILAM Spiking (after lysis) AQUA, QconCAT, Super-SILAC Enzymatic (digestion) ¹⁸O Labeling Chemical (before HPLC) iTRAQ, TMT, Dimethylation No labeling Spectral Counting, MRM, SWATH, XIC
1. Introduction to Quantitative Proteomics
Quantitative Proteomics: An analytical field focused on measuring the relative expression levels of proteins and characterizing their Post-Translational Modifications (PTMs).
Primary goal: Evaluate how protein expression shifts between different states/conditions.
Main applications:
- Tissue Comparison: Understanding molecular differences between tissue types
- Biomarker Discovery: Identifying proteins that differentiate healthy vs. diseased states
- Drug & Pathogen Response: Monitoring cellular reactions to treatments and infections
- Stress Analysis: Studying adaptation to environmental or physiological stress
Key distinction:
- Qualitative: What proteins are present? (identification)
- Quantitative: How much of each protein? (abundance)
Longitudinal Profiling: Monitoring a person's molecular profile over long time frames, comparing current data against their own previous measurements (rather than just population averages).
Why it's important:
- More meaningful: Individual baseline is more informative than population average
- Early detection: Identifies risks before symptoms appear
- High sensitivity: Catches subtle molecular changes unique to the individual
- Prevention: Enables proactive interventions to stop disease progression
Example: Athlete Biological Passport (ABP)
- Monitors biological variables in athletes over time
- Doesn't detect specific substances
- Looks for fluctuations that indirectly reveal doping effects
- Consistent monitoring makes it harder to bypass anti-doping rules
2. Plasma Proteomics & Biomarkers
Plasma vs Serum:
| Plasma | Serum |
|---|---|
| Blood + anticoagulant | Blood allowed to clot |
| Contains clotting factors | Devoid of clotting factors |
The Composition Challenge:
- Unbalanced distribution of protein mass
- In cells: >2,300 proteins = 75% of mass
- In plasma: Only 20 proteins = ~90% of mass (albumin, immunoglobulins)
The masking problem:
- Dominant proteins mask low-abundance proteins
- Disease biomarkers often hidden in the "low-abundance" fraction
Solutions:
- Depletion: Remove abundant proteins (albumin, IgG)
- Enrichment: Increase concentration of rare proteins
Leakage Proteins: Intracellular proteins that are abnormally released into the bloodstream (or other body fluids) as a result of damage, stress, or death of a specific tissue or organ.
Why they're important:
- Serve as biomarkers for tissue damage
- Indicate which organ/tissue is affected
- Used in clinical diagnostics
Primary example: Cardiac Troponin
- Normally found inside heart muscle cells
- Released into blood when heart muscle is damaged
- Gold standard biomarker for heart attack (myocardial infarction)
- Very specific to cardiac tissue
Other examples:
- AST/ALT → liver damage
- Creatine kinase → muscle damage
- Amylase/Lipase → pancreatic damage
| Feature | Quantitative (Discovery) | Targeted |
|---|---|---|
| Goal | Comprehensive proteome view | Measure specific proteins |
| Proteins measured | 2,000-6,000 | 10-100 |
| Selection | Untargeted (find what's there) | Pre-selected before analysis |
| Sensitivity | Lower | Higher |
| Accuracy | Lower | Higher |
| Methods | SILAC, iTRAQ, Label-free | MRM, SRM, PRM |
| Use | Find candidates | Validate candidates |
The logical workflow:
- Step 1 (Discovery): Use quantitative proteomics to explore the landscape and find potential biomarker candidates
- Step 2 (Validation): Use targeted proteomics to zoom in on specific candidates with high sensitivity to confirm clinical relevance
3. Label-Based vs Label-Free Strategies
Label-Free Approach:
- Direct analysis without external tags
- Less expensive and less invasive
- Samples analyzed separately in parallel workflows
- Used for initial screening or natural samples
- May be less accurate with complex samples
- Methods: Spectral counting, AUC/XIC, MRM, SWATH
Label-Based Approach:
- Uses tracer/label to monitor proteins
- Labels have high signal-to-mass ratio
- Samples can be mixed and analyzed together
- Label identifies origin of each protein
- More accurate for relative quantification
When labeling occurs:
| Stage | Method | Type |
|---|---|---|
| In vivo (metabolic) | SILAC, SILAM | Living cells |
| After lysis (spiking) | AQUA, QconCAT | Isolated proteins |
| During digestion | ¹⁸O Labeling | Enzymatic |
| Before HPLC | iTRAQ, TMT, ICAT | Chemical |
4. SILAC (Stable Isotope Labeling by Amino Acids in Cell Culture)
SILAC = Stable Isotope Labeling by Amino Acids in Cell Culture
An in vivo metabolic labeling technique for quantitative proteomics.
Core principle:
- Uses stable isotopes (¹³C, ¹⁵N) — NOT radioactive
- Same chemical-physical properties as natural isotopes
- Isotopes incorporated into "heavy" amino acids
- Cells incorporate labeled amino acids during translation
- Label encoded directly into the proteome
Why Arginine and Lysine?
- Essential/semi-essential: Cells must obtain them from media
- Trypsin cleavage sites: Trypsin cleaves after K and R
- Every tryptic peptide (except C-terminal) contains at least one K or R
- Ensures all peptides are labeled
Also used: Leucine (present in ~70% of tryptic peptides)
- Cell Cultures:
- Two populations grown separately
- One in "light" medium (normal amino acids)
- One in "heavy" medium (¹³C/¹⁵N-labeled amino acids)
- Protein Integration:
- Cells incorporate amino acids during translation
- Multiple cell divisions for complete labeling
- Treatment:
- Apply experimental condition (e.g., drug, stimulus)
- Harvest & Mixing:
- Samples mixed early (at cell level)
- Minimizes experimental error
- Lysis & Separation:
- Cells lysed, proteins separated (SDS-PAGE or 2D-PAGE)
- Digestion:
- Trypsin digestion → peptides
- MS Analysis:
- Light and heavy peptides co-elute from LC
- Two peak families in spectrum
- Ratio of peak intensities = relative abundance
SILAC spectrum interpretation:
- Two families of peaks: "light" and "heavy"
- Heavy peaks shifted to the right (higher m/z)
- Peak intensity ratio = relative protein abundance
Calculation example:
- ¹³C₆-Lysine adds 6 Da mass difference
- With +2 charge state:
- m/z shift = Mass difference ÷ Charge
- m/z shift = 6 ÷ 2 = 3 m/z units
General formula:
Δm/z = ΔMass / z
Quantification:
- Compare peak heights or areas
- Heavy/Light ratio indicates fold change
- SILAC provides relative (not absolute) quantification
SILAC Limitations:
- Requires living cells:
- Cells must grow in culture
- Must incorporate labeled amino acids
- Time-consuming:
- Multiple cell divisions needed for complete labeling
- Typically 5-6 doublings
- Limited multiplexing:
- Maximum 2-3 samples (light, medium, heavy)
- Arginine-to-Proline conversion:
- Some cells convert Arg to Pro
- Can cause labeling artifacts
Samples that CANNOT be used:
- Cell-free biological fluids:
- Plasma/serum
- Urine
- Saliva
- CSF
- Reason: No living cells to incorporate labels!
Samples that CAN be used:
- Cell lines
- Blood-derived leukocytes (if cultured)
- Biopsy-obtained cancer cells (if cultured)
SILAC Advantages:
- Early mixing:
- Samples mixed at cell level (earliest possible point)
- Minimizes experimental error during sample preparation
- Complete labeling:
- Nearly 100% incorporation after sufficient doublings
- No chemical modification:
- Label is natural amino acid (just different isotope)
- No affinity purification needed
- High proteome coverage:
- ~70% of peptides contain Leucine
- All tryptic peptides contain K or R
- Accurate quantification:
- Light and heavy peptides co-elute
- Analyzed simultaneously = same ionization conditions
5. ICAT (Isotope-Coded Affinity Tag)
ICAT = Isotope-Coded Affinity Tag
An in vitro chemical labeling technique targeting Cysteine residues.
Three functional components:
- Reactive Group (Iodoacetamide):
- Specifically binds to cysteine thiol groups (-SH)
- Highly specific reaction
- Isotope-Coded Linker (PEG):
- Polyethylene glycol bridge
- Light version: Normal hydrogen atoms
- Heavy version: 8 hydrogens replaced with deuterium
- Mass difference: 8 Da
- Biotin Tag:
- Affinity tag for purification
- Strong binding to streptavidin/avidin
- Enables selective isolation of labeled peptides
Structure: [Iodoacetamide]—[PEG linker]—[Biotin]
- Denaturation & Reduction:
- Unfold proteins
- Break disulfide bonds to expose cysteines
- Labeling:
- Sample 1 → Light ICAT reagent
- Sample 2 → Heavy ICAT reagent
- Iodoacetamide reacts with Cys thiols
- Mixing & Digestion:
- Combine labeled samples
- Trypsin digestion → peptides
- Affinity Chromatography:
- Add streptavidin-coated beads
- Biotin-tagged peptides bind
- Non-Cys peptides washed away
- Reduces complexity!
- Nano-HPLC & MS:
- Separate and analyze peptides
- Light/Heavy peaks separated by 8 Da
- MS/MS:
- Fragment for sequence identification
- Database search (MASCOT)
Advantages:
- Reduced complexity: Only Cys-containing peptides selected → cleaner spectra
- Accuracy: ~10% accuracy in relative quantification
- Flexibility: Works on complex protein mixtures
- Clinical samples: Can use tissues, biopsies, fluids (unlike SILAC)
Disadvantages:
- Cysteine dependency:
- Only ~25% of peptides contain Cys
- Proteins without Cys cannot be identified!
- Accessibility issues:
- Some Cys buried in protein structure
- Cannot be labeled
- Limited multiplexing:
- Only 2 samples (light vs heavy)
- Cost: Expensive reagents
- Yield concerns: Non-specific binding and incomplete labeling
6. SILAC vs ICAT Comparison
| Feature | SILAC | ICAT |
|---|---|---|
| Type | In vivo (metabolic) | In vitro (chemical) |
| Target | Lys, Arg (all tryptic peptides) | Cysteine only |
| Proteome coverage | ~70% (Leu-containing) | ~25% (Cys-containing) |
| Sample mixing | Very early (cells) | After labeling |
| Multiplexing | 2-3 samples | 2 samples |
| Sample type | Living cells only | Any protein mixture |
| Clinical samples | ❌ Cannot use fluids | ✅ Can use biopsies/fluids |
| Complexity | Full (many peptides) | Reduced (Cys-only) |
When to use SILAC:
- Cell culture experiments
- Need high proteome coverage
- Can afford time for labeling
When to use ICAT:
- Clinical samples (plasma, tissue)
- Complex mixtures needing simplification
- Cannot grow cells in culture
7. iTRAQ (Isobaric Tags for Relative and Absolute Quantitation)
Isobaric = Same total mass
All iTRAQ reagents have identical total mass (e.g., 145 Da for 4-plex).
Why this matters:
- Identical peptides from different samples appear as ONE peak in MS1
- Keeps spectrum simple and clean
- No peak splitting like in SILAC
How it works:
- Different isotope distribution within the reagent
- Reporter group + Balance group = constant mass
- When reporter is heavier → balancer is lighter
Example (4-plex):
| Reagent | Reporter | Balance | Total |
|---|---|---|---|
| 1 | 114 Da | 31 Da | 145 Da |
| 2 | 115 Da | 30 Da | 145 Da |
| 3 | 116 Da | 29 Da | 145 Da |
| 4 | 117 Da | 28 Da | 145 Da |
iTRAQ reagent has three parts:
- Reporter Group:
- Unique "ID" for each sample
- 4-plex: 114, 115, 116, 117 Da
- 8-plex: 113-121 Da
- Released during MS/MS fragmentation
- Used for quantification!
- Balance Group:
- Compensates for reporter mass
- Ensures total mass is constant
- Lost during fragmentation
- Reactive Group (NHS ester):
- Binds to N-terminus and Lysine side chains
- Labels all peptides (not just Cys like ICAT)
Structure: [Reporter]—[Balance]—[NHS-ester]
iTRAQ Workflow:
- Extraction & Preparation: Purify, denature, reduce proteins
- Digestion: Trypsin → peptides BEFORE labeling
- Labeling: Each sample labeled with specific iTRAQ reagent
- Pooling: Combine all labeled samples into one
- HPLC Separation: Treat as single sample
- MS1: Single peak per peptide (isobaric!)
- MS/MS (CID): Fragmentation breaks Reporter-Balance bond
- Reporter ions released: 114-117 region shows intensities
Quantification occurs at MS/MS (MS2) level!
| Method | Quantification Stage |
|---|---|
| SILAC | MS1 (peak ratios) |
| ICAT | MS1 (peak ratios) |
| iTRAQ | MS2 (reporter ions) |
Ratio Compression Effect: Measured differences in protein abundance appear smaller than actual biological values, compressing observed ratios toward 1:1.
Cause: Co-Isolation Challenge
- During MS2, mass spectrometer isolates precursor ion for fragmentation
- Peptides with similar m/z that co-elute are co-isolated
- These "contaminating" peptides also fragment
- Their reporter ions merge with target signal
- Background peptides at different concentrations → dilute the true signal
- Result: Systematic underestimation of fold-change
Mitigation strategies:
- Better chromatography: Reduce co-elution
- MS3 analysis: Additional fragmentation stage (gold standard)
- Narrower isolation windows: Reduce co-isolated species
Advantages:
- High multiplexing: Up to 8 samples (4-plex or 8-plex)
- Statistical power: More samples = better p-values, less noise
- Clean MS1 spectra: Isobaric tags → single peaks
- High coverage: Labels N-terminus + Lys (most peptides)
- Relative & absolute: Can include standards
Limitations:
- Ratio compression: Background interference underestimates differences
- Expensive reagents: High cost compared to label-free
- High sample concentration needed:
- Complex preparation: Risk of sample loss, incomplete labeling
- Sophisticated software needed: ProQuant, etc.
8. Method Comparison: SILAC vs ICAT vs iTRAQ
| Feature | SILAC | ICAT | iTRAQ |
|---|---|---|---|
| Type | In vivo (metabolic) | In vitro (chemical) | In vitro (chemical) |
| Labeling stage | Cell culture | After lysis | After digestion |
| Target | Lys, Arg, Leu | Cysteine only | N-terminus + Lys |
| Multiplexing | 2-3 samples | 2 samples | 4-8 samples |
| Quantification | MS1 | MS1 | MS2 |
| Coverage | High (~70%) | Low (~25%) | Very high |
| Sample type | Cells only | Any mixture | Any mixture |
| Clinical samples | ❌ No | ✅ Yes | ✅ Yes |
| Main limitation | Needs living cells | Cys dependency | Ratio compression |
9. Label-Free Quantification
Label-Free Quantification: Quantitative proteomics without isotope labels or chemical tags.
Key characteristics:
- Direct comparison of individual LC-MS/MS runs
- No expensive reagents needed
- Samples never mixed — analyzed separately
- Requires strict experimental standardization
Comparison to label-based:
| Feature | Label-Based | Label-Free |
|---|---|---|
| Sample mixing | Combined before MS | Analyzed separately |
| Cost | Higher (reagents) | Lower |
| Multiplexing | Limited by reagents | Unlimited samples |
| Variability | Lower (same run) | Higher (run-to-run) |
| Complexity | Sample prep | Data analysis |
1. Spectral Counting:
- Principle: More protein → more peptides → more MS/MS spectra
- Data level: MS2
- Measures: Number of spectra, unique peptides, sequence coverage
- Advantages: Easy to implement, no special algorithms
- Best for: High-abundance proteins
2. Precursor Signal Intensity (AUC):
- Principle: Measure Area Under the Curve of chromatographic peaks
- Data level: MS1
- Measures: Peak intensity/height as peptides elute
- Advantages: More accurate for subtle changes
- Best for: Low-abundance proteins
| Feature | Spectral Counting | AUC |
|---|---|---|
| Data Level | MS2 | MS1 |
| Complexity | Low | High (needs alignment) |
| Sensitivity | Better for abundant | Better for low-abundance |
Technical challenges:
- Experimental Drift:
- Fluctuations in retention time (RT) between runs
- m/z drift over time
- Hard to align same peptide across samples
- Solution: Alignment algorithms that "stretch/shrink" chromatograms
- Run-to-Run Variability:
- Even identical samples show intensity differences
- ESI efficiency fluctuations
- Column performance variation
- Solution: Internal standards, global normalization
- Data Complexity:
- Massive data volume from separate runs
- Requires sophisticated bioinformatics pipelines
- Automated alignment, normalization, statistics
- No internal standard:
- Unlike labeled methods, no built-in reference
Label-Free Advantages:
- Cost-effective: No expensive reagents
- Simple sample prep: No labeling steps
- Unlimited multiplexing: Compare any number of samples
- Works with any sample: Tissues, fluids, cells
- Lower sample amount: No sample loss during labeling
- Dynamic range: Can detect wider range of changes
- No ratio compression: Unlike iTRAQ
Best applications:
- Large-scale studies (many samples)
- Clinical cohorts
- When sample is limited
- Initial screening studies
10. Quick Review Questions
Test yourself with these rapid-fire questions:
SILAC is an ❓ vivo or in vitro method? In vivo (metabolic labeling)
iTRAQ can compare up to ❓ samples simultaneously 8 samples (8-plex)
ICAT specifically targets ❓ amino acid Cysteine
iTRAQ quantification occurs at ❓ level MS/MS (MS2) level — reporter ions
SILAC quantification occurs at ❓ level MS1 level — peak ratios
"Isobaric" means ❓ Same total mass
SILAC cannot be used on ❓ Cell-free fluids (plasma, urine, saliva) — no living cells
The ICAT mass difference between light and heavy is ❓ Da 8 Da (8 deuteriums)
Ratio compression in iTRAQ is caused by ❓ Co-isolation of background peptides during MS2
Spectral counting uses ❓ data level MS2 (number of spectra)
AUC (Area Under Curve) uses ❓ data level MS1 (peak intensity)
In plasma, only ❓ proteins constitute ~90% of the mass 20 proteins (albumin, immunoglobulins)
Cardiac troponin is an example of a ❓ protein Leakage protein (biomarker for heart damage)
ABP (Athlete Biological Passport) uses ❓ profiling Longitudinal profiling (individual over time)
Discovery proteomics measures ❓ proteins, targeted measures ❓ 2,000-6,000 proteins (discovery) vs 10-100 proteins (targeted)
ICAT biotin tag binds to ❓ for affinity purification Streptavidin/avidin beads
Label-free main challenge is ❓ Run-to-run variability / alignment between runs
iTRAQ reporter ions appear in the ❓ region of MS/MS spectrum Low-mass region (114-117 for 4-plex)