Analyzing Label-Free Quantitative Proteomics Data

Label-free Quantitative Proteomics Introduction

Multiple algorithms and software implementations have been developed for quantitation label-free proteomics experiments (LFQ), in particular for extracted ion chromatograms (XIC). For more background information you may look at Wikipedia labell-free Proteomics.

The tools presented here are designed for use with label-free XIC (ie LFQ) data. Several of the programs for extracting initial quantitations also allow getting spectral counting (PSM) data which can also get imported into R, however their use is not further discussed in this vignette. In general it is preferable to use XIC for comparing peptde of protein quantities between different protein extracts/samples.

This package provides support for importing quantitation results from Proteome Discoverer, MaxQuant, Fragpipe, Proline, MassChroQ, DIA-NN, AlphaPept, Wombat-P and OpenMS.

All quantitation import functions offer special features for further separating annotation related information, like species, for later use.

In most common real-world cases people typically analyze data using only one quantitation algorithm/software. Below in this vignette, we’ll use only the quantitation data generated using MaxQuant (AlphaPept, DIA-NN, FragPipe, MassChroQ, OpenMS, ProteomeDiscoverer, Proline and Wombat-P are supported, too). The other vignette to this package (“UPS-1 spike-in Experiments”) shows in detail the import functions available for MaxQuant, ProteomeDiscoverer and Proline and how further comparsions can be performed in bench-mark studies. All these import functions generate an equivalent output format, separating (selected) annotation data ($annot) from normalized log2-quantitation data ($quant) and initial quantitation ($raw).

Normalization (discussed below in more detail) is an important part of ‘preparing’ the data for subsequant analysis. The import functions in this package allow performin an initial normalization step (with choice among multiple algorithims), too. Further information about the proteins identifed can be considered during normalization: For example, it is possible to exclude contaminants like keratins which are frequently found among the higher abundant proteins which may potentially introduce bias at global normalization.

Technical replicates are very frequently produced in proteomics, they allow to assess the variability linked to repeated injection of the same material. Biological replicates, however, make additional information accessible, allowing the interpretation of experiments in a more general way.

Import From Dedicated Quantification Algorithms/Software

MaxQuant: Import Protein Quantification Data

MaxQuant is free software provided by the Max-Planck-Institute, see also Tyanova et al 2016. Typically MaxQuant exports by default quantitation data on level of consensus-proteins as a folder called txt with a file always called ‘proteinGroups.txt’. Data exported from MaxQuant can get imported (and normalized) using readMaxQuantFile(), in a standard case one needs only to provide the path to the file ‘proteinGroups.txt’ which can be found the combined/txt/ folder produced by MaxQuant. gz-compressed files can be read, too (as in the example below the file ‘proteinGroups.txt.gz’). The argument specPref allows giving further details about expected (primary) species, it defaults to working with human proteins. To get started, let’s just set it to NULL for ignoring.

path1 <- system.file("extdata", package="wrProteo")
dataMQ <- readMaxQuantFile(path1, specPref=NULL, normalizeMeth="median")
#> readMaxQuantFile : Note: Found 11 out of 1115 proteins marked as 'REV_' (reverse peptide identification) - Removing
#> readMaxQuantFile : Transform 2845(9.5%) initial '0' values to 'NA'
#> readMaxQuantFile : Could not find peptide counts columns (argument 'pepCountCol') matching to 'Unique peptides MS\.MS\.count '
#> readMaxQuantFile : Found 1 species name(s) appearing inside other ones, assume as truncated (eg  Saccharomyces cerevis)
#> readMaxQuantFile : Note: 5 proteins with unknown species
#>      data by species : Gallus gallus: 1,  Homo sapiens: 49,  Mus musculus: 1,  Saccharomyces cerevisiae: 1047,  Sus scrofa: 1,
#> readMaxQuantFile : Found 70 composite accession-numbers (eg P00761;P00761), truncating
#> readMaxQuantFile : Use column 'Accession' as identifyer (has fewest, ie 0 duplicated entries) as rownames

## number of lines and columns of quantitation data
dim(dataMQ$quant)
#> [1] 1104   27

Adding Meta-Data at Import (Example MaxQuant)

Similarly we can also add directly information about principal species, contaminants, special groups of proteins and add sdrf annotation (if existing) directly when reading the data. Setting customized tags according to species or other search-terms can be done using the argument specPref. In the example below we define a main species (tags are made by comparing to the species information initially given by the fasta) and we define a custom group of proteins by their Uniprot-Accessions (here the UPS1 spike-in). Then, the content of argument specPref will get searched in multiple types of annotation (if available from the initial Fasta).

By setting suplAnnotFile=TRUE the import function will also look for files (by default produced by MaxQuant as ‘summary.txt’ and ‘parameters.txt’) giving more information about experiment and samples and integrate this to the output. (This time let’s do not display the plot of distributions, it’s the same plot as above, see argument plotGraph.)

## The grouping of replicates
grp9 <- rep(1:9, each=3)
head(grp9)
#> [1] 1 1 1 2 2 2

## special group of proteins (we want to differentiate/ highlight lateron)
specPrefMQ <- list(conta="CON_|LYSC_CHICK", mainSpecies="OS=Saccharomyces cerevisiae", 
  spike=getUPS1acc()$ac)

dataMQ <- readMaxQuantFile(path1, specPref=specPrefMQ, suplAnnotFile=TRUE, 
  groupPref=list(lowNumberOfGroups=FALSE), gr=grp9, plotGraph=FALSE)
#> readMaxQuantFile : Note: Found 11 out of 1115 proteins marked as 'REV_' (reverse peptide identification) - Removing
#> readMaxQuantFile : Transform 2845(9.5%) initial '0' values to 'NA'
#> readMaxQuantFile : Could not find peptide counts columns (argument 'pepCountCol') matching to 'Unique peptides MS\.MS\.count '
#> readMaxQuantFile : Found 1 species name(s) appearing inside other ones, assume as truncated (eg  Saccharomyces cerevis)
#> readMaxQuantFile : Note: 5 proteins with unknown species
#>      data by species : Gallus gallus: 1,  Homo sapiens: 49,  Mus musculus: 1,  Saccharomyces cerevisiae: 1047,  Sus scrofa: 1,
#> readMaxQuantFile : Found 70 composite accession-numbers (eg P00761;P00761), truncating
#> readMaxQuantFile : Use column 'Accession' as identifyer (has fewest, ie 0 duplicated entries) as rownames
#> readMaxQuantFile : .readCsvTxt :  Importing table:  nCol= 1, 1 and 52   ie, best import : 52 cols

## the quantifiation data is the same as before
dim(dataMQ$quant)
#> [1] 1104   27

Now we can access special tags in the annotation part of the resulting object the results :

## count of tags based on argument specPref
table(dataMQ$annot[,"SpecType"])
#> 
#> mainSpecies 
#>        1047

This information can be used automatically lateron for assigning different symbols and/or colors when drawing Volcano-plots or PCA.

Adding Experimental Setup (Sdrf) to Meta-Data at Import (Example MaxQuant)

To further analyze the data from an experiment typically the user also need to know/declare different groups of samples (eg who is replicate of whom). In the simplest case this can be done via the argument gr, as shown above. By the way, if gr is provided it gets priority over other automcatic mining results.

The import-functions from this package try to help you in multiple ways to find out more about the experimental details. Most quantitation software (like MaxQuant and ProteomeDiscoverer) also produce files/documentation about experimental annotation specified by the user. These files may be automatically read and mined via argument suplAnnotFile=TRUE to gather information about groups of samples.

The project Proteomics Sample Metadata Format aims to provide a framework of providing a uniform format for documenting experimental meta-data (sdrf). If sfdr-annotation (see Proteomics Sample Metadata Datasets) exists on Pride, it can be imported, too. The information on the experimental setup will be mined to automatically to design groups of samples (ie levels of covariant factors). If sdrf has not been prepared, the user may also simply provide a data.frame formatted like sfdr from Pride.

Finally, if nothing of the above is available, the column-names from the quantitation columns will be minded to search hints about groups of replicates (in particular when using MaxQuant).

For a bit more complex example of using readMaxQuantFile() or integrating other annotation information, please look at the vignette “UPS1 spike-in Experiments” also available to this package.

The simplest way of adding sdrf annotation consists in addin the project ID from Pride, as shown below. The argument groupPref allows defining further adjustments/choices. The import-function will first check if this a local file, and if not try to download from Pride (if available) and further mine the information.

path1 <- system.file("extdata", package="wrProteo")
specPrefMQ <- list(conta="CON_|LYSC_CHICK", mainSpecies="OS=Saccharomyces cerevisiae", 
  spike=getUPS1acc()$ac)

dataMQ <- readMaxQuantFile(path1, specPref=specPrefMQ, sdrf="PXD001819", suplAnnotFile=TRUE,
  groupPref=list(lowNumberOfGroups=FALSE), plotGraph=FALSE)
#> readMaxQuantFile : Note: Found 11 out of 1115 proteins marked as 'REV_' (reverse peptide identification) - Removing
#> readMaxQuantFile : Transform 2845(9.5%) initial '0' values to 'NA'
#> readMaxQuantFile : Could not find peptide counts columns (argument 'pepCountCol') matching to 'Unique peptides MS\.MS\.count '
#> readMaxQuantFile : Found 1 species name(s) appearing inside other ones, assume as truncated (eg  Saccharomyces cerevis)
#> readMaxQuantFile : Note: 5 proteins with unknown species
#>      data by species : Gallus gallus: 1,  Homo sapiens: 49,  Mus musculus: 1,  Saccharomyces cerevisiae: 1047,  Sus scrofa: 1,
#> readMaxQuantFile : Found 70 composite accession-numbers (eg P00761;P00761), truncating
#> readMaxQuantFile : Use column 'Accession' as identifyer (has fewest, ie 0 duplicated entries) as rownames
#> readMaxQuantFile : .readCsvTxt :  Importing table:  nCol= 1, 1 and 52   ie, best import : 52 cols
#> readMaxQuantFile : readSampleMetaData : readSdrf : Successfully read 27 annotation columns for 27 samples
#> readMaxQuantFile : readSampleMetaData : Note : Some filenames contain '.raw', others do NOT; solved inconsistency ..
#> readMaxQuantFile : readSampleMetaData : Successfully adjusted order of sdrf to content of summary.txt.gzparameters.txt.gz
#> readMaxQuantFile : readSampleMetaData : Unable to find initially designed colnames for mining of sdrf, now using all
#> readMaxQuantFile : readSampleMetaData : Using method 'combNonOrth' for evaluating replicate-structure (ie 9 groups of samples from column(s) 'source.name')

Exporting Experimental Setup from MaxQuant to Draft-Sdrf

As mentioned, the Proteomics Sample Metadata Format - sdrf is an effort for standardizing experimental meta-data. Many of the typically documented ones may already have been entered when lauching MaxQuant and can be exported as a draft Sdrf-file. All main columns for standard experiments are present in the file, though some columns will have to be completed by the user (by any text-editor) for submitting to Pride.

path1 <- system.file("extdata", package="wrProteo")
fiNaMQ <- "proteinGroups.txt.gz"
dataMQ2 <- readMaxQuantFile(path1, file=fiNaMQ, sdrf=FALSE, suplAnnotFile=TRUE)
#> readMaxQuantFile : Note: Found 11 out of 1115 proteins marked as 'REV_' (reverse peptide identification) - Removing
#> readMaxQuantFile : Transform 2845(9.5%) initial '0' values to 'NA'
#> readMaxQuantFile : Could not find peptide counts columns (argument 'pepCountCol') matching to 'Unique peptides MS\.MS\.count '
#> readMaxQuantFile : Found 1 species name(s) appearing inside other ones, assume as truncated (eg  Saccharomyces cerevis)
#> readMaxQuantFile : Note: 5 proteins with unknown species
#>      data by species : Gallus gallus: 1,  Homo sapiens: 49,  Mus musculus: 1,  Saccharomyces cerevisiae: 1047,  Sus scrofa: 1,
#> readMaxQuantFile : Found 70 composite accession-numbers (eg P00761;P00761), truncating
#> readMaxQuantFile : Use column 'Accession' as identifyer (has fewest, ie 0 duplicated entries) as rownames
#> readMaxQuantFile : .readCsvTxt :  Importing table:  nCol= 1, 1 and 52   ie, best import : 52 cols
#> readMaxQuantFile : readSampleMetaData : Note: 'sdrf' looks bizarre (trouble ahead ?), expecting either file, data.frame or complete list
#> readMaxQuantFile : readSampleMetaData : Note : Ignoring 'sdrf'  : it does NOT have the expected number or rows (1 given but 27 expected !)

## Here we'll write simply in the current temporary directory of this R-session
exportSdrfDraft(dataMQ2, file.path(tempdir(),"testSdrf.tsv"))
#> exportSdrfDraft : Successfully exported sdrf-draft to file 'C:\Users\wraff\AppData\Local\Temp\Rtmpwhc9rw/testSdrf.tsv'

MaxQuant : Import Peptide Data

Similarly it is possible to read the file by default called ‘peptides.txt’ for the peptide-data. In the example below we’ll provide a custom file-name (to a tiny example non-representative for biological interpretation). The data get imported to a similar structure like the protein-level data, quantitations on peptide level by default median-normalized, sample-setup from sdrf-files may be added, too.

MQpepFi1 <- "peptides_tinyMQ.txt.gz"
path1 <- system.file("extdata", package="wrProteo")
specPref1 <- c(conta="conta|CON_|LYSC_CHICK", mainSpecies="YEAST", spec2="HUMAN")
dataMQpep <- readMaxQuantPeptides(path1, file=MQpepFi1, specPref=specPref1, tit="Tiny MaxQuant Peptides")
#> readMaxQuantPeptides : Transform 405(19%) initial '0' values to 'NA'

summary(dataMQpep$quant)
#>    12500am.1       12500am.2       12500am.3        125am.1     
#>  Min.   :20.87   Min.   :21.02   Min.   :19.89   Min.   :18.87  
#>  1st Qu.:22.24   1st Qu.:22.24   1st Qu.:22.42   1st Qu.:22.20  
#>  Median :23.08   Median :23.08   Median :23.08   Median :23.08  
#>  Mean   :23.38   Mean   :23.44   Mean   :23.46   Mean   :23.29  
#>  3rd Qu.:24.12   3rd Qu.:24.28   3rd Qu.:24.32   3rd Qu.:24.11  
#>  Max.   :28.65   Max.   :28.66   Max.   :28.86   Max.   :28.17  
#>  NA's   :37      NA's   :26      NA's   :28      NA's   :37     
#>     125am.2         125am.3        25000am.1       25000am.2    
#>  Min.   :20.74   Min.   :20.39   Min.   :20.58   Min.   :19.69  
#>  1st Qu.:22.24   1st Qu.:22.23   1st Qu.:22.26   1st Qu.:22.12  
#>  Median :23.08   Median :23.08   Median :23.08   Median :23.08  
#>  Mean   :23.35   Mean   :23.26   Mean   :23.48   Mean   :23.26  
#>  3rd Qu.:24.22   3rd Qu.:24.02   3rd Qu.:24.34   3rd Qu.:24.09  
#>  Max.   :28.09   Max.   :27.99   Max.   :28.75   Max.   :28.51  
#>  NA's   :35      NA's   :38      NA's   :28      NA's   :37     
#>    25000am.3        2500am.1        2500am.2        2500am.3    
#>  Min.   :18.98   Min.   :20.52   Min.   :20.85   Min.   :20.70  
#>  1st Qu.:22.08   1st Qu.:22.11   1st Qu.:22.33   1st Qu.:22.15  
#>  Median :23.08   Median :23.08   Median :23.08   Median :23.08  
#>  Mean   :23.32   Mean   :23.31   Mean   :23.37   Mean   :23.37  
#>  3rd Qu.:24.25   3rd Qu.:24.13   3rd Qu.:24.16   3rd Qu.:24.20  
#>  Max.   :28.87   Max.   :28.39   Max.   :28.46   Max.   :28.66  
#>  NA's   :24      NA's   :39      NA's   :38      NA's   :38

If the argument suplAnnotFile is set to TRUE, the files ‘summary.txt’ and ‘parameters.txt’ (produced by MaxQuant by default) will be searched in the same directory. If these files are available and seem to correspond to the quantiation date read in the main part of the function, supplemental information about experimental setup will be mined and added to the resulting object.

ProteomeDiscoverer : Import Protein Quantification

Proteome Discoverer is commercial software from ThermoFisher (www.thermofisher.com), see also Orsburn, 2021. Data exported from Proteome Discoverer can get imported (typically the xx_Proteins.txt file) using readProteomeDiscovererFile(), for details please see the vignette “UPS-1 spike-in Experiments” also available with this package. The example below is just a toy data-set, normally one can identify and quantify many more proteins.

path1 <- system.file("extdata", package="wrProteo")
fiNa <- "tinyPD_allProteins.txt.gz"
dataPD <- readProteomeDiscovererFile(file=fiNa, path=path1, suplAnnotFile=FALSE, plotGraph=FALSE)
#> readProteomeDiscovererFile : Adding supl annotation-columns
summary(dataPD$quant)
#>  Abundance.S1rep1 Abundance.S1rep2 Abundance.S1rep3 Abundance.S2rep1
#>  Min.   :17.56    Min.   :17.44    Min.   :18.32    Min.   :17.20   
#>  1st Qu.:20.15    1st Qu.:20.21    1st Qu.:20.01    1st Qu.:20.28   
#>  Median :21.72    Median :21.72    Median :21.72    Median :21.72   
#>  Mean   :21.75    Mean   :21.83    Mean   :21.71    Mean   :21.90   
#>  3rd Qu.:23.08    3rd Qu.:23.21    3rd Qu.:23.04    3rd Qu.:23.27   
#>  Max.   :28.27    Max.   :28.37    Max.   :28.26    Max.   :28.62   
#>  NA's   :1        NA's   :3        NA's   :1        NA's   :2       
#>  Abundance.S2rep2 Abundance.S2rep3
#>  Min.   :17.64    Min.   :17.22   
#>  1st Qu.:20.25    1st Qu.:20.11   
#>  Median :21.72    Median :21.72   
#>  Mean   :21.86    Mean   :21.78   
#>  3rd Qu.:23.23    3rd Qu.:23.10   
#>  Max.   :28.54    Max.   :28.47   
#>                   NA's   :2

Please note, that quantitation data exported from ProteomeDiscoverer frequently have very generic column-names (increasing numbers). When calling the import-function they can be replaced by more meaningful names either using the argument sampNa (thus, much care should be taken on the order when preparing the vector sampleNames !), or from reading the default annotation in the file ‘InputFiles.txt’ (if exported) or, from sdrf-annotation (if available). In this case, supplemental information about experimental setup will be mined and added to the resulting object.

As described with MaxQuant, additional meta-data as sdrf can be imported in the same way. For a more complex example of using readProteomeDiscovererFile() please see the vignette ‘UPS1 spike-in Experiments’ of this package.

ProteomeDiscoverer : Import Peptide Data

Similarly it is possible to read the peptide-data files exported by ProteomeDiscoverer using the function readProtDiscovererPeptides(). The data get imported to a similar structure like the protein-level data, quantitations on peptide level by default median-normalized, sample-setup from sdrf-files may be added, too.

DIA-NN: Import Protein Quantification Data

DIA-NN is free software provided by the by Demichev, Ralser and Lilley labs, see also Demichev et al, 2020. Typically DIA-NN allows exporting quantitation data on level of consensus-proteins as tsv-formatted files. Such data can get imported (and normalized) using readDiaNNFile(). The example below is just a toy data-set, normally one can identify and quantify many more proteins.

diaNNFi1 <- "tinyDiaNN1.tsv.gz"
## This file contains much less identifications than one may usually obtain
path1 <- system.file("extdata", package="wrProteo")
## let's define the main species and allow tagging some contaminants
specPref1 <- c(conta="conta|CON_|LYSC_CHICK", mainSpecies="HUMAN")
dataNN <- readDiaNNFile(path1, file=diaNNFi1, specPref=specPref1, tit="Tiny DIA-NN Data", plotGraph=FALSE)
summary(dataNN$quant)
#>        1               e2              e3       
#>  Min.   :13.39   Min.   :13.02   Min.   :13.82  
#>  1st Qu.:15.79   1st Qu.:15.83   1st Qu.:15.96  
#>  Median :17.30   Median :17.30   Median :17.30  
#>  Mean   :17.23   Mean   :17.38   Mean   :17.43  
#>  3rd Qu.:18.55   3rd Qu.:18.79   3rd Qu.:18.82  
#>  Max.   :22.93   Max.   :23.27   Max.   :23.26  
#>  NA's   :21      NA's   :4

DIA-NN : Import Peptide Data

Similarly data from DIA-NN on peptide level can get imported (and normalized) using readDiaNNPeptides().

Proline : Import Protein Quantification Data

Proline is free software provided by the Profi-consortium, see also Bouyssié et al 2020. Data exported from Proline (xlsx, csv or tsv format) can get imported using readProlineFile(). The example below is just a toy data-set, normally one can identify and quantify many more proteins.

path1 <- system.file("extdata", package="wrProteo")
fiNa <- "exampleProlineABC.csv.gz"                  # gz compressed data can be read, too
dataPL <- readProlineFile(file=fiNa, path=path1, plotGraph=FALSE)
summary(dataPL$quant[,1:8])
#>      A_01_t          A_02_t          A_03_t          A_04_t     
#>  Min.   :14.11   Min.   :14.06   Min.   :14.03   Min.   :14.34  
#>  1st Qu.:19.11   1st Qu.:19.23   1st Qu.:19.15   1st Qu.:19.37  
#>  Median :20.65   Median :20.65   Median :20.65   Median :20.65  
#>  Mean   :20.83   Mean   :20.92   Mean   :20.86   Mean   :20.99  
#>  3rd Qu.:22.35   3rd Qu.:22.43   3rd Qu.:22.47   3rd Qu.:22.47  
#>  Max.   :28.51   Max.   :28.59   Max.   :28.54   Max.   :28.72  
#>  NA's   :29      NA's   :27      NA's   :21      NA's   :27     
#>      B_01_t          B_02_t          B_03_t          B_04_t     
#>  Min.   :16.13   Min.   :12.58   Min.   :15.70   Min.   :13.85  
#>  1st Qu.:19.18   1st Qu.:19.33   1st Qu.:19.16   1st Qu.:19.09  
#>  Median :20.65   Median :20.65   Median :20.65   Median :20.65  
#>  Mean   :20.78   Mean   :20.85   Mean   :20.80   Mean   :20.81  
#>  3rd Qu.:22.37   3rd Qu.:22.51   3rd Qu.:22.38   3rd Qu.:22.42  
#>  Max.   :27.96   Max.   :28.17   Max.   :28.13   Max.   :28.23  
#>  NA's   :77      NA's   :77      NA's   :76      NA's   :73

As described with MaxQuant, additional meta-data as sdrf can be imported in the same way. For a more complex example of using readProlineFile() please see the vignette ‘UPS1 spike-in Experiments’ from this package.

Fragpipe : Import Protein Quantification Data

Fragpipe is a database search tool for peptide identification, open-source developed by the Nesvizhskii lab, see eg Kong et al 2017, da Veiga Leprevost; et al 2020 or other related publications. Data exported from Fragpipe (in tsv format) can get imported using readFragpipeFile(). The example below is just a toy data-set, normally one can identify and quantify many more proteins.

FPproFi1 <- "tinyFragpipe1.tsv.gz"
## let's define the main species and allow tagging some contaminants
specPref1 <- c(conta="conta|CON_|LYSC_CHICK", mainSpecies="MOUSE")
dataFP <- readFragpipeFile(path1, file=FPproFi1, specPref=specPref1, tit="Tiny Fragpipe Example", plotGraph=FALSE)
#> readFragpipeFile : Count by 'specPref' : Bos taurus: 1 ;  Homo sapiens: 5 ;  Mus muscullus: 1 ;  Mus musculus: 92 ;  Sus scrofa: 1 ;
#> readFragpipeFile : Removing 24 lines/proteins removed as NOT passing protein identification filter at 0.99
summary(dataFP$quant)
#>       A_1             A_2             B_1             B_2       
#>  Min.   :15.09   Min.   :13.73   Min.   :14.48   Min.   :15.53  
#>  1st Qu.:17.89   1st Qu.:18.21   1st Qu.:18.78   1st Qu.:18.58  
#>  Median :19.94   Median :19.94   Median :19.94   Median :19.94  
#>  Mean   :19.93   Mean   :20.13   Mean   :20.54   Mean   :20.39  
#>  3rd Qu.:21.43   3rd Qu.:21.68   3rd Qu.:21.92   3rd Qu.:21.78  
#>  Max.   :30.64   Max.   :30.41   Max.   :30.32   Max.   :29.99  
#>  NA's   :11      NA's   :10      NA's   :15      NA's   :14     
#>       B_3             B_4             C_1             C_2       
#>  Min.   :13.31   Min.   :13.96   Min.   :14.77   Min.   :14.74  
#>  1st Qu.:18.57   1st Qu.:18.46   1st Qu.:18.51   1st Qu.:18.73  
#>  Median :19.94   Median :19.94   Median :19.94   Median :19.94  
#>  Mean   :20.38   Mean   :20.40   Mean   :20.29   Mean   :20.66  
#>  3rd Qu.:21.80   3rd Qu.:21.94   3rd Qu.:21.63   3rd Qu.:21.84  
#>  Max.   :30.16   Max.   :30.46   Max.   :30.17   Max.   :30.63  
#>  NA's   :14      NA's   :14      NA's   :12      NA's   :12     
#>       C_3             C_4       
#>  Min.   :13.94   Min.   :14.89  
#>  1st Qu.:18.36   1st Qu.:18.85  
#>  Median :19.94   Median :19.94  
#>  Mean   :20.30   Mean   :20.41  
#>  3rd Qu.:21.78   3rd Qu.:21.70  
#>  Max.   :29.89   Max.   :30.46  
#>  NA's   :10      NA's   :12