--- title: "Creating, updating & maintaining the statquotes data base" output: rmarkdown::html_vignette vignette: > %\VignetteIndexEntry{Creating, updating & maintaining the statquotes data base} %\VignetteEngine{knitr::rmarkdown} %\VignetteEncoding{UTF-8} --- ```{r, include = FALSE} knitr::opts_chunk$set( collapse = TRUE, comment = "#>" ) ``` ```{r setup} library(statquotes) ``` ## Primary format for quotes file The quotes are stored in the `data-raw/quotes_raw.txt` file. The script `data-raw/convert_quotes_to_rda.R` can be used to read these quotes and save them to `data/quotes.rda`, which is the main data file used in the package. The `quotes_raw.txt` file uses the following format for each quotation. Lines beginning with "%" are comments and ignored. Other lines contain a "key:value" pair. The key is used to identify the right column when building the `quotes` data.frame: ``` % Comment quo: This is a quotation. src: Person or persons who said or wrote the quote. cit: Citation for the original quote. url: URL where the quote can be found (such as journal articles). tag: Comma-separated tags to categorize the quote. tex: TeX-formatted citation ``` Here is an example: ``` quo: A judicious man looks at Statistics, not to get knowledge, but to save himself from having ignorance foisted on him. src: Thomas Carlyle cit: Chartism, 1840, Chapter II, Statistics url: https://en.wikisource.org/wiki/Chartism/Chapter_2 tag: data visualization ``` ## LaTeX format for quotes file The `statquotes` package originally arose from a LaTeX file, that Michael Friendly used to collect interesting quotations related to statistics, data visualization, history, software and other topics. This was designed to be a collection that a person could search, then copy/paste an appropriate one into a working LaTeX document. The format of quotes was designed to use the LaTeX `epigraph` package: ``` \epigraph{You can see a lot, just by looking.}{Yogi Berra} \epigraph{Every picture tells a story.}{Rod Stewart, 1971} \epigraph{A picture is worth a thousand words.}{F. Barnard, 1927} ``` Each quote has some `text` and a `source` attribution, and so could be displayed in a document something like > *You can see a lot, just by looking.* > --- Yogi Berra Some of the quotes have manually-added `tags` to classify the quotes into groups. As many tags as desired can be added to a quote.