Data Scientist / Statistical Programmer
Data Scientist (noun): A person who is better at statistics than any software engineer, and better at software engineering than any statistician.
I completed an MSc in Statistics at the University of British Columbia, under the supervision of Jenny Bryan. My thesis (PDF) involved the study of gene-gene and gene-Rx interaction effects, and statistical assessment of these epistatic effects in yeast growth curves.
A set of miscellaneous functions for everything from data subsetting and reshaping functions for
faster and easier file I/O and wrapper functions for easy generation of C++-backed apply and tapply-style
functions. Essentially, this package is a grab-bag of functions that have made my life easier
when programming in R.
I contributed functions for querying SNP information from NCBI's dbSNP, as well
as an easy wrapper to the Broad Institute's SNAP for identifying SNPs in high linkage disequilibrium
with SNPs of interest.
This project exports a function,
makesvg, that can be used to generate tool-tip augmented
SVG graphics from R lattice plots. The tool-tip dialog information is inherited from the data frame
containing the vectors to be plotted, and the tool-tips are dynamically displayed on mouseover
The Rcpp Gallery is a set of user-submitted articles on using Rcpp to write, compile and link C++ code that can operate on R objects. I have contributed a number of articles on the basics of using Rcpp:
PublicationsSynthetic Lethality of Cohesins with PARPs and Replication Fork Mediators (PLOS Genetics)
Jessica L. McLellan, Nigel J. O'Neil, Irene Barrett, Elizabeth Ferree, Derek M. van Pel, Kevin Ushey, Payal Sipahimalani, Jennifer Bryan, Ann M. Rose, Philip Hieter