Data import
read_cross2
- read data for a cross from a set of files
fread_csv
- read a csv file, using a particular set of options
fread_csv_numer
- like read_csv but assuming the contents are strictly
numeric
read_pheno
- read phenotype data from a CSV file, plus (optionally) phenotype
covariate data from a separate CSV file
write_control_file
- write the control file for a set of QTL data
zip_datafiles
- zip a set of data files (in the format read by
read_cross2)
Data subsetting
subset.cross2
- subset a cross2 object by individuals or chromosomes
"[".cross2
- shorthand for subset.cross2, with the form
mycross[ind, chr]
subset.calc_genoprob
subset genotype probabilties by indivduals or chromosomes
"[".calc_genoprob
- shorthand for subset.calc_genoprob, with the form
probs[ind, chr]
subset.scan1
- subset genome scan results by chromosome or column
subset_scan1
- the same as subset.scan1
subset.sim_geno
- subset genotype imputations by individuals or chromosomes
"[".sim_geno
- shorthand for subset.sim_geno, with the form
simg[ind, chr]
subset.viterbi
- subset inferred genotypes
"[".viterbi
- shorthand for subset.viterbi, with the form
geno[ind, chr]
drop_markers
- drop a vector of markers from a "cross2" object
drop_nullmarkers
- drop markers with no genotypes from a "cross2"
object
pull_markers
- drop all except a vector of markers from "cross2"
object
find_dup_markers
- find sets of markers with identical genotype data
subset.phasedgeno
- subset phased genotype objects, as output by
guess_phase()
"[".phasedgeno
- shorthand for subset.phasedgeno with the form
ph[ind, chr]
Combining data
cbind_expand
- Like cbind() but using row names to align the rows and
expanding with missing values as necessary
cbind.calc_genoprob
- combine genotype probabilities for multiple chromosomes but on the
same set of individuals
rbind.calc_genoprob
- combine genotype probabilities for different individuals
cbind.scan1
- combine genome scan results for multiple phenotypes/analyses
rbind.scan1
- combine genome scan results for different chromosomes
c.scan1perm
- combine genome scan permutation results for multiple replicates
cbind.scan1perm
- combine genome scan permutation results for multiple
phenotypes/analyses
rbind.scan1perm
- combine genome scan permutation results for multiple chromosomes
cbind.sim_geno
- combine genotype imputations for multiple chromosomes but on the same
set of individuals
rbind.sim_geno
- combine genotype imputations for different individuals
cbind.viterbi
- combine inferred genotypes for multiple chromosomes but on the same
set of individuals
rbind.viterbi
- combine inferred genotypes for different individuals
cbind.phasedgeno
- combined phased genotypes for different chromosomes but on the same
set of individuals
rbind.phasedgeno
- combined phased genotypes for different individuals but on the same
set of chromosomes
Genotype reconstruction
calc_genoprob
- calculate conditional genotype probabilities given marker data
clean_genoprob
- clean up genotype probabilities, setting small values to 0
genoprob_to_alleleprob
- convert genotype probabilities to allele dosages
genoprob_to_snpprob
- convert genotype probabilities to SNP probabilities
interp_genoprob
- linear interpolation of genotype probabilities, for example to get two
sets onto the same map for comparison purposes
probs_to_grid
- subset genotype probabilities to a grid of pseudomarkers
pull_genoprobpos
- pull out the genotype probabilities for a particular position
pull_genoprobint
- pull out the genotype probabilities for an interval
Genotype imputation
maxmarg
- for each individual at each position, find genotype with maximum
marginal probability
guess_phase
- turn imputed genotypes into phased genotypes along chromosomes
sim_geno
- multiple imputations of underlying genotypes given marker data
viterbi
- find mostly likely sequence of true genotypes given marker data
predict_snpgeno
- predict SNP genotypes in a multiparent population from inferred
genotypes plus founder strains’ SNP alleles.
Kinship matrix calculations
calc_kinship
- calculate genetic similarity among individuals
decomp_kinship
- calculate eigen decomposition of a kinship matrix
scale_kinship
- scale kinship matrix to be like a correlation matrix
Marker maps
est_map
- re-estimate the inter-marker distances in a genetic map
insert_pseudomarkers
- add pseudomarkers into a map of genetic markers
calc_grid
- Calculate indicators of which pseudomarker positions are along a fixed
grid
map_to_grid
- subset a map object to the locations on some grid
interp_map
- Use interpolate to convert from one map to another
reduce_markers
- Reduce marker map to the largest subset that are some distance
apart
smooth_gmap
- Smooth genetic map by mixing it with a bit of constant
recombination
unsmooth_gmap
- Performs the reverse operation of smooth_gmap()
QTL analysis
est_herit
- estimate heritability with linear mixed model
fit1
- fit a single-QTL model at a single position
scan1
- genome scan with a single-QTL model
scan1perm
- permutation test to establish statistical significance in genome
scan
scan1coef
- calculate QTL effects in scan along one chromosome
scan1blup
- like scan1coef, but calculating treating QTL effects as
random and calculating BLUPs
scan1snps
- single-QTL scan over SNPs in a multi-parent population
scan1max
- genome-wide maximum LOD score from genome scan
QTL summaries
maxlod
- calculate genome-wide maximum LOD score in genome scan results
max.scan1
- calculate maximum LOD score in genome scan and the position at which
it occurred
max_scan1
- the same as max.scan1
find_peaks
- find QTL peaks in genome scan results
lod_int
- calculate LOD support intervals from genome scan results
bayes_int
- calculate approximate Bayes intervals for QTL position from genome
scan results
summary.scan1perm
- calculate significance thresholds from genome scan permutation
results
summary_scan1perm
- same as summary.scan1perm
top_snps
- find the top SNPs from a SNP association scan
calc_hotspots
- count QTL in a sliding window, to identify QTL hotspots
Data diagnostics
check_cross2
- check for inconsistencies or errors in a "cross2"
object
calc_entropy
- calculate entropy from genotype probabilities, for each individual and
position
calc_errorlod
- calculate genotyping error LOD scores to help identify potential
genotyping errors and problem markers or individuals
calc_geno_freq
- calculate genotype frequencies, by individual or marker, from genotype
probabilities
calc_het
- Calculate heterozygosities, by individual or marker, from genotype
probabilities
chisq_colpairs
- Perform chi-square test for independence for all pairs of columns of a
matrix
compare_geno
- compare genotypes for all pairs of individuals, to look for possible
sample duplicates
compare_founder_geno
- compare genotypes for all pairs of founders in a multi-parent
population, to assess similarities
compare_genoprob
- compare two sets of genotype probabilities for one individual on a
single chromosome
summary.compare_geno
- summarize the results of compare_geno
summary_compare_geno
- same as summary.compare_geno
max.compare_geno
- from the results of compare_geno, show the pair with most
similar genotypes
max_compare_geno
- same as max.compare_geno
count_xo
- count the number of crossovers in each individual on each chromosome,
from matrices of inferred genotypes
locate_xo
- locate the positions of crossovers in each individual on each
chromosome, from matrices of inferred genotypes.
find_ibd_segments
- in genotypes of a set of inbred lines, find genomic segments that are
identity-by-descent (IBD)
compare_maps
- compare two marker maps, to identify markers present in one but not
the other, or on different chromosomes or in different orders between
the maps.
find_map_gaps
- find large gaps between markers in a genetic map
reduce_map_gaps
- reduce the lengths of gaps in a genetic map
calc_raw_het
- Calculate heterozygosity in the raw SNP genotypes
calc_raw_maf
- Calculate the minor allele frequency in the raw SNP genotypes
calc_raw_geno_freq
- Calculate the genotype frequencies in the raw SNP data
calc_raw_founder_maf
- Calculate the minor allele frequency in the founder strains’ SNP
genotypes
Data summaries
summary.cross2
- summarize a "cross2" object
chr_names
- names of chromosomes in a "cross2" object
marker_names
- names of markers in a "cross2" object
pheno_names
- names of phenotypes in a "cross2" object
phenocovar_names
- names of “phenotype covariates” (metadata about phenotypes) in a
"cross2" object
covar_names
- names of covariates in a "cross2" object
ind_ids
- return IDs for all individuals in a "cross2" object
ind_ids_geno
- return IDs for all individuals in a "cross2" object that
have genotype data
ind_ids_pheno
- return IDs for all individuals in a "cross2" object that
have phenotype data
ind_ids_gnp
- return IDs for all individuals in a "cross2" object that
have both genotype and phenotype data
ind_ids_covar
- return IDs for all individuals in a "cross2" object that
have covariate data
n_chr
- number of chromosomes in a "cross2" object
n_ind
- number of individuals in a "cross2" object
n_ind_geno
- number of individuals in a "cross2" object that have
genotype data
n_ind_pheno
- number of individuals in a "cross2" object that have
phenotype data
n_ind_gnp
- number of individuals in a "cross2" object that have both
genotype and phenotype data
n_ind_covar
- number of individuals in a "cross2" object that have
covariate data
n_mar
- number of markers on each chromosome in a "cross2"
object
tot_mar
- total number of markers in a "cross2" object
n_pheno
- number of phenotypes in a "cross2" object
n_covar
- number of covariates in a "cross2" object
n_phenocovar
- number of “phenotype covariates” (metadata on phenotypes) in a
"cross2" object
chr_lengths
- calculate chromosome lengths for a map object
find_marker
- find marker closest to a particular genomic position
find_markerpos
- find the position of a marker
n_missing
- number of missing genotypes, by individual or marker
n_typed
- number of genotypes, by individual or marker
founders
- names of the founder strains
n_founders
- number of founder strains
QTL plots
plot.scan1
- plot genome scan results
plot_scan1
- same as plot.scan1
xpos_scan1
- determine the x-axis location of a particular genomic position in a
genome scan plot (for adding annotations)
add_threshold
- Add horizontal line at a significance threshold to a genome scan
plot.
plot.scan1coef
- plot QTL effects along a chromosome
plot_coef
- same as plot.scan1coef
plot_coefCC
- like plot_coef but assuming there are 8 effects and using
the standard colors for the Collaborative Cross
(CCcolors)
plot_snpasso
- plot SNP association results
plot_genes
- plot locations of a set of genes
plot_sdp
- plot strain distribution patterns of SNPs in a region
plot_peaks
- plot a summary of QTL positions for multiple phenotypes, using the
results of find_peaks
plot_lodpeaks
- scatterplot of LOD scores vs QTL peak locations (possibly with
intervals) for multiple traits
plot_pxg
- plot phenotype versus QTL genotypes
plot_ci
- Plot a set of confidence intervals (for example, of QTL effects at a
fixed QTL position)
plot_cistrans
- scatterplot of gene location vs QTL location to display eQTL/pQTL
results
plot_scan1_heatmap
- plot LOD curves for multiple traits as a heat map
plot_colorscale
- plot a color scale (for use with plot_scan1_heatmap)
Diagnostic plots
plot.calc_genoprob
- plot the genotype probabilities for one individual on one chromosome,
as a heat map
plot_genoprob
- the same as plot.calc_genoprob
plot_onegeno
- plot one individual’s genome-wide genotypes
plot_genoprobcomp
- plot a comparison of two sets of genotype probabilities for one
individual on one chromosome, as a bivariate heat map
plot_compare_geno
- Plot histogram of the results of compare_geno()
plot.compare_geno
- Same as plot_compare_geno()
plot_geno
- plot genome-wide genotypes for multiple individuals
SNP/gene databases
create_variant_query_func
- create a function to connect to a SQLite database of founder variant
information and return a data frame with variants for a selected
region
create_gene_query_func
- create a function to connect to a SQLite database of gene annotations
and return a data frame with genes in a selected region
calc_sdp
- convert founder SNP genotypes to a numeric code for the strain
distribution pattern
invert_sdp
- the inverse of calc_sdp
index_snps
- partition SNPs into groups that are contained within common marker
intervals and have the same strain distribution pattern, and create an
index to a set of distinct SNPs, one per partition
find_index_snp
- For a particular SNP, find the corresponding indexed SNP.
create_snpinfo
- Create a table of SNP information from a cross2 object.
sdp2char
- convert strain distribution pattern numeric codes to more meaningful
character strings
Utility functions
batch_cols
- identify batches of columns of a matrix that have the same pattern of
missing values
batch_vec
- split a vector into batches, for help in balancing parallel code
get_common_ids
- find IDs that are present in all of the input objects
get_x_covar
- from a "cross2" object, get the matrix of covariates to
be used for the null hypothesis when performing QTL analysis on the X
chromosome
mat2strata
- use the rows of a matrix to define a set of strata for a stratified
permutation test
replace_ids
- Replace the individual IDs in an object
replace_ids.calc_genoprob
- Replace the individual IDs in a "calc_genoprob"
object
replace_ids.cross2
- Replace the individual IDs in a "cross2" object
replace_ids.sim_geno
- Replace the individual IDs in a "sim_geno" object
replace_ids.viterbi
- Replace the individual IDs in a "viterbi" object
replace_ids.data.frame
- Replace the individual IDs (in row names) in a data frame
replace_ids.matrix
- Replace the individual IDs (in row names) in a matrix
align_scan1_map
- aligns the markers/pseudomarkers in a "scan1" object
(output by scan1()) and a marker map.
clean
- clean an object
clean.scan1
- clean a "scan" object (replacing negative values with
NA and removing rows where all values are
NA.
clean_scan1
- the same as clean.scan1.
clean.calc_genoprob
- clean a "calc_genoprob" object (setting small values to
0)
clean_genoprob
- same as clean.calc_genoprob
qtl2version
- print the installed version of R/qtl2
recode_snps
- Recode the SNP genotypes so that 1 is for the major
allele in the founders
convert2cross2
- convert an R/qtl1 "cross" object to the R/qtl2
"cross2" format
dim.calc_genoprob - Get dimensions of
"calc_genoprob" object
dimnames.calc_genoprob - Get dimension names of
"calc_genoprob" object
Newly added functions (in development version)
scan1gen - General genome scan with user-supplied
fitting function