editor_options | ||||
---|---|---|---|---|
|
This is the first released version on github, following a lengthy period of internal development. Many things have changed in a major overhaul of the package.
summariseDMRsByGene
now correctly summarises genes when they have different positions or lengths in different windows.plotCorrelationMatrix
no longer crashes when no annotation is being used.
- Examples added for some functions, and documentation for some related functions merged.
- Added more tests following inspection of
covr
coverage. getPCA()
andgetUMAP()
now return an object of classmesaDimRed
, which wraps the previous list based output. This object now contains a copy of the sampleTable, which may be edited usingmutate
andleft_join
.- A new example qseaSet for a small portion of the mouse genome.
- Added the ability to specifying a biomaRt object directly on a qseaSet for used for finding gene annotation in
plotGeneHeatmap
.
plotPCA()
andplotUMAP()
no longer require passing the qseaSet, as the sampleTable is stored in the object.- Added functions
setMesaGenome
,setMesaTxDb
andsetMesaAnnoDb
to set global defaults for the annotation packages required byannotateWindows
. This has the effect thatannotateWindows
will no longer assume hg38 by default with no arguments. summariseDMRsByGene
function now requiresannotateWindows
to have already been called on the DMRs (previously it called this internally if necessary).- Reduced the number of messages produced when generating tables of data.
poolSamples
now returns data frames instead of matrices in the libraries slot.select
now works when dropping a column.- PCA/UMAP functions now give a more informative error if there are insufficient regions or samples.
downSample
now works correctly again following changes to howtable
andenframe
interact.calculateDMRs
now correctly returns an empty data frame if no DMRs are found.
plotPCA
now works when colouring by a factor variablemutate
now cannot be used to change the sample_name in the sample table of a qseaSet which breaks the object.renameQsetNames
orrenameSamples
must be used for this.
- HMMCopy related functions now take explicit input of the GC and mappability tracks, rather than having hardcoded internal hg38 objects, allowing for different genomes.
- Removed the data objects
gc_hg38_10kb
andmap_hg38_10kb
(GC and mappability tracks for hg38 at 10kb resolution) for package size reasons.
-
Fixed how the relative enrichment (relH/GoGe) calculation is performed (
addMedipsEnrichmentFactors
) when using the qsea default method for reads (#1). -
Filtering a
qseaSet
now works correctly when aqseaSet
has nocnv
data. -
Fixed
pull
to work directly on aqseaSet
to return a column of the sampleTable.
-
Removed
mutateQset
,filterQset
,arrangeQset
,sortQset
,pullQset
,leftJoinQset
as these just work on thedplyr
verbs directly. -
Combined
getBetaMeans
andgetNormalisedReadSum
into new function calledsummariseAcrossWindows
. This allows an arbitrary function to be (e.g. mean or sd) to be applied over all the windows in each sample, and returns a data frame with one result per sample. -
Added a new function
addSummaryAcrossWindows
that performssummariseAcrossWindows
but returns theqseaSet
with the results added to thesampleTable
. -
Added a function
asValidGranges
that checks if an object can be coerced to a GRanges object and does so if possible. -
Added
setMesaGenome
function to globally set which genome to be used for plotting functions. -
Standardised on using "Windows" instead of "Regions". For instance,
filterRegions
is nowfilterWindows
, andplotGRangesHeatmap
is nowplotRegionsHeatmap
. Added a functiongetWindows
to mimicgetRegions
from qsea. -
Complete overhaul of the PCA functionality.
- This is now split into two functions,
getPCA
andplotPCA
, and thegetPCAwithBatch
function has been removed.
- This is now split into two functions,
-
UMAP functionality added, using the same framework as PCAs, with
getUMAP
andgetPCA
. This is still experimental. -
Changed
pivotDMRsLonger
to return columns "group1
" and "group2
" instead of "sample1
" and "sample2
". -
In DMR results, moved the
betaDelta
columns to be after the equivalentadjPval
columns instead of all being at the end. -
Renamed the
contrastsToDo
argument ofmakeDMRs
to justcontrasts
. -
mixSamples
(formerlymixQsetSamples
) no longer setstype
andtumour
in thesampleTable
of the returnedqseaSet
. -
Changes to heatmap plotting functions (
plotRegionsHeatmap
andplotGeneHeatmap
,plotCNVHeatmap
):-
plotGRangesHeatmap
is now renamed asplotRegionsHeatmap
-
Both support a
useGroups
argument, to determine whether to average over the "group" column of the sampleTable or not. -
Argument
signatureGR
ofplotRegionsHeatmap
changed toregionsToOverlap
. -
Changed from
pheatmap
package toComplexHeatmap
package for the plotting. -
The method of adding sample annotations has been overhauled. This is now specified using
sampleAnnotation
(instead ofannotationCol
), and does not support the use of thegetAnnotationDataFrame
function. These options now directly take the name of the features to be plotted, supporting unquoted tidy evaluation, i.e. all of these work:-
sampleAnnotation = group
-
sampleAnnotation = c(group, type)
-
sampleAnnotation = c("group","type")
-
-
Heatmaps can now have windows annotated with the use of
windowAnnotation
. This supports unquoted tidy evaluation as well, and can use anything on the qseaSet regions slot or on theregionsToOverlap
argument. -
The
getAnnotationDataFrame
andgetAnnotationDataFrameIndividual
functions have been removed. -
The colours for the annotations have been overhauled. These now try to choose appropriate colour scales for continuous data based on whether they are strictly positive or not. For discrete annotations, the colours are set globally for the entire annotation set, to try and prevent the occurrance of different annotations using very similar colours.
-
-
getDataTable
function can now keep the suffix (e.g._nrpm
or_beta
) if required. -
removeWindowsOverCutoff
andkeepWindowsOverCutoff
have been replaced by a more general function calledsubsetWindowsBySignal
. This allows for a general function to be specified (for instance mean or max) as well as a threshold, enabling for the filtering of windows based on that function. -
Parallelisation of functions is now controlled explicitly by the
setMesaParallel
function. This must be used to enable multicore evaluation. -
When calculating DMRs, by default the
qseaSet
will now be filtered to only include samples which are present in the contrasts for the calculating of the dispersion estimates in the generalised linear model. This can be controlled by thecalcDispersionAll
option. The pre-filtering of windows based on expression is unchanged - this only uses samples included in the contrasts. -
The difference in mean beta values in the DMR output has been renamed from
betaDelta
todeltaBeta
to reflect the mathematical usage \tex{\Delta \beta}. -
Updated documentation in many functions.
-
Authors updated to include Paddy Harker, Kevin Brennan and Katarzyna Kamieniecka.
-
Removed exporting of internal functions:
dropAvgFragDetails
,getBamCoveragePairedAndUnpairedR1
,mixThreeQsetSamples
,getCGPositions
,fitQseaGLM
,getDMRsData
,subsetWindowsOverBackground
-
Renamed many functions throughout the package:
Previous Function Name New Function Name addQseaNormalisationSteps
addNormalisation
annotateData
annotateWindows
calculateArrayBetas
convertToArrayBetaTable
calculateCpGEnrichment
calculateCGEnrichment
calculateCpGEnrichmentGRanges
calculateCGEnrichmentGRanges
countWindowsOverCutoff
countWindowsAboveCutoff
downsampleQsea
downSample
filterRegions
filterWindows
liftOverhg19
liftOverHg19
mixQsetSamples
mixSamples
plotDMRUpSet
plotDMRUpset
qseaWriteDMRsBeds
writeDMRsToBed
qseaWriteDMRsExcel
writeDMRsToExcel
relabelQset
renameSamples
summariseByGene
summariseDMRsByGene
-
Six functions have been replaced by different functions (detailed above)
+---------------------------+--------------------------+ | Previous Function | Replacement Function(s) | +===========================+==========================+ |
getQseaPCA
|getPCA
/plotPCA
| | | | |getPCAwithBatch
| | +---------------------------+--------------------------+ |getBetaMeans
|summariseAcrossWindows
| | | | |getNormalisedReadSum
| | +---------------------------+--------------------------+ |removeWindowsOverCutoff
|subsetWindowsBySignal
| | | | |keepWindowsOverCutoff
| | +---------------------------+--------------------------+ -
Renamed data objects:
Previous data name New data name encodeBlacklist
ENCODEbadRegions
examplePairedTumourQset
exampleTumourNormal
-
plotGeneHeatmap
now usesbioMaRt
to find the gene details, based on the genome or mart supplied. -
filterByOverlaps
now works when the `qseaSet` regions contains "chr" but the regions to filter by do not or vice versa. -
Removed all references to
PooledControl
sample (previously suggested for normalisation). -
Various R CMD check warning issues fixed.
-
filterByOverlaps
no longer allows the option to take a list of window ID numbers. -
Fixed
addMedipsEnrichmentFactors
function insidemakeQset
when using the"qseaPaired"
coverageMethod. -
Fixed tests for
addMedipsEnrichmentFactors
- Update mixQsetSamples to match the mixArrayWithQset code (and now runs with latest version of R).
- Less stringent checking on the p values being zero bug in qsea, to minimise false positives.
* pull
is not working directly, pullQset
must be used.
Version set for initial release in v2.0.1 of the internal CBC Nextflow pipeline.