BRESEQ :: Summary Statistics

breseq version 0.33.1 revision 8505477f25b3
mutation predictions | marginal predictions | summary statistics | genome diff | command line log

Read File Information

	read file	reads	bases	passed filters	average	longest	mapped
errors	E2C3_S167_R1_001.good.fq	1,652,083	91,175,929	100.0%	55.2 bases	62 bases	58.3%
	total	1,652,083	91,175,929	100.0%	55.2 bases	62 bases	58.3%

Reference Sequence Information

		seq id	length	fit mean	fit dispersion	% mapped reads	description
coverage	distribution	W3110S.gb	4,646,334	[53.4]	[47.6]	100.0%	Escherichia coli str. K-12 substr. W3110 DNA, complete genome.
		total	4,646,334			100.0%

fit dispersion is the ratio of the variance to the mean for the negative binomial fit. It is =1 for Poisson and >1 for over-dispersed data.

Fit failed Negative binomial fit failed for this reference sequence. It may have an unusual coverage depth distribution. JC and MC predictions may be less accurate.

New Junction Evidence

Junction Candidates Tested

option	limit	actual
Number of alignment pairs examined for constructing junction candidates	≤ 100000	5573
Coverage evenness (position-hash) score of junction candidates	≥ 2	≥ 3
Test this many junction candidates (n). May be smaller if not enough passed the coverage evenness threshold	100 ≤ n ≤ 5000	1
Total length of all junction candidates (factor times the reference genome length)	≤ 0.1	0.000

Junction Skew Score Calculation

reference sequence	pr(no read start)
W3110S.gb	NA

pr(no read start) is the probability that there will not be an aligned read whose first base matches a given position on a given strand.

Final Junction Predictions

option	value
Coverage evenness (position-hash) score of predicted junctions must be	≥ 3
Skew score of predicted junction (−log10 probability of unusual coverage evenness) must be	≤ 3
Number of bases that at least one read must overlap each uniquely aligned side of a predicted junction	≥ 1

Read Alignment Evidence

option	value
Mode	Consensus/Mixed Base
Ploidy	1 (haploid)
Consensus mutation E-value cutoff	10
Consensus frequency cutoff	0.75
Consensus minimum variant coverage each strand	OFF
Consensus minimum total coverage each strand	OFF
Consensus minimum variant coverage	OFF
Consensus minimum total coverage	OFF
Polymorphism E-value cutoff	10
Polymorphism frequency cutoff	0.2
Polymorphism minimum variant coverage each strand	OFF
Polymorphism minimum total coverage each strand	OFF
Polymorphism minimum variant coverage	OFF
Polymorphism minimum total coverage	OFF
Polymorphism bias cutoff	OFF
Predict indel polymorphisms	YES
Skip indel polymorphisms in homopolymers runs of	OFF
Skip base substitutions when they create a homopolymer flanked on each side by	OFF

Software Versions

program	version
bowtie2	2.3.4.1
R	3.4.4

Execution Times

step	start	end	elapsed
Read and reference sequence file input	14:24:08 10 Dec 2019	14:24:25 10 Dec 2019	17 seconds
Read alignment to reference genome	14:24:25 10 Dec 2019	14:36:27 10 Dec 2019	12 minutes 2 seconds
Preprocessing alignments for candidate junction identification	14:36:27 10 Dec 2019	14:36:45 10 Dec 2019	18 seconds
Preliminary analysis of coverage distribution	14:36:45 10 Dec 2019	14:36:59 10 Dec 2019	14 seconds
Identifying junction candidates	14:36:59 10 Dec 2019	14:37:00 10 Dec 2019	1 second
Re-alignment to junction candidates	14:37:00 10 Dec 2019	14:37:31 10 Dec 2019	31 seconds
Resolving best read alignments	14:37:31 10 Dec 2019	14:37:57 10 Dec 2019	26 seconds
Creating BAM files	14:37:57 10 Dec 2019	14:38:10 10 Dec 2019	13 seconds
Tabulating error counts	14:38:10 10 Dec 2019	14:38:14 10 Dec 2019	4 seconds
Re-calibrating base error rates	14:38:14 10 Dec 2019	14:38:15 10 Dec 2019	1 second
Examining read alignment evidence	14:38:15 10 Dec 2019	14:39:30 10 Dec 2019	1 minute 15 seconds
Polymorphism statistics	14:39:30 10 Dec 2019	14:39:31 10 Dec 2019	1 second
Output	14:39:31 10 Dec 2019	16:12:19 10 Dec 2019	1 hour 32 minutes 48 seconds
Total			1 hour 48 minutes 11 seconds