BRESEQ :: Summary Statistics

breseq version 0.32.0 revision 6ff6de7d1b87
mutation predictions | marginal predictions | summary statistics | genome diff | command line log

Read File Information

	read file	reads	bases	passed filters	average	longest	mapped
errors	pgi-8-2_S12_L001_R1_001.good.fq	1,220,674	281,979,126	100.0%	231.0 bases	239 bases	99.7%
errors	pgi-8-2_S12_L001_R2_001.good.fq	1,220,674	282,344,406	100.0%	231.3 bases	239 bases	93.1%
	total	2,441,348	564,323,532	100.0%	231.2 bases	239 bases	96.4%

		seq id	length	fit mean	fit dispersion	% mapped reads	description
coverage	distribution	NC_000913	4,641,653	121.7	7.4	100.0%	Escherichia coli str. K-12 sbstr. MG1655, complete genome. Live strain from Systems Biology Research Group
		total	4,641,653			100.0%

fit dispersion is the ratio of the variance to the mean for the negative binomial fit. It is =1 for Poisson and >1 for over-dispersed data.

option	limit	actual
Number of alignment pairs examined for constructing junction candidates	≤ 100000	4916
Coverage evenness (position-hash) score of junction candidates	≥ 2	≥ 2
Test this many junction candidates (n). May be smaller if not enough passed the coverage evenness threshold	100 ≤ n ≤ 5000	207
Total length of all junction candidates (factor times the reference genome length)	≤ 0.1	0.022

reference sequence	pr(no read start)
NC_000913	0.84884

pr(no read start) is the probability that there will not be an aligned read whose first base matches a given position on a given strand.

option	value
Coverage evenness (position-hash) score of predicted junctions must be	≥ 3
Skew score of predicted junction (−log10 probability of unusual coverage evenness) must be	≤ 0
Number of bases that at least one read must overlap each uniquely aligned side of a predicted junction	≥ 6

option	value
Mode	Full Polymorphism
Ploidy	1 (haploid)
Consensus mutation E-value cutoff	10
Consensus frequency cutoff	OFF
Consensus minimum variant coverage each strand	OFF
Consensus minimum total coverage each strand	OFF
Consensus minimum variant coverage	OFF
Consensus minimum total coverage	OFF
Polymorphism E-value cutoff	2
Polymorphism frequency cutoff	0.025
Polymorphism minimum variant coverage each strand	2
Polymorphism minimum total coverage each strand	OFF
Polymorphism minimum variant coverage	OFF
Polymorphism minimum total coverage	OFF
Polymorphism bias cutoff	OFF
Predict indel polymorphisms	YES
Skip indel polymorphisms in homopolymers runs of	≥3 bases
Skip base substitutions when they create a homopolymer flanked on each side by	≥5 bases

program	version
bowtie2	2.2.8
R	3.3.1

step	start	end	elapsed
Read and reference sequence file input	09:20:08 19 Mar 2018	09:20:43 19 Mar 2018	35 seconds
Read alignment to reference genome	09:20:43 19 Mar 2018	09:23:21 19 Mar 2018	2 minutes 38 seconds
Preprocessing alignments for candidate junction identification	09:23:21 19 Mar 2018	09:24:04 19 Mar 2018	43 seconds
Preliminary analysis of coverage distribution	09:24:04 19 Mar 2018	09:25:40 19 Mar 2018	1 minute 36 seconds
Identifying junction candidates	09:25:40 19 Mar 2018	09:25:43 19 Mar 2018	3 seconds
Re-alignment to junction candidates	09:25:43 19 Mar 2018	09:26:14 19 Mar 2018	31 seconds
Resolving best read alignments	09:26:14 19 Mar 2018	09:27:26 19 Mar 2018	1 minute 12 seconds
Creating BAM files	09:27:26 19 Mar 2018	09:28:45 19 Mar 2018	1 minute 19 seconds
Tabulating error counts	09:28:45 19 Mar 2018	09:29:31 19 Mar 2018	46 seconds
Re-calibrating base error rates	09:29:31 19 Mar 2018	09:29:32 19 Mar 2018	1 second
Examining read alignment evidence	09:29:32 19 Mar 2018	10:29:02 19 Mar 2018	59 minutes 30 seconds
Polymorphism statistics	10:29:02 19 Mar 2018	10:29:03 19 Mar 2018	1 second
Output	10:29:03 19 Mar 2018	10:29:31 19 Mar 2018	28 seconds
Total			1 hour 9 minutes 23 seconds