BRESEQ :: Summary Statistics

breseq version 0.33.1 revision 8505477f25b3
mutation predictions | marginal predictions | summary statistics | genome diff | command line log

Read File Information

	read file	reads	bases	passed filters	average	longest	mapped
errors	Plate-2-BOP-1000-SUCB-ALE-1-flask-40_S42_L001_R1_001.good.fq	211,316	25,849,771	100.0%	122.3 bases	149 bases	94.4%
errors	Plate-2-BOP-1000-SUCB-ALE-1-flask-40_S42_L001_R2_001.good.fq	211,316	25,447,752	100.0%	120.4 bases	149 bases	97.9%
errors	Plate-2-BOP-1000-SUCB-ALE-1-flask-40_S42_L002_R1_001.good.fq	170,542	20,786,133	100.0%	121.9 bases	149 bases	93.9%
errors	Plate-2-BOP-1000-SUCB-ALE-1-flask-40_S42_L002_R2_001.good.fq	170,542	20,445,426	100.0%	119.9 bases	149 bases	97.4%
errors	Plate-2-BOP-1000-SUCB-ALE-1-flask-40_S42_L003_R1_001.good.fq	192,492	23,494,393	100.0%	122.1 bases	149 bases	94.1%
errors	Plate-2-BOP-1000-SUCB-ALE-1-flask-40_S42_L003_R2_001.good.fq	192,492	23,117,754	100.0%	120.1 bases	149 bases	97.7%
errors	Plate-2-BOP-1000-SUCB-ALE-1-flask-40_S42_L004_R1_001.good.fq	167,320	20,340,109	100.0%	121.6 bases	149 bases	93.9%
errors	Plate-2-BOP-1000-SUCB-ALE-1-flask-40_S42_L004_R2_001.good.fq	167,320	20,024,267	100.0%	119.7 bases	149 bases	97.3%
	total	1,483,340	179,505,605	100.0%	121.0 bases	149 bases	95.8%

		seq id	length	fit mean	fit dispersion	% mapped reads	description
coverage	distribution	NC_000913	4,641,652	36.6	3.4	100.0%	Escherichia coli str. K-12 substr. MG1655, complete genome.
		total	4,641,652			100.0%

fit dispersion is the ratio of the variance to the mean for the negative binomial fit. It is =1 for Poisson and >1 for over-dispersed data.

option	limit	actual
Number of alignment pairs examined for constructing junction candidates	≤ 100000	17839
Coverage evenness (position-hash) score of junction candidates	≥ 2	≥ 3
Test this many junction candidates (n). May be smaller if not enough passed the coverage evenness threshold	100 ≤ n ≤ 5000	98
Total length of all junction candidates (factor times the reference genome length)	≤ 0.1	0.006

reference sequence	pr(no read start)
NC_000913	0.86651

pr(no read start) is the probability that there will not be an aligned read whose first base matches a given position on a given strand.

option	value
Coverage evenness (position-hash) score of predicted junctions must be	≥ 3
Skew score of predicted junction (−log10 probability of unusual coverage evenness) must be	≤ 3
Number of bases that at least one read must overlap each uniquely aligned side of a predicted junction	≥ 1

option	value
Mode	Consensus/Mixed Base
Ploidy	1 (haploid)
Consensus mutation E-value cutoff	10
Consensus frequency cutoff	0.75
Consensus minimum variant coverage each strand	OFF
Consensus minimum total coverage each strand	OFF
Consensus minimum variant coverage	OFF
Consensus minimum total coverage	OFF
Polymorphism E-value cutoff	10
Polymorphism frequency cutoff	0.2
Polymorphism minimum variant coverage each strand	OFF
Polymorphism minimum total coverage each strand	OFF
Polymorphism minimum variant coverage	OFF
Polymorphism minimum total coverage	OFF
Polymorphism bias cutoff	OFF
Predict indel polymorphisms	YES
Skip indel polymorphisms in homopolymers runs of	OFF
Skip base substitutions when they create a homopolymer flanked on each side by	OFF

program	version
bowtie2	2.2.6
R	3.4.4

step	start	end	elapsed
Read and reference sequence file input	05:09:55 12 Sep 2020	05:10:17 12 Sep 2020	22 seconds
Read alignment to reference genome	05:10:17 12 Sep 2020	05:12:51 12 Sep 2020	2 minutes 34 seconds
Preprocessing alignments for candidate junction identification	05:12:51 12 Sep 2020	05:13:14 12 Sep 2020	23 seconds
Preliminary analysis of coverage distribution	05:13:14 12 Sep 2020	05:13:51 12 Sep 2020	37 seconds
Identifying junction candidates	05:13:51 12 Sep 2020	05:14:37 12 Sep 2020	46 seconds
Re-alignment to junction candidates	05:14:37 12 Sep 2020	05:15:24 12 Sep 2020	47 seconds
Resolving best read alignments	05:15:24 12 Sep 2020	05:15:56 12 Sep 2020	32 seconds
Creating BAM files	05:15:56 12 Sep 2020	05:16:30 12 Sep 2020	34 seconds
Tabulating error counts	05:16:30 12 Sep 2020	05:16:43 12 Sep 2020	13 seconds
Re-calibrating base error rates	05:16:43 12 Sep 2020	05:16:45 12 Sep 2020	2 seconds
Examining read alignment evidence	05:16:45 12 Sep 2020	05:19:34 12 Sep 2020	2 minutes 49 seconds
Polymorphism statistics	05:19:34 12 Sep 2020	05:19:34 12 Sep 2020	0 seconds
Output	05:19:34 12 Sep 2020	05:19:48 12 Sep 2020	14 seconds
Total			9 minutes 53 seconds