BRESEQ :: Summary Statistics

breseq version 0.27.1 revision 87c22d663cc3
mutation predictions | marginal predictions | summary statistics | genome diff | command line log

Read File Information

	read file	reads	bases	passed filters	average	longest	mapped
errors	ALE-IL30_S38_L004_R2_001	1,769,201	266,320,346	99.7%	150.5 bases	151 bases	97.7%
errors	ALE-IL30_S38_L003_R1_001	1,784,088	268,561,662	99.6%	150.5 bases	151 bases	98.9%
errors	ALE-IL30_S38_L001_R2_001	1,769,618	266,355,348	99.7%	150.5 bases	151 bases	97.3%
errors	ALE-IL30_S38_L002_R1_001	1,752,007	263,725,525	99.3%	150.5 bases	151 bases	99.2%
errors	ALE-IL30_S38_L001_R1_001	1,762,611	265,310,643	99.3%	150.5 bases	151 bases	99.2%
errors	ALE-IL30_S38_L003_R2_001	1,787,871	269,103,897	99.8%	150.5 bases	151 bases	97.0%
errors	ALE-IL30_S38_L004_R1_001	1,766,267	265,899,592	99.6%	150.5 bases	151 bases	99.1%
errors	ALE-IL30_S38_L002_R2_001	1,758,461	264,689,988	99.7%	150.5 bases	151 bases	97.4%
	total	14,150,124	2,129,967,001	99.6%	150.5 bases	151 bases	98.2%

		seq id	length	fit mean	fit dispersion	% mapped reads	description
coverage	distribution	NC_000913	4,641,652	449.7	5.3	100.0%	Escherichia coli str. K-12 substr. MG1655, complete genome.
		total	4,641,652			100.0%

fit dispersion is the ratio of the variance to the mean for the negative binomial fit. It is =1 for Poisson and >1 for over-dispersed data.

option	limit	actual
Number of alignment pairs examined for constructing junction candidates	≤ 100000	10802
Coverage evenness (position-hash) score of junction candidates	≥ 2	≥ 2
Test this many junction candidates (n). May be smaller if not enough passed the coverage evenness threshold	100 ≤ n ≤ 5000	272
Total length of all junction candidates (factor times the reference genome length)	≤ 0.1	0.018

reference sequence	pr(no read start)
NC_000913	0.32231

pr(no read start) is the probability that there will not be an aligned read whose first base matches a given position on a given strand.

option	value
Coverage evenness (position-hash) score of predicted junctions must be	≥ 3
Skew score of predicted junction (−log10 probability of unusual coverage evenness) must be	≤ 3
Number of bases that at least one read must overlap each uniquely aligned side of a predicted junction	≥ 1

option	value
Mode	Consensus/Mixed Base
Ploidy	1 (haploid)
Consensus mutation E-value cutoff	10
Consensus frequency cutoff	0.8
Consensus minimum coverage each strand	OFF
Polymorphism E-value cutoff	10
Polymorphism frequency cutoff	0.2
Polymorphism minimum coverage each strand	OFF
Polymorphism bias cutoff	OFF
Predict indel polymorphisms	YES
Skip indel polymorphisms in homopolymers runs of	OFF
Skip base substitutions when they create a homopolymer flanked on each side by	OFF

step	start	end	elapsed
Read and reference sequence file input	19:22:24 21 Apr 2016	19:28:08 21 Apr 2016	5 minutes 44 seconds
Read alignment to reference genome	19:28:08 21 Apr 2016	19:42:53 21 Apr 2016	14 minutes 45 seconds
Preprocessing alignments for candidate junction identification	19:42:53 21 Apr 2016	19:50:57 21 Apr 2016	8 minutes 4 seconds
Preliminary analysis of coverage distribution	19:50:57 21 Apr 2016	20:12:13 21 Apr 2016	21 minutes 16 seconds
Identifying junction candidates	20:12:13 21 Apr 2016	20:12:16 21 Apr 2016	3 seconds
Re-alignment to junction candidates	20:12:16 21 Apr 2016	20:16:18 21 Apr 2016	4 minutes 2 seconds
Resolving alignments with junction candidates	20:16:18 21 Apr 2016	20:37:53 21 Apr 2016	21 minutes 35 seconds
Creating BAM files	20:37:53 21 Apr 2016	20:51:17 21 Apr 2016	13 minutes 24 seconds
Tabulating error counts	20:51:18 21 Apr 2016	21:03:12 21 Apr 2016	11 minutes 54 seconds
Re-calibrating base error rates	21:03:12 21 Apr 2016	21:03:17 21 Apr 2016	5 seconds
Examining read alignment evidence	21:03:17 21 Apr 2016	22:46:49 21 Apr 2016	1 hour 43 minutes 32 seconds
Polymorphism statistics	22:46:49 21 Apr 2016	22:46:52 21 Apr 2016	3 seconds
Output	22:46:52 21 Apr 2016	22:49:20 21 Apr 2016	2 minutes 28 seconds
Total			3 hours 26 minutes 55 seconds