breseq version 0.35.4 revision f352f80f4bc9
mutation predictions | marginal predictions | summary statistics | genome diff | command line log |
read file | reads | bases | passed filters | average | longest | mapped | |
---|---|---|---|---|---|---|---|
errors | TF_KALE_crp_A4_F20_I1_R1_S40_L001_R1_001.good.fq | 329,159 | 42,522,022 | 100.0% | 129.2 bases | 137 bases | 97.5% |
errors | TF_KALE_crp_A4_F20_I1_R1_S40_L001_R2_001.good.fq | 329,159 | 42,521,764 | 100.0% | 129.2 bases | 137 bases | 95.7% |
errors | TF_KALE_crp_A4_F20_I1_R1_S40_L002_R1_001.good.fq | 327,171 | 42,261,809 | 100.0% | 129.2 bases | 137 bases | 98.3% |
errors | TF_KALE_crp_A4_F20_I1_R1_S40_L002_R2_001.good.fq | 327,171 | 42,262,740 | 100.0% | 129.2 bases | 137 bases | 97.0% |
errors | TF_KALE_crp_A4_F20_I1_R1_S40_L003_R1_001.good.fq | 328,411 | 42,358,641 | 100.0% | 129.0 bases | 137 bases | 98.3% |
errors | TF_KALE_crp_A4_F20_I1_R1_S40_L003_R2_001.good.fq | 328,411 | 42,358,408 | 100.0% | 129.0 bases | 137 bases | 97.2% |
errors | TF_KALE_crp_A4_F20_I1_R1_S40_L004_R1_001.good.fq | 337,539 | 43,501,539 | 100.0% | 128.9 bases | 137 bases | 98.5% |
errors | TF_KALE_crp_A4_F20_I1_R1_S40_L004_R2_001.good.fq | 337,539 | 43,502,219 | 100.0% | 128.9 bases | 137 bases | 97.7% |
total | 2,644,560 | 341,289,142 | 100.0% | 129.1 bases | 137 bases | 97.5% |
seq id | length | fit mean | fit dispersion | % mapped reads | description | ||
---|---|---|---|---|---|---|---|
coverage | distribution | NC_000913 | 4,641,652 | 71.3 | 3.7 | 100.0% | Escherichia coli str. K-12 substr. MG1655, complete genome. |
total | 4,641,652 | 100.0% |
fit dispersion is the ratio of the variance to the mean for the negative binomial fit. It is =1 for Poisson and >1 for over-dispersed data.
option | limit | actual |
---|---|---|
Number of alignment pairs examined for constructing junction candidates | ≤ 100000 | 1352 |
Coverage evenness (position-hash) score of junction candidates | ≥ 2 | ≥ 2 |
Test this many junction candidates (n). May be smaller if not enough passed the coverage evenness threshold | 100 ≤ n ≤ 5000 | 87 |
Total length of all junction candidates (factor times the reference genome length) | ≤ 0.1 | 0.005 |
reference sequence | pr(no read start) |
---|---|
NC_000913 | 0.83820 |
pr(no read start) is the probability that there will not be an aligned read whose first base matches a given position on a given strand.
option | value |
---|---|
Coverage evenness (position-hash) score of predicted junctions must be | ≥ 3 |
Minimum probablilty assigned that no mapped read will start at a given position and strand for junction prediction | 0.1 |
Junction allow suboptimal matches | FALSE |
Skew score of predicted junction (−log10 probability of unusual coverage evenness) must be | ≤ 3 |
Number of bases that at least one read must overlap each uniquely aligned side of a predicted junction | ≥ 1 |
option | value |
---|---|
Mode | Consensus/Mixed Base |
Ploidy | 1 (haploid) |
Consensus mutation E-value cutoff | 10 |
Consensus frequency cutoff | 0.75 |
Consensus minimum variant coverage each strand | OFF |
Consensus minimum total coverage each strand | OFF |
Consensus minimum variant coverage | OFF |
Consensus minimum total coverage | OFF |
Polymorphism E-value cutoff | 10 |
Polymorphism frequency cutoff | 0.2 |
Polymorphism minimum variant coverage each strand | OFF |
Polymorphism minimum total coverage each strand | OFF |
Polymorphism minimum variant coverage | OFF |
Polymorphism minimum total coverage | OFF |
Polymorphism bias cutoff | OFF |
Predict indel polymorphisms | YES |
Skip indel polymorphisms in homopolymers runs of | OFF |
Skip base substitutions when they create a homopolymer flanked on each side by | OFF |
program | version |
---|---|
bowtie2 | 2.2.6 |
R | 3.4.4 |
step | start | end | elapsed |
---|---|---|---|
Read and reference sequence file input | 21:22:47 22 Apr 2021 | 21:23:32 22 Apr 2021 | 45 seconds |
Read alignment to reference genome | 21:23:32 22 Apr 2021 | 21:28:52 22 Apr 2021 | 5 minutes 20 seconds |
Preprocessing alignments for candidate junction identification | 21:28:52 22 Apr 2021 | 21:29:23 22 Apr 2021 | 31 seconds |
Preliminary analysis of coverage distribution | 21:29:23 22 Apr 2021 | 21:30:53 22 Apr 2021 | 1 minute 30 seconds |
Identifying junction candidates | 21:30:53 22 Apr 2021 | 21:30:54 22 Apr 2021 | 1 second |
Re-alignment to junction candidates | 21:30:54 22 Apr 2021 | 21:32:10 22 Apr 2021 | 1 minute 16 seconds |
Resolving best read alignments | 21:32:10 22 Apr 2021 | 21:32:56 22 Apr 2021 | 46 seconds |
Creating BAM files | 21:32:56 22 Apr 2021 | 21:34:17 22 Apr 2021 | 1 minute 21 seconds |
Tabulating error counts | 21:34:17 22 Apr 2021 | 21:34:46 22 Apr 2021 | 29 seconds |
Re-calibrating base error rates | 21:34:46 22 Apr 2021 | 21:34:48 22 Apr 2021 | 2 seconds |
Examining read alignment evidence | 21:34:48 22 Apr 2021 | 21:41:01 22 Apr 2021 | 6 minutes 13 seconds |
Polymorphism statistics | 21:41:01 22 Apr 2021 | 21:41:02 22 Apr 2021 | 1 second |
Output | 21:41:02 22 Apr 2021 | 21:41:10 22 Apr 2021 | 8 seconds |
Total | 18 minutes 23 seconds |