SeqMonk Vistory

Project Summary

Basic Project Info

ParameterValue
Project Namepluripotency_exit_input_comparison.smk
Vistory Namepluripotency_exit_input_comparison.smv
GenomeMus musculus GRCm38_v95
SeqMonk Version1.46.0

Data Sets

NameFile NameReadsImport Options
DNMT_TKOInput_1_10KO2i1/bi/seqfac/seqfac/200225_Aled_ChIP_Globus/Aligned/Project_Sanger/Sample_lane1/lane1_DNMT_TKOInput_1_10KO2i1_TAGTCTTG_L001_3536STDY8736207_GRCm38_bowtie2.bam28931827Library type Paired End Dedup=No MAPQ>=20 Primary alignments only.
DNMT_TKOInput_1_10KO2i2/bi/seqfac/seqfac/200225_Aled_ChIP_Globus/Aligned/Project_Sanger/Sample_lane1/lane1_DNMT_TKOInput_1_10KO2i2_TTCCATTG_L001_3536STDY8736208_GRCm38_bowtie2.bam21271478Library type Paired End Dedup=No MAPQ>=20 Primary alignments only.
DNMT_TKOInput_1_10KO2i3/bi/seqfac/seqfac/200225_Aled_ChIP_Globus/Aligned/Project_Sanger/Sample_lane1/lane1_DNMT_TKOInput_1_10KO2i3_TCGAAGTG_L001_3536STDY8736209_GRCm38_bowtie2.bam12312860Library type Paired End Dedup=No MAPQ>=20 Primary alignments only.
DNMT_TKOInput_1_10KOEpiLC1/bi/seqfac/seqfac/200225_Aled_ChIP_Globus/Aligned/Project_Sanger/Sample_lane1/lane1_DNMT_TKOInput_1_10KOEpiLC1_TAACGCTG_L001_3536STDY8736210_GRCm38_bowtie2.bam19880874Library type Paired End Dedup=No MAPQ>=20 Primary alignments only.
DNMT_TKOInput_1_10KOEpiLC2/bi/seqfac/seqfac/200225_Aled_ChIP_Globus/Aligned/Project_Sanger/Sample_lane1/lane1_DNMT_TKOInput_1_10KOEpiLC2_TTGGTATG_L001_3536STDY8736211_GRCm38_bowtie2.bam20670158Library type Paired End Dedup=No MAPQ>=20 Primary alignments only.
DNMT_TKOInput_1_10KOEpiLC3/bi/seqfac/seqfac/200225_Aled_ChIP_Globus/Aligned/Project_Sanger/Sample_lane1/lane1_DNMT_TKOInput_1_10KOEpiLC3_TGAACTGG_L001_3536STDY8736212_GRCm38_bowtie2.bam25195174Library type Paired End Dedup=No MAPQ>=20 Primary alignments only.
DNMT_TKOInput_1_10WT2i1/bi/seqfac/seqfac/200225_Aled_ChIP_Globus/Aligned/Project_Sanger/Sample_lane1/lane1_DNMT_TKOInput_1_10WT2i1_TGCGATCT_L001_3536STDY8736201_GRCm38_bowtie2.bam28657679Library type Paired End Dedup=No MAPQ>=20 Primary alignments only.
DNMT_TKOInput_1_10WT2i2/bi/seqfac/seqfac/200225_Aled_ChIP_Globus/Aligned/Project_Sanger/Sample_lane1/lane1_DNMT_TKOInput_1_10WT2i2_TTCCTGCT_L001_3536STDY8736202_GRCm38_bowtie2.bam23097758Library type Paired End Dedup=No MAPQ>=20 Primary alignments only.
DNMT_TKOInput_1_10WT2i3/bi/seqfac/seqfac/200225_Aled_ChIP_Globus/Aligned/Project_Sanger/Sample_lane1/lane1_DNMT_TKOInput_1_10WT2i3_TAGTGACT_L001_3536STDY8736203_GRCm38_bowtie2.bam20889522Library type Paired End Dedup=No MAPQ>=20 Primary alignments only.
DNMT_TKOInput_1_10WTEpiLC1/bi/seqfac/seqfac/200225_Aled_ChIP_Globus/Aligned/Project_Sanger/Sample_lane1/lane1_DNMT_TKOInput_1_10WTEpiLC1_TACAGGAT_L001_3536STDY8736204_GRCm38_bowtie2.bam28320642Library type Paired End Dedup=No MAPQ>=20 Primary alignments only.
DNMT_TKOInput_1_10WTEpiLC2/bi/seqfac/seqfac/200225_Aled_ChIP_Globus/Aligned/Project_Sanger/Sample_lane1/lane1_DNMT_TKOInput_1_10WTEpiLC2_TCCTCAAT_L001_3536STDY8736205_GRCm38_bowtie2.bam22496023Library type Paired End Dedup=No MAPQ>=20 Primary alignments only.
DNMT_TKOInput_1_10WTEpiLC3/bi/seqfac/seqfac/200225_Aled_ChIP_Globus/Aligned/Project_Sanger/Sample_lane1/lane1_DNMT_TKOInput_1_10WTEpiLC3_TGTGGTTG_L001_3536STDY8736206_GRCm38_bowtie2.bam20464019Library type Paired End Dedup=No MAPQ>=20 Primary alignments only.
TET_KO_2i_Input_(from_K27Ac)_rep1_valid_regionsReimported from TET_KO_2i_Input_(from_K27Ac)_rep1_sensible_regions21475967Visible stores parser Filter by feature Excluding excluded regions 2kb Remove DuplicatesNo
TET_KO_2i_Input_(from_K27Ac)_rep2_valid_regionsReimported from TET_KO_2i_Input_(from_K27Ac)_rep2_sensible_regions25888215Visible stores parser Filter by feature Excluding excluded regions 2kb Remove DuplicatesNo
TET_KO_2i_Input_(from_K27Ac)_rep3_valid_regionsReimported from TET_KO_2i_Input_(from_K27Ac)_rep3_sensible_regions23808429Visible stores parser Filter by feature Excluding excluded regions 2kb Remove DuplicatesNo
TET_KO_EpiLC_Input_(from_K27Ac)_rep1_valid_regionsReimported from TET_KO_EpiLC_Input_(from_K27Ac)_rep1_sensible_regions28776866Visible stores parser Filter by feature Excluding excluded regions 2kb Remove DuplicatesNo
TET_KO_EpiLC_Input_(from_K27Ac)_rep2_valid_regionsReimported from TET_KO_EpiLC_Input_(from_K27Ac)_rep2_sensible_regions30189166Visible stores parser Filter by feature Excluding excluded regions 2kb Remove DuplicatesNo
TET_KO_EpiLC_Input_(from_K27Ac)_rep3_valid_regionsReimported from TET_KO_EpiLC_Input_(from_K27Ac)_rep3_sensible_regions27676372Visible stores parser Filter by feature Excluding excluded regions 2kb Remove DuplicatesNo
TET_WT_2i_Input_(from_K27Ac)_rep1_valid_regionsReimported from TET_WT_2i_Input_(from_K27Ac)_rep1_sensible_regions35912967Visible stores parser Filter by feature Excluding excluded regions 2kb Remove DuplicatesNo
TET_WT_2i_Input_(from_K27Ac)_rep2_valid_regionsReimported from TET_WT_2i_Input_(from_K27Ac)_rep2_sensible_regions32417426Visible stores parser Filter by feature Excluding excluded regions 2kb Remove DuplicatesNo
TET_WT_2i_Input_(from_K27Ac)_rep3_valid_regionsReimported from TET_WT_2i_Input_(from_K27Ac)_rep3_sensible_regions33773024Visible stores parser Filter by feature Excluding excluded regions 2kb Remove DuplicatesNo
TET_WT_EpiLC_Input_(from_K27Ac)_rep1_valid_regionsReimported from TET_WT_EpiLC_Input_(from_K27Ac)_rep1_sensible_regions35148305Visible stores parser Filter by feature Excluding excluded regions 2kb Remove DuplicatesNo
TET_WT_EpiLC_Input_(from_K27Ac)_rep2_valid_regionsReimported from TET_WT_EpiLC_Input_(from_K27Ac)_rep2_sensible_regions29526185Visible stores parser Filter by feature Excluding excluded regions 2kb Remove DuplicatesNo
TET_WT_EpiLC_Input_(from_K27Ac)_rep3_valid_regionsReimported from TET_WT_EpiLC_Input_(from_K27Ac)_rep3_sensible_regions28687101Visible stores parser Filter by feature Excluding excluded regions 2kb Remove DuplicatesNo

Replicate Sets

DNMT_TKOInput_1_10KO2iDNMT_TKOInput_1_10KOEpiLCDNMT_TKOInput_1_10WT2iDNMT_TKOInput_1_10WTEpiLCTET_KO_2i_InputTET_KO_EpiLC_InputTET_WT_2i_InputTET_WT_EpiLC_Input
DNMT_TKOInput_1_10KO2i1DNMT_TKOInput_1_10KOEpiLC1DNMT_TKOInput_1_10WT2i1DNMT_TKOInput_1_10WTEpiLC1TET_KO_2i_Input_(from_K27Ac)_rep1_valid_regionsTET_KO_EpiLC_Input_(from_K27Ac)_rep1_valid_regionsTET_WT_2i_Input_(from_K27Ac)_rep1_valid_regionsTET_WT_EpiLC_Input_(from_K27Ac)_rep1_valid_regions
DNMT_TKOInput_1_10KO2i2DNMT_TKOInput_1_10KOEpiLC2DNMT_TKOInput_1_10WT2i2DNMT_TKOInput_1_10WTEpiLC2TET_KO_2i_Input_(from_K27Ac)_rep2_valid_regionsTET_KO_EpiLC_Input_(from_K27Ac)_rep2_valid_regionsTET_WT_2i_Input_(from_K27Ac)_rep2_valid_regionsTET_WT_EpiLC_Input_(from_K27Ac)_rep2_valid_regions
DNMT_TKOInput_1_10KO2i3DNMT_TKOInput_1_10KOEpiLC3DNMT_TKOInput_1_10WT2i3DNMT_TKOInput_1_10WTEpiLC3TET_KO_2i_Input_(from_K27Ac)_rep3_valid_regionsTET_KO_EpiLC_Input_(from_K27Ac)_rep3_valid_regionsTET_WT_2i_Input_(from_K27Ac)_rep3_valid_regionsTET_WT_EpiLC_Input_(from_K27Ac)_rep3_valid_regions

Quantitation

NameParentProbesDescriptionComments
2 kb tiles1362779Running windows of size 2000 with step size 2000. Quantitated with Read Count Quantitation using All Reads correcting for total count per million reads log transformed

This is to look at a) coverage outliers and b) at regions which are genetically different between the strains and therefore should be taken out of the analysis. We already know that for some reason the TET WT and the TET TKO don't have the same genetic background. After much deliberation I came up with a set of excluded regions for this comparsion. I'll see what those regions look like and if they include the coverage outliers for the DNMT samples.

Importing file excluded_regions_191112.txt as annotation 'excluded regions TKO'.

These include coverage outliers. Also DNMT TKO and WT generally have very similar coverage (as have EpiLC and 2i) which is good!

New Probe List

Not Overlapping excluded regions TKO (1231453 probes):[Feature Filter] Filter probes in 2 kb tiles on regions based on excluded regions TKO Over feature relationship is Not Overlapping

Unfortunately, not all coverage outliers were excluded, so need an additional filter. Let's to > 2

New Probe List

Value above 2 in any (146 probes):[Probe Values Filter] Filter on probes in 2 kb tiles where at least 1 of DNMT_TKOInput_1_10KO2i , DNMT_TKOInput_1_10KOEpiLC , DNMT_TKOInput_1_10WT2i , DNMT_TKOInput_1_10WTEpiLC , TET_KO_2i_Input , TET_KO_EpiLC_Input , TET_WT_2i_Input , TET_WT_EpiLC_Input had a value above 2.0. Quantitation was Read Count Quantitation using All Reads correcting for total count per million reads log transformed

New Probe List

excluded_regions_all_input_200425 (131335 probes):[Combine Filter] Overlapping excluded regions TKO OR Value above 2 in any

New Probe Set

excluded_regions_all_input_200425 (131335 probes):Running windows of size 2000 with step size 2000. Quantitated with Read Count Quantitation using All Reads correcting for total count per million reads log transformed then took subset of probes in excluded_regions_all_input_200425

Probes Quantitated

Fixed Value Quantitation using a value of 1

New Probe Set

excluded_regions_all_input_200425 (15427 probes):Deduplicated probes from excluded_regions_all_input_200425 created by Running windows of size 2000 with step size 2000. Quantitated with Read Count Quantitation using All Reads correcting for total count per million reads log transformed then took subset of probes in excluded_regions_all_input_200425. Quantitated with Fixed Value Quantitation using a value of 1Max Distance was 1 strands were merged separately

Probes Quantitated

Fixed Value Quantitation using a value of 1

This exercise was to merge adjacent exlcuded regions, then converted to an annotation track called excluded_regions_all_input_200425 and exported as probe report

New Probe Set:2 kb tiles (1362779 probes)

Running windows of size 2000 with step size 2000

Probes Quantitated

Read Count Quantitation using All Reads correcting for total count per million reads log transformed

New Probe List: Not Overlapping excluded_regions_all_input_200425 (1231444 probes)

[Feature Filter] Filter probes in 2 kb tiles on regions based on excluded_regions_all_input_200425 Over feature relationship is Not Overlapping

Actually, I should also exclude the regions that are very different between DNMT and TET cell lines

How about I go for differences of > 2

New Probe List: Difference above 2 (1751 probes)

[Differences Filter] Filter on probes in Not Overlapping excluded_regions_all_input_200425 where average difference when comparing DNMT_TKOInput_1_10KO2i to TET_WT_EpiLC_Input had a value above 2.0. Quantitation was Read Count Quantitation using All Reads correcting for total count per million reads log transformed

New Probe List: Difference below -2 (639 probes)

[Differences Filter] Filter on probes in Not Overlapping excluded_regions_all_input_200425 where average difference when comparing DNMT_TKOInput_1_10KO2i to TET_WT_EpiLC_Input had a value below -2.0. Quantitation was Read Count Quantitation using All Reads correcting for total count per million reads log transformed

adding those to the annotation and then merging

New Probe Set:All Probes (143901 probes)

Feature generator using excluded_regions_all_input_200425 duplicates removed Over feature from 0-0

Probes Quantitated

Fixed Value Quantitation using a value of 1

New Probe Set:excluded_regions_all_input_200425 (16460 probes)

Deduplicated probes from All Probes created by Feature generator using excluded_regions_all_input_200425 duplicates removed Over feature from 0-0. Quantitated with Fixed Value Quantitation using a value of 1Max Distance was 1 strands were merged separately

Probes Quantitated

Fixed Value Quantitation using a value of 1

saved as new annotation track and exported a probe report