-
Notifications
You must be signed in to change notification settings - Fork 6
Open
Description
Hi !
I am trying tu use msCENTIPEDE to generate TF footprints for several ATAC-seq data sets i've produced.
I wanted to have general advices on how to achieve the most accurate workflow.
So far, what I am doing is the following :
- Call ATAC-peaks with MACS2
- Scan the entire genome for a given PWM with STEME with a probability cut-off at 1/1000
- Select only motifs falling inside ATAC peaks
- Train msCENTIPEDE based on the top 10000 motifs falling inside ATAC peaks
- Infer on the whole motif falling inside ATAC peaks
Could you tell me please if I am on the right track ?
Or is my approach somehow biased ?
I was then wondering how to define a cut-off using the statistics reported by msCENTIPEDE to focus only on the most likely bound sites ?
Could you please provide in the README a short description of the output of msCENTIPEDE and how to use LogPosOdds, LogPriorOdds, MultLikeRatio and NegBinLikeRatio ?
Thanks a lot,
Cheers,
Pef
Metadata
Metadata
Assignees
Labels
No labels