[Genome] problem on mouse genome for promoter region prediction
Brooke Rhead
rhead at soe.ucsc.edu
Wed Nov 21 16:02:21 PST 2007
Hello Xu Jerry,
Here is a similar previously-answered question:
http://www.soe.ucsc.edu/pipermail/genome/2007-November/014963.html
(This is specific to TFBS, but you could also use the liftOver utility
for the Eponine TSS or FirstEF tracks. However, please see my note
below about using liftOver -- you might want to use pslMap instead.)
Alternatively, you could intersect one of the human promoter tracks with
the human Conservation track to get the corresponding mouse regions, as
described in this answer:
http://www.soe.ucsc.edu/pipermail/genome/2007-June/013989.html
(Note that multiz17way has been replaced by multiz28way in hg18. Also,
there is another tool run by Penn State University called Galaxy that is
very useful for working with MAF data, located here:
http://main.g2.bx.psu.edu/ -- look under the "Fetch Alignments" tool on
the left-hand side of the page.)
In either case, you could make a custom track in the mouse browser and
intersect it with the mouse regions you found.
Another option would be to go in the opposite direction: use your
compiled list of mouse coordinates to create a custom track in the human
browser (either using liftOver, pslMap or the Conservation track to
convert to human coordinates), then intersect that with one of the
promoter tracks on the human browser.
NOTE about using liftOver:
liftOver is fairly coarse (just maps start and end), which makes it more
useful for same-species mapping than cross-species mapping. The program
pslMap is much more detailed (it breaks the mapping down to the
chainLink/gapless-block level), so it is better than liftOver for
mapping gene-sized regions.
The pslMap program is available in our source tree, which is free for
academic, nonprofit and personal use:
http://hgdownload.cse.ucsc.edu/downloads.html#source_downloads
Typical pslMap options for running with one of our .over.chain files
(for example, converting mm8 to hg18 coordinates) are:
pslMap -chainMapFile -swapMap mm8.input.psl mm8ToHg18.over.chain.gz
hg18.output.psl
Note that the input to pslMap needs to be in PSL format, described here:
http://genome.ucsc.edu/FAQ/FAQformat#format2 . If you already have
coordinates in BED format, our program genePredToPsl -bedFormat can be used.
The output of pslMap can be uploaded as a custom track, and/or
translated back into BED format using the program pslToBed.
I hope this information is helpful. If you have further questions,
please feel free to contact us again at the genome mailing list address.
--
Brooke Rhead
UCSC Genome Bioinformatics Group
xu_jianzhen wrote:
> hi,genome
> I am a member of GIBH,Chinese Academy of Science. I have compiled a lists of mouse (mm8) genome coordinates.I'd like to
> use some predicton tools to search those regions to see whether they are promoter regions(or there are transcription start sites among them).
> In human genome, there are tracks of pre-computed predicted promoter region(human Mar.2006,for example:Eponine TSS or FirstEF),But
> I can find one for mouse genome.Could you help me to solve this problem?
>
>
>
>
> Xu Jerry
> GIBH,CAS
>
>
> _______________________________________________
> Genome maillist - Genome at soe.ucsc.edu
> http://www.soe.ucsc.edu/mailman/listinfo/genome
More information about the Genome
mailing list