[Genome] Precomputed TFBS for mouse
Kayla Smith
kayla at soe.ucsc.edu
Thu Nov 15 15:38:35 PST 2007
Hello Diego,
One of our developers has recommended this method:
If his files have a name column with unique values (each name is used
for only one item), then the UNIX sort and join programs could do
this. The steps would be:
1. Make a sorted-by-name version of the hg17 file. The UNIX command
sort -k can do this. For example, in a BED file the name is in the
4th column so "sort -k4,4" would do the trick.
2. Similarly, make a sorted-by-name version of the hg18 file.
3. Run join -j <nameColumnNumber>, e.g. join -j 4 for BED, with -o and
an output format suitable for the input files (man join to see the
format and other options).
I hope this information is helpful to you. Please don't hesitate to
contact us again if you require further assistance.
Kayla Smith
UCSC Genome Bioinformatics Group
Diego Miranda wrote:
> Hello,
>
> I was using the downloadable liftOver facility under Linux--the program
> works perfect. However, since n input lines may map to n-m coordinates plus
> m mismatches, how do I reconstruct it? Is there a way I could make liftOver
> give me e.g.
>
> chromosome hg17_start hg17_end mapto:hg18_start mapto:hg18_end
>
> Many thanks,
>
> With best wishes from Cambridge,
>
> diego
>
> On Nov 8 2007, Kayla Smith wrote:
>
>> Hello Diego,
>>
>> We do not have any plans to create a TFBS track for the mouse. One
>> thing you might try is our liftOver utility
>> (http://genome.ucsc.edu/cgi-bin/hgLiftOver), to convert human TFBS
>> coordinates to mouse coordinates.
>>
>> The details page from the human TFBS track has detailed information on
>> the generation of this track, and a link to Biobase, which might be
>> useful to look into:
>> http://genome.ucsc.edu/cgi-bin/hgTrackUi?g=tfbsConsSites
>>
>> I hope this information is helpful to you. Please don't hesitate to
>> contact us again if you require further assistance.
>>
>> Kayla Smith
>> UCSC Genome Bioinformatics Group
>>
>>
>> Diego Miranda wrote:
>>> Hello,
>>>
>>> I am in need of the precomputed TFBS for the mouse genome (same format
>>> as they are provided for the human genome). I guess they are not
>>> available on the UCSC Genome Browser....would you have them somewhere
>>> for me, please?
>>>
>>> Thanks a lot for your help.
>>>
>>> With best wishes from Cambridge,
>>>
>>> diego
>>>
>>>
>>
>
More information about the Genome
mailing list