[Genome] how is knowntoEnsembl generated?

Hiram Clawson hiram at soe.ucsc.edu
Wed Jun 6 10:45:22 PDT 2007


Good Morning Jean:

You can always find the sequence of building any table in the genome
browser in our doc files in the source tree:
	src/hg/makeDb/doc/*.txt

The knownToEnsembl is created via the program hgMapToGene:

hgMapToGene - Map a track to a genePred track.
usage:
    hgMapToGene database track geneTrack newMapTable
where:
    database - is a browser database (ex. hg16)
    track - the track to map to the genePred track
            This must be bed12 psl or genePred itself
    geneTrack - a genePred format track
    newMapTable - a new table to create with two columns
            <geneTrackId><trackId>
For each gene in geneTrack this will select the most
overlapping item in track to associate

And for the knownToEnsemble, typical usage is:
	hgMapToGene hg18 ensGene knownGene knownToEnsembl

--Hiram

Jean Chang wrote:
> Hi,
> 
> I'm not able to find anywhere how knowntoEnsembl is generated...
> 
> I found this:
>> The hg18 (March 2006/NCBI Build 36.1) knownToEnsembl table was  
>> built in May, 2006 using data downloaded from Ensembl BioMart at  
>> that time.  The hg17 (May 2004/NCBI Build 35.1) knownToEnsembl was  
>> built in May, 2005. We did not log specific Ensembl versions.
> 
> But what I'd really like to know is how you create the mapping of  
> genes using the Ensembl BioMart - or if you could point me to how  
> Ensembl says they do it. I took a look at the Biomart and couldn't  
> figure out where you would get those mappings but what I'm looking  
> for is the methods to those mappings.
> 
> Thanks,
> 
> Jean


More information about the Genome mailing list