[Genome] : how is knowntoEnsembl generated?
Jean Chang
jlchang at broad.mit.edu
Wed Jun 6 06:15:10 PDT 2007
> Hi,
>
> I'm not able to find anywhere how knowntoEnsembl is generated...
>
> I found this:
>> The hg18 (March 2006/NCBI Build 36.1) knownToEnsembl table was
>> built in May, 2006 using data downloaded from Ensembl BioMart at
>> that time. The hg17 (May 2004/NCBI Build 35.1) knownToEnsembl was
>> built in May, 2005. We did not log specific Ensembl versions.
>
> But what I'd really like to know is how you create the mapping of
> genes using the Ensembl BioMart - or if you could point me to how
> Ensembl says they do it. I took a look at the Biomart and couldn't
> figure out where you would get those mappings but what I'm looking
> for is the methods to those mappings.
I tried contacting Biomart about my question above and got the
following response:
> Apologies, but you will have to ask UCSC that question. BioMart is a
> tool that extracts data from databases such as Ensembl. With it, you
> could convert Ensembl gene IDs to IDs in UniProt, Refseq, etc.
> However
> we don't have a UCSC 'known gene' set. I would guess they convert
> Ensembl IDs to something else, such as UniProt IDs, then convert those
> to UCSC known genes, but you would have to ask them to be sure. We
> are
> not doing the mapping on this end.
Our group is very interested in mirroring the UCSC browser as an
internal reference source but we need to know more about the mapping
of knowntoEnsembl and the mapping methodology of a few of the other
cross-mapped data sets (Affy probe sets, in particular)
Thanks,
Jean
More information about the Genome
mailing list