[Genome] Gene symbols with spaces in kgxref table

Robert Kuhn kuhn at soe.ucsc.edu
Thu Jun 7 12:46:11 PDT 2007


Greg,

I'm afraid the issue your report is the result of our getting
our geneSymbol from the genbank record.  The only suggestion
I can make is that you will need to do some post-processing
to combine information from similar names.  

You could also contact the contributor of the original information
and suggest correcting teh genbank record.  Because our data are
generated by an automated pipeline, we are not in a position to
curate this kind of inconsistency.

best wishes,

			--b0b kuhn
			ucsc genome bioinformatics group


> From genome-bounces at soe.ucsc.edu  Wed Jun  6 15:50:06 2007
> To: Genome at soe.ucsc.edu
> Subject: [Genome] Gene symbols with spaces in kgxref table
> 
> Hello Genome Browser folks,
> 
> Great job with the new UCSC gene annotations, but I've noticed that
> some genes have inconsistent gene symbols in the kgXref table. For
> example, the gene SEMA3B is called "SEMA 3B" in a couple of entries.
> This makes it difficult to perform "group by" queries on the gene
> symbol. Is there some logic to this inconsistent naming, or a
> workaround that you could suggest?
> 
> Thanks,
> 
> Greg Singer
> _______________________________________________
> Genome maillist  -  Genome at soe.ucsc.edu
> http://www.soe.ucsc.edu/mailman/listinfo/genome
> 


More information about the Genome mailing list