[Genome] data inconsistence from table browser and mysql server

Kayla Smith kayla at soe.ucsc.edu
Thu Dec 13 14:45:40 PST 2007


Hello Tao,

The two databases you are accessing in your question are not necessarily 
kept in sync with each other.   When you use any of the genbank tables, 
you should stay within one instance of a database.

To illustrate that the author is consistent between the two methods you 
mentioned below, let me paste some queries:

 From the mysql server:

mysql> select id, name from author where id = 173413;
+--------+-----------------------------------------------------------------------------------------------------------------------------------+
| id     | name 
      |
+--------+-----------------------------------------------------------------------------------------------------------------------------------+
| 173413 | Tsuritani,K., Irie,T., Yamashita,R., Sakakibara,Y., 
Wakaguri,H., Kanai,A., Mizushima-Sugano,J., Sugano,S., Nakai,K. and 
Suzuki,Y. |
+--------+-----------------------------------------------------------------------------------------------------------------------------------+
1 row in set (0.11 sec)


And from the Table Browser:

Clade: Vertebrate, Genome: Human, Assembly: May 2004, grlup: All Tables, 
database: hg17, table: author

Paste list:  173412

#id	name	crc
173412	Tsuritani,K., Irie,T., Yamashita,R., Sakakibara,Y., Wakaguri,H., 
Kanai,A., Mizushima-Sugano,J., Sugano,S., Nakai,K. and Suzuki,Y.	2746902304

You can see from this that the data in the authors table agree.

I hope this is helpful for understand the data.  Please don't hesitate 
to contact us again if you require further assistance.

Kayla Smith
UCSC Genome Bioinformatics Group

Tao Peng wrote:
> Dear colleague,
> 
> I want to retrieve cDNA information from UCSC genome browser. To do this, I
> tried two methods. One is by table browser and the other is by querying
> directly with mysql server. But I found some results with these two methods
> are inconsistent.
> 
> For example, I query the EST BP870449 in both table browser and mysql
> server.
> 
> 1) By table browser:
> Clade: Vertebrate Genome: Human  assembly: May 2004
> group: mRNA and EST Tracks          track: Human ESTs
> table: gbCdnaInfo
> Paste list:  BP870449
> result:
> #id    acc    version    moddate    type    direction    source
> organism    library    mrnaClone    sex    tissue   development    cell
> cds    keyword    description    geneName    productName    author    gi
> mol
> 13582040    BP870449    1    2007-06-17    EST    5    293    292
> 18113     9788246    0    6731    0    866    0  42740    0    0    0
> 173412    50713749    mRNA
> 
> 2) By mysql server:            mysql --user=genome --host=
> genome-mysql.cse.ucsc.edu -A
> use hg17;
> select * from gbCdnaInfo where or acc='BP870449'
> result:
> +----------+----------+---------+------------+------+-----------+--------+----------+---------+-----------+-----+--------+-------------+------+-----+---------+-------------+----------+-------------+--------+----------+------+
> | id       | acc      | version | moddate    | type | direction | source |
> organism | library | mrnaClone | sex | tissue | development | cell | cds |
> keyword | description | geneName | productName | author | gi       | mol  |
> +----------+----------+---------+------------+------+-----------+--------+----------+---------+-----------+-----+--------+-------------+------+-----+---------+-------------+----------+-------------+--------+----------+------+
> | 13582040 | BP870449 |       1 | 2007-06-17 | EST  | 5         |    293
> |      292 |   18113 |   9788246 |   0 |   6731 |           0 |  866 |   0
> |   42740 |           0 |        0 |           0 | 173413 | 50713749 | mRNA
> |
> +----------+----------+---------+------------+------+-----------+--------+----------+---------+-----------+-----+--------+-------------+------+-----+---------+-------------+----------+-------------+--------+----------+------+
> 
> You can see that the 'author' field in table browser result is 173412, while
> in mysql server result, it's 173413.
> 
> I'm not sure whether there are other inconsistence. Could anyone tell my
> why?
> 
> Thank you very much!
> 
> 
> 



More information about the Genome mailing list