[Genome] About snp126

Rachel Harte hartera at soe.ucsc.edu
Mon Jan 22 21:32:41 PST 2007


Hello,

The easiest way to do this is to use our public mySQL server to query our
database tables directly - see this page for help on doing this:

http://genome.ucsc.edu/FAQ/FAQdownloads#download29

There are a large number of SNPs in the snp126 table:
mysql> select count(*) from snp126;
+----------+
| count(*) |
+----------+
| 12351941 |
+----------+

Therefore you may want to break the query down by class. Most SNPs are
single base substitutions with a standard format:
mysql>select count(*) from snp126 where class = 'single';
+----------+
| count(*) |
+----------+
| 10090600 |
+----------+

This query will count the number of instances of each observed
substitution for the "single" class:

mysql>select observed, count(observed) from snp126 where class = "single"
group by observed;

For insertions, you may wish to limit the size of the insertion so that
you do not get too many rows of data returned:
e.g.
mysql> select distinct(observed) from snp126 where class = 'insertion' and
length(observed) < 5;

This is the most basic characterization of the snp126 data:
mysql> select count(*), class from snp126 group by class;
+----------+----------------+
| count(*) | class          |
+----------+----------------+
| 10090600 | single         |
|     5596 | in-del         |
|        4 | het            |
|     4321 | microsatellite |
|     6486 | named          |
|    51631 | mixed          |
|    16809 | mnp            |
|  1393040 | insertion      |
|   783454 | deletion       |
+----------+----------------+

I hope that this helps you. Please let us know if you have further
questions.

Rachel

Rachel Harte
UCSC Genome Bioinformatics Group
http://genome.ucsc.edu


On Tue, 23 Jan 2007, pauhsi wrote:

> How do I know the frequency or amount of every "observed" variation in snp126?
>


More information about the Genome mailing list