[Genome] question about data from Human Genome Annotation
Hiram Clawson
hiram at soe.ucsc.edu
Thu Oct 25 10:38:40 PDT 2007
Good Morning Miguel:
There are several methods you can use to extract this information
from the UCSC Genome browser.
1. From the chr*_rmsk.txt.gz files from the database:
http://hgdownload.cse.ucsc.edu/goldenPath/hg18/database/chr*_rmsk.txt.gz
or: ftp://hgdownload.cse.ucsc.edu/goldenPath/hg18/database/chr*_rmsk.txt.gz
2. From the "Tables" browser with a filter "Alu%" on the repName or repFamily columns
3. Directly from the MySQL server with a shell script, for example:
mysql --user=genome --host=genome-mysql.cse.ucsc.edu -A \
-N -e "SELECT chrom FROM chromInfo;" hg18 | while read CHR
do
mysql --user=genome --host=genome-mysql.cse.ucsc.edu -A \
-N -e 'SELECT
genoName,genoStart,genoEnd,repName,swScore,strand,repClass,repFamily
FROM '${CHR}'_rmsk
WHERE repName LIKE "Alu%"
OR repFamily LIKE "Alu%"
' hg18
done
From that information, you can run an analysis of the density of these elements
for your research.
--Hiram
Miguel.Gallach at uv.es wrote:
> Dear Mr/Ms,
>
> We are analyzing few aspects of methylation in Alu repeats. We need to
> know the distribution of each Alu subfamily in the human genome. We
> just find the output table from the program REPEATMASKER. Do you have
> any table summing up the densities and/or distribution in the human
> genome of the Alu subfamilies?
>
> Thank you very much for your attention,
>
> Miguel
> --
> ********************************
> miguel.gallach at uv.es
> ********************************
> Miguel Gallach Caballero
> Laboratorio de Genómica Comparativa
> Departamento de Genética
> Universidad de Valencia
>
> Calle Doctor Moliner, 50
> Burjassot 46100 (Valencia)
> SPAIN
>
>
> _______________________________________________
> Genome maillist - Genome at soe.ucsc.edu
> http://www.soe.ucsc.edu/mailman/listinfo/genome
>
More information about the Genome
mailing list