[Genome] GNF Atlas 2 Median Ratios...
Michael Mayhew
mmayhew at mcb.mcgill.ca
Wed Apr 11 00:21:25 PDT 2007
Hello,
I am attempting to build a distribution out of the 'expression
values' for a set of genes (say of N=200) for all tissues but one (say
all tissues but brain).
I am using the GNF Atlas 2 median ratios to build this distribution.
So, what I end up with is a vector of points with one point for each
tissue except brain for each gene (so 200*78 points).
I have noticed a large number of duplicate median ratios in the
vector of points I create to build my distribution. (in one case ~11000
/ ~16000) I have already mapped all RefSeq IDs to Entrez Gene IDs so
that each Entrez ID has its own set of median ratios and so that the
duplications are not coming from multiple median ratio vectors for
multiple isoforms of the same gene.
I have spoken to microarray users in my group and they have all
advised me that log-ratios (in general) should be very minimally
duplicated (~1% was one person's estimate).
Is there anything inherent in the median ratios that would result in
this kind of pattern (perhaps the number of significant digits)? Is
there something I could be missing in building my vector of points?
Thank you very much for your time and consideration in this matter.
Michael
More information about the Genome
mailing list