Genotyping Glossary

Summary

Displays the fundamental description of an SNP sensing Probe Set.


Physical Map

The current genomic context of the SNP, with the Allele sequence at the center.


Associated Genes

Describes transcripts and their association with the SNP. Although some redundant transcripts are removed from our library, NetAffx incorporates several transcript sources that may still describe the same record or its variants.

  1. Relationship – Describes the relationship of the listed feature to the SNP described. Four possible entries to this field are:


  2. 3UTR – SNP is located in the 3' UTR of the transcript.
    5UTR – SNP is located in the 5' UTR of the transcript.
    CDS – SNP is located in the CDS of the transcript.
    Intron – SNP is located in the intron.
    Exon – SNP is located in the exon.
    Upstream – The 3' end of the transcript lies some distance upstream from the SNP.
    Downstream – The 5' end of the transcript lies some distance downstream of the SNP.
    If you need to retrieve all the SNPs in exon, you will have to perform a search against the relationship field with the following keyword that combines multiple relationship attributes separated by "|" operator: "3UTR|5UTR|CDS|Exon"

  3. Distance – Describes the distance between the SNP and the transcript in nucleotide bases. A distance of 0 means that the SNP falls within the outer boundaries of the transcript record. There may be more than one transcript at the same distance from the SNP if there are multiple records describing the same locus.
  4. Gene – Displays a gene name derived from NCBI UniGene/Gene records when available.
  5. Transcript – The accession and link that goes out to the full entry record.
  6. Entrez Gene – The identifier from the NCBI Entrez Gene database and the link that goes out to the Locus description.
  7. Description – mRNA sequence description. This is derived from GenBank or Ensembl.

Genetic Map

Describes the genetic location of the SNP derived from three separate linkage maps (deCODE, Marshfield, or SLM). The physical distance between the markers is assumed to be linear with their genetic distance. The genetic location is computed using the linkage maps from the latest physical location of the SNP and the neighboring microsatellite markers.

Marshfield reference: Broman KW, Murray JC, Sheffield VC, et.al., (1998) Comprehensive human genetic maps: Individual and sex-specific variation in recombination. Am J Human Genetics, 63(3):861-869.

Decode Reference: Kong A, Gudbjartsson DF, Sainz J, et.al., (2002) A high-resolution recombination map of the human genome. Nat Genet. Jul;31(3):241-7. Epub 2002 Jun 10.

SLM is a Genetic Map accumulated in house at Affymetrix. It is described in the technical note: SNP Selection Criteria for the GeneChip® Human Mapping 10K Array Xba 131.


Neighboring Microsatellite

Describes the nearest microsatellite markers (upstream, downstream and overlapping) for the SNP.


Population Frequency Data From Affymetrix Validation Studies

Describes the frequency of the allele from studies using the Affymetrix genotyping arrays. The data are described in the following reference:

  1. Population – The Ethnic group used in the genotyping study. The following is a description of the population groups:
  2. Yoruba – Yoruba in Ibadan, Nigeria
    Japanese – Japanese in Tokyo, Japan
    Han Chinese – Han Chinese in Beijing, China
    CEPH – CEPH (Utah residents with ancestry from northern and western Europe)

  3. p(N) – Major allele frequency. The actual allele is provided in parenthesis.
  4. q(N) – Minor allele frequency. The actual allele is provided in parenthesis.
  5. H(2pq) – Heterozygosity frequency.
  6. # of individuals counted – The number of individuals included in the study to derive the genotype. This is an indication of the total number of individuals included in the study and the SNP call rate.

Back to Top >