NCBI Links

The National Center for Biotechnology Information (NCBI) at the National Library of Medicine maintains several databases of genetic sequence data. A full record in MEDLINE® may be linked to any of ten databases. Click the link in the record to get information about the genetic sequence data reported in the source record.

For more information about NCBI databases, go to the NCBI Web site.

Database Description
Conserved Domains Conserved Domains is a database of protein domains. The source databases for Conserved Domains are Pfam, Smart, and COG.*
Genome Database The Genome database provides views for a variety of genomes, complete chromosomes, sequence maps with contigs, and integrated genetic and physical maps. The database is organized in six major organism groups: Archaea, Bacteria, Eukaryotae, Viruses, Viroids, and Plasmids and includes complete chromosomes, organelles, and plasmids as well as draft genome assemblies.*
Nucleotide The Nucleotide database contains sequence data from GenBank, EMBL, and DDBJ, the members of the tripartite, international collaboration of sequence databases. EMBL is the European Molecular Biology Laboratory at Hinxton Hall, UK; DDBJ is the DNA Database of Japan in Mishima, Japan. Sequence data are also incorporated from the Genome Sequence Data Base (GSDB), Santa Fe, NM. Patent sequences are incorporated through arrangements with the U.S. Patent and Trademark Office (USPTO) and via the collaborating international databases from other international patent offices.*
OIMM Database

OMIM (Online Mendelian Inheritance in Man) is a database of human genes and genetic disorders authored and edited by Dr. Victor A. McKusick and his colleagues at Johns Hopkins University and elsewhere. It was developed for the World Wide Web by NCBI.

The database contains textual information and references. It contains copious links to MEDLINE and sequence records in the Entrez system. It also contains links to additional related resources at NCBI and elsewhere.*

PopSet Database The PopSet database contains aligned sequences submitted as a set resulting from a population, phylogenetic, or mutation study. These alignments describe such events as evolution and population variation. The PopSet database contains both nucleotide and protein sequence data.*
ProbSet Probe is a public registry of nucleic acid reagents designed for use in a wide variety of biomedical research applications, together with information on reagent distributors, probe effectiveness, and computed sequence similarities.*
Protein Database The Protein database contains sequence data from the translated coding regions from DNA sequences in GenBank, EMBL, and DDBJ as well as protein sequences submitted to Protein Information Resource (PIR), SWISS-PROT, Protein Research Foundation (PRF), and Protein Data Bank (PDB) (sequences from solved structures).*
SNP The Single Nucleotide Polymorphism Database (dbSNP) is a central repository database for both single-base nucleotide substitutions and short deletion and insertion polymorphisms.*
Structure The Structure database or Molecular Modeling Database (MMDB) contains experimental data from crystallographic and NMR structure determinations. The data for MMDB are obtained from the Protein Data Bank (PDB). The NCBI has cross-linked structural data to bibliographic information, to the sequence databases, and to the NCBI taxonomy.*
Taxonomy Database The Taxonomy database contains the names of all organisms that are represented in the NCBI genetic database by at least one nucleotide or protein sequence.*
UniGene

The UniGene database contains gene-oriented clusters of transcript sequences.

UniGene is an experimental system for automatically partitioning GenBank sequences into a non-redundant set of gene-oriented clusters. Each UniGene cluster contains sequences that represent a unique gene, as well as related information such as the tissue types in which the gene has been expressed and map location.*

UniSTS

The UniSTS database contains Sequence Tagged Sites (STS) derived from STS mapping data.

UniSTS integrates marker and mapping data from a variety of public resources. Data sources include dbSTS, RHdb, GDB, various human maps, and various mouse maps.*