Skip to main content
Figure 4 | Human Genomics

Figure 4

From: What the papers say: Text mining for genomics and systems biology

Figure 4

The genomic nomenclature is highly ambiguous. The plot shows the rank of a gene name against the total number of times that the gene name is found in Biothesaurus. The inset shows this only looking at human genes. The plot is in log-log coordinates. Both graphs show Zipf-like (discrete power-law) distributions. Biothesaurus is a collection of gene names mapped to Entrez Gene/Uniprot identifiers across approximately 7,000 species.

Back to article page