Figure 4From: What the papers say: Text mining for genomics and systems biologyThe genomic nomenclature is highly ambiguous. The plot shows the rank of a gene name against the total number of times that the gene name is found in Biothesaurus. The inset shows this only looking at human genes. The plot is in log-log coordinates. Both graphs show Zipf-like (discrete power-law) distributions. Biothesaurus is a collection of gene names mapped to Entrez Gene/Uniprot identifiers across approximately 7,000 species.Back to article page