Results from searching nr and RefSeq protein databases. (A) Number of best-matching proteins by species. Blueberry transcript sequences were searched against the non-redundant protein database from NCBI and the best-matching protein for each blueberry sequence was identified. The plot shows the number of blueberry genes whose best matching protein was from the indicated species. (B) Distribution of percent identity scores by plant RefSeq database. Blueberry transcript sequences were used to search RefSeq protein databases for plants with close-to-complete, annotated genomes. Boxplots show the distribution of percent identity scores by species. RefSeq databases include wine grape (Vitis vinifera), castor bean (Ricinis communus), poplar (Populus trichocarpa), tomato (Solanum lycopersicum), strawberry (Fragaria vesca), soybean (Glycine max), cucumber (Cucumus sativus), Arabidopsis (Arabidopsis thaliana), Medicago (Medicago trunculata), Brachypodium (Brachypodium distachyon), rice (Oryza sativa), sorghum (Sorghum bicolor), corn (Zea mays), and a moss (Selaginella moellendorffii).