Skip to main content

Advertisement

Springer Nature is making SARS-CoV-2 and COVID-19 research free. View research | View latest news | Sign up for updates

Table 1 Characteristics of the single nucleotide polymorphisms (SNPs) identified in the 3,000 rice genomes when aligned to the reference japonica Nipponbare genome IRGSP-1.0

From: The 3,000 rice genomes project

Chrom. Gene mRNA 5’-UTR CDS Intron 3’-UTR Intergenic Total Syn Non-syn Total Non-syn/Syn
Chr1 634,912 630,396 25,880 291,817 286,601 26,098 1,252,989 1,887,901 118,095 173,722 291,817 1.471
Chr2 528,417 524,172 20,087 243,967 238,738 21,380 1,013,475 1,541,892 97,306 146,661 243,967 1.507
Chr3 490,402 487,611 19,899 223,196 224,129 20,387 962,304 1,452,706 88,477 134,719 223,196 1.523
Chr4 730,310 727,473 19,018 388,220 301,071 19,164 1,176,274 1,906,584 160,101 228,115 388,220 1.425
Chr5 489,370 485,848 13,623 257,327 200,307 14,591 867,799 1,357,169 103,723 153,604 257,327 1.481
Chr6 560,506 557,361 16,943 280,933 242,635 16,850 1,023,473 1,583,979 114,625 166,308 280,933 1.451
Chr7 548,266 546,569 16,210 280,994 231,797 17,568 973,670 1,521,936 115,332 165,662 280,994 1.436
Chr8 582,068 580,181 16,396 302,785 244,991 16,009 998,651 1,580,719 124,025 178,759 302,785 1.441
Chr9 436,037 434,440 10,692 222,916 190,025 10,807 763,771 1,199,808 90,299 132,617 222,916 1.469
Chr10 476,710 473,603 11,735 258,013 192,214 11,641 806,940 1,283,650 109,451 148,561 258,013 1.357
Chr11 684,803 681,891 16,642 354,874 291,049 19,326 1,148,735 1,833,538 140,772 214,101 354,874 1.521
Chr12 607,336 603,783 16,549 319,401 251,103 16,730 1,055,044 1,662,380 129,296 190,105 319,401 1.470
ChrUn 19,706 19,706 0 12,615 7,091 0 26,669 46,375 5,819 6,796 12,615 1.168
ChrSy 11,463 11,463 0 7,913 3,550 0 15,043 26,506 3,846 4,067 7,913 1.057
Total 6,800,306 6,764,497 203,674 3,444,971 2,905,301 210,551 12,084,837 18,885,143 1,401,167 2,043,797 3,444,971 1.459
  1. The MSU V7.0 rice gene annotation for 55,986 genes and 66,338 mRNA [13] as a raw gff3 file type was downloaded from the Rice Genome Project Annotation ftp site [19]. Prior to categorization of SNP types, the raw gff3 file was processed 1) to remove all but the primary mRNA transcript and 2) to select the gene models with the highest support in cases where there are overlapping gene models. Hence, SNP characteristics are reported here for 55,107 of the 55,986 gene models. Characteristics of SNPs in pseudogenes or where the reference base is N (unknown or missing) are not reported. Syn = synonymous; Non-syn = non-synonymous.