Available Libraries

GenSAS provides the following datasets as globally available libraries for use with alignment tools or RepeatMasker.  Click here for a file with GenSAS tool and database references.

GenSAS provided library RNA number Protein number Release date
refseq_archaea 1,205 2,105,145 (nr) 9/16/2020
refseq_bacteria 23,260 154,106,184 (nr) 9/16/2020
refseq_fungi 3,294,947 3,298,550 9/17/2020
refseq_invertebrate 5,551,405 5,114,272 9/16/2020
refseq_mitochondrion NA 160,879 9/16/2020
refseq_plant 6,345,653 5,795,592 9/16/2020
refseq_plasmid 7 1,288,743 (nr) 9/21/2020
refseq_plastid 44 486,771 9/17/2020
refseq_protozoa 1,039,327 1,091,632 9/16/2020
refseq_vertebrate_mammalian 7,806,463 6,577,182 9/16/2020
refseq_vertebrate_other 9,014,808 8,112,356 9/16/2020
refseq_viral NA 477,258 9/16/2020
sprot NA 563,082 8/12/2020
trembl NA 188,961,949 8/12/2020
RepBase library NA NA 12/24/2018