Natural or synthetic? How Addgene’s dataset reveals trends in biological innovation
Type
Blog Post
... plasmids in the repository, each verified by sequencing, which makes the repository a convenient source...content and rare codon percentage were not good predictors. There is not much variation in GC content between...for GC content. Rare codons also are not good predictors because they are just that: rare. “Nature has...says. Percent sequence identity was a different story - the team found that the percent sequence identity... a synthetic version of that gene was a clear predictor of whether the gene is synthetic or natural. They...most common expression system within the Addgene repository is mammalian, but the largest source of unique...is from Proteobacteria to mammalian expression vectors. And more broadly, their data revealed that the...