Abstract
The total digital information today amounts to 3.52 × 1022 bits globally, and at its consistent exponential rate of growth is expected to reach 3 × 1024 bits by 2040. Data storage density of silicon chips is limited, and magnetic tapes used to maintain large-scale permanent archives begin to deteriorate within 20 years. Since silicon has limited data storage ability and serious limitations, such as human health hazards and environmental pollution, researchers across the world are intently searching for an appropriate alternative. Deoxyribonucleic acid (DNA) is an appealing option for such a purpose due to its endurance, a higher degree of compaction, and similarity to the sequential code of 0’s and 1’s as found in a computer. This emerging field of DNA as means of data storage has the potential to transform science fiction into reality, wherein a device that can fit in our palms can accommodate the information of the entire world, as latest research has revealed that just four grams of DNA could store the annual global digital information. DNA has all the properties to supersede the conventional hard disk, as it is capable of retaining ten times more data, has a thousandfold storage density, and consumes 108 times less power to store a similar amount of data. Although DNA has an enormous potential as a data storage device of the future, multiple bottlenecks such as exorbitant costs, excruciatingly slow writing and reading mechanisms, and vulnerability to mutations or errors need to be resolved. In this review, we have critically analyzed the emergence of DNA as a molecular storage device for the future, its ability to address the future digital data crunch, potential challenges in achieving this objective, various current industrial initiatives, and major breakthroughs.
Similar content being viewed by others
References
Ailenberg M, Rotstein OD (2009) An improved Huffman coding method for archiving text, images, and music characters in DNA. Biotechniques 47:747
Allentoft ME, Collins M, Harker D, Haile J, Oskam CL (2012) The half-life of DNA in bone: measuring decay kinetics in 158 dated fossils. Proc R Soc Lond B Bio. https://doi.org/10.1098/rspb.2012.1745
Arita M, Ohashi Y (2004) Secret signatures inside genomic DNA. Biotechnol Prog 20:1605–1607
Bancroft C, Bowler T, Bloom B, Clelland CT (2001) Long-term storage of information in DNA. Science 293:1763–1765
Benson E, Mohammed A, Gardell J, Masich S, Czeizle E (2015) DNA rendering of polyhedral meshes at the nanoscale. Nature 523:441–444
Blawat M, Gaedke K, Huetter I, Chen XM, Turczyk B (2016) Forward error correction for DNA data storage. Proc Comput Sci 80:1011–1022
Bornholt J, Lopez R, Carmean DM, Ceze L, Seelig G (2016) A DNA-based archival storage system. ACM SIGOPS Oper Syst Rev 50:637–649
Briggs AW, Stenzel U, Johnson PL, Green RE, Kelso J, Prüfer K, Pääbo S (2007) Patterns of damage in genomic DNA sequences from a Neandertal. Proc Natl Acad Sci 104(37):14616–14621
Chepesiuk R (1999) Where the chips fall: environmental health in the semiconductor industry. Environ Health Perspect 107:A452
Church GM, Gao Y, Kosuri S (2012) Next-generation digital information storage in DNA. Science 337:1628
Clelland CT, Risca V, Bancroft C (1999) Hiding messages in DNA microdots. Nature 399:533–534
Davis J (1996) Microvenus. Art J 55:70–74
Erlich Y, Zielinski D (2017) DNA fountain enables a robust and efficient storage architecture. Science 355:950–954
Extance A (2016) How DNA could store all the world’s data. Nature 537:7618
Gibson DG, Glass JI, Lartigue C, Noskov VN, Chuang R-Y (2010) Creation of a bacterial cell controlled by a chemically synthesized genome. Science 329:52–56
Goldman N, Bertone P, Chen S, Dessimoz C, LeProust EM (2013) Toward practical, high-capacity, low-maintenance information storage in synthesized DNA. Nature 494:77–80
Grass RN, Heckel R, Puddu M, Paunescu D, Stark WJ (2015) Robust chemical preservation of digital information on DNA in silica with error-correcting codes. Angew Chem Int Ed 54:2552–2555
Grigoryev Y (2012) How much information is stored in the human genome? Technical report from BitesizeBio http://bitesizebio.com/8378/how-much-information-is-stored-in-the-human-genome/
Gustafsson C (2009) For anyone who ever said there’s no such thing as a poetic gene. Nature 458:703
Hofreiter M, Serre D, Poinar HN, Kuch M, Pääbo S (2001) Ancient DNA. Nat Rev Genet 2(5):353–359
iGEM C (2010) Bacterial-based storage and encryption device. http://2010.igem.org/Team:Hong_Kong-CUHK
Jarvis CNPC (2012) Blighted by Kenning. http://www.artforeating.co.uk/restaurant/indexphp?/blighted-by-ken/project-overview
Kac E (1999) GENESIS. http://www.ekac.org/geninfo2.html
Kaku M (2012) Physics of the future: How science will shape human destiny and our daily lives by the year 2100: Anchor
Kashiwamura S, Yamamoto M, Kameda A, Shiba T, Ohuchi A (2005) Potential for enlarging DNA memory: the validity of experimental operations of scaled-up nested primer molecular memory. BioSyst 80:99–112
Keller A, Graefen A, Ball M, Matzas M, Boisguerin V, Maixner F, Stade B (2012) New insights into the Tyrolean Iceman’s origin and phenotype as inferred by whole-genome sequencing. Nat Commun 3:698
Khalifa A, Atito A (2012) High-capacity DNA-based steganography. IEEE Bio 76–80
Khalifa A, Elhadad A, Hamad S (2016) Secure blind data hiding into pseudo DNA sequences using playfair ciphering and generic complementary substitution. Appl Math 10:1483–1492
Kim C, Li M, Rodesch M, Lowe A, Richmond K (2004a) Biological lithography: improvements in DNA synthesis methods. J Vac Sci Technol B Microelectron Nanometer Struct Process Measurement Phenomena 22:3163–3167
Kim S, Soltis DE, Soltis PS, Suh Y (2004b) DNA sequences from Miocene fossils: an ndhF sequence of Magnolia latahensis (Magnoliaceae) and an rbcL sequence of Persea pseudocarolinensis (Lauraceae). Am J Bot 91:615–620
Leier A, Richter C, Banzhaf W, Rauhe H (2000) Cryptography with DNA binary strands. BioSyst 57:13–22
Miller W, Schuster SC, Welch AJ, Ratan A, Bedoya-Reina C (2012) Polar and brown bear genomes reveal ancient admixture and demographic footprints of past climate change. Proc Natl Acad Sci 109:E2382–E2390
Moore GE (1998) Cramming more components onto integrated circuits. Proc IEEE 86:82–85
Orlando L, Ginolhac A, Zhang G, Froese D, Albrechtsen A (2013) Recalibrating Equus evolution using the genome sequence of an early Middle Pleistocene horse. Nature 499:74–78
Portney NG, Wu Y, Quezada LK, Lonardi S, Ozkan M (2008) Length-based encoding of binary data in DNA. Langmuir 24:1613–1616
Patel P (2016) Scientific American https://www.scientificamerican.com/article/tech-turns-to-biology-as-data-storage-needs-explode/. Accessed 31 May 2016
Service RF (2017) DNA could store all of the world’s data in one room. Science. https://doi.org/10.1126/science.aal0852
Shendure J, Aiden EL (2012) The expanding scope of DNA sequencing. Nat Biotechnol 30:1084–1094
Shipman SL, Nivala J, Macklis JD, Church GM (2017) CRISPR–Cas encoding of a digital movie into the genomes of a population of living bacteria. Nature 547(7663):345
Shrivastava S, Badlani R (2014) Data storage in DNA. Int J Elec Energy 2:119–124
Skinner GM, Visscher K, Mansuripur M (2007) Biocompatible writing of data into DNA. J Bionanosci 1:17–21
Van Bogart, JW (1995) Life Expectancy: How long will magnetic media last?. Council on Library and Information Resources
Williams ED, Ayres RU, Heller M (2002) The 1.7 kilogram microchip: energy and material use in the production of semiconductor devices. Environ Sci Technol 36:5504–5510
Wong PC, Wong KK, Foote H (2003) Organic data memory using the DNA approach. Commun ACM 46:95–98
Amazon:https://www.amazon.com/s/ref=nb_sb_noss_2?url=search-alias%3Daps&field-keywords=DNADrive
openMoSS: Open Molecular Storage System. https://openmoss.org/
Yachie N, Sekiyama K, Sugahara J, Ohashi Y, Tomita M (2007) Alignment-based approach for durable data storage into living organisms. Biotechnol Prog 23:501–505
Yang YR, Liu Y, Yan H (2015) DNA nanostructures as programmable biomolecular scaffolds. Bioconjug Chem 26:1381–1395
Yazdi SHT, Yuan Y, Ma J, Zhao H, Milenkovic O (2015) A rewritable, random-access DNA-based storage system. Sci Rep. https://doi.org/10.1038/srep14138
Yazdi SHT, Gabrys R, Milenkovic O (2017) Portable and error-free DNA-based data storage. Sci Rep 7(1):5011
Yong E (2013) Synthetic double-helix faithfully stores Shakespeare’s sonnets. Nature. http://www.nature.com/news/synthetic-double-helix-faithfully-stores-shakespeare-s-sonnets-1.12279. Accessed 23 Jan 2013
Zhang F, Jiang S, Wu S, Li Y, Mao C (2015) Complex wireframe DNA origami nanostructures with multi-arm junction vertices. Nat Nanotechnol 10:779–784
Zhirnov V, Zadegan RM, Sandhu GS, Church GM, Hughes WL (2016) Nucleic acid memory. Nat Mater 15:366–370
Acknowledgements
We thank Justin Shih, PSU, USA for his assistance with language editing. Critical input from Chitra Lele, UK is acknowledged. We acknowledge the funding from Indian Council of Agricultural Research, New Delhi.
Author information
Authors and Affiliations
Corresponding authors
Ethics declarations
Conflict of interest
The authors declare that they have no competing interests.
Rights and permissions
About this article
Cite this article
Panda, D., Molla, K.A., Baig, M.J. et al. DNA as a digital information storage device: hope or hype?. 3 Biotech 8, 239 (2018). https://doi.org/10.1007/s13205-018-1246-7
Received:
Accepted:
Published:
DOI: https://doi.org/10.1007/s13205-018-1246-7