Salzberg was born in 1960 as one of four children to Herman Salzberg, a Distinguished Professor Emeritus of Psychology, and Adele Salzberg, a retired school teacher.[9] Salzberg did his undergraduate studies at Yale University where he received his Bachelor of Arts degree in English in 1980. In 1981 he returned to Yale, and he received his Master of Science and Master of Philosophy degrees in Computer Science in 1982 and 1984, respectively. After several years in a startup company, he enrolled at Harvard University, where he earned a Ph.D. in Computer Science in 1989.[10]
Career
After obtaining his undergraduate degree, he worked for a local power company in South Carolina, where he gained programming experience on an IBM mainframe,[11] programming in COBOL and IBM assembly language. He then joined a Boston-based AI startup upon completion of his masters degree in Computer Science.[11]
After earning his Ph.D., Salzberg joined Johns Hopkins University as an assistant professor in the Department of Computer Science, and was promoted to associate professor in 1997. From 1998 to 2005, he was the head of the Bioinformatics department at The Institute for Genomic Research, one of the world's largest genome sequencing centers. Salzberg then joined the Department of Computer Science at the University of Maryland, College Park, where he was the Horvitz Professor of Computer Science as well as the Director of the Center for Bioinformatics and Computational Biology. In 2011, Salzberg returned to Johns Hopkins University as a professor in the McKusick-Nathans Institute of Genetic Medicine and in the Department of Medicine.[8][12][13]
Salzberg has been a prominent scientist in the field of bioinformatics and computational biology since the 1990s. He has made many contributions to gene finding algorithms, notably the GLIMMER[17] program for bacterial gene finding as well as several related programs for finding genes in animals, plants, and other organisms. He has also been a leader in genome assembly research and has led the assembly of dozens of genomes, both large and small. He was a participant in the human genome project[18] as well as many other genome projects, including the malaria genome (Plasmodium falciparum) and the genome of the model plant Arabidopsis thaliana. In 2001–2002, he and his colleagues sequenced the anthrax that was used in the 2001 anthrax attacks. They published their results in the journal Science in 2002.[19] These findings helped the FBI track the source of the attacks to a single vial at Ft. Detrick in Frederick, Maryland.
Soon after the advent of next-generation sequencing (NGS) in the mid-2000s, Salzberg's research lab and his collaborators developed a suite of highly efficient, accurate programs for alignment of NGS sequences to large genomes and for assembly of sequences from RNA-Seq experiments. These include the "Tuxedo" suite, comprising the Bowtie, TopHat, and Cufflinks programs, which have been cited tens of thousands of times in the years since their publication.
Salzberg has also been a vocal advocate against pseudoscience and has authored editorials and appeared in print media on this topic. Since 2010, he has written a column at Forbes magazine[22] on science, medicine, and pseudoscience, where he has published hundreds of articles that have received tens of millions of views. His work at Forbes won the 2012 Robert P. Balles Prize in Critical Thinking.[23]
Salzberg was a charter member of the Cambridge Working Group in 2014, which was created to express alarm in the scientific community over the creation of highly transmissible and contagious viruses (also called Gain-of-function research) and the likelihood of an accidental lab release.[24]
Publications
Salzberg has authored or co-authored over 300 scientific publications.[25] He has more than 300,000 citations in Google Scholar and an h-index of 159.[26] In 2014 and every year since (through at least 2022), Salzberg was selected for inclusion in HighlyCited.com, a ranking compiled by the Institute for Scientific Information of scientists who are among the top 1% most cited for their subject field during the previous ten years. He was also chosen for this list when it was first created in 2001. This list of highly cited researchers continues under Clarivate, and Salzberg was also included in the list in 2018, 2019, 2020, 2021, 2022, and 2023.[27]
Highly cited articles (more than 10,000 citations)
2012 with B Langmead, Fast gapped-read alignment with Bowtie 2, in: Nature Methods. Vol. 9, nº 4; 357.
2009 With B Langmead, C Trapnell, M Pop, Ultrafast and memory-efficient alignment of short DNA sequences to the human genome, in: Genome Biology. Vol. 10, nº 3; 1–10.
2001 with JC Venter, MD Adams, EW Myers, PW Li, RJ Mural, et al., The sequence of the human genome, in: Science. Vol. 291, nº 5507; 1304–1351.
2015 with D Kim, B Langmead, HISAT: a fast spliced aligner with low memory requirements, in: Nature Methods Vol. 12, 357–360. (2015)
2010 with C Trapnell, BA Williams, G Pertea, A Mortazavi, G Kwan, MJ Van Baren, BJ Wold, L Pachter, Transcript assembly and quantification by RNA-Seq reveals unannotated transcripts and isoform switching during cell differentiation, in: Nature Biotechnology. Vol. 28, nº 5; 511–515.
2009 with C Trapnell, L Pachter, TopHat: discovering splice junctions with RNA-Seq, in: Bioinformatics. Vol. 25, nº 9; 1105–1111.
2013 with D Kim, G Pertea, C Trapnell, H Pimentel, R Kelley, TopHat2: accurate alignment of transcriptomes in the presence of insertions, deletions and gene fusions, in: Genome Biology. Vol. 14, nº 4; 1–13.
2012 with C Trapnell, A Roberts, L Goff, G Pertea, D Kim, DR Kelley, H Pimentel, JL Rinn, L Pachter. Differential gene and transcript expression analysis of RNA-seq experiments with TopHat and Cufflinks, in: Nature Protocols. Vol. 7, nº 3; 562-578.
2011 with T Magoč. FLASH: fast length adjustment of short reads to improve genome assemblies, in: Bioinformatics. Vol. 27, nº 21; 2957-2963.
2000 with The Arabidopsis Genome Initiative. Analysis of the genome sequence of the flowering plant Arabidopsis thaliana, in: Nature. Vol. 408, nº 6814; 796-815.
^Venter, J. C.; Adams, M.; Myers, E.; Li, P.; Mural, R.; Sutton, G.; Smith, H.; Yandell, M.; Evans, C.; Holt, R. A.; Gocayne, J. D.; Amanatides, P.; Ballew, R. M.; Huson, D. H.; Wortman, J. R.; Zhang, Q.; Kodira, C. D.; Zheng, X. H.; Chen, L.; Skupski, M.; Subramanian, G.; Thomas, P. D.; Zhang, J.; Gabor Miklos, G. L.; Nelson, C.; Broder, S.; Clark, A. G.; Nadeau, J.; McKusick, V. A.; et al. (2001). "The Sequence of the Human Genome". Science. 291 (5507): 1304–1351. Bibcode:2001Sci...291.1304V. doi:10.1126/science.1058040. PMID11181995.
^Baker, Nicholson. (4 January 2021). "The Lab-Leak Hypothesis For decades, scientists have been hot-wiring viruses in hopes of preventing a pandemic, not causing one. But what if …?". New York magazine Retrieved 18 January 2021.