Save Big on Coursera Plus. 7,000+ courses at $160 off. Limited Time Only!
Grab it
Explore the intersection of large language models and computational biology in this 49-minute lecture by Jian Ma at the Computational Genomics Summer Institute (CGSI). Delve into the recent history of large language models and foundation models, understanding their architecture and applications in genomics. Learn about the Transformer model, self-attention mechanisms, and various tokenization techniques specific to biological sequences. Discover specialized models like DNA Bird 2, nucleotide Transformer, SD Bert Model, and SCGPT, designed for genomic data analysis. Examine the concept of generative pretraining and its relevance to computational biology. Conclude with a discussion on open questions and a summary of the potential impact of these technologies on genomic research.
Large Language Models for Computational Biology - A Primer