Bioinformatics Specialization

Bioinformatics Specialization

Journey to the Frontier of Computational Biology. Master bioinformatics software and computational approaches in modern biology.

Taught in English

Some content may not be translated

Instructors: Pavel Pevzner

63,057 already enrolled

Included with Coursera Plus

Learn more

Specialization - 7 course series

Get in-depth knowledge of a subject

4.4

(1,017 reviews)

Beginner level

No prior experience required

3 months at 10 hours a week

Flexible schedule

Learn at your own pace

View all courses

Skills you'll gain

Details to know

Shareable certificate

Add to your LinkedIn profile

Specialization - 7 course series

Get in-depth knowledge of a subject

4.4

(1,017 reviews)

Beginner level

No prior experience required

3 months at 10 hours a week

Flexible schedule

Learn at your own pace

View all courses

See how employees at top companies are mastering in-demand skills

Learn more about Coursera for Business

Advance your subject-matter expertise

Learn in-demand skills from university and industry experts
Master a subject or tool with hands-on projects
Develop a deep understanding of key concepts
Earn a career certificate from University of California San Diego

Earn a career certificate

Add this credential to your LinkedIn profile, resume, or CV

Share it on social media and in your performance review

Specialization - 7 course series

Join Us in a Top 50 MOOC of All Time!

How do we sequence and compare genomes? How do we identify the genetic basis for disease? How do we construct an evolutionary Tree of Life for all species on Earth?

When you complete this Specialization, you will learn how to answer many questions in modern biology that have become inseparable from the computational approaches used to solve them. You will also obtain a toolkit of existing software resources built on these computational approaches and that are used by thousands of biologists every day in one of the fastest growing fields in science.

Although this Specialization centers on computational topics, you do not need to know how to program in order to complete it. If you are interested in programming, we feature an "Honors Track" (called "hacker track" in previous runs of the course). The Honors Track allows you to implement the bioinformatics algorithms that you will encounter along the way in dozens of automatically graded coding challenges. By completing the Honors Track, you will be a bioinformatics software professional!

Learn more about the Bioinformatics Specialization (including why we are wearing these crazy outfits) by watching our introductory video.

You can purchase the Specialization's print companion, Bioinformatics Algorithms: An Active Learning Approach, from the textbook website.

Our first course, "Finding Hidden Messages in DNA", was named a top-50 MOOC of all time by Class Central!

Finding Hidden Messages in DNA (Bioinformatics I)

Course 115 hours4.3 (995 ratings)

What you'll learn

Named a top 50 MOOC of all time by Class Central!

This course begins a series of classes illustrating the power of computing in modern biology. Please join us on the frontier of bioinformatics to look for hidden messages in DNA without ever needing to put on a lab coat. In the first half of the course, we investigate DNA replication, and ask the question, where in the genome does DNA replication begin? We will see that we can answer this question for many bacteria using only some straightforward algorithms to look for hidden messages in the genome. In the second half of the course, we examine a different biological question, when we ask which DNA patterns play the role of molecular clocks. The cells in your body manage to maintain a circadian rhythm, but how is this achieved on the level of DNA? Once again, we will see that by knowing which hidden messages to look for, we can start to understand the amazingly complex language of DNA. Perhaps surprisingly, we will apply randomized algorithms, which roll dice and flip coins in order to solve problems. Finally, you will get your hands dirty and apply existing software tools to find recurring biological motifs within genes that are responsible for helping Mycobacterium tuberculosis go "dormant" within a host for many years before causing an active infection.

Skills you'll gain

Category: Bioinformatics

Category: Graph Theory

Category: Bioinformatics Algorithms

Category: Python Programming

Genome Sequencing (Bioinformatics II)

Course 217 hours4.6 (307 ratings)

What you'll learn

You may have heard a lot about genome sequencing and its potential to usher in an era of personalized medicine, but what does it mean to sequence a genome?

Biologists still cannot read the nucleotides of an entire genome as you would read a book from beginning to end. However, they can read short pieces of DNA. In this course, we will see how graph theory can be used to assemble genomes from these short pieces. We will further learn about brute force algorithms and apply them to sequencing mini-proteins called antibiotics. In the first half of the course, we will see that biologists cannot read the 3 billion nucleotides of a human genome as you would read a book from beginning to end. However, they can read shorter fragments of DNA. In this course, we will see how graph theory can be used to assemble genomes from these short pieces in what amounts to the largest jigsaw puzzle ever put together. In the second half of the course, we will discuss antibiotics, a topic of great relevance as antimicrobial-resistant bacteria like MRSA are on the rise. You know antibiotics as drugs, but on the molecular level they are short mini-proteins that have been engineered by bacteria to kill their enemies. Determining the sequence of amino acids making up one of these antibiotics is an important research problem, and one that is similar to that of sequencing a genome by assembling tiny fragments of DNA. We will see how brute force algorithms that try every possible solution are able to identify naturally occurring antibiotics so that they can be synthesized in a lab. Finally, you will learn how to apply popular bioinformatics software tools to sequence the genome of a deadly Staphylococcus bacterium that has acquired antibiotics resistance.

Skills you'll gain

Category: Algorithms

Category: Python Programming

Category: Whole Genome Sequencing

Category: Dynamic Programming

Comparing Genes, Proteins, and Genomes (Bioinformatics III)

Course 322 hours4.7 (130 ratings)

What you'll learn

Once we have sequenced genomes in the previous course, we would like to compare them to determine how species have evolved and what makes them different.

In the first half of the course, we will compare two short biological sequences, such as genes (i.e., short sequences of DNA) or proteins. We will encounter a powerful algorithmic tool called dynamic programming that will help us determine the number of mutations that have separated the two genes/proteins. In the second half of the course, we will "zoom out" to compare entire genomes, where we see large scale mutations called genome rearrangements, seismic events that have heaved around large blocks of DNA over millions of years of evolution. Looking at the human and mouse genomes, we will ask ourselves: just as earthquakes are much more likely to occur along fault lines, are there locations in our genome that are "fragile" and more susceptible to be broken as part of genome rearrangements? We will see how combinatorial algorithms will help us answer this question. Finally, you will learn how to apply popular bioinformatics software tools to solve problems in sequence alignment, including BLAST.

Molecular Evolution (Bioinformatics IV)

Course 418 hours4.5 (77 ratings)

What you'll learn

In the previous course in the Specialization, we learned how to compare genes, proteins, and genomes. One way we can use these methods is in order to construct a "Tree of Life" showing how a large collection of related organisms have evolved over time.

In the first half of the course, we will discuss approaches for evolutionary tree construction that have been the subject of some of the most cited scientific papers of all time, and show how they can resolve quandaries from finding the origin of a deadly virus to locating the birthplace of modern humans. In the second half of the course, we will shift gears and examine the old claim that birds evolved from dinosaurs. How can we prove this? In particular, we will examine a result that claimed that peptides harvested from a T. rex fossil closely matched peptides found in chickens. In particular, we will use methods from computational proteomics to ask how we could assess whether this result is valid or due to some form of contamination. Finally, you will learn how to apply popular bioinformatics software tools to reconstruct an evolutionary tree of ebolaviruses and identify the source of the recent Ebola epidemic that caused global headlines.

Genomic Data Science and Clustering (Bioinformatics V)

Course 59 hours4.2 (90 ratings)

What you'll learn

How do we infer which genes orchestrate various processes in the cell? How did humans migrate out of Africa and spread around the world? In this class, we will see that these two seemingly different questions can be addressed using similar algorithmic and machine learning techniques arising from the general problem of dividing data points into distinct clusters.

In the first half of the course, we will introduce algorithms for clustering a group of objects into a collection of clusters based on their similarity, a classic problem in data science, and see how these algorithms can be applied to gene expression data. In the second half of the course, we will introduce another classic tool in data science called principal components analysis that can be used to preprocess multidimensional data before clustering in an effort to greatly reduce the number dimensions without losing much of the "signal" in the data. Finally, you will learn how to apply popular bioinformatics software tools to solve a real problem in clustering.

Skills you'll gain

Category: Bioinformatics

Category: Bioinformatics Algorithms

Category: Algorithms

Category: Python Programming

Finding Mutations in DNA and Proteins (Bioinformatics VI)

Course 623 hours4.7 (59 ratings)

What you'll learn

In previous courses in the Specialization, we have discussed how to sequence and compare genomes. This course will cover advanced topics in finding mutations lurking within DNA and proteins.

In the first half of the course, we would like to ask how an individual's genome differs from the "reference genome" of the species. Our goal is to take small fragments of DNA from the individual and "map" them to the reference genome. We will see that the combinatorial pattern matching algorithms solving this problem are elegant and extremely efficient, requiring a surprisingly small amount of runtime and memory. In the second half of the course, we will learn how to identify the function of a protein even if it has been bombarded by so many mutations compared to similar proteins with known functions that it has become barely recognizable. This is the case, for example, in HIV studies, since the virus often mutates so quickly that researchers can struggle to study it. The approach we will use is based on a powerful machine learning tool called a hidden Markov model. Finally, you will learn how to apply popular bioinformatics software tools applying hidden Markov models to compare a protein against a related family of proteins.

Bioinformatics Capstone: Big Data in Biology

Course 712 hours3.8 (24 ratings)

What you'll learn

In this course, you will learn how to use the BaseSpace cloud platform developed by Illumina (our industry partner) to apply several standard bioinformatics software approaches to real biological data.

In particular, in a series of Application Challenges will see how genome assembly can be used to track the source of a food poisoning outbreak, how RNA-Sequencing can help us analyze gene expression data on the tissue level, and compare the pros and cons of whole genome vs. whole exome sequencing for finding potentially harmful mutations in a human sample. Plus, hacker track students will have the option to build their own genome assembler and apply it to real data!

Instructors

Pavel Pevzner

University of California San Diego

16 Courses784,882 learners

Phillip Compeau

University of California San Diego

8 Courses265,940 learners

Nikolay Vyahhi

University of California San Diego

1 Course20,744 learners

Offered by

University of California San Diego

Why people choose Coursera for their career

Felipe M.

Learner since 2018

"To be able to take courses at my own pace and rhythm has been an amazing experience. I can learn whenever it fits my schedule and mood."

Jennifer J.

Learner since 2020

"I directly applied the concepts and skills I learned from my courses to an exciting new project at work."

Larry W.

Learner since 2021

"When I need courses on topics that my university doesn't offer, Coursera is one of the best places to go."

Chaitanya A.

"Learning isn't just about being better at your job: it's so much more than that. Coursera allows me to learn without limits."

New to Health Informatics? Start here.

Open new doors with Coursera Plus

Unlimited access to 7,000+ world-class courses, hands-on projects, and job-ready certificate programs - all included in your subscription

Learn more

Advance your career with an online degree

Earn a degree from world-class universities - 100% online

Explore degrees

Join over 3,400 global companies that choose Coursera for Business

Upskill your employees to excel in the digital economy

Learn more

Frequently asked questions

The print companion accompanying the Specialization is Bioinformatics Algorithms: An Active Learning Approach (Vols. 1 and 2).

Time to completion can vary based on your schedule, but most learners are able to complete the Specialization in 4-6 months.

We require only a basic knowledge of high school-level biology and the ability to think technically.

Yes! To get started, click the course card that interests you and enroll. You can enroll and complete the course to earn a shareable certificate, or you can audit it to view the course materials for free. When you subscribe to a course that is part of a Specialization, you’re automatically subscribed to the full Specialization. Visit your learner dashboard to track your progress.

Bioinformatics Specialization

Specialization - 7 course series

Skills you'll gain

Details to know

Specialization - 7 course series

See how employees at top companies are mastering in-demand skills

Advance your subject-matter expertise

Earn a career certificate

Specialization - 7 course series

Finding Hidden Messages in DNA (Bioinformatics I)

What you'll learn

Skills you'll gain

Genome Sequencing (Bioinformatics II)

What you'll learn

Skills you'll gain

Comparing Genes, Proteins, and Genomes (Bioinformatics III)

What you'll learn

Molecular Evolution (Bioinformatics IV)

What you'll learn

Genomic Data Science and Clustering (Bioinformatics V)

What you'll learn

Skills you'll gain

Finding Mutations in DNA and Proteins (Bioinformatics VI)

What you'll learn

Bioinformatics Capstone: Big Data in Biology

What you'll learn

Instructors

Offered by

Why people choose Coursera for their career

New to Health Informatics? Start here.

Open new doors with Coursera Plus

Advance your career with an online degree

Join over 3,400 global companies that choose Coursera for Business

Frequently asked questions

Are there any suggested readings for the Specialization?

How long does it take to complete the Specialization?

What background knowledge is necessary?

More questions