An introduction to the statistics behind the most popular genomic data science projects. This is the sixth course in the Genomic Big Data Science Specialization from Johns Hopkins University.

Loading...

来自 Johns Hopkins University 的课程

Statistics for Genomic Data Science

133 个评分

An introduction to the statistics behind the most popular genomic data science projects. This is the sixth course in the Genomic Big Data Science Specialization from Johns Hopkins University.

从本节课中

Module 4

In this week we will cover a lot of the general pipelines people use to analyze specific data types like RNA-seq, GWAS, ChIP-Seq, and DNA Methylation studies.

- Jeff Leek, PhDAssociate Professor, Biostatistics

Bloomberg School of Public Health

Welcome to the end of the class.

I hope that you've really enjoyed learning about statistics for genomic data science.

As you know, statistics is one of the main branch of genomic data science.

But sometimes its the one that gets left out a little bit, statistics is often

the one that's the least one talked about when you talk about genomic data science.

Computer science and biology often get a lot of the attention.

You can see that in, this is no joke,

a published paper where they put the statement,

(insert statistical method here), right there in the abstract of the paper.

Sort of gives you an idea that sometimes statistics isn't always on the top of

people's minds.

But it's really important and it's a huge component of the process.

In fact, the p-value that we talked about in the class is the most widely used

statistic in all of genomics, and all of science really.

If every time that the guy who wrote the p-value paper got a citation he

would have over 3 million citations, which would make it

the most highly cited paper in the history of any scientific discipline.

So statistics is incredibly important and

I hope that you've learned enough about it to get you sort of started and get you

excited on your journey towards being a statistical genomic data scientist.

So there's a lot more to learn that what we've covered in these four weeks.

Obviously we had to move pretty quickly to get through all the material in that time.

But here's a couple of other things that you could go check out.

There's an Advanced Statistics for

Life Sciences course taught by one of my buddies, Rafa Irizarry.

And you should go check that out if you have liked what you learned here and

want to learn a little bit more.

You could also take the other classes in the genomic data science specialization,

or the classes from the Johns Hopkins data science specialization which focus

more on basic statistics.

Which you might want to learn about more if you're kind of into the stuff that

you've learned in this class.

So finally, I'd just like to thank you for sticking around.

And I hope you've enjoyed the class, and good luck with your genomics career.