返回到 Introduction to Probability and Data with R

星

5,199 个评分

This course introduces you to sampling and exploring data, as well as basic probability theory and Bayes' rule. You will examine various types of sampling methods, and discuss how such methods can impact the scope of inference. A variety of exploratory data analysis techniques will be covered, including numeric summary statistics and basic data visualization. You will be guided through installing and using R and RStudio (free statistical software), and will use this software for lab exercises and a final project. The concepts and techniques in this course will serve as building blocks for the inference and modeling courses in the Specialization....

AM

Feb 7, 2021

After trying several courses to get me started with R programming, this one came to the rescue and had all the info I wanted. It also provides a great way to practice through labs and a final project!

AA

Feb 24, 2021

I always wanted to learn statistics from scratch, but I never had a good university teacher. Here I found a good teacher and also the opportunity to learn whenever I want ( and skipping parts I knew!)

筛选依据：

创建者 David K

•Mar 7, 2019

I liked:

+ The detailed Learning Objectives.

+ Good examples in the lectures.

+ The quizzes are great for testing and refreshing one's memory.

+ Overall the course seems very well-focused on the most important foundational items and hammers them in.

I'd appreciate improvement in:

+ Providing more clarity on how much R is expected to be learned for the final project, or lowering the level of R skill expected for the final project.

+ Providing a more relaxed time estimate on the final project. I spent 10+ hours on it, in addition to ~10 hours learning more R on DataCamp.

+ Getting feedback on my work from a professional, not just from fellow students. (I would be willing to pay for that.)

创建者 Yash G

•Feb 22, 2020

Good course for statistics but not for R

创建者 katie v

•Apr 11, 2019

I wish they went over how to use R for beginners in the beginning of the course. I feel like the final project was stressful and piecemealed together from google searches on the web. I think they should give us a list of all the codes in the beginning of the course that we will use throughout the entire course.

创建者 Rosalie I

•Apr 19, 2022

The video lectures were an excellent introduction/review to basic statistical concepts. I'm grateful for what I learned. However, if your goal (like mine) is to gain introductory skills with R, I *do not* recommend this course. The labs in Weeks 1-4 were very difficult and there was almost no guidance on how to complete; I struggled through them by finding solutions on StackExchange, etc. But the worst was the final project in Week 5; the material in Weeks 1-4 was absolutely inadequate to prepare the novice user of R. Furthermore, the assignment is "peer graded", meaning you will not receive instructor feedback--fellow novices will attempt to provide feedback on your work. It is the blind leading the blind. Absolutely not worth the money.

创建者 Alexander C

•Jul 16, 2020

Before you start this specialization you should check the start date for the final capstone course and make sure that it aligns with the time you anticipate wanting to take it (add a few days for the annoying bottleneck of getting peer reviews back for your assignment in course 4 since you will not be allowed to enroll in course 5 until all of the first four are completed).

When I made it to the capstone course there was a full month until the start date so all of the assignments are locked until then. I am far from the only one this happened to. Others on the forum for the course are complaining about two-month waits. This seems like a naked cash grab since the specialization is a monthly subscription. Honestly, it would be better if they did just surprise you with an extra $50 charge to do the capstone because at least you would have the option of paying and getting started on finishing up the specialization.

Otherwise, the specialization isn't really maintained any more. For the first three courses it doesn't matter much because it's all pretty easy (my rating for the first three would actually be two stars as stand-alone courses probably). In the third week of the fourth course the instructor is swapped out for a new instructor who is utterly incomprehensible, using terms and ideas that haven't actually been introduced. The accompanying text is similarly incomprehensible. I say this as a math person who has subsequently gone to other resources to learn this specific material. The problem is not that the material just gets harder, but that the quality of instruction drops through the floor. So beware of that.

创建者 Karen B

•Dec 13, 2021

This course (and specialization) was promoted as suitable for someone with no programming background. The fact is that the way it is taught, it is assumed that the learner has basic programming experience. I enrolled in the course and would now like to un-enroll since I have no programming background, but the un-enroll option is not available to me.

创建者 Julio I C T

•Jun 21, 2022

This is a statistic course, not a R course.

创建者 Richard E

•Apr 30, 2020

Lectures by Prof Çetinkaya-Rundel: Excellent content for the time alotted and well-organized. The URL to the "Distribution Calculator" was consistently erroneous in the slides (=: You can find it here: https://gallery.shinyapps.io/dist_calc/. The source code for that web calculator is part of a github R-language project and can be found here: https://github.com/ShinyEd/intro-stats/tree/master/dist_calc . One of these days, I am going to write up an issue on her use of "set.seed(12345)" and see if she takes it seriously! Her code is not bad for a statistician. (=:

Very simple Math. Depending on your background, this could be dissapointing or a relief. Too light for me.

A lot of R-language, RStudio, and R-markup. Expect to dive in. Note that RStudio is still evolving so you can expect a surprise or two along the way (E.g. crashes, hangs). The web version of RStudio has its own issues. I would stick to the desktop version since you have more control of your own desktop. Windows users can gleefully reboot when RStudio hangs - is it RStudio or Windoze?

"Let them eat cake" should not be missed: https://speakerdeck.com/minecr/let-them-eat-cake-first-0a3bbf75-f6f1-42d5-8d2f-ac2ff741611f .

Warning to physical science students: This is not in any way a criticism of the course but there are no examples from Astronomy, Chemistry, Geology, or Physics. Only health and social science stuff. I would have liked to see course content applied to something like data acquired from a telescope such as TESS or Hubble but that's me.

The forum for the last week is clogged with requests to have final projects reviewed, detracting from the intended purpose. I do not agree with having lower division students review each other's work: student maturity level to assess and give feedback is lacking. But, I knew what I was letting myself in for.

Another warning to physical science students: The data for the final project is CDC health survey responses for 2013. Roughly, 0.5 million rows and 330 columns. Lots of non-response values in the data. Not my cup of tea.

创建者 Gabriel H B

•Feb 17, 2018

Very impressive course, certainly among the top five I've ever taken online. Course design is basically flawless. The lectures are clear, concise and based on interesting examples. The course also comes with a textbook for which you can pay what you want, even a price of zero. The textbook is also very well written and contains plenty of examples to illustrate concepts that are introduced and lots of practice problems too. This is crucial for developing a good understanding of the material taught in a course like this.

I do have one warning about this course, however. The learning curve for the programming aspect of it is very steep. It says no previous knowledge of R is required, but I don't think I would've been able to finish my final project if I hadn't already taken about 15 other courses (mostly on DataCamp) that focused on R programming. At the very least, you should take the free Introduction to R course on DataCamp before you start any of the labs for this course. Ideally, you should also get a DataCamp membership and work through about 60% of the courses on the Data Scientist with R track before even starting this course.

I realize that sounds like a lot to ask, and that I am contradicting the course description that was written by the instructors, but this course makes use of a great deal of the knowledge that is taught on that track, especially the dplyr and ggplot2 packages, from the first module. Dplyr in particular is a wonderful R package that does things I've always dreamed of while struggling to do basic things in Excel, but it takes a lot of practice to get the hang of it you will get much more out of this course if you already have some experience with dplyr (and ggplot2) before starting it.

创建者 Rui Z

•May 5, 2019

I've audited several similar courses and found this one to be the best.

First of all, Dr. Mine is just so great at explaining things. There is no doubt that she's one of the best in her area, but she's also born to teach and communicate. She combines all kinds of way to make a concept vivid and clear. I've audited couple other courses, and I took relevant courses back in college a while ago, Dr. Mine is the best out of all the professors I've met at explaining things. This is not in this course but next, but just as an example of how clear she is when explaining standard deviation of sample means. She takes time to combine a specific example, visualization, and simulation, to really make all the points clear. You could try to listen to her on that part in the next course week 1.

Second, the R practice in every week is very beneficial and helpful. The cases used in those practices are fun to work with too. The hands-on experience on R and data exploring is valuable.

Overall, this is a very helpful course for me to review probability that I took a while ago in college and almost forgot, and for me to learn R and get hands-on practice.

创建者 Jenny Z

•Aug 14, 2016

This course is definitly suitable for learners who don't have any related background. Dr. Mine Cetinkaya-Rundel has an amiable speaking style and always highlighted the key points in teaching videos, which helped me understand other contents in the textbook. Besides, the time and assignment arrangement of this course are also very reasonable. The only thing I could complain about is a system bug of the Amazon AWS and the grading system. My final project file went blank after the system told me the file uploading is sucessfully completed on August 3, and I got three 0 point from three peers since they only saw a blank file, of which I had no idea, and all I could see is "Grading in progress" on the system. Until the final grading day is over, which is August 12, the system finally reminded me of this horrible thing. I came to mentors, disscussion forum as well as the help center, reuploaded my file, desperately tried to find peers who can still spare some time to review my file, and it is finally fixed today. I got my certificate in the end, but the grading process is really frustrating. Hope this bug won't happen to anyone ever ag

创建者 Hao C

•Nov 6, 2019

Teaching: I really like the clear and concise teaching style of lecturer and the wide range of simple real-life example used to explain the course content. I’m a social science student, given I’ve studied quantitative research methods before, this course is easy intro to and good refresher of data and probability theory. This course really gives me some confidence to continue to study probability theory, after finishing this specialization.

Textbook: The textbook used in this course is a good supplementary material, although it is not necessary to read the textbook. Course videos have already explained everything that we need to know at intro level. The textbook also covers some extra optional topics that are worth reading.

Course Structure: The course structure is well organized with clear focus in each week.

Assessment: The assessment of quiz in each week is relatively easy. The exploratory data analysis required in peer-reviewed assignment is relatively difficult for beginners. However, the course mentor has drafted an easy-to-follow guide in the discussion section which is really helpful for finishing this assignment.

创建者 Roel M

•Oct 30, 2020

I liked the course very much, as it really did a good job to refresh my previously learned concepts in basic statistics and probability. The use of RMarkdown was not that easy, but somehow manageable with trial and error, and the knitting to html was also a little cumbersom.

My main problem was that the week 5 assignment was not very clear and demanded more from the inspiration and creativity of the student in coming up with original research questions than on how to handle them and put them in R. As a consequence, the line of thinking risked to be reversed: first you realize what you can handle in 'R' and then, you create a question that can be handled... Perhaps, I would have preferred a first research question, formulated by the teachers, where only the concepts and programming already taught are to be used, before letting us free in the wild. Another consequence of the total open question is that the more fancy assignments that I could review, were those that used more R packages/commands than we were supposed to know already... But still, overall, it was a good course, sometimes challenging, which is good!

创建者 Erfan S

•Jul 1, 2022

The instructor (Mine Çetinkaya-Rundel) is absolutely incredible & clearly extremely thoughtful of what the students know, don't know, and should know. To get a stronger grip on statistics I took this course alongside Stanford's "Introduction to Statistics" as well as University of Michigan's "Understanding and Visualizing Data with Python". Doing all courses at the same time is definitely helpful, but by far this course had the most clear, informative, and applicable information.

1) The use of diagrams to explain concepts was a great way to visualize data.

2) The examples used are all relevant and give dynamic interaction between you and statistics.

3) The assesments were very well made and at the appropiate level when compared ot content.

One thing I will note is that the R introductions aren't extremely beginner friendly, and I still managed because I've been coding with R for the past 8 months. BUT, it's still very doable as long as you can do a crash course on R on a website like codeAcademy beforehand.

Thanks for the incredible course!

创建者 Anna D

•Apr 3, 2017

Best statistics course I've ever taken. So many Aha! moments I can't count them.

I have struggled for years to understand and get the hang of statistics, at uni, with online courses and at work. With this course (and the following courses) I think I have finally gained a DEEPER understanding of some of the basic but very important concepts of statistics. Lots of detailed examples and no overly complicated maths gibberish (although still mathematically sound!).

The R programming bits run in parallel to the statistics lectures and can be followed (necessary for a certificate) or can be ignored (if you only want to grasp the concepts), but are overall very easy to understand and follow. There is only little background to R as a programming language and the different types of data, lists, matrices etc. To me that's a good thing, as it allows you to use R right away (which in turn makes me more motivated to go back and learn more about R).

I whole-heartedly recommend this to anyone who wants to understand and use statistics!

创建者 Raw N

•Apr 23, 2017

Very well put-together course.

I like that the course has in-video quizzes as well as practice exercises to help prepare you for the weekly quizzes. The labs for the course are also very helpful.

The textbook that accompanies the course is freely available in pdf format online and the suggested exercises are a great complement to the rest of the course materials.

For those unfamiliar with R, the project is a bit of a leap from the rest of the contents in the course. To get around that, I'd suggest to both use the discussion forum (posts by mentor David Hood are particularly helpful) and to take both the R programming course and the Exploratory Data Analysis course from the Johns Hopkins data science sequence. Those 2 should together be doable in 5-6 weeks and at that point you should have sufficient background to where doing the project in this course (and those in follow-up courses in this specialization) should not be a problem.

创建者 Tural K

•Nov 25, 2020

Before starting the course, I was not expecting it to be ghat much effective. The statistics classes were informative. However, the main reason that I took this class was because of R studio and implementation of statistics via R studio. First 2 weeks were perfect, the only lacking part was the 3rd week (I think R assignment could be better. Obviously, we did not need much effort to complete Week 3 R assignment which I did not like). Week 5 project, on the other hand, was just on point. We could apply everything we learnt throughout the course; especially, reviewers were strict and gave constructive feedback which I liked a liked a lot. One more thing, some people might want to start learning R studio via video tutorials, which this course does not have. However, R markdown was very good method for learning purposes. Overall, very solid one. Thanks!

创建者 Vladimir V

•Feb 10, 2018

I think this is a very good entry level course for those who are interested in entering the realm of statistics.

The learning objectives of each week are well defined and the practice and weekly tests are based on those learning objectives. The videos explain very well each objective in a very convenient, easy to comprehend and interesting manner. For students who want more 'after class' material, the course offers a very nice book, which I personally used and helped me a lot during the course. The book also offers practice tasks at the end of each chapter.

The course project: Personally I think the course project in the 5th week of the course is interesting in a way that you have actual data to work with and use almost everything you have learned during the course.

Overall I think this is a very good course!

创建者 Natalia S

•Jun 15, 2016

This course was excellent, the teaching material top-notch and with excellent pedagogy. It's amazing that the course authors offer a statistics textbook almost exactly covering the course content for free. The idea to combine R and statistics is right on the money too, thanks to this one can learn 2 skills at the same time, with statistical analysis letting you practice coding in R and R helping you visualise your statistics. The lectures are divided into small, easy to absorb chunks and the teacher does an excellent job explaining the material, giving very good examples and analogies to help the students understand concepts. The exercises and assignments are fun to complete, and the course offers a flexibility in how much time you spend on it per week, e.g. there are non-mandatory exercises to do.

创建者 Tamir L

•Jul 25, 2016

This is a brilliant course that makes statistics and probability as approachable, engaging and clear as humanely possible.

Prof. Mine Cetinkaya-Rundel explains every subject very clearly, and has included some very effective quizzes and lab exercises.

I first encountered R markdown files in this course and have used them constantly ever since.

My only tiny point of criticism is that the non-graded exercise quizzes are way easier than the real quizzes, and do not really prepare you at all to the more complex questions in the actual quizzes. It's a petty and unimportant kind of criticism in an otherwise wonderful course.

If everyone taught stats like Prof. Cetinkaya-Rundel, this important subject would have been a whole lot better understood and utilized globally.

创建者 Yağız Y

•May 2, 2022

Mrs. Mine is the greatest teacher that I have ever seen. Her real life examples are highly benefical to grasp the each detail of the subject. She is not a professor that talks only about theoretical background and not mentions any practical case. She gives the theoretical background and more importantly she solves a plenty of real life examples and after that, it is not possible for you to not understand the subject. Thank you so much for giving this course as I supposed that statistics is only pain, but now I know the concept and I can solve problems easily.

There are rmd files in the course that teache how to use R-studio and I think they are enough to understand how to use the program. Thank you for everything.

创建者 MARIO J G M

•Mar 14, 2018

Excelente. Es un buen curso introductorio. Hace particular énfasis en las distribuciones normal y binomial. Da una pasada introductoria a R que, entre otras cosas, no es enseñado durante las clases sino que a través de los talleres que se realizan al final de cada capítulo. Son explicados con solvencia conceptos como correlación, causalidad y generalización.

Para quienes no saben, desconocen o no han tenido contacto con markdown valdría la pena ver un par de vídeos en youtube. Yo manejaba algo de R, pero nunca había tenido contacto con markdown y me pareció una herramienta muy útil, y aunque no es explicada en las lecciones o en los talleres, el proyecto de final del curso debe ser hecho en markdown.

创建者 Matthew L

•Aug 9, 2016

Professor Cetinkaya-Rundel's explanations are clear and she gives many examples, the quizzes are fair and I think it is an excellent idea to have a lab in R to get students familiar with that tool.

I recommend that students read the book chapters and do the practice problems there, it's very helpful.

My one criticism is that the amount of R taught in the course is not really enough to do a good job on the capstone project, because the data in the given database is formatted very differently. I think maybe the course staff could reformat the database to make it more user-friendly for beginning R users, but in the meantime you may want to study a little R on the side at, say, DataCamp.

创建者 Tascha S

•Apr 29, 2020

Very, very useful course. Exactly what I was looking for. You have to do some research beyond what's covered in the labs to really get acquainted with R, but there is so much available online that it wasn't an issue for me. The open-intro textbook is fantastic, and the course lectures help summarize the textbook info in a rich way that adds to the textbook content. The suggested problems, quizzes, labs and final project were all fantastic for reinforcing the content learned and actually putting it in practice (as most of us learn best by doing). All in all, I'm extremely pleased, and I'm moving onto the next course in the Statistics with R progression with much excitement!

创建者 Aaradhya G

•Nov 22, 2019

Absolutely amazing! It is clear that the professor, Ms. Mine Çetinkaya-Rundel is passionate about the subject and knows it inside out. The practical example-based approach to learning is appreciated, since a lot of statistics courses don't give learners a realistic setting to think about their knowledge, leaving them with the infamous 'how will this help me in real life?' question. The book, OpenIntro is also very helpful in this regard.

The R course has been introduced nicely too. The difficulty curve might take time to get used to, but the packages introduced and the codes used make sense, so it should not take too much time.

Wholeheartedly recommended!