Data analysis has replaced data acquisition as the bottleneck to evidence-based decision making --- we are drowning in it. Extracting knowledge from large, heterogeneous, and noisy datasets requires not only powerful computing resources, but the programming abstractions to use them effectively. The abstractions that emerged in the last decade blend ideas from parallel databases, distributed systems, and programming languages to create a new class of scalable data analytics platforms that form the foundation for data science at realistic scales.
Founded in 1861, the University of Washington is one of the oldest state-supported institutions of higher education on the West Coast and is one of the preeminent research universities in the world.
- 5 stars
- 4 stars
- 3 stars
- 2 stars
- 1 star
Great course that strikes a balance between teaching general principles and concepts, and providing hands-on technical skills and practice. The lessons are well designed and clearly conveyed.
I like the breadth of coverage of this class. Each of the exercise is a gem in that I get to learn something new also. I would highly recommend this even to experience practitioner also.
Good! I like the final (optional) project on running on a large dataset through EC2. The lectures aren't as polished and compact as they could be but certainly a very valuable course.
Well structured and nice overview of data manipulation. But the assignments should really be updated in order to use python 3.x instead of 2.7, which is not maintained anymore...
关于 大规模数据科学 专项课程
Learn scalable data management, evaluate big data technologies, and design effective visualizations.