Chevron Left
返回到 Big Data Integration and Processing

Big Data Integration and Processing, 加州大学圣地亚哥分校

4.4
1,333 个评分
287 个审阅

课程信息

At the end of the course, you will be able to: *Retrieve data from example database and big data management systems *Describe the connections between data management operations and the big data processing patterns needed to utilize them in large-scale analytical applications *Identify when a big data problem needs data integration *Execute simple big data integration and processing on Hadoop and Spark platforms This course is for those new to data science. Completion of Intro to Big Data is recommended. No prior programming experience is needed, although the ability to install applications and utilize a virtual machine is necessary to complete the hands-on assignments. Refer to the specialization technical requirements for complete hardware and software specifications. Hardware Requirements: (A) Quad Core Processor (VT-x or AMD-V support recommended), 64-bit; (B) 8 GB RAM; (C) 20 GB disk free. How to find your hardware information: (Windows): Open System by clicking the Start button, right-clicking Computer, and then clicking Properties; (Mac): Open Overview by clicking on the Apple menu and clicking “About This Mac.” Most computers with 8 GB RAM purchased in the last 3 years will meet the minimum requirements.You will need a high speed internet connection because you will be downloading files up to 4 Gb in size. Software Requirements: This course relies on several open-source software tools, including Apache Hadoop. All required software can be downloaded and installed free of charge (except for data charges from your internet provider). Software requirements include: Windows 7+, Mac OS X 10.10+, Ubuntu 14.04+ or CentOS 6+ VirtualBox 5+....

热门审阅

创建者 AA

Mar 06, 2018

It was a good course, it could have been better if some examples of Spark were also provided in other Languages like Java, people without having background of python may find it difficult.

创建者 DC

Oct 08, 2017

Very Interactive course. Theatrical classes are nicely drafted. Hands On exercises are interesting and some are challenging too. Overall very interesting course. Happy learning

筛选依据:

275 个审阅

创建者 Chetan Hirapara

Mar 12, 2019

This is awesome course for beginner who didn't have any knowledge of bigdata

创建者 EL MOUZARI

Mar 10, 2019

Les fonctions ne fonctionnent pas sur Jupyter. Il faut revoir ces TP !! j'ai perdu beaucoup de temps à chercher sur internet les bonnes fonctions.

创建者 Srishti Ramchandani

Mar 05, 2019

Great experience towards learning this course

创建者 Rafael Tardelli Pacheco dos Santos

Mar 04, 2019

The last quiz was very hard to complete. I didn't found enough content to solve que questions in the course material.

创建者 Markus.schwarz.de@gmail.com

Mar 03, 2019

With deep regrets I feel obliged to share a negative rating on the course. While the course material/video lectures are average to good (no rocket science but well done introduction into the subjects), the hands-on exercises and particularly the technical environment, i.e. Cloudera VM is entirely messed-up: - setup scripts are not working/ are outdated (e.g., anaconda requires no-check-certificate); user permissions are all set wrong and need to be corrected; firefox outdated with update function not working; countless error around spark context (SC) variables.... and so on... For a course that is so prominently promoted on the platform the least expectation is that the provided environment works and that students don´t need to spend hours on google to figure out how to debug the cloudera image.... Here, imo, a much better job can be done!

创建者 Kajal Nathani

Mar 03, 2019

Great experience towards this course

创建者 Ahmad Mostafa Elsayed Rezk

Mar 01, 2019

The most interesting part in this course dealing with spark and the final quiz is really amazing

创建者 Apurva TR

Feb 21, 2019

The quiz was a bit difficult since there was no much guidance on how to sort in descending order and how to find the total times a country was mentioned in a single tweet.

创建者 Ievgenii Martynenko

Feb 07, 2019

Final assignment is not working properly. Whatever you choose - you are right.

There is no enough information how to dump tweets into file and how to use that in assignment.

This course doesn't worth any cent and should be either reworked or excluded.

创建者 AJIT MENTA

Feb 05, 2019

Great Course! Provides a good exposure to the tools and utilities. The database part has been done well. The part with the Spark needs some more info on how to use the data frames.