Learn From the Industry’s Best

Join 276203 registered members and put your career on the right track Sign up now

Who We Are

A bunch of new and experienced Hadoop, Spark, Big Data, Analytics and Data Science enthusiasts who want to learn, contribute, and network with others with similar interests. Our community includes open source enthusiasts, the academia, professionals, and companies including IBM, MetiStream, and DataCamp.

Learn more »

Our Mission

Make Big Data and Data Science education available to everyone, and start a journey of discovery to change the world! Big data technologies such as Hadoop and Spark paired with Cloud Computing can let even students explore data that can lead to important discoveries in the health industry, the environment, and any other area you can think of!

Big Data use cases »

Our Courses

They are mostly free, developed by experienced professionals and teachers, and nicely structured. Most courses include hands-on labs that you can perform on the Cloud, on Docker, or VMWare images, or by locally installing the required software. Pass the course test to print your certificate of completion, and to receive and industry-recognized badge!.

Course catalog »

What is Hadoop?

Akmal Chaudhri on Hadoop and Big Data

Akmal Chaudhri on Hadoop and Big Data

Sign up

According to our members…

  • The course is a great opportunity to get some quick practical experience with Hadoop and its related subprojects. Also the provided vm image could be used for future projects and deepening your understanding of Hadoop framework. Waiting for Hadoop Fundamentals II to be ready :)

  • Dear all, The course is excellent because it saves time from reading big books to learn Hadoop. I prefer agile practice: try to achive small results ASAP. I didn’t know anything about Hadoop two months ago. But these two months were enough for me to create 7 nodes Hadoop cluster which everyday computes recommendations for our site…

  • The lessons in Hadoop Fundamentals I are well organized. The lessons start with an overview of the topic being covered, before going into the finer details. It would however have been helpful if the lessons could cover applications of ¬†each of the topics. For example, the scenarios where usage of pig/hive/jaql is well suited. It would also be useful if mapreduce lab sessions could go over the code for wordcount program in detail before running ¬†the program. It would also be useful if the lab transcripts could give examples of what parts need to be modified to execute certain sections in the ¬†lab tutorial, in the cloud. Overall, the course is well thought out and gives all the details needed to learn the fundamentals of Hadoop.