Help

Learn From the Industry’s Best

Join 158427 registered members and put your career on the right track Sign up now

Who We Are

A bunch of new and experienced Hadoop, Big Data and information management enthusiasts who want to learn, contribute, and network with others with similar interests. Our community includes open source enthusiasts, the academia, professionals, and companies including IBM, Rightscale, and Jaspersoft.

Learn more »

Our Mission

Make Big Data education available to everyone, and start a journey of discovery to change the world! Big data technologies such as Hadoop and Streams paired with Cloud Computing can let even students explore data that can lead to important discoveries in the health industry, the environment, and any other area you can think of!

Big Data use cases »

Our Courses

They are mostly free, developed by experienced professionals and teachers, and nicely structured. Most courses include hands-on labs that you can perform on the Cloud, on VMWare images, or by locally installing the required software. Pass the course test to print your certificate of completion.

Course catalog »


What is Hadoop?

Akmal Chaudhri on Hadoop and Big Data

Akmal Chaudhri on Hadoop and Big Data

Sign up

According to our members…

  • It is the perfect web site to get into new technologies and have a hint of the main and newest projects around big data, open source… The material is really well organized and easy to understand and go through. The labs provide hands-on to the technologies guide you step bu step in the different tasks.

    Joao
  • The course is very interesting, because it explains the components of this new system…The first part was very well detailed, the installation part for the different environments. It was very easy to create the machines in RightScale and register in AWS, or download the application locally… Good work, I will be waiting for the next tutorials.

    Andreas
  • The lessons in Hadoop Fundamentals I are well organized. The lessons start with an overview of the topic being covered, before going into the finer details. It would however have been helpful if the lessons could cover applications of ¬†each of the topics. For example, the scenarios where usage of pig/hive/jaql is well suited. It would also be useful if mapreduce lab sessions could go over the code for wordcount program in detail before running ¬†the program. It would also be useful if the lab transcripts could give examples of what parts need to be modified to execute certain sections in the ¬†lab tutorial, in the cloud. Overall, the course is well thought out and gives all the details needed to learn the fundamentals of Hadoop.

    Madhavi