Big Data University

Skip courses

Courses

Collapse all
Expand all

Skip available courses

Available courses

  • BD305EN (FREE course): Learn the basics of IBM BLU Acceleration for Cloud for Administrators. The materials used in this course are free. Register to the Open Beta to explore this technology.

    Audience: Data Scientists, Business Analysts

    Available in: English

    Self enrollment
  • BD304EN (FREE course): Learn the basics of IBM BLU Acceleration for Cloud specific to Data Scientists. The materials used in this course are free. Register to the Open Beta to explore this technology.

    Audience: Data Scientists

    Available in: English

    Self enrollment
  • BD301EN (FREE course): Learn the basics of IBM BLU Acceleration for Cloud. This course teaches you how to quickly work with this data warehouse offering running on the Cloud. The materials used in this course are free. Register to the Technology Preview to explore this technology.

    Audience: Data Scientists, Business Analysts, Developers, Administrators

    Available in: English

    Self enrollment
  • BD302EN (FREE course): Learn the basics of IBM BLU Acceleration for Cloud that apply to StartUp companies and developers. This course teaches you how to quickly work with this data warehouse offering running on the Cloud. The materials used in this course are free. Register to the Open Beta to explore this technology.

    Audience: Developers, StartUp companies

    Available in: English

    Self enrollment
  • BD303EN (FREE course): Learn the basics of IBM BLU Acceleration for Cloud for Business Analysts. This course teaches you how to quickly work with this data warehouse offering running on the Cloud. The materials used in this course are free. Register to the Open Beta to explore this technology.

    Audience: Business Analysts

    Available in: English

    Self enrollment
  • BD201EN (FREE course): Learn how to perform Data Analysis using R when your data is in a relational database.

    You will learn how to:

    • decide when to store data in a relational database for analysis with R
    • how to connect to relational databases from R
    • access data in DB2 with RJDBC
    • access data in DB2 and BLU Accleration for Cloud from R
    • reuse existing database assets like stored procedures

    Audience: Data Analysts, Programmers

    Available in: English

    Learn more!

    Self enrollment
  • BD500EN (FREE course): This course teaches you the basics of stream computing using IBM InfoSphere Streams. Stream computing allows you to process and analyze big data in real time. This type of computing can be applied in many industries, from health care to manufacturing, finance and more.

    Audience: Stream computing beginners.

    Available in: English

    Self enrollment
  • BD110EN (FREE course): This course teaches non-technical users how to take advantage of Big Data technologies without having to learn how to write a program to run Hadoop, JAQL, and so on. It uses BigSheets, a plug-in that can be run on top of Hadoop, and is designed for the business user who is familiar with spreadsheet tools like MS Excel.

    Audience: Business users who want to perform analytics.

    Available in: English

    Self enrollment
  • MI710EN (Free course): Java Fundamentals is a free course provided by SciSpike (www.scispike.com), a partner company of Big Data University. In this course you will learn the basics of the Java Programming Language. Having some knowledge of Java is very helpful in many areas in IT. For example, for Big Data related technologies, you can write MapReduce jobs in the Hadoop framework using Java.

    Audience: Java Beginners.

    Available in: English

    Self enrollment
  • BD200EN (FREE course): This course teaches you how to work with query and scripting languages such as Hive, Pig, and Jaql. These query and scripting languages simplify the development of map-reduce programs in Hadoop for developers with no Java expertise. Emphasis will be on Jaql.

    Audience: Hadoop Beginners.

    Available in: English

    Self enrollment
  • ZooKeeper is a coordination service that provides sets of tools to help manage distributed applications. Building distributed applications comes with challenges that are intrinsic to distributed applications itself, which includes maintaining configuration information, groups, naming, and synchronization. ZooKeeper allows developers to handle these challenges to create robust distributed applications. ZooKeeper comes with a set of guarantees: sequential consistency, atomicity, single system image, reliability, and timeliness.  This course will help you learn how to use ZooKeeper to keep your Big Data applications running smoothly despite the challenges of operating in a complex distributed environment.

    Self enrollment
  • BD030EN (FREE course): Accessing Hadoop Data Using Hive

    Writing map/reduce programs to analyze your Big Data can get complex. Hive can help make querying your data much easier. Apache Hive, first created at Facebook, is a data warehouse system for Hadoop that facilitates easy data summarization, ad-hoc queries, and the analysis of large datasets stored in Hadoop compatible file systems. Hive provides a mechanism to project structure onto this data and query the data using a SQL-like language called HiveQL.

    After completing this course, you should be able to:

    • Understand what Apache Hive is, the Hive architecture, and Hive use cases.
    • Make basic configuration changes in a Hive installation.
    • Use DDL to create new Hive databases and tables with a variety of different data types.
    • Create partitioned tables that are optimized for hadoop.
    • Create and run a variety of useful DML queries against Hive.
    • Use built in Hive operators and functions to get work done.
    • Create your own user defined functions in Hive.
    • Use a variety of different file formats and records formats with Hive.

     

    Self enrollment
  • BDO001EN (FREE course): This course includes materials used for an onsite event held in Foster City, California (March 13, 2014). It teaches you the basics of Hadoop technology.

    The materials cover Hadoop, Hadoop Architecture and HDFS, MapReduce, Hive for Warehousing and SQL for Hadoop. Join hands on labs for each of these subjects to get jump started with Hadoop and more; with guidance from the development experts. This is a unique opportunity to develop skills and learn about exciting technologies and techniques that you can immediately take back to your organization.

    Self enrollment
  • BD060EN: Introduction to Pig

    This course begins with an overview of Pig. It explains the data structures supported by Pig and how to access data using the LOAD operator. The next lesson covers the Pig relational operators. This is followed by the Pig evaluation functions, as well as math and string functions.

    Self enrollment
  • ZooKeeper is a coordination service that provides sets of tools to help manage distributed applications. Building distributed applications comes with challenges that are intrinsic to distributed applications itself, which includes maintaining configuration information, groups, naming, and synchronization. ZooKeeper allows developers to handle these challenges to create robust distributed applications. ZooKeeper comes with a set of guarantees: sequential consistency, atomicity, single system image, reliability, and timeliness.  This course will help you learn how to use ZooKeeper to keep your Big Data applications running smoothly despite the challenges of operating in a complex distributed environment.

    Self enrollment
  • BD501EN (FREE course)

    IBM will host a live Developers Conference Webcast on Thursday November 14, 2013 from 10:00AM to 6:00PM U.S. Eastern Standard Time. The Webcast will be recorded an posted here after the conference.

    Audience: Streams Beginners

    Available in: English

    Self enrollment
  • BD777SP (Fee/Precio: 65$-49€):

    Es un curso de iniciación con una orientación funcional, que utiliza el método del estudio del caso. Una vez superado el programa el alumno estará en posesión de una visión global del concepto “Big  Data”, susprincipales herramientas y aplicaciones. Asimismo sabrá el valor puede aportar a una organización el uso del big data y cuáles son las principales necesidades para implementarlo.

    Los casos de estudio deberán ser analizados siguiendo unas directrices marcadas por el profesor. Cada alumno enviará su análisis de manera individual y le será enviado a su vez su correspondiente feedback.

    Al final del curso será necesario realizar un trabajo en forma de redacción sobre un tema que será elegido entre el alumno y el profesor.

    La comunicación entre el alumno y el profesor se realizará principalmente vía email y adicionalmente mediante conferencias por Skype.

    Precio del curso: US$ 65 ó 49€

    A quién va dirigido: todas aquellas personas hispanohablantes con un perfil tecnológico que quieran adentrarse en el mundo del Big Data o bien las personas con un perfil de negocio que quieran entender globalmente el concepto de Big Data y conocer los beneficios y transformaciones en la empresa que puede aportar su aplicación.

    Available in Spanish only

    Self enrollment
  • BD070V212EN (FREE course): Big Data Fundamentals

    This course presents a holistic approach to Big Data, taking both a top-down and a bottom-up approach to questions such as: What is Big Data? How do we tackle Big Data? Why are we interested in it? What is a Big Data platform?

    The course emphasizes that we study Big Data to gain insight that will be used to get  people throughout the enterprise to run the business better and to provide better service to customers. Rather than a implementation of a single open-source systems such as Hadoop, the course recommends that Big Data should be processed in a platform that can handle the variety, velocity, and volume of data by using a family of components that require integration and data governance.  Big Data is NoHadoop (“not only Hadoop”) as well as NoSQL (“not only SQL”).

    Self enrollment
  • BD001V212EN - Hadoop Fundamentals I - Version 3

    Self enrollment
  • BD030EN (FREE course): Accessing Hadoop Data Using Hive

    Writing map/reduce programs to analyze your Big Data can get complex. Hive can help make querying your data much easier. Apache Hive, first created at Facebook, is a data warehouse system for Hadoop that facilitates easy data summarization, ad-hoc queries, and the analysis of large datasets stored in Hadoop compatible file systems. Hive provides a mechanism to project structure onto this data and query the data using a SQL-like language called HiveQL.

    After completing this course, you should be able to:

    • Understand what Apache Hive is, the Hive architecture, and Hive use cases.
    • Make basic configuration changes in a Hive installation.
    • Use DDL to create new Hive databases and tables with a variety of different data types.
    • Create partitioned tables that are optimized for hadoop.
    • Create and run a variety of useful DML queries against Hive.
    • Use built in Hive operators and functions to get work done.
    • Create your own user defined functions in Hive.
    • Use a variety of different file formats and records formats with Hive.

     

    Self enrollment
  • BD801EN (FREE course): Learn how to tackle data analysis problems using the powerful open source language R. The course will take you from learning the basics of R to using it to explore many different types of data.

    You will learn how to:

    • prepare data for analysis
    • compute various statistical measures
    • create meaningful data visualizations
    • create reusable R functions
    • create R models to predict expected future outcomes.

    Audience: Data Analysts, Programmers

    Available in: English

    Learn more!

    Self enrollment
  • BD700EN (FREE course) - Brought to you by Jaspersoft (jaspersoft.com)

    "Success in the Big Data era is about more than size. It's about getting insight from these huge data sets more quickly." - Doug Henschen, Executive Editor, InformationWeek

    This course teaches you importance of Hadoop Reporting and Analysis and provides instructions to build your own Hadoop/Big Data reports over relevant Hadoop technologies such as HBase, Hive, etc. It provides guidelines to choose between various reporting techniques: Direct Batch Reports, Live Exploration, and Indirect Batch Analysis.

    Self enrollment
  • BD506EN (FREE course): Learn the basics of IBM InfoSphere Streams, Cloud Computing and what the IBM SmartCloud Enterprise has to offer. Practice building your Streams cluster on the IBM SmartCloud Enterprise.

    Audience: Streams and IBM SmartCloud Enterprise Beginners.

    Available in: English

    Self enrollment
  • BD100EN (FREE course): This course shows you big data analytics at work. It provides real-life scenarios or demonstrations than can help you understand the value of big data analytics. It includes a collection of videos from several IBM solutions that support the IBM Smarter Planet agenda.


    Self enrollment
  • BD006EN (FREE course): Learn the basics of Hadoop, Cloud Computing and what the IBM SmartCloud Enterprise has to offer. Practice building your Hadoop cluster on the IBM SmartCloud Enterprise.

    Audience: Hadoop and IBM SmartCloud Enterprise Beginners.

    Available in: English

    Self enrollment
  • BDO001EN (FREE course): This course includes materials used for an onsite event held in Foster City, California (March 13, 2014). It teaches you the basics of Hadoop technology.

    The materials cover Hadoop, Hadoop Architecture and HDFS, MapReduce, Hive for Warehousing and SQL for Hadoop. Join hands on labs for each of these subjects to get jump started with Hadoop and more; with guidance from the development experts. This is a unique opportunity to develop skills and learn about exciting technologies and techniques that you can immediately take back to your organization.

    Self enrollment
  • BD005EN (FREE course): Learn the basics of Hadoop, Cloud Computing and what Amazon Web Services (AWS) has to offer. You will learn how virtual machines are created on the cloud using AWS EC2 (Elastic Compute Cloud) and  understand various storage options. You will also learn how to get your data in and out of the cloud, and about RightScale, a  cloud management platform. Finally, you will be learn about building your Hadoop cluster on AWS.

    Audience: Hadoop and Amazon Cloud Beginners.

    Available in: English

    Self enrollment
  • BD104EN (FREE course): The analysis of emails, blogs, tweets, forums and other forms of unstructured text data constitutes what we call text analytics.  Text analytics is applicable to most industries; for example, if your company is suspicious about company secrets being leaked to competitors by employees, text analytics can help analyze millions of employees’ emails.  If you would like to find common pain points your customers face when using your products, you can analyze their comments and questions in forums. If you would like to measure positive or negative perceptions of a company, brand, or product, you can perform sentiment analysis using text analytics. This course teaches you the basics of text analytics. It includes hands-on exercises!

    Audience: Text Analytics Beginners.

    Available in: English

    Self enrollment
  • BD105EN (FREE course): Learn the basics of Text Analytics!  This course will teach you how to retrieve relevant text from structured, semi-structured or unstructured documents based on criteria you define using a complete case study.  Apply these same criteria to big data by running them on top of a Hadoop cluster! All materials and software used are FREE!

    Self enrollment
  • BD777SP (Fee/Precio: 65$-49€):

    Es un curso de iniciación con una orientación funcional, que utiliza el método del estudio del caso. Una vez superado el programa el alumno estará en posesión de una visión global del concepto “Big  Data”, susprincipales herramientas y aplicaciones. Asimismo sabrá el valor puede aportar a una organización el uso del big data y cuáles son las principales necesidades para implementarlo.

    Los casos de estudio deberán ser analizados siguiendo unas directrices marcadas por el profesor. Cada alumno enviará su análisis de manera individual y le será enviado a su vez su correspondiente feedback.

    Al final del curso será necesario realizar un trabajo en forma de redacción sobre un tema que será elegido entre el alumno y el profesor.

    La comunicación entre el alumno y el profesor se realizará principalmente vía email y adicionalmente mediante conferencias por Skype.

    Precio del curso: US$ 65 ó 49€

    A quién va dirigido: todas aquellas personas hispanohablantes con un perfil tecnológico que quieran adentrarse en el mundo del Big Data o bien las personas con un perfil de negocio que quieran entender globalmente el concepto de Big Data y conocer los beneficios y transformaciones en la empresa que puede aportar su aplicación.

    Available in Spanish only

    Self enrollment
  • BD102EN (FREE course): This course is restricted to those who have participated in the IBM Big Data Developer Day event.  It contains the presentation materials for download and a forum you can use to ask questions about the labs.

    This course requires an enrollment key which is provided at the Developer Day event.

    Audience: Participants of the IBM Big Data Developer Day event.

    Available in: English

    Self enrollment
  • BD502EN (FREE course): This course is restricted to those who have participated in the Big Data Meetup.  It contains the presentation materials for download and a forum you can use to ask questions about the labs.

    This course requires an enrollment key which is provided at the Meetup event.

    Audience: Participants of the Big Data Meetup event.

    Available in: English

    Self enrollment
  • BD010EN Introduction to Jaql

    Jaql is primarily a query language for JavaScript Object Notation (JSON), but it supports more than just JSON. It allows you to process both structured and nontraditional data and was donated by IBM to the open source community.

    This course begins with an overview of Jaql. It describes the JSON data format and does a simple comparison between Jaql and Pig and Hive. The second lesson starts to build a foundation for the Jaql language. After setting a foundation, next the Jaql core operators are introduced.  Then there is a simple review of MapReduce and how this applies to Jaql.  This is followed by showing how SQL can be used as Jaql operators.  And finally, the input/output capabilities of Jaql are covered.

    Self enrollment
  • BD020EN (FREE course): Moving Data into Hadoop

    This course describes techniques for moving data into Hadoop. There are a variety of ways to get data into Hadoop from simple Hadoop shell commands to more sophisticated processes. Several techniques are presented but two, Sqoop and Flume, are covered in greater detail.

    Self enrollment
  • BD015EN (FREE course): Learn about MapReduce Programming!

    This course begins with an overview of MapReduce. It explains the use of the mapper and reducer classes that make up a MapReduce application and where then get invoked in the application process. Next comes the actual coding of a MapReduce application. The student is walked through the development of a simple MapReduce application, as one would do that using a development environment similar to Eclipse.  After understanding what is required to code a MapReduce application, the student then sees how much more quickly a MapReduce application can be created using the MapReduce development wizard that is part of the IBM BigInsights development environment.

    After completing this course, you should be able to:

    • Describe the term map in regard to Hadoop
    • Explain the term reduce in regard to Hadoop
    • Describe how the JobTracker and TaskTrackers work with MapReduce
    • Explain the fault tolerance capability of MapReduce
    • Describe the basic Java code required to code
      • The mapper class
      • The reducer class
      • The driver
    • Code a simple MapReduce application, taking advantage of the BigInsights development environment
    Self enrollment
  • BD050EN (FREE course): Learn how to Control Hadoop Jobs with Oozie.

    This course gives an overview of Oozie and how it is able to control Hadoop jobs. It begins with looking at the components required to code a workflow as well as optional components such as case statements, forks, and joins. That is followed by using the Oozie coordinator in order to schedule a workflow.

    One of the things that the student will quickly notice is that workflows are coded using XML which tends to get verbose. IBM BigInsights has a graphical workflow editor designed to simplify the work in generating a workflow. This course also shows how to invoke the IBM BigInsights editor via a wizard during the publication of an application.

    Self enrollment
  • BD111EN (FREE course):  This course teaches you how to take advantage of the SQL language to access big data stored in HDFS or HBase using SQL.

    The course presents the different alternatives for SQL access, such as Hive, Impala, and Big SQL.  It explains the similarities and differences between these three technologies.

    The course includes hands on exercises and access to a Hadoop cluster with Hive, HBase, HDFS and Big SQL, so you can try these technologies first hand.

    At the end of the course you will understand the different alternatives for accessing Big Data with SQL, and you will gain hands-on experience with these technologies.

    Audience: Beginners accessing Hadoop using SQL

    Available in: English

    Self enrollment
  • BD105JP (FREE course): ビッグデータ時代に必要な情報統合、情報ガバナンスとは何か? どうしたら信頼できるデータを維持し、活用できるかについて解説します。

    Audience: Developers, DB Administrators, IT Architect, IT Strategist

    Available in: Japanese

    Self enrollment
  • BD104JP (FREE course):RやSPSSによる複雑な分析もサクサク動く、ストレス・フリーな分析環境である PureData for Analyticsのテクノロジーとその効果を解説します。

    Audience: Data Scientist, Analyst, IT Strategist

    Available in: Japanese

    Self enrollment
  • BD103JP (FREE course): このコースでは、ストリーム・コンピューティングによる、新しいリアルタイム・データ処理/分析の世界を紹介します。ストリーム・コンピューティングを実現する IBM InfoSphere Streams の機能と特徴についても解説します。

    Audience: 開発者

    Available in: Japanese

    Self enrollment
  • BD102JP (FREE course): Apache Hadoopをより使いやすく、より高速化した IBM版 HadoopであるInfoShere BigInsightsと、そのアプライアンス版である IBM PureData for Hadoopについて、分かり易く解説します。
    ビッグデータを扱う次世代のエンタープライズ環境におけるHadoopの役割についても理解できます。

    Audience: Developers, StartUp companies

    Available in: Japanese

    Self enrollment
  • BD101JP (FREE course): ビッグデータ活用の5つのシナリオを解説すると共に、すべてのデータと分析を活用できる新アーキテクチャーである、IBM Watson Foundations を紹介します。

    Audience: Developers, IT Architect

    Available in: Japanese

    Self enrollment
  • UCSC - CMPS290H (FREE but restricted course):  This course is offered by UC Santa Cruz as part of the Large scale data integration - Text Analytics course.   This course will teach you the basics of Text Analytics:  how to retrieve relevant text from structured, semi-structured or unstructured documents based on criteria you define. It uses a complete case study.  Apply these same criteria to big data by running them on top of a Hadoop cluster! All materials and software used are FREE!

     

    Self enrollment
  • DB001EN (FREE course): Learn the basics of the relational database model and the SQL language using DB2 Express-C, the free version of IBM DB2 database servers. You will learn how to create, read, update and delete data using SQL. This course is part 1 of 2 courses.

    Audience: Database Beginners.

    Available in: English

    Self enrollment
  • DB101EN (FREE course): Learn the basics of DB2 using DB2 Express-C, the free version of IBM DB2 database servers. You will learn how to download, install, and use DB2 Express-C including its tools. This course is part 1 of 3 courses.

    Audience: DB2 Beginners.

    Available in: English | Spanish | Portuguese | Russian

    Learn more!

    Self enrollment
  • DB102EN (FREE course): Continue learning the basics of DB2 using DB2 Express-C, the free version of IBM DB2 database servers. This is part 2 of 3 courses. You will learn about the DB2 architecture (memory, process, and storage models), connecting to a DB2 server, working with database objects, and more!

    Audience: DB2 Beginners.

    Available in: English

    Learn more!

    Self enrollment
  • DB101PL (FREE course): Learn the basics of DB2 using DB2 Express-C, the free version of IBM DB2 database servers. You will learn how to download, install, and use DB2 Express-C including its tools. This course is part 1 of 3 courses.

    Audience: DB2 Beginners.

    Available in: English | Spanish | Portuguese | Russian

    Learn more!

    Self enrollment
  • DB101PT (Curso GRATIS): Aprenda os princípios do DB2 usando o DB2 Express-C, a versão gratuita de servidores de banco de dados IBM DB2. Você vai aprender como fazer o download, instalar e utilizar o DB2 Express-C, incluindo suas ferramentas. Este curso é parte 1 de 3 cursos.

    Audiência: DB2 iniciantes

    Disponível em: English | Spanish | Portuguese | Russian

    Mais detalhes!

    Self enrollment
  • DB101RU (Бесплатный курс): Основные сведения о DB2 с использованием DB2 Express-C, бесплатную версию DB2 IBM серверов баз данных. Вы узнаете, как загрузить, установить и использовать DB2 Express-C в том числе ее инструментов. Этот курс является частью 1 из 3 курсов.

    Аудитории: DB2 начинающих

    Имеющиеся в: English | Spanish | Portuguese | Russian

    Узнать больше!

    Self enrollment
  • DB101ES (Curso GRATUITO): Aprenda los conceptos básicos de DB2 utilizando DB2 Express-C, la versión gratuita de servidores de base de datos IBM DB2. Usted aprenderá cómo descargar, instalar y utilizar DB2 Express-C, incluyendo sus herramientas. Este curso es parte 1 de 3 cursos.

    Audiencia: Principiantes en DB2   

    Disponible en: English | Spanish | Portuguese | Russian

    Más detalles!

    Self enrollment
  • AA001EN (Free course): This course prepares you to take the IBM DB2 Academic Associate exam.  Those passing this exam earn the title IBM DB2 Academic Associates: DB2 Database and Application Fundamentals

    Content for this course is similar to the one offered as an onsite classroom course at selected universities.

    Audience: DB2 and database beginners.

    Available in: English

    Learn more!

    Self enrollment
  • This course has been set up temporarily to offer the DB2 Academic Training - Pre-qualification Test.  This course requires an enrollment key to be provided onsite the day of the test.

    Self enrollment
  • MI001EN (FREE course):  Learn how to create and host a course in Big Data University!

    This is a free, open course for anyone interested in developing and posting his course on our site. The course explains how to create the course, including the creation of videos, transcripts, and more!

    All topics are welcomed, not just Big Data or DB2-related topics!

    Self enrollment
  • BD001EN (FREE course): Learn the basics of Hadoop. You will learn about the Hadoop architecture, HDFS, MapReduce, Pig, Hive, JAQL, Flume, and many other related Hadoop technologies.  Practice with hands-on labs on a Hadoop cluster using any of these methods: On the Cloud, with the supplied VMWare image, or install locally. This course is part 1 of 2 courses.

    Audience: Hadoop Beginners.

    Available in: English

    Self enrollment
  • DB111EN (FREE course): Learn the basics of IBM Data Studio, an eclipse-based free tool for administration and development of databases. You will learn how to download, install, and use IBM Data Studio with DB2 Express-C.

    Audience: IBM Data Studio Beginners.

    Available in: English

    Self enrollment
  • BD035EN (FREE course): Using HBase for Real-time Access to Your Big Data

    HBase is the open source Hadoop database used for random, real-time read/writes to your Big Data. HBase runs on a distributed architecture on top of commodity hardware. HBase has the following features:

    • Linear and modular scalability
    • Strictly consistent read and writes
    • Automatic and configurable sharding of tables
    • Automatic failover support between RegionServers
    • Easy to use Java API for client access
    • And more…

    This course will get you started so that you can use HBase to access your Big Data in real-time.

    After completing this course, you should be able to:

    • Explain the HBase system and architecture
    • Use the Client API to perform data operations on HBase
    • Describe the various HBase client used to communicate with HBase
    • Integrate HBase with a MapReduce job
    • Configure HBase for a pseudo-distributed environment
    • Backup and restore data from your HBase
    • Run performance tuning techniques (optional)
    Self enrollment
  • BD001V2EN (FREE course): Learn the basics of Hadoop. You will learn about the Hadoop architecture, HDFS, MapReduce, Pig, Hive, JAQL, Flume, and many other related Hadoop technologies.  Practice with hands-on labs on a Hadoop cluster using any of these methods: On the Cloud, with the supplied VMWare image, or install locally. This is the second version of this course.

    Audience: Hadoop Beginners.

    Available in: English

    Self enrollment
  • BD302EN (FREE course): Learn the basics of IBM BLU Acceleration for Cloud that apply to StartUp companies and developers. This course teaches you how to quickly work with this data warehouse offering running on the Cloud. The materials used in this course are free. Register to the Open Beta to explore this technology.

    Audience: Developers, StartUp companies

    Available in: English

    Self enrollment
  • BD302EN (FREE course): Learn the basics of IBM BLU Acceleration for Cloud that apply to StartUp companies and developers. This course teaches you how to quickly work with this data warehouse offering running on the Cloud. The materials used in this course are free. Register to the Open Beta to explore this technology.

    Audience: Developers, StartUp companies

    Available in: English

    Self enrollment
  • BD035EN (FREE course): Using HBase for Real-time Access to Your Big Data

    HBase is the open source Hadoop database used for random, real-time read/writes to your Big Data. HBase runs on a distributed architecture on top of commodity hardware. HBase has the following features:

    • Linear and modular scalability
    • Strictly consistent read and writes
    • Automatic and configurable sharding of tables
    • Automatic failover support between RegionServers
    • Easy to use Java API for client access
    • And more…

    This course will get you started so that you can use HBase to access your Big Data in real-time.

    After completing this course, you should be able to:

    • Explain the HBase system and architecture
    • Use the Client API to perform data operations on HBase
    • Describe the various HBase client used to communicate with HBase
    • Integrate HBase with a MapReduce job
    • Configure HBase for a pseudo-distributed environment
    • Backup and restore data from your HBase
    • Run performance tuning techniques (optional)
    Self enrollment
  • MI000EN (TEMPLATE for course MI001EN)

    <Enter text as required.  Example below:>

    MI001EN (FREE course): Learn how to create and host a course in BigData or DB2 University!

    Send a e-mail to administrator@db2university.com with the title of the course, table of contents, and let us know if you will be offering it for free or for a cost. If it looks interesting, we will approve it and set you up to create your course!

    This course will teach you how to create the course, including the creation of videos, transcripts, live sessions, and more!  The course requires anenrollment key that will be provided when the course is approve for development.

    All topics are welcomed, not just Big Data or DB2-related!

    Self enrollment
  • MI000EN (TEMPLATE for course MI001EN)

    <Enter text as required.  Example below:>

    MI001EN (FREE course): Learn how to create and host a course in BigData or DB2 University!

    Send a e-mail to administrator@db2university.com with the title of the course, table of contents, and let us know if you will be offering it for free or for a cost. If it looks interesting, we will approve it and set you up to create your course!

    This course will teach you how to create the course, including the creation of videos, transcripts, live sessions, and more!  The course requires anenrollment key that will be provided when the course is approve for development.

    All topics are welcomed, not just Big Data or DB2-related!

    Self enrollment