Hadoop Courses
Introduction
- Big Data
- 3Vs
- Role of Hadoop in Big data
- Hadoop and its ecosystem
- Overview of other Big Data Systems
- Requirements in Hadoop
- UseCases of Hadoop>
HDFS
- Design
- Architecture
- Data Flow
- CLI Commands
- Java API
- Data Flow Archives
- Data Integrity
- WebHDFS
- Compression
Mapreduce
- Theory
- Data Flow (Map-Shuffle-Reduce)
- Programming [Mapper, Reducer, Combiner, Partitioner]
- Writables
- InputFormat
- Outputformat
- Streaming API
Advanced Mapreduce
- Counters
- CustomInputFormat
- Distributed Cache
- Side Data Distribution
- Joins
- Sorting
- ToolRunner
- Debugging
HBase
- NoSQL vs SQL
- CAP Theorem
- Architecture
- Configuration
- Role of Zookeeper
- Java Based APIs
- MapReduce Integration
- Performance Tuning
HIVE
- Architecture
- Tables
- DDL-DML-UDF-UDAF
- Partitioning
- Bucketing
- Hive-Hbase Integration
- Hive Web Interface
- Hive Server
Duration
6 week| 3 months| 6 Months| 1 year stipend based
Pre-requisite
Basic computer knowledge| R & D
Career Options
After completing your full course stipend or Job on behalf of performance during Training period
Project Work
Two project will be covered in the class and then individual projects will be assigned to students. As the project is desktop application and students will be asked to give professional Look, Feel and Functionality to applications.