Professional Diploma in Big Data & Data Analytics

Eligibility: BE, B.Tech, ME, M.Tech, MCA, BCA, MSc, BSc
Duration: 3 Months

Enroll Now

From advertising to healthcare, almost every industry is now adopting Data Science technology to get an edge over the businesses. Data Science has recorded six times faster growth than the average growth rate of IT industry in the past couple of years. According to the market experts, it would sustain the momentum and continue to outpace other IT sectors by a significant margin in the years to come. If you are looking to make a career in one of the fastest growing IT sectors, there is no better alternative than data science.

Using massive datasets to guide decisions is becoming more and more important for modern businesses. Hadoop and MapReduce are fundamental tools for working with big data. By knowing how to deploy your own Hadoop clusters, you’ll be able to start exploring big data on your own.

Modules

  • Programming in C and Advance C
  • Database Concepts
  • Programming in Python
  • Data Analytics using Python
  • Cloud Computing
  • Big Data and Hadoop

Programming in C and Advance C - 18 days

  • Introduction to C
  • Preprocessor Directives
  • Pointers
  • File I/O : Sequential and Random Access
  • Formatted I/O
  • Decision control statements & Loops
  • Storage classes(Internal Linkage & External Linkage)
  • Dynamic Memory
  • Function Pointers
  • Command Line Arguments
  • Modular programming using functions
  • Arrays and Strings User Defined Data Types
  • User Defined Data Types
  • Variable number of arguments

Database : SQL Fundamentals - 5 days

  • Introduction to Database Concepts
  • Using DDL statement to Create and Manage Tables
  • Reporting Aggregated data using Group Functions
  • Displaying data from Multiple tables using Joins
  • Retrieving Data with SQL SELECT Statement
  • Using Single-Row functions to Customize Output
  • Using Sub queries to Solve Queries
  • Manipulating Data : Insert, Update, Delete
  • Restricting and Sorting Data
  • Set Operators

Python Programming - 10 days

  • Python Introduction
  • Python Lists
  • Exception Handling
  • Flow Control
  • Set, Tuple and Dictionary
  • File Handling
  • Functions
  • List Comprehensions
  • Object Oriented Programming in Python

Data Analysis and Visualization in Python - 6 days

  • Mathematical Computing with Python(NumPy)
  • Importing Data in Python
  • Data Manipulation with Pandas
  • Data Ingestion & Inspection
  • Exploratory Data Analysis
  • Data Visualization using Matplotlib

Cloud Computing - 7 days

  • Fundamental of Cloud
  • Virtualization concepts
  • Cloud Platform in Industries
  • Overview on Amazon Storage Services
  • Cloud Service Models : IaaS, PaaS, SaaS
  • Limitation and challenges of cloud Environments
  • Cloud Applications
  • Cloud Deployment Models
  • Attacks in Public clouds
  • Case Study : AWS

Big Data & Hadoop - 15 days

  • Introduction to Big Data & Hadoop
  • Hadoop Installation and Configuration
  • Hadoop Streaming
  • Pig Latin statements and programming
  • Introduction of Hive Data-Warehouse
  • DDL & DML with Hive QL
  • Overview of HBase
  • Working of Hadoop
  • HDFS
  • Programming with Rhadoop
  • Pig native functions
  • Hive Architecture
  • Hive functions
  • HBase Architecture and Commands
  • Hadoop Architecture
  • MapReduce
  • Pig commands and control structures
  • Hive QL
  • UDFs
  • CRUD operations