[Xxxx] Syllabus - Big Data Analytics - Business Improvement Strategy 090914

Embed Size (px)

DESCRIPTION

zza

Citation preview

Course OverviewBig data analytics is the process of examining large amounts of data of a variety of types to uncover hidden patterns, unknown correlations and other useful information. Pentahos Business Intelligence (BI) platform layered on the Hortonworks Data Platform provides big data visualization and exploration with full data discovery and predictive analytics for improving business strategy.

This 3-day course provides an in-depth understanding of analyzing unstructured Big Data to draw important predictions for businesses using Pentaho BI Suite. After completing this course, students will be able to optimize business decisions and create competitive advantage with Big Data analytics, analyze and visualize website clickstream data, and securing big data infrastructres. Duration: 3 days

Who Should Attend

Business analysts, database professionals, and others involved in forecasting and trends managementPrerequisites Basic understanding of relational database is helpful.

Course Content

Suggested Next Course

Implementing BI Concepts with Pentaho Data Mining - User Profile & Analysis Implementing XBRL for Financial Reporting

Big Data Analytics - Business Improvement Strategy

Introduction to Big Data

/ Defining and describing Big Data and its role in the corporate world

Defining Big Data Why Big Data and why now? Big Data Example - Social Media Posts Big Data Example - Survey Data Big Data Characteristics Big Data Processing Architectures The Lifecycle of Big Data Hadoop Distributed File System (HDFS) MapReduce Zookeeper Coordination Service Apache Pig Platform HBase Distributed Database Hive Data Warehousing Solution Sqoop 2 Technology

Integrating Big Data with Enterprise RDBMS Using Sqoop 2 /

Mastering Sqoop 2 for Data Transfer within Big Data

Integration Strategies Hadoop & RDBMS Integrating Hadoop with RDBMS using Sqoop 2 Sqoop 2 Architecture How Sqoop 2 Works? Importing and Exporting Data using Sqoop 2 Data Import in Hive and HBase with Sqoop 2 Import from Microsoft SQL Server into the Hortonworks Sandbox using Sqoop 2

big data analytics tools, trends and best practices

big data analytics: Introduction to Methods, Tools, and Techniques

What is Analytics? Different Kinds of Analytics The Benefits of Big Data Analytics The Requirements for Being Successful with Big Data Analytics Techniques for Analyzing Big Data A New Approach Tools for Analyzing Big Data

Big Data Analytics with Hortonworks Data Platform /

Simplifying Big Data Analytics with Hortonworks

Run a MapReduce Job on YARN Write a Pig Script to Explore and Transform Data in HDFS Use Pig to Organize and Analyze Big Data Use Hive to Run SQL-like queries to Perform Data Analysis Use HCatalog with Pig and Hive

Modeling and Predictive Data Analysis

What is Predictive Analytics Applying Appropriate Mining Techniques Most Common Predictive Modeling Tasks Predict Website Visitors Evaluating Predictive Models

Mining Unstructured Data

What is Unstructured Data? Types of Unstructured Data Integrating Unstructured Data Images Implementing Text Mining and Social Network Analysis Analyze Machine and Sensor Data Analyze Website Clickstream Data with Hortonworks

Big Data Reporting using Pentaho

Understanding Why Reporting is Important Approaches to Big Data Reporting Big Data Access Technologies for Reporting Business Intelligence and Big Data Architecture Understanding Direct Batch Reporting Understanding Live Exploration of Big Data Understanding Indirect Batch Analysis

Big Data Visualization with Pentaho /

Visualizing Big Data with Pentaho

Generating the Best Visualizations for Your Data The Basics: Charting Large Data Volumes Different Varieties of Data Visualization Velocity Filtering Big Data Data Visualization Made Easy with Autocharting Visualizing Trends using a Pentaho

Securing Big Data Infrastructures

Hadoop File Permission MapReduce ACLs Securing a Cluster through a Gateway Prevent Accidental Access Encryption for Data at Rest Hive Security Using HDFS Snapshots to Protect Important Enterprise Datasets