Big Data 2 Days Bootcamp in Alexandria, VA
Offers: Group of 5 - 10 people 10% Discount, Group of 11 - 20 people 15% Discount
Select date and time
Location
For venue details reach us at info@academyforpros.com, PH: +1 469 666 9332
Alexandria Alexandria, VARefund Policy
Agenda
Day 1
Introduction to Big Data
Introduction to Hadoop
Hadoop Distributed File System (HDFS)
MapReduce
YARN
Day 2
Pig
Hive
Sqoop
Oozie
About this event
Certificate: Course Completion Certificate
Language: English
Duration: 2 Days
Credits: 16
Refreshments: Snacks, Beverages and Lunch included in a classroom session
Course Delivery: Classroom
Course Description:
This big data training course will provide a technical overview of Apache Hadoop for project managers, business managers and data analysts. Students will understand the overall big data space, technologies involved and will get a detailed overview of Apache Hadoop. The course will expose students to real world use cases to comprehend the capabilities of Apache Hadoop. Students will also learn about YARN and HDFS and how to develop applications and analyze Big Data stored in Apache Hadoop using Apache Pig and Apache Hive. Each topic will provide hands-on experience to the students.
Course Outline:
Introduction to Big Data
- Big Data - beyond the obvious trends
- Exponentially increasing data
- Big data sources
- Data warehousing, business intelligence, analytics, predictive statistics, data science
Survey of Big Data technologies
- First generation systems
- Second generation systems
- Enterprise search
- Visualizing and understanding data with processing
- NOSQL databases
- Apache Hadoop
Introduction to Hadoop
- What is Hadoop? Who are the major vendors?
- A dive into the Hadoop Ecosystem
- Benefits of using Hadoop
- How to use Hadoop within your infrastructure?
Introduction to MapReduce
- What is MapReduce?
- Why do you need MapReduce?
- Using Mapreduce with Java and Ruby
Introduction to Yarn
- What is Yarn?
- What are the advantages of using Yarn over classical MapReduce?
- Using Yarn with Java and Ruby
Introduction to HDFS
- What is HDFS?
- Why do you need a distributed file system?
- How is a distributed file system different from a traditional file system?
- What is unique about HDFS when compared to other file systems?
- HDFS and reliability?
- Does it offer support for compressions, checksums and data integrity?
Data Transformation
- Why do you need to transform data?
- What is Pig?
- Use cases for Pig
Structured Data Analysis?
- How do you handle structured data with Hadoop?
- What is Hive/HCatalog?
- Use cases for Hive/HCatalog
Loading data into Hadoop
- How do you move your existing data into Hadoop?
- What is Sqoop?
Automating workflows in Hadoop
- Benefits of Automation
- What is oozie?
- Automatically running workflows
- Setting up workflow triggers
Exploring opportunities in your own organization
- Framing scenarios
- Understanding how to ask questions
- Tying possibilities to your own business drivers
- Common opportunities
- Real world examples
Hands-on Exercises
- How to use MapReduce in Hadoop?
- How to use Yarn within Hadoop?
- Overview of HDFS commands
- Hands-on activities with Pig
- Hands-on activities with Hive/HCatalog
- Hands-on activities with Sqoop
- Demonstration of Oozie
Learning Objectives:
- Learn about the big data ecosystem
- Understand the benefits and ROI you can get from your existing data
- Learn about Hadoop and how it is transforming the workspace
- Learn about MapReduce and Hadoop Distributed File system
- Learn about using Hadoop to identify new business opportunities
- Learn about using Hadoop to improve data management processes
- Learn about using Hadoop to clarify results
- Learn about using Hadoop to expand your data sources
- Learn about scaling your current workflow to handle more users and lower your overall performance cost
- Learn about the various technologies that comprise the Hadoop ecosystem
- Learn how to write a simple mapreduce job from Java or your favorite programming language
- Learn how to use a very simple scripting language to transform your data
- Learn how to use a SQL like declarative language to analyze large quantities of data
- Learn how to connect your existing data warehouse to the Hadoop ecosystem
- Learn how to move your data to the Hadoop ecosystem
- Learn how to move the results of your data analysis to Business Intelligence Tools like Tableaux
- Learn how to automate your workflow using oozie
- Learn about polyglot persistence and identifying the right tool for the right job
- Learn about future trends in Big data and technologies to keep an eye on
- Discover tips and tricks behind successful Hadoop deployments
Target Audience:
Anybody who is involved with databases, data analysis, wondering how to deal with the mountains of data (anywhere gigabytes of user/log data etc to petabytes will benefit from this program). This course is perfect for:
- Business Analysts
- Software Engineers
- Project Managers
- Data Analysts
- Business Customers
- Team Leaders
- System Analysts
No prior knowledge of big data and/or Hadoop is required for this class. Some prior programming experience is a plus for this class, but not necessary.
We also offer a variety of other courses:-
Big Data Overview 1-Day Training
This is an awareness course designed to provide you with an understanding of Big Data, the potential sources of Big Data that can be used for solving real business problems. The course also introduces Big Data technologies, such as Hadoop and MongoDB, and provides the overview of data mining and the tools used in it.
Big Data Strategy 1-Day Training
Developing and implementing a Big Data strategy is vital if you want to stay in business in the coming years. Big Data offers so many benefits to organizations and research indicated that companies leveraging Big Data financially outperform their peers by 20% or more. So, if you do not want to be left behind, you should focus on Big Data now.
But what is Big Data? How should you develop a Big Data strategy? What can Big Data do for your organization and how should you deal with the privacy aspect of Big Data? Important questions to ask that can be difficult to answer without sufficient knowledge on Big Data.
This unique Big Data strategy training focuses on Big Data from a business perspective and will provide you with all the knowledge and valuable insights to develop a successful and winning Big Data strategy. This is the only training available that focuses on Big Data from a strategic point of view.
Microsoft Power BI 2-Days Training
Power BI is the newest Microsoft Business Intelligence and Data Analysis tool. In this module we will go through basics of this product, and introduce all five components of Power BI (Power Query, Power Pivot, Power View, Power Map, and Power Q&A).
You will see some demos and introduction about Power BI desktop, Office 365 Power BI subscription, and Power BI website, and mobile apps. You will see some basic demos of how easy to use is Power BI in some scenarios.
Mastering in DAX and Data Modelling 1-Day Training
Learn Microsoft Power BI with our comprehensive training courses. Our courses range from the basics of Power BI to complex data modelling using DAX and creating live dashboards for reporting. These courses are for anyone interested in business intelligence, reporting and complex data modelling.
Microsoft Power BI for Business Users 1-Day Training
Learn Microsoft Power BI with our comprehensive training courses. Our courses range from the basics of Power BI to complex data modelling using DAX and creating live dashboards for reporting. These courses are for anyone interested in business intelligence, reporting and complex data modelling.
Microsoft Power BI for Report Developers 1-Day Training
Learn Microsoft Power BI with our comprehensive training courses. Our courses range from the basics of Power BI to complex data modelling using DAX and creating live dashboards for reporting. These courses are for anyone interested in business intelligence, reporting and complex data modelling.
CCC-Big Data Foundation 2-Days Training
The Big Data foundation course provides you with an understanding of Big Data, potential data sources that can be used for solving real business problems, and an overview of data mining and the tools used in it.
This is a fundamental course with practical exercises designed to provide you with hands-on experience in using two of the most popular technologies in Big Data processing – Hadoop and MongoDB. You will get the opportunity to practice installing these two technologies through lab exercises. The exercises expose you to real-life Big Data technologies with the purpose of obtaining results from real datasets from Twitter.
After completing the course, you will be equipped not only with fundamental Big Data knowledge, but will also be introduced to a working development environment containing Hadoop and MongoDB, installed by yourself. This practical knowledge can be used as a starting point in the organizational Big Data journey.
Microsoft Power BI Comprehensive 2-Days Training
Learn Microsoft Power BI with our comprehensive training courses. Our courses range from the basics of Power BI to complex data modelling using DAX and creating live dashboards for reporting. These courses are for anyone interested in business intelligence, reporting and complex data modelling.
Power BI Dashboard and Data Analysis 2-Days Training
Create stunning interactive reports and share amazing insights! Power BI gives Excel users the power to extract data from multiple sources, link it together, perform calculations and create powerful visualizations.
Power BI is an AMAZING cloud-based business analytics service provided by Microsoft that allows users to transform data into rich, interactive visual reports that present a 360-degree business view and improve business decision making.
Power BI Desktop is a development platform from Microsoft that allows users to connect to a huge range of data sources, clean and transform messy data, create relationships between data sets, perform calculations and prepare stunning interactive reports.
Power BI.com allows users to publish their interactive reports and share certain features of those reports with others. Reports published to PowerBI.com can be set to refresh automatically and can be interacted with online using any browser or mobile device.
Data Analysis 3-Days Bootcamp
The Data Analysis Boot Camp equips candidates with the knowledge, techniques and models to transform data into usable insights for making business decisions. The course simplifies complex concepts, breaks down math jargon and helps navigate complex symbols and equations. These skills enable candidates to zoom in on the most useful data and apply it in the real world. It also provides practical techniques for presenting findings to quickly make decisions that drive organizations forward. These tools include graphic presentation techniques and simplified models to transform the results of data analysis into digestible, easy-to-understand insights and usable recommendations.
Frequently asked questions
We provide Course Materials, Lunch, Beverages and Course Completion Certificate.
You can request a refund by sending an email to info@academyforpros.com and within 7-14 working days you get your money back.
You can reach us at info@academyforpros.com or enroll through our website.
We host the training through both the platform, Online and Classroom. The virtual training option can be chosen by busy professionals.
The duration of the training is 16 hours. The training will run from 9 AM to 5 PM.
Yes, we do provide great discount for the group registration. To enquire, reach us at info@academyforpros.com
Once you complete the training, you will receive a globally recognized Course Completion Certificate.
Yes. You can switch your registration to a different course with a week prior notice.
Our subject matter experts are from relevant industries and are certified.
You will be credited with 16 PDUs on completion of this training.
Organized by
We deliver training solutions to Corporate, Government Agencies, Public sectors, Multinational organizations and Private Individuals. Our Primary focus is to train in a wide range of areas from IT Technical, Personal Development, Human Resources and Management Courses to Project, Program and IT Service Management.
We have most experienced trainers in the Industry. Our Trainers are highly skilled in their subject areas and are uniquely positioned to provide participants with deep industry experience. They are motivated to transfer knowledge through practical support post and pre-training to provide participants with additional support outside the classroom.